diff options
| author | Christine Dodrill <me@christine.website> | 2021-01-14 22:36:34 -0500 |
|---|---|---|
| committer | GitHub <noreply@github.com> | 2021-01-14 22:36:34 -0500 |
| commit | d2455aa1c1bfc599a07966a7d717c1380d41bbc0 (patch) | |
| tree | c2b206aa41cd6f0e13d61b5455861f09ab5d1304 /blog | |
| parent | a359f54a91f4aeb914c69f59a02afabccd72450e (diff) | |
| download | xesite-d2455aa1c1bfc599a07966a7d717c1380d41bbc0.tar.xz xesite-d2455aa1c1bfc599a07966a7d717c1380d41bbc0.zip | |
Cache better (#296)
* Many improvements around bandwidth use
- Use ETags for RSS/Atom feeds
- Use cache-control headers
- Update to rust nightly (for rust-analyzer and faster builds)
- Limit feeds to the last 20 posts:
https://twitter.com/theprincessxena/status/1349891678857998339
- Use if-none-match to limit bandwidth further
Also does this:
- bump go_vanity to 0.3.0 and lets users customize the branch name
- fix formatting on jsonfeed
- remove last vestige of kubernetes/docker support
Signed-off-by: Christine Dodrill <me@christine.website>
* expire cache quicker for dynamic pages
Signed-off-by: Christine Dodrill <me@christine.website>
* add rss ttl
Signed-off-by: Christine Dodrill <me@christine.website>
* add blogpost
Signed-off-by: Christine Dodrill <me@christine.website>
Diffstat (limited to 'blog')
| -rw-r--r-- | blog/site-update-rss-bandwidth-2021-01-14.markdown | 69 |
1 files changed, 69 insertions, 0 deletions
diff --git a/blog/site-update-rss-bandwidth-2021-01-14.markdown b/blog/site-update-rss-bandwidth-2021-01-14.markdown new file mode 100644 index 0000000..ce68c48 --- /dev/null +++ b/blog/site-update-rss-bandwidth-2021-01-14.markdown @@ -0,0 +1,69 @@ +--- +title: "Site Update: RSS Bandwidth Fixes" +date: 2021-01-14 +tags: + - devops + - optimization +--- + +# Site Update: RSS Bandwidth Fixes + +Well, so I think I found out where my Kubernetes cluster cost came from. For +context, this blog gets a lot of traffic. Since the last deploy, my blog has +served its RSS feed over 19,000 times. I have some pretty naiive code powering +the RSS feed. It basically looked something like this: + +- Write RSS feed content-type and beginning of feed +- For every post I have ever made, include its metadata and content +- Write end of RSS feed + +This code was _fantastically simple_ to develop, however it was very expensive +in terms of bandwidth. When you add all this up, my RSS feed used to be more +than a _one megabyte_ response. It was also only getting larger as I posted more +content. + +This is unsustainable, so I have taken multiple actions to try and fix this from +several angles. + +<blockquote class="twitter-tweet"><p lang="en" dir="ltr">Rationale: this is my +most commonly hit and largest endpoint. I want to try and cut down its size. +<br><br>current feed (everything): 1356706 bytes<br>20 posts: 177931 bytes<br>10 +posts: 53004 bytes<br>5 posts: 29318 bytes <a +href="https://t.co/snjnn8RFh8">pic.twitter.com/snjnn8RFh8</a></p>— Cadey +A. Ratio (@theprincessxena) <a +href="https://twitter.com/theprincessxena/status/1349892662871150594?ref_src=twsrc%5Etfw">January +15, 2021</a></blockquote> <script async +src="https://platform.twitter.com/widgets.js" charset="utf-8"></script> + +[Yes, that graph is showing in _gigabytes_. We're so lucky that bandwidth is +free on Hetzner.](conversation://Mara/hacker) + +First I finally set up the site to run behind Cloudflare. The Cloudflare +settings are set very permissively, so your RSS feed reading bots or whatever +should NOT be affected by this change. If you run into any side effects as a +result of this change, [contact me](/contact) and I can fix it. + +Second, I also now set cache control headers on every response. By default the +"static" pages are cached for a day and the "dynamic" pages are cached for 5 +minutes. This should allow new posts to show up quickly as they have previously. + +Thirdly, I set up +[ETags](https://developer.mozilla.org/en-US/docs/Web/HTTP/Headers/ETag) for the +feeds. Each of my feeds will send an ETag in a response header. Please use this +tag in future requests to ensure that you don't ask for content you already +have. From what I recall most RSS readers should already support this, however +I'll monitor the situation as reality demands. + +Lastly, I adjusted the +[ttl](https://cyber.harvard.edu/rss/rss.html#ltttlgtSubelementOfLtchannelgt) of +the RSS feed so that compliant feed readers should only check once per day. I've +seen some feed readers request the feed up to every 5 minutes, which is very +excessive. Hopefully this setting will gently nudge them into behaving. + +As a nice side effect I should have slightly lower ram usage on the blog server +too! Right now it's sitting at about 58 and a half MB of ram, however with fewer +copies of my posts sitting in memory this should fall by a significant amount. + +If you have any feedback about this, please [contact me](/contact) or mention me +on Twitter. I read my email frequently and am notified about Twitter mentions +very quickly. |
