From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mp10.migadu.com ([2001:41d0:8:6d80::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by ms5.migadu.com with LMTPS id OM/xJJwOwmJr7gAAbAwnHQ (envelope-from ) for ; Sun, 03 Jul 2022 23:48:12 +0200 Received: from aspmx1.migadu.com ([2001:41d0:8:6d80::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by mp10.migadu.com with LMTPS id qPcJJJwOwmLwwAAAG6o9tA (envelope-from ) for ; Sun, 03 Jul 2022 23:48:12 +0200 Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by aspmx1.migadu.com (Postfix) with ESMTPS id 2605625525 for ; Sun, 3 Jul 2022 23:48:12 +0200 (CEST) Received: from localhost ([::1]:40126 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1o87Rq-0006Oc-EE for larch@yhetil.org; Sun, 03 Jul 2022 17:48:10 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]:56204) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1o87RE-0006OG-Ey for emacs-orgmode@gnu.org; Sun, 03 Jul 2022 17:47:32 -0400 Received: from mail-pl1-x62e.google.com ([2607:f8b0:4864:20::62e]:38862) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1o87RC-0000YL-QI for emacs-orgmode@gnu.org; Sun, 03 Jul 2022 17:47:32 -0400 Received: by mail-pl1-x62e.google.com with SMTP id m14so7006133plg.5 for ; Sun, 03 Jul 2022 14:47:30 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=references:user-agent:from:to:cc:subject:date:in-reply-to :message-id:mime-version; bh=vQGKM2Z4qGBMbff0SEvv2t56Vy7AIHZV1HjPzp22oBU=; b=PTMKEHXyY+hlKhIEm1J0zwUTI3cAkoZ3hWLETZpCmGO0CYUxIsDvCpiOLS6zWO2kQH ZlGRnYTmXC1OGlOLHdigcYPdCt9lZgdwz2dbY/SuTLas5Tqv8RMnr0Lcxhhu2gBxjWBb R2xKC5xL8fDHfFYbGVSznqo2g0wN+/6tMuNRZNw9oTj0R7BePdSi6y/NU9shkGEIfuiy g9YCmZuLBLLKnYYRNKcAmnpB8LYprQm5wZKSVdZkady5PDx/22hx8UMLMeubTCzC3jFq M+cHta2AGyhWn3s2ruHfXpfux7Ww0+nKuNr765iGGCi52j0d4e1pSnfqIwsLKBzvu6bU 8hxg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:references:user-agent:from:to:cc:subject:date :in-reply-to:message-id:mime-version; bh=vQGKM2Z4qGBMbff0SEvv2t56Vy7AIHZV1HjPzp22oBU=; b=XWl7hT3Bs8ZfvNAkmFc81QSjmwFkEg21gUPVkG2TftzXUR/v4cVyI+MKfuXGDJxgu+ kDkw3QNEcTDfXlcY2OKK0ObeQNToWGPG2i+4ljQScUI5NZsYnS7PA7W/exUsCkm2Yavc sdrqjAv1v6GKsZqlWwdQU4u1ToVwmrotbAQWwpyI/+BsAp5UTS+ygLI7Px9m9Tuh1Ei3 NMX5PBNPIrKF7rPq3123S89AvJ4x6m/QIyK9+HP049uA4g9lKP9k/hCQQ5qVyw56/d6w Od7QNUfGFgQeb+gAHyxJIYPnycQ8YPNEtQnlvFeOUp/S9UrJP2FlAS99R95ne20CMYmb gBug== X-Gm-Message-State: AJIora9JloxPOTs+ltlDmDpDZ/rgqxfqOac6U0cCAT08V7QEj2gU/cWG kajJc4C7EpD3CZHvp3ZujubBqlKotXBwig== X-Google-Smtp-Source: AGRyM1tJqH/hRc+LT4q3wl3S2w8L0yNxlAZeveK4Vyxx0S+4liodhpYEOlj+oFAWYyaL1iv+QEKDYg== X-Received: by 2002:a17:902:c14a:b0:16b:d07c:ad60 with SMTP id 10-20020a170902c14a00b0016bd07cad60mr9379044plj.141.1656884849181; Sun, 03 Jul 2022 14:47:29 -0700 (PDT) Received: from dingbat (2001-44b8-31f2-bb00-1877-6b53-d829-d2af.static.ipv6.internode.on.net. [2001:44b8:31f2:bb00:1877:6b53:d829:d2af]) by smtp.gmail.com with ESMTPSA id mt7-20020a17090b230700b001ef7bd5b711sm2266788pjb.0.2022.07.03.14.47.27 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sun, 03 Jul 2022 14:47:28 -0700 (PDT) References: <878rpah0tx.fsf@localhost> <87wncu8iuw.fsf@gmail.com> <87wncufg6l.fsf@localhost> User-agent: mu4e 1.8.3; emacs 29.0.50 From: Tim Cross To: Max Nikulin Cc: emacs-orgmode@gnu.org Subject: Re: [BUG] The documentation webserver gives 404s [9.5.4 (release_9.5.4-3-g6dc785 @ )] Date: Mon, 04 Jul 2022 07:30:23 +1000 In-reply-to: Message-ID: <87o7y598cz.fsf@gmail.com> MIME-Version: 1.0 Content-Type: text/plain Received-SPF: pass client-ip=2607:f8b0:4864:20::62e; envelope-from=theophilusx@gmail.com; helo=mail-pl1-x62e.google.com X-Spam_score_int: -20 X-Spam_score: -2.1 X-Spam_bar: -- X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: emacs-orgmode@gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+larch=yhetil.org@gnu.org Sender: "Emacs-orgmode" X-Migadu-Flow: FLOW_IN X-Migadu-To: larch@yhetil.org X-Migadu-Country: US ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=yhetil.org; s=key1; t=1656884892; h=from:from:sender:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type:in-reply-to:in-reply-to: references:references:list-id:list-help:list-unsubscribe: list-subscribe:list-post:dkim-signature; bh=vQGKM2Z4qGBMbff0SEvv2t56Vy7AIHZV1HjPzp22oBU=; b=L04ZRAsOLK8ilSgpf1k2Q51nVV4SEhGYIndhSmzWPqf8CHMu0o1bqkdz7n/Q+GfY5TGD4b d/5WRay/iSwaxfxGqpWjSY9P3HQInekdufzTDftHj0ycZXfKKXpDF9Im40KXJqfVzBizDg 7t0AsWAg/hqlTVr9A9AF/6+n+dw5rxfCIHx42hIQYJEfimqvTbAHAbLg1eEy26o8NnGGCs SY9UGWUT7wtOQjanwKReJlk1is0LbNR1K/cAK44QmaOSxFVc/Ymj/Q8sRk7GFZg1uFNMMi 0u+iji997FZQGDZp/q3Y3UqLb3JHTsvFdfijsSnBXzvWLABgGW+9qS7JorXg5w== ARC-Seal: i=1; s=key1; d=yhetil.org; t=1656884892; a=rsa-sha256; cv=none; b=rqqfLzwUbIJjxWErUCNmK2hh2QNIBdsF3oUyNVjnBZc/Ty4teYnW+U9Bc8eFOydQur5Xkj C6XypS3mHTMU2jGpAHMjG3ctYq2SYZAVKXXy5zEjUv5q9ChVUR2Cchi9eFVa+bAS26493Y /y16f7/ZBu3xW+MYCW8hXPUmbh2A1LBBWUsZe3HJc4Msm6hD4XvQFCOWRaHBLPZ7+Nlf+C N/hvXNXeqlmHZpQ+r4dm5grYWz7UzAbla5lmwPVry/LTFV9EXkAbTgtiZbsIq6l/WM/yQy D8BVn8ojw0DIE8Gt1IEeQFGWZ8/w+utjSukXgf1xBl4kQ7BdnNUxDh8lhfLeBQ== ARC-Authentication-Results: i=1; aspmx1.migadu.com; dkim=pass header.d=gmail.com header.s=20210112 header.b=PTMKEHXy; dmarc=pass (policy=none) header.from=gmail.com; spf=pass (aspmx1.migadu.com: domain of "emacs-orgmode-bounces+larch=yhetil.org@gnu.org" designates 209.51.188.17 as permitted sender) smtp.mailfrom="emacs-orgmode-bounces+larch=yhetil.org@gnu.org" X-Migadu-Spam-Score: -5.44 Authentication-Results: aspmx1.migadu.com; dkim=pass header.d=gmail.com header.s=20210112 header.b=PTMKEHXy; dmarc=pass (policy=none) header.from=gmail.com; spf=pass (aspmx1.migadu.com: domain of "emacs-orgmode-bounces+larch=yhetil.org@gnu.org" designates 209.51.188.17 as permitted sender) smtp.mailfrom="emacs-orgmode-bounces+larch=yhetil.org@gnu.org" X-Migadu-Queue-Id: 2605625525 X-Spam-Score: -5.44 X-Migadu-Scanner: scn0.migadu.com X-TUID: 09ps0PyFBnHE Max Nikulin writes: >> >>>>> I noticed that the Org documentation server gives 404 Not Found for a >>>>> large number of links published all over the internet because (1) it >>>>> parses URLs case-sensitively and (2) the case has changed at some point. >>>>> I stumble upon such 404s errors daily. See, for instance, the link to >>>>> Column Groups documentation in this Stack Overflow answer: >>>>> >>>>> https://stackoverflow.com/a/8570307/1306956. > > On 03/07/2022 21:01, Ihor Radchenko wrote: >> Then, I am also CCing Bastien. >> As an idea, the "fix" can be creating symlinks to alternative file >> names. But we first need to figure out the difference between old/new >> naming schemes. > > Previous discussion of the issue: > > Ori. Re: Possible bug report: URL capitalization in online manual. Sun, 9 Feb 2020 > 13:49:04 -0500 > https://list.orgmode.org/CACyCJqX3xrJsctFEmTNUp6HwzGE1M83KGnc9atYt_LUpgk69Bw@mail.gmail.com/ Thanks Max, that helped fill in some of the blanks and confirms some of what I assumed. Seems that the change to letter case was a deliberate change, as was removal of the old pages without upper case letters following the initial uppercase letter in the filename. Making the server case insensitive is not straight-forward. Essentially, we would need to add a regular expression based rewrite rule OR we would need to add copies or perhaps setup symbolic links. While all possible, it will add to maintenance overhead and will likely be a source for future bugs. I do have an alternative suggestion which may help. Given that the 'broken' URLs are actually from external links to old documentation which has been removed, what we could do is create a more informative 404 page. Once users are on the 'real' site, the case issue does not exist. It is only a problem due to outdated URLs on external sites like stack overflow. Instead of just saying 404 Not Found, the page could say the requested URL was not found and is likely a link to old outdated doc documentation. The page could include a link to the main orgmode page. This would be a fairly simple 'fix' that would improve user experience to some degree.