tag:blogger.com,1999:blog-16081779.post3827922801919339603..comments2023-10-28T09:24:38.420+01:00Comments on iPhylo: OpenURL and spidersRoderic Pagehttp://www.blogger.com/profile/00269598293846172649noreply@blogger.comBlogger4125tag:blogger.com,1999:blog-16081779.post-62530427096199621392007-05-29T08:31:00.000+01:002007-05-29T08:31:00.000+01:00OK, I've added issue handling, it should parse the...OK, I've added issue handling, it should parse these OK now (fingers crossed). Thanks for the feedback.Roderic Pagehttps://www.blogger.com/profile/00269598293846172649noreply@blogger.comtag:blogger.com,1999:blog-16081779.post-82978581473065049882007-05-29T08:19:00.000+01:002007-05-29T08:19:00.000+01:00Thanks! It was a bit of an unfair test, as I omitt...Thanks! It was a bit of an unfair test, as I omitted the refs that I were listed as having dois when I grabbed them from the article I used, so I'm not surprised so few had them.<BR/><BR/>Issue numbers are still confusing this, e.g. in<BR/><BR/>Stolarsky, Kenneth (1976). "Beatty sequences, continued fractions, and certain shift operators". Canadian Mathematical Bulletin 19 (4): 473–482.<BR/><BR/>but that's easily worked around. I can see your little script being quite useful...Anonymousnoreply@blogger.comtag:blogger.com,1999:blog-16081779.post-62439587279841480112007-05-29T08:10:00.000+01:002007-05-29T08:10:00.000+01:00David, sorry about this, there were two problems. ...David, sorry about this, there were two problems. The first is that the regular expressions I use didn't match the Wikipedia style. I've fixed this. The original code I use (ParaTools) comes with loads of templates, but I've commented these out with the aim of being able to easily debug just the ones I need.<BR/><BR/>The second issue is the presence of a "en dash" (–, Unicode symbol 2013, UTF8 E2 80 93) separating the pages, rather than a plain old hyphen (-). The Perl tools I use don't handle anything other than a hyphen. After some hair pulling I found a way to replace the en dash using a reguar expression ( <B>s/\xe2\x80\x93/\-/g</B>, courtesy of <A HREF="http://www.last.fm/forum/21716/_/84721/1#f1583506" REL="nofollow">shell-monkey</A>).<BR/><BR/>Now things should work better. The Kuhn paper is the only one with a DOI (<A HREF="http://dx.doi.org/10.1007/BF01584648/" REL="nofollow">doi:10.1007/BF01584648</A>).Roderic Pagehttps://www.blogger.com/profile/00269598293846172649noreply@blogger.comtag:blogger.com,1999:blog-16081779.post-71082883429717153792007-05-28T20:36:00.000+01:002007-05-28T20:36:00.000+01:00I tried copying and pasting in some doi-less refer...I tried copying and pasting in some doi-less references from Wikipedia (all consistently formatted using WP's citation templates), but it couldn't parse them. Maybe it should? Here's the list I tried:<BR/><BR/>Chandrasekaran, R.; Tamir, A. (1989). "Open questions concerning Weiszfeld's algorithm for the Fermat-Weber location problem".<BR/><BR/>Cockayne, E. J.; Melzak, Z. A. (1969). "Euclidean constructability in graph minimization problems.". Mathematics Magazine 42: 206–208. <BR/><BR/>Kuhn, H. W. (1973). "A note on Fermat's problem". Mathematical Programming 4: 98–107. <BR/><BR/>Wesolowsky, G. (1993). "The Weber problem: History and perspective". Location Science 1: 5–23. <BR/><BR/>Weiszfeld, E. (1937). "Sur le point pour lequel la somme des distances de n points donnes est minimum". Tohoku Math. Journal 43: 355–386.Anonymousnoreply@blogger.com