In the not so good old time, when I hadn’t yet seen the green grass in the land of Python, I was building websites in PHP. And one of them needed a way to identify (i.e. guess) the language of arbitrary text.
One of the most well known open source tools for this seems to be Maciej Ceglowski’s languid, written in Perl. Three years ago, I had this ported to PHP through a RAC project (meaning: While I do own the copyright, I have not written the code). And today, while cleaning up my repositories, I stumbled over it again and decided I might just was well put it our there.
So here it is:
https://launchpad.net/php-languid
http://github.com/miracle2k/php-languid
I also briefly considered porting it to Python, but fortunately someone else has already done that:
Hi Michael,
I would really need this class for some text-processing I’m doing. Could you send it to me or point me to a page where I can download it, since I only found a page where you can download all files one-by-one, and that takes a lot of time ๐
LikeLike
Gustaf, currently you will have to download bzr und checkout the source (using bzr branch lp:php-languid)
LikeLike
Okay, thanks a lot! Didnt know about that versioning system.
Now I’m off to analyzing the internet ๐ We’re making a database of all webpages with their respective languages , so this comes really handy!
LikeLike