Recent Messages
-
Tagged data
http://www.grouplens.org/node/73 http://www.grouplens.org/system/files/README_10M100K.html flip -- http://www.infochimps.org
-
Unlimited Wii Downloads! Wii Games, Wii Music & Much
More!
Unlimited Wii Downloads! Wii Games, Wii Music & Much More! http://wii.bloggerhelp.cn/ --~--~---------~--~----~------------~-------~--~----~ [from the http://groups.google.com/group/get-theinfo mailing list] -~----------~----~----~----~------~----~------~--~---
-
Convert Geogratis shapefile to lat/lons
Hi, I'm trying to convert the Population Ecumene Census Division Boundary File (http://geogratis.cgdi.gc.ca/geogratis/en/option/select.do? id=426) from its e00 format to a series of WGS84 lat/lon points. I've tried doing this by first converting the e00 file to a shapefile
-
Tagged data
I should have read the site more carefully. :-) http://theinfo.org/get/data links to the citeulike database, which is definitely enough to get me started. Still, if anyone can suggest other sources it would be appreciated. --~--~---------~--~----~------------~-------~--~----~
-
Tagged data
Hi, I'm looking for some sort of user tagged dataset - e.g. URLs from delicious, photos from flickr, that sort of thing. I don't really care what's being tagged other than that there's some unique identifier for it.
-
Twitter Scrape (rough draft)
Hi Philip, Any updates on this? Has it been pulled indefinitey? Nitin --~--~---------~--~----~------------~-------~--~----~ [from the http://groups.google.com/group/get-theinfo mailing list]
-
GIS land information?
Hi all, I'm looking for some datasets related to the U.S. land information system, particularly related to average temperatures, sunlight, humidity, other weather data points, etc. There only seems to be a Google Maps mashup for global average temperature, and then there's
-
translating addresses to geopoint
http://download.geonames.org/export/zip/ Kickass. flip On Sun, Sep 28, 2008 at 7:03 PM, Philip (flip) Kromer --~--~---------~--~----~------------~-------~--~----~
-
Announcing Datahub 0.7
http://lucasmanual.com/mywiki/DataHub *Datahub is a tool that allows faster download/crawl, parse, load, and visualize of data. It achieves this by allowing you to divide each step into its own work folders. In each work folder you get a sample files that you can start coding.
-
earn money $10,000(home business) your adds here for
free
hi members %%%%%%%%%%%%%%%%%%%%%%%%%%%%%% earn money $10,000 in 2 months.save the money...... a free business for you.the first and best online business ........ ..........income with only 30 minutes to setup........
-
Twitter Scrape (rough draft)
Hey y'all, I've gathered a massive scrape of the Twitter friend graph: about 2.7M users (and slowing, meaning I'm starting to find the edge), 10M tweets, 58M edges, with pretty-near complete edge data for users with more than a dozen followers.
-
List of blog feeds?
Does any have a large list of blog feeds, preferably RSS feeds? I see that http://share.opml.org/, which was used for sharing OPML files has been taken down. Thanks, Joseph
-
who owns the masterpieces locked up at the National
Galleries?
Hi all - In May of 2007, Carl Malamud uploaded 6,288 Smithsonian photographs that are in the public domain to Flickr. They are all still there today. (Including some classic Edward Muybridge cyanotypes -- see http://www.flickr.com/photos/publicresourceorg/999397141/in/set-72157601200446023/).
-
The NYT annotated corpus
http://ckan.net/package/read/nyt-corpus Its a shame they can't just put it on archive.org :-) J. --~--~---------~--~----~------------~-------~--~----~ [from the http://groups.google.com/group/get-theinfo mailing list]
-
Change.gov CC License
Hi Ian, Most .gov sites are run by the Federal Government and copyright law says the Federal Government can't get copyrights -- everything they do is in the public domain. change.gov is an exception because it's not actually produced by the Federal Government, its run by The Obama-Biden Transition Project,
-
Change.gov CC License
Hi, Hoping someone can jump in here, I am writing a post for beyondchron.org about Gavin Newsom's use of YouTube for his "State of the City Address." Are government websites (.gov) presumed to be public record? I interpret much of the activity done by OGOSH people as riding on this
-
EARN MONEY WITH GOOD TIME SENSE
http://gptreasure.com/register.php?ref=earncash http://gptreasure.com/register.php?ref=earncash http://gptreasure.com/register.php?ref=earncash http://gptreasure.com/register.php?ref=earncash http://gptreasure.com/register.php?ref=earncash
-
Unofficial Phone,the most cheap mobile phones from
china
Unofficial Phone,the most cheap mobile phones from china http://www.unofficialphone.cn/2008/11/friends-of-walled-design-ahead-of.html http://www.unofficialphone.cn/2008/11/wf188-history-of-mountain-fastness-of.html --~--~---------~--~----~------------~-------~--~----~ [from the http://groups.google.com/group/get-theinfo mailing list]
-
Free Russia Girls Pictures, Free Chinese Girls Pictures
Free Russia Girls Pictures, Free Chinese Girls Pictures Free Girls Picture http://freegirlspictures.blogspot.com/ Chinese Girls Pictures http://www.cn.1dis.cn/
-
aws public data sets
http://aws.amazon.com/publicdatasets/ Amazon is hosting several large data sets on EC2 so that people can use EC2 to do computations more easily. --~--~---------~--~----~------------~-------~--~----~ [from the http://groups.google.com/group/get-theinfo mailing list]
-
The NYT annotated corpus
Aaron - Thanks for the info, but unless I'm missing something, actually getting one's hands on the corpus data looks like a non-trivial task. The link you provided (http://corpus.nytimes.com/) redirects to a Google group that provides informational overview, but no data.
-
The NYT annotated corpus
http://corpus.nytimes.com/ The New York Times Annotated Corpus is a collection of over 1.8 million articles annotated with rich metadata published by The New York Times between January 1, 1987 and July 19, 2007. With over 650,000 individually written summaries and 1.5 million
-
bulk.altlaw.org
Hi folks, http://bulk.altlaw.org/ This is an experiment to see if anyone's interested. Right now, it's got the collections of PDF/HTML/WordPerfect files we downloaded from the appeals courts. In the coming weeks, I'll add data dumps from our
-
Harvestman crawler 2.0 beta released !
How are you crawling and downloading websites, files, images? Do you need something better? Its time for a change ! Download the beta version of harvestman crawler today!!!! HarvestMan is a modular, extensible and flexible web crawler program
-
Open knowledge events in London on 1st, 6th, 8th
November
Hi all, We're hosting three open knowledge events in London this November that * Workshop on Finding and Re-using Public Information - Saturday 1st November 2008, London Knowledge Lab - http://okfn.org/wiki/PublicInformation
-
us government sponsored research?
Hello, 1. Would anybody know where I can get a list of us government sponsored research? ( a list of research, contact info etc.) 2. The finding of that research? If the research was sponsored by us gov then it should be licensed under a public domain?!!
-
Freebase as Linked Data
I'm forwarding/cross-posting the below from the Freebase Developer email list. This is the first I've heard of the "Linked Open Data" community. Appears to be RDF focused, but also represents another take on the data commons. Josh Tauberer's release of the SEC ownership data is mentioned on their wiki
-
translating addresses to geopoint
IP Address Geolocation http://www.ip2location.com ZIP Code Geolocation http://www.zipcodeworld.com GIS Geolocation
-
zipcodes as a geo-database
You might want to get the Premium Edition database from http://www.zipcodeworld.com --~--~---------~--~----~------------~-------~--~----~ [from the http://groups.google.com/group/get-theinfo mailing list] -~----------~----~----~----~------~----~------~--~---
-
Ravishing Bvlgari watches at Replica Classics
Ravishing Bvlgari watches at Replica Classics http://joingaze.com --~--~---------~--~----~------------~-------~--~----~ [from the http://groups.google.com/group/get-theinfo mailing list] -~----------~----~----~----~------~----~------~--~---
-
open search implementations?
Hello, Has anybody used or implemented the following open search format? http://www.opensearch.org Thanks, Lucas