Skip to main content  



We will keep fighting for all libraries - stand with us!





 A line drawing of the Internet Archive headquarters building façade.  

Donate icon  An illustration of a heart shape   "Donate to the archive"  

Upload icon  An illustration of a horizontal line over an up pointing arrow.   Upload  

 erson's head and chest.    |    




Web icon  An illustration of a computer application window   Wayback Machine   Texts icon  An illustration of an open book.   Books   Video icon  An illustration of two cells of a film strip.   Video   Audio icon  An illustration of an audio speaker.   Audio   Software icon  An illustration of a 3.5" floppy disk.   Software   Images icon  An illustration of two photographs.   Images   Ellipses icon  An illustration of text ellipses.   More  





Internet Archive Audio


Live Music Archive   Librivox Free Audio  

Featured



All Audio
 
This Just In
 
Grateful Dead
 
Netlabels
 
Old Time Radio
 
78 RPMs and Cylinder Recordings
 


Top



Audio Books & Poetry
 
Computers, Technology and Science
 
Music, Arts & Culture
 
News & Public Affairs
 
Spirituality & Religion
 
Podcasts
 
Radio News Archive
 

Images


Metropolitan Museum   Cleveland Museum of Art  

Featured



All Images
 
This Just In
 
Flickr Commons
 
Occupy Wall Street Flickr
 
Cover Art
 
USGS Maps
 


Top



NASA Images
 
Solar System Collection
 
Ames Research Center
 

Software


Internet Arcade   Console Living Room  

Featured



All Software
 
This Just In
 
Old School Emulation
 
MS-DOS Games
 
Historical Software
 
Classic PC Games
 
Software Library
 


Top



Kodi Archive and Support File
 
Vintage Software
 
APK
 
MS-DOS
 
CD-ROM Software
 
CD-ROM Software Library
 
Software Sites
 
Tucows Software Library
 
Shareware CD-ROMs
 
Software Capsules Compilation
 
CD-ROM Images
 
ZX Spectrum
 
DOOM Level CD
 

Books


Books to Borrow   Open Library  

Featured



All Books
 
All Texts
 
This Just In
 
Smithsonian Libraries
 
FEDLINK (US)
 
Genealogy
 
Lincoln Collection
 


Top



American Libraries
 
Canadian Libraries
 
Universal Library
 
Project Gutenberg
 
Children's Library
 
Biodiversity Heritage Library
 
Books by Language
 
Additional Collections
 

Video


TV News   Understanding 9/11  

Featured



All Video
 
This Just In
 
Prelinger Archives
 
Democracy Now!
 
Occupy Wall Street
 
TV NSA Clip Library
 


Top



Animation & Cartoons
 
Arts & Music
 
Computers & Technology
 
Cultural & Academic Films
 
Ephemeral Films
 
Movies
 
News & Public Affairs
 
Spirituality & Religion
 
Sports Videos
 
Television
 
Videogame Videos
 
Vlogs
 
Youth Media
 




Mobile Apps



Wayback Machine (iOS)  

Wayback Machine (Android)  

Browser Extensions



Chrome  

Firefox  

Safari  

Edge  


Archive-It Subscription



Explore the Collections  

Learn More  

Build Collections  


Save Page Now

Capture a web page as it appears now for use as a trusted citation in the future.

Please enter a valid web address



About

Blog

Projects

Help

Donate

Contact

Jobs

Volunteer

People
 






Sign up for free  

Log in  


Advanced Search  



About  

Blog  

Projects  

Help  



Contact  

Jobs  

Volunteer  

People  


Advanced Search

 This form allows you to perform an advanced search.  You only need to fill in one field below.  This can be any field.  If you select "not" as your match criteria, you must select one other field.
Any field:
AND
Title:
AND
Creator:
AND
Description:
AND
Collection:
AND
Mediatype:
AND
AND
AND
AND
Date:
AND
Date range:
TO


Advanced Search returning JSON, XML, and more

This will return results in the format of your choice.
Query:

Fields to return (pick one or more):

(optional) Sort results by:





Number of results:

Page:


JSON format:
XML format:
save to file:
HTML table:
CSV format:
(show/hide help)
RSS format:
("Fields to return" ignored)




Help with CSV and Excel (show/hide help)

Instructions for using the advanced search engine to create reports for items pertaining to books... go to archive.org
click on Go!
click on "advanced search"
look half way down the page to the "advanced XML Search" window.
Type into the "Query" field- "contributor:(library of congress) AND publicdate:[2008-08-01 TO 2008-08-30]"

Important:

  • No " " required. Must have "()" around contributor name.
  • No spaces after ":"
By holding down the "Shift" Key, highlight the fields you want to eventually have exported to excel. For example- "date, identifier, imagecount and title"
Click the radio button for "CSV"
Hit "search". There might be a small delay as the search is executed. In this query, approx. 2143 results will be returned.
When prompted, save the ".csv" file to your hard drive and note the location saved.
Open Excel and open the previously saved ".csv" file. It will parse the data into columns and you now will be able to sort, sum, or otherwise manipulate the data.

Notes

  • You can try constructing a search query and hitting "Search" in the top "Advanced Search" section and then coming "back" to this section to copy that query into the "Query:" box.
  • "downloads" is the number of downloads for content in an item
  • "month" is the number of downloads in the last 30 days
  • "week" is the number of downloads in the last 7 days
  • We have a known bug in our site with using " TO『 in things like titles/descriptions. If you run into a problem and need to use 』TO ", use it lowercase and you will avoid this bug.

Dates and ranges

The following dates can be used for range queries:
  • addeddate
  • createdate
  • date
  • indexdate
  • publicdate
  • reviewdate
  • ribbon_state_modify_date
  • updatedate
The special date field, "oai_updatedate", can be used to mean all of the following dates combined:
  • addeddate
  • indexdate
  • publicdate
  • reviewdate
  • updatedate
The rationale for "oai_updatedate" date merging is to allow something like the Open Archives Initiative protocol get time-sortable lists of updated or added items.
  • addededate -- should be time the item was initially added to archive
  • indexdate -- should be last time item had a change that updated our search engine
  • publicdate -- should be, for *most* items, the time after its first derive is done (ie: has "public formats" for a /details/ page)
  • reviewdate -- updated to be the most recent time of all reviews (updated on new reviews and review changes)
  • updatedate -- mostly the time of the last /editxml/ page submit for an item


Example date ranges: updatedate:[2007 TO 2008] createdate:[2007-02-01 TO 2007-02-11]

For dates like "sponsordate" that are not able to do ranges, they can still use * wildcards, for example: sponsordate:200802*

Example queries

publicdate:[2008-02-01 TO 2008-03-01] AND contributor:smithsonian sponsordate:200802* AND mediatype:texts indexdate:[2008-02-01T00:12:00Z TO 2008-03-01T00:23:59Z]

Explanation of Terms, Operators, Queries, and Grouping

Here, you will find an explanation on how the syntax to the search engine works.

Our site uses a query syntax like the lucene query syntax from the Apache Lucene opensource search engine. A briefer explanation of the syntax follows.

Terms:
The search engine supports two types of terms, single terms and phrases.
A single term is a single word such as "test" or "hello".
A phrase is a group of words such as "duck and cover".

Field Specific queries:
If you want to restrict your search to a specific part of the metadata, you can append in front the term a Field Shortcut, such as:
  
will look for "Duck and Cover" only in the title of the items. For more information on Field Shortcuts, see section about Field Shortcuts Expansion
NOTE: searching in only a restricted set is better than using the default set for two reasons:

(一)its much faster

(二)the results are much more relevant


Boolean Operators:
By default, the operator is AND, which means that the query:
  
is exactly similar to test AND hello, which means that documents containing the term "test" AND the term "hello" will be returned.
The query:
  
will return documents either containing the term "test" OR the term "hello". The query:
  
will return documents that contain the term "test" but not the term "hello" NOTE: using the NOT operator, be sure to supply at least one term without the NOT, or search will fail (the query NOT test is not valid).

Range Queries:
range queries allow you to search for documents whose field match values in between two bounds, for example:
  
will return items whose download count is between 1000 and 2000. using [] will include the bounds in the search, and () will exclude the bounds from the search If you dont want to specify a bound, you can use the special keyword "null":
  
will search for items with more that 10000 downloads.

Grouping:
The search engine supports using parentheses to group clauses to form subqueries, for example:
  
you can also use this feature within a field:
  

Fuzzy Queries:
If you are not sure how to spell a word you can ask the search engine to try to figure it out for you. To do this, append a ~ after the term to apply it on, for example:
  
will find the words buttonwood, as well as cottonwood and buttonware. NOTE: this query involves a lot of computing, so use it wisely and expect to wait a little for results to come in!