2018-02-27
by Jerome Choo- URL Report downloads are now sorted in newest-first order
- Crawlbot now indexes the seed URL of each extracted object in the
fromSeedUrl
field.
fromSeedUrl
field.Crawlbot and Bulk Service data retrieval no longer requires access to port :18100. Data downloads are also now HTTPS-only.
url
value would retain HTML escaping if present within the original page source.<video>
elements could be returned in the Article API.Fixed an issue in the Global Index in which complicated Boolean (OR) queries would return no results.
brand
detection in the Product API.humanLanguage
could be mis-identified on some Spanish-language pages.