August 2020

by Jerome Choo

We've launched the Diffbot Event API BETA designed to facilitate the extraction of online and in-person event data for individual events. Support for multi-day, multi-track events, i.e. full conferences and festivals, is planned for a future release.

To try out the API directly, insert your token into this API call:
http://api.diffbot.com/v3/event?token=YOURTOKEN&url=https://www.meetup.com/BAS-G1Outdoors/events/271921536/
or use the Analyze API.

The BETA program enables you to extract data for

July 2020

by Jerome Choo

A new Excel add-in version (1.5.0.0) is available in production (available at office.com & for your desktop ). What's new?

  • Match score is part of Enhance output (data enrichment)
  • Search articles by sentiment (Article-level filter)
  • Pull query parameter values from a designated spreadsheet
  • Drag and drop in the outputs tab to reorder columns returned

Support for Deep Pagination: streaming KG search results. You are now able to remove the 1000 result limit on KG queries. For Example:

curl -s "https://kg.diffbot.com/kg/v2/dql_endpoint/stream?token=TOKEN&size=1500&query=type:Organization%20nbEmployeesMax%3E2%20nbEmployeesMax%3C10%20location.country.name:%22Venezuela%22" > stream.es.jsonl

or

curl -s "https://kg.diffbot.com/kg/v2/dql_endpoint/stream?token=TOKEN&size=1500&query=type:Article%20tags.uri:%22https://diffbot.com/entity/BEAlC2Xn3M16WkrunMNhqvw%22%20publisherRegion:'North%20America'" > stream.gb.jsonl

April 2020

by Jerome Choo

Now you can track the sentiment for a company, brand, or industry over time. For example, the following query will show only articles that are negative towards Apple Inc, sorted by date:
type:Article tags.{uri:"http://diffbot.com/entity/CHb0_0NEcMwyY8b083taTTw" sentiment<0.0} sortBy:date.timestamp

Or, count the number of negative/positive articles by day/week/month for the last 365 days/52 weeks/12 months. For example, the following query shows the number of articles that are positive towards Apple Inc grouped by week:
type:Article tags.{uri:"http://diffbot.com/entity/CHb0_0NEcMwyY8b083taTTw" sentiment>0.0} facet[week]:date

Article Search has also been added to the Diffbot Excel Add-In. If you haven't installed the Add-In yet, try it out in Office365 or on your desktop. Download it here.

March 2020

by Jerome Choo

Announcing the Diffbot Google Sheets integration for the Diffbot Knowledge Graph! Now you can search the graph and enrich your data from within a GSheet (you may need to coordinate with your IT team if you are using a corporate/office-sponsored Google Suite account vs. a personal one).

We also made an update to the AccountAPI to ease management and tracking of the usage of your Diffbot services including entities downloaded and credits consumed.

February 2020

by Jerome Choo
  • We added support for a quarterly revenues attribute sourced from SEC filings for US corporate entities: type:Organization quarterlyRevenues.quarter:"Q1-2020"
  • You can now facet on NAICs code names: type:Organization nbEmployeesMax>100000 facet:naicsClassification.name
  • We added over 15k open-source academic journals to the list of Diffbot Knowledge Graph article sources.
  • You can now search the Knowledge Graph and enhance firmographic data profiles from the Diffbot Excel Add-In.
  • We added Diffbot Knowledge Graph ontology reference documentation.
  • We expanded Diffbot Query Language (DQL) docs.

May 2020

by Jerome Choo
  • Natural Language API (BETA) & Demo: https://demo.nl.diffbot.com/
    • The demo combines graph visualization and table visualization (see the tabs 'Entities' and 'Facts') to show how the natural language API can be used to build knowledge graphs from text and the web. Double-click on a node to expand it with facts from the Diffbot KG.
    • NL API Documentation: https://docs.diffbot.com/docs/en/nl-index
  • Support for querying Discussions by sentiment (e.g. type:Article title:"Coronavirus" discussion.posts.sentiment>0.5 sortBy:date)

January 2020

by Jerome Choo
  • Improved handling of tables and lists in Article data to better support Entity tagging and linking.
  • Optimized entity tagging in the Article Title. It now occurs when the same entity is mentioned in the title and text of the Article.
  • Improved location data extraction in the AnalyzeAPI for events.
  • Added a Diffbot Excel Plug-in to enable our clients to TestDrive Diffbot's data enrichment API. The beta service currently supports organization firmographic profile data enrichment.

December 2019

by Jerome Choo
  • Launched new Renderer architecture in support of Crawlbot and DiffbotAPI services.
  • Added descriptions to 90+ million Organization entities in the KG.
  • Added 300+ local US news sources to Article data.

November 2019

by Jerome Choo
  • Improved date/time handling
  • Improved linking of Board Members to Organizations
  • Added revisit/update frequency signals based on whether or not a profile was accessed in the last 30 days.

October 2019

by Jerome Choo
  • Added Longitude and/or Latitude data to 53M Organizations
  • Added sicClassification attribute to Organizations
  • Created a more robust employment category taxonomy and ML model in support of employment data
  • Improved coverage of parentCompany attribute for subsidiary organization entities
  • Normalized stock exchange labels to improve filtering and discoverability.
  • Deployed bug fixes to developer Dashboards