Skip to main content

OpenRefine extensions

You can install the following extensions to add functionalities to OpenRefine. We do not vouch for the function of any of these extensions. Some only work with earlier version of OpenRefine, or with third-party distributions. See how to install an extension.

Extension NameDescriptionAuthorsLatest compatible version
RDF TransformTransform data into RDF formats. Based on the RDF extension. However, it has been thoroughly rewritten to incorporate the newer Java and JavaScript technologies, techniques, and process enhancements.AtesComp3.6.0
3.7-SNAPSHOT
OSM ExtractorAdds a new import option that allows you to import OpenStreetMap data into OpenRefine using the Overpass API. Also adds a new GREL function called getInteriorPoint() which extracts the interior point of any Geometry (represented using Well-Known Text).Labian Gashi3.5.0
GeoJSON ExportAdds a Graphical User Interface (GUI) that allows you to export OpenRefine data to the GeoJSON format. Supports latitude/longitude coordinates and Well-known Text (WKT).Labian Gashi3.5.0
FAIR metadataSupports FAIR metadata by integrating with FAIR Data Point to store your data and export to FAIR. It is a successor of the legacy FAIRifier OpenRefine 2.6 fork. Manual.FAIRDataTeam3.4.1
VIB-BitsAdds history tools, pivot tool, cross() graphical user interface (GUI), and scatterplot tool. Manual PDF.VIB-Bits3.4.1
Named-Entity RecognitionExtracts identifiable or reconcilable entities from text fields. It supports AlchemyAPI, DBpedia Lookup, and Zemanta. Manual.Ruben Verborgh and Stuart Kenny3.4.1
String-TransformersAdds a collection of Java string transformers to Jython functions. Some generic, some aimed at botany and taxonomy. Download this and this and follow their installation instructions.RBGKew3.4.1
RDF extensionRDF Refine - an improved fork for exporting RDF. Adds a graphical user interface (GUI) for exporting data in RDF format.stkenny & DRI3.4.1
GOKb UtilitiesAdds three new features: prepend rows (add new blank rows); trim all data (remove preceding/trailing whitespace from across all cells); extractHost() (GREL function that extracts a host name from a URL).GOKb & Owen Stephens3.1
Stats extension for Google Refine 2.5+Computes elementary statistics on column data.sparkica3.1
CKAN Storage ExtensionExports data to CKAN Storage 2.2+ using CKAN storage API.Ontodia3.1
GeoRefineAdd geospatial processing GREL functions.Ryan Baumann2.8
D2RefineA clinical study metadata harmonization and validation workbench to 1) convert restrictions (specified in a spreadsheet-like interface) into a clinical model; and 2) provide reconciliation services leveraging the Common Terminology Services 2 (CTS2). Manual.Deepak K. Sharma2.7

Legacy Extensions

These extensions were produced in the past and are good examples of desired uses of OpenRefine. They are maintained here for historical reference, and in case a contributor would like to renew them.

Extension NameDescriptionAuthorsSuggested compatible version
LODRefineLODRefine is actually OpenRefine with integrated extensions that make transition from tabular data to Linked Data a bit easier. Integrated extensions are: RDF extension, DBpedia extension, CrowdFlower crowdsourcing extension, Stats extensionSparkica
VIB-BITS Diff pluginAdds a "diff" tool which compares two text values. Manual PDF.VIB-Bits3.4.1. The VIB website states "We are investigating how to provide [the diff plugin] again with version 3.4 of OpenRefine."
VIB-BITS Save facets pluginAdds support for save facet settings and re-apply the same facets at a later time. Manual PDF.VIB-BitsUnknown. The VIB website states "We are investigating how to provide [the diff plugin] again with version 3.4 of OpenRefine."
geoXtensiongeoXtension is based on GDAL, Proj, and GEOS libraries. Adding this extension takes a lot of time, usage of the ready-made docker is recommended.giTorto(circa 2.6)
geoXtensiongeoXtension is based on GDAL, Proj, and GEOS libraries. Adding this extension takes a lot of time, usage of the ready-made docker is recommended.giTorto(circa 2.6)
extraCTU-pluginTo extract e-mails, telephone numbers, URLs and identification numbers from text.giTorto(circa 2.7)
OpentreeDisplays phylogenetic trees from the Open Tree of Life. Manual.nickynicolson2.5
BioVeLAdds functions for cleaning & clustering biodiversity data. The entire package can be installed and run locally as well as on a dedicated server.Botanic Garden and Botanical Museum Berlin-Dahlem2.5
DBpedia extensionAdds columns from DBpedia to reconciled data. Data has to be reconciled with DBpedia or SPARQL endpoint, which returns DBpedia resource types.Zemanta2.5
Stats extension for Google Refine 2.1Computes elementary statistics on column data.The Chicago Tribune2.1

Reconciliation services

OpenRefine can connect to several reconciliation services. You can find information about them at:

Client libraries

You can automate some OpenRefine operations using one of the existing libraries. Those libraries are using the OpenRefine API.