https://hal.archives-ouvertes.fr/hal-02404932
Raw File
Tip revision: 85e5e2771cb1f80dd6813f378aed7b3efa460597 authored by Software Heritage on 11 December 2019, 14:42:19 UTC
hal: Deposit 353 in collection hal
Tip revision: 85e5e27
README.md
**Presentation**

Extractify is a free extension for Chrome, developed in JavaScript under Bracket, whose purpose is to scrape structured data on the web. It is particularly designed for collecting comments or online conversations such as forums.

It allows you to: 
1) Select structured information on a web page (like tables with rows and columns), by direct selection on the web page, or manual selection by entering HTML tags and related CSS code
2) Select the pagination of pages with the same structure and level
3) Repeat the process as many times as desired for lower levels
4) Scrape the whole selection
5) Finally, obtain a file in json format that can be easily imported in other software, [in L@ME for example](https://github.com/fredericvergnaud/lame).

What it does not allow: everything else!

**Manual installation for Chrome**

1.	Go to [Releases](https://github.com/fredericvergnaud/extractify/releases) to download the latest version and unzip it
2.	In Chrome adress bar, go to extensions page by typing **chrome://extensions/** and load the folder **extractify** as an **unpacked extension**

**Usage**

[Go to the wiki](https://github.com/fredericvergnaud/extractify/wiki) to see how to use Extractify.

**Love it ?** [Tell me](mailto:frederic.vergnaud@mines-paristech.fr) !

**Found a bug** ? Don’t be afraid to [open an issue](https://github.com/fredericvergnaud/extractify/issues/new).
back to top