How to use the Textractor API
Simply by doing a request to https://www.textractor.net/api?u={Aricle url}. It will return a bunch of JSon for you to further process.
The JSon output will return something like this:
{
"PageTitle":"......",
"MetaDescription":"......",
"MetaKeywords":"......",
"DocumentLanguage":"......",
"Paragraphs":[
{"Text":"......"},
{"Text":"......"}
],
"Images":......,
"Title":"......"
}
What can go wrong?
If either you or we mess up, one of the following errors will be returned:
- 100: Not a valid url
- 101: The requested page is not available
- 102: Page not found
- 103: Website returned an internal server error
- 104: The page html structure of the requested url is invalid
- 105: No content was found
- 106: Unknown error occured
Some examples