DigitalPebble's Blog
Digital Pebble Blog Spot. DigitalPebble Ltd is a consulting company specialised in linguistic engineering, document management, information retrieval and...
Read Digitalpebble.blogspot.com news digest here: view the latest Digital Pebble Blog Spot articles and content updates right away or get to their most visited pages. Digitalpebble.blogspot.com is not yet rated by Alexa and its traffic estimate is unavailable. It seems that Digital Pebble Blog Spot content is notably popular in USA. We haven’t detected security issues or inappropriate content on Digitalpebble.blogspot.com and thus you can safely use it. Digitalpebble.blogspot.com is hosted with Google LLC (United States) and its basic language is English.
- Content verdict: Safe
- Website availability: Live
- Language: English
- Last check:
-
N/A
Visitors daily -
N/A
Pageviews daily -
5
Google PR -
N/A
Alexa rank
Best pages on Digitalpebble.blogspot.com
-
DigitalPebble's Blog: Nutch training course
We are planning to run a 2-day training courses on Apache Nutch on the 24/25 October 2013. It will take place in Bristol, UK (the exact v...
-
DigitalPebble's Blog: Parsing the Enron email dataset using Tika and Hadoop
Friday, 27 May 2011 In order to parse a large collection of emails, such as the Enron Email Dataset, we might choose to use Apache Hadoop, a scalable computing framework, and Apache Tika, a content an...
-
DigitalPebble Ltd is a consulting company specialised in linguistic engineering, document management, information retrieval and extraction. Our expertise is based on open source solutions, such as Luc...
Digitalpebble.blogspot.com news digest
-
5 years
What's new in StormCrawler 1.12
The previous release was only last month but I decided to ship this one now as it contains several bugfixes and improvements which many users would benefit from.
As you can see below, the main changes are around protocols and sitemaps. We have used Selenium and OKHTTP a lot recently to deal with dynamic websites and the changes below definitely help for these. There is also an important bugfix for JSOUP (#65... -
6 years
What's new in StormCrawler 1.11
I've just released StormCrawler 1.11, here are the main changes, some of which require modifications of your configuration.
Users should upgrade to this version as it fixes several bugs and adds loads of functionalities.
Dependency upgrades -
6 years
What's new in StormCrawler 1.10
StormCrawler 1.9 is only a couple of weeks old but the new functionalities added since justify a new release.
Dependency upgrades
Apache Storm 1.2.2 (#583)
Domain history
Web host: | Google LLC |
Registrar: | MarkMonitor Inc. |
Registrant: | Google LLC |
Updated: | August 02, 2024 |
Expires: | July 31, 2025 |
Created: | July 31, 2000 |
Whois record
Safety scores
Trustworthiness
N/AChild safety
N/A