SHMsoft blog: Using FreeEed for social media discovery

Friday, November 4, 2016

Using FreeEed for social media discovery

One of the areas that the Memex/DARPA teams excel in is crawling. FreeEed and the people behind it are part of the Memex, so it was quite natural to integrate discovery of crawl results into FreeEed processing and review.

Here is a recent Forbes article about the team.

Searching the websites and social media has been added to FreeEed starting from version 7. The common format to store crawl results is JSON. Each JSON description corresponds to a website page, user post, or a similar item.

Each JSON search entry is represented by a one-line in the archive file. The archive is given the extension *.jl, which stands for "JSON line".

FreeEed understands the *.jl extension, parses the JSON content of every line in the *.jl file, and finds indexes such fields as text, authors, etc., and makes them searchable in the FreeEed Review tool.

Below is a screenshot of FreeEeedUI review, illustrating searches in a collection from an escort services website.

How to create your crawler? You can use the crawler from Scraping Hub, also a member of the Memex team. Or you can use the trusted friend, Apache Nutch. Nutch has been around for more than ten years, and it is the beginning of Hadoop.

By the way, we provide training in all these technologies.

6 comments:

Blog Comment Backlinks said...: This is actually the kind of information I have been trying to find. Thank you for writing this information. Mobile Game; July 21, 2020 at 12:00 AM
nency said...: Im no expert, but I believe you just made an excellent point. You certainly fully understand what youre speaking about, and I can truly get behind that. Game Mobile Online; July 24, 2020 at 11:52 PM
Football Jersey said...: Buy IG Followers,buy instagram followers cheap,buy instagram
followers cheap,buy real instagram followers,buy facebook likes,buy instagram
followers and likes
if you are looking for the
buy youtube subscribersbest SMM panel online? Want to buy Instagram followers? Check
out our website and learn more about how you can increase your followers today; October 6, 2021 at 10:17 PM
Unknown said...: We add new presentation designs bi-weekly! This month, we added a variety of widescreen
PowerPoint templates focused on 2021, business concepts and abstract animated designs.
powerpoint backgrounds; December 1, 2021 at 8:56 AM
socialpanel said...: Social media panel expert with more than seven years of experience in creating, tracking, and analyzing social media campaigns.; February 5, 2022 at 9:46 PM
socialpanel said...: Crushed it with my social panel and learned how to utilize social media effectively as a platform to broadcast my brand and secure new clients.; February 17, 2022 at 3:46 AM

SHMsoft blog

Friday, November 4, 2016

Using FreeEed for social media discovery

6 comments:

Blog Archive

About Me