From 5013cd6fb9997b1d3e0cffb45ec66a2f4344eb8c Mon Sep 17 00:00:00 2001 From: FernandoCelmer Date: Mon, 27 Feb 2023 01:07:39 -0300 Subject: [PATCH] =?UTF-8?q?=F0=9F=93=98=20DOCS=20-=20Update?= MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit --- README.md | 6 +++--- docs/index.md | 6 +++--- docs/nav/graphic.md | 10 ++++++++++ docs/nav/tor-network.md | 39 --------------------------------------- mkdocs.yml | 8 ++++---- 5 files changed, 20 insertions(+), 49 deletions(-) create mode 100644 docs/nav/graphic.md delete mode 100644 docs/nav/tor-network.md diff --git a/README.md b/README.md index d9a492a..b50583f 100644 --- a/README.md +++ b/README.md @@ -9,11 +9,11 @@ Database of onion network sites and graphs related to connections. ## Project objectives: -- Answer the question of how many sites are possible to discover through scraping. +- Build a database through tor network link data scraping. - Graph relationships between onion network sites. - Provide a complete list of discovered sites. -- Apply machine learning techniques to categorize websites based on content. -- Apply ranking techniques to provide a search engine. +- Apply machine learning techniques to categorize websites based on your data. +- Apply ranking techniques to provide a search engine for onion sites. ### Initial dataset diff --git a/docs/index.md b/docs/index.md index 146b29a..800ae39 100644 --- a/docs/index.md +++ b/docs/index.md @@ -7,11 +7,11 @@ Database of onion network sites and graphs related to connections. ## Project objectives: -- Answer the question of how many sites are possible to discover through scraping. +- Build a database through tor network link data scraping. - Graph relationships between onion network sites. - Provide a complete list of discovered sites. -- Apply machine learning techniques to categorize websites based on content. -- Apply ranking techniques to provide a search engine. +- Apply machine learning techniques to categorize websites based on your data. +- Apply ranking techniques to provide a search engine for onion sites. ### Initial dataset diff --git a/docs/nav/graphic.md b/docs/nav/graphic.md new file mode 100644 index 0000000..42ebfec --- /dev/null +++ b/docs/nav/graphic.md @@ -0,0 +1,10 @@ +![Graphic](https://raw.githubusercontent.com/IndexOffy/tor-network-dataset/master/docs/assets/tor_connections.png) +Last graph - [2022-12-03] + +--- + +**Previous models** + +- [tor_connections_03.png](https://raw.githubusercontent.com/IndexOffy/tor-network-dataset/master/docs/assets/graphic/_tor_connections_03.png) - [2022-12-30] +- [tor_connections_02.png](https://raw.githubusercontent.com/IndexOffy/tor-network-dataset/master/docs/assets/graphic/_tor_connections_02.png) - [2022-11-29] +- [tor_connections_01.png](https://raw.githubusercontent.com/IndexOffy/tor-network-dataset/master/docs/assets/graphic/_tor_connections_01.png) - [2022-11-29] diff --git a/docs/nav/tor-network.md b/docs/nav/tor-network.md deleted file mode 100644 index 7014da3..0000000 --- a/docs/nav/tor-network.md +++ /dev/null @@ -1,39 +0,0 @@ -## Is it legal to enter dark web? - -Using Tor is legal in most countries. It’s illegal to perform certain activities, depending on your residency these may include: buying or selling drugs, weapons, counterfeit money, abusive materials etc. - -## What is darknet? - -The Web consists of three large areas: - -- **surface web** (often called clearnet, cleannet) — publicly accessible resources, e.g. search engines, news, social media that can be indexed by search engines -- **deep web** — resources not indexed by search engines, because they are protected by password or stored behind public services, e.g. companies internal platforms, medical records, research papers, legal documents -- **dark web** — resources that can be accessed only with specific software, they are not accessible from standard web browser, e.g. whistleblowers secure drops, secret communication channels for activist, journalists, human rights activists but also many illegal marketplaces and shops - -## What is Tor? - -Tor (The Onion Router) is an open-source software that bounces Internet traffic through a worldwide network consisting of almost million relays in order to hide user’s location and protect him against surveillance or traffic analysis. Tor makes more difficult to trace Internet activity: websites visits, online posts, instant messages and other communication forms. - -## How Tor works? - -Your traffic passes through 3 intermediate nodes before reaching destination. Each of the 3 nodes has separate layer of encryption and nobody who watches your connection can read what you send and where. - -**Tor layers** - -- Guard node — knows your IP address but doesn’t know where you connect to and what you send to destination -- Middle node — immediate layer between guard node and exit node -- Exit node — knows destination but doesn’t know who you are - -## What are hidden services? - -Hidden services are accessible only within Tor network. Their domain names end with .onion. They are not indexed by any public search engine. The only way to enter .onion sites is to know equal address. - -## Who created Tor? - -The idea of onion routing was created in 1995 at the U.S. Naval Research Lab by David Goldschlag, Mike Reed and Paul Syverson in effect of a research to find a way to create Internet connections that don’t reveal who is talking to whom. The reason was to protect US intelligence communications online. - -In early 2000s, Roger Dingledine (MIT graduate) with Paul Syverson began working on the onion routing project created at Naval Research Lab. To distinguish their work from other efforts, they named the project Tor (The Onion Routing). - -Tor was oficially deployed in October 2002 and its source code was released under a free and open software license. In December 2006 computer scientists Roger Dingledine, Nick Mathewson and five others founded The Tor Project research-education nonprofit organization that is responsible for maintaining the software. - -Tor is supported by US government, many NGOs, private foundations, research institutions, private companies and over 20,000 personal donations from people from around the World. \ No newline at end of file diff --git a/mkdocs.yml b/mkdocs.yml index cc1e9e6..3cad0b6 100644 --- a/mkdocs.yml +++ b/mkdocs.yml @@ -1,14 +1,14 @@ site_name: Tor Network Dataset -repo_url: https://github.com/IndexOffy/onion-network-dataset -copyright: IndexOffy +site_url: http://www.indexoffy.com/ +repo_name: IndexOffy/tor-network-dataset +repo_url: https://github.com/IndexOffy/tor-network-dataset nav: - Home: index.md - - Tor Network: nav/tor-network.md + - Graphic: nav/graphic.md - Development: - nav/development/development-guide.md - nav/development/release-notes.md - theme: name: simple-blog