Go-Crawl

A Simple Concurrency-Safe Web-Crawler Implementation in GO!

Overview

Go-Crawl is a simple, concurrency-safe web-crawler written in GO. It allows users to crawl websites concurrently and extract internal links.

Features

Go-Crawl includes the following features:

Concurrency Support: Go-Crawl uses goroutines to crawl multiple URLs concurrently, making it ideal for large-scale web scraping tasks.
Robust Error Handling: The project includes robust error handling mechanisms, including logging and exception handling.
Customizable Configuration Options: Users can customize the crawler's behavior by modifying configuration options.

Requirements

To use Go-Crawl, you need:

GO 1.22 or later
net/url package
golang.org/x/net/html package

Installation

To install Go-Crawl, run the following commands from your terminal:

create directory and cd into

> mkdir project && cd project

clone the repo down

> git clone https://github.com/etrinque/go-crawl

build the project

> go build -o crawl main.go

Usage

From console, run the resulting file with args

// enter a target URL to start from and number of go routine workers (optional)

> crawl {url} {worker-pool-size}

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
.gitignore		.gitignore
README.md		README.md
crawl.go		crawl.go
gethtml.go		gethtml.go
geturls.go		geturls.go
go.mod		go.mod
go.sum		go.sum
main.go		main.go
main_test.go		main_test.go
mergeSort.go		mergeSort.go
mergeSort_test.go		mergeSort_test.go
normalize_url.go		normalize_url.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Go-Crawl

A Simple Concurrency-Safe Web-Crawler Implementation in GO!

Table of Contents

Overview

Features

Requirements

Installation

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Etrinque/go-crawl

Folders and files

Latest commit

History

Repository files navigation

Go-Crawl

A Simple Concurrency-Safe Web-Crawler Implementation in GO!

Table of Contents

Overview

Features

Requirements

Installation

Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages