drjobs Java Developer with Web Crawler Experience

Java Developer with Web Crawler Experience

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Austin, TX - USA

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

Role: Java Developer with Web Crawler Experience
Location: Austin TX(Hybrid)
Responsibilities:
1. Web Crawler Development: Design and implement efficient and scalable web crawlers in Java to collect data from various online sources.
2. Data Extraction: Develop and maintain systems for structured data extraction handling various data formats (HTML JSON XML etc.).
3. Data Storage and Processing: Design data storage and processing pipelines ensuring extracted data is clean structured and easily accessible.
4. Performance Optimization: Optimize web crawling processes for speed efficiency and accuracy while ensuring minimal impact on source websites.
5. Error Handling and Logging: Implement errorhandling mechanisms and logging systems to detect and resolve issues during crawling operations.
6. Data Integrity and Compliance: Ensure data collection practices are ethical legal and compliant with relevant regulations (e.g. robots.txt copyright laws).
Requirements:
Proficiency in Java and experience with Javabased web sing libraries (e.g. Jsoup Apache HttpClient).
Knowledge of web crawling frameworks and tools such as Sy Selenium or Puppeteer.
Strong understanding of HTML CSS JavaScript and web data structures.
Familiarity with data parsing and handling techniques for JSON XML and other common formats.
Experience with database technologies (SQL NoSQL) to store and manage sed data.
Knowledge of HTTP protocols headers proxies and load handling.

Employment Type

Full Time

Company Industry

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.