Skip to content

fix cookie handling by refreshing page after loading cookies#4

Open
davidesner wants to merge 5 commits intomainfrom
de-fix-cookie-handling
Open

fix cookie handling by refreshing page after loading cookies#4
davidesner wants to merge 5 commits intomainfrom
de-fix-cookie-handling

Conversation

@davidesner
Copy link
Copy Markdown
Contributor

This pull request introduces improvements to the web crawler component, focusing on more robust cookie handling and minor code cleanups. The main changes ensure that cookies are properly applied by refreshing the browser after loading cookies, and clarify some configuration parameter defaults.

Web crawler improvements:

  • Added a refresh_page method to the SeleniumCrawler class, which refreshes the browser to ensure cookies are applied after loading.
  • Updated the run method in component.py to call refresh_page and wait for 2 seconds after loading cookies, ensuring proper session restoration.

Configuration and code cleanup:

  • Changed the import section in component.py to include time for the new sleep logic.
  • Fixed the retrieval of the docker_mode configuration parameter by providing a default value directly in the get call. Otherwise the docker mode was always true when provided

@davidesner davidesner force-pushed the de-fix-cookie-handling branch from ae8a921 to fd3ab2b Compare December 8, 2025 07:49
@davidesner davidesner requested a review from soustruh December 8, 2025 07:49
@davidesner davidesner force-pushed the de-fix-cookie-handling branch 2 times, most recently from b31acc7 to a96345f Compare December 8, 2025 08:54
@davidesner davidesner force-pushed the de-fix-cookie-handling branch from a96345f to 6508aaa Compare December 8, 2025 09:06
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant