Scanning through the correct sequence of WhatsApp web chat list using the Selenium WebDriver

Question

Scanning through the correct sequence of WhatsApp web chat list using the Selenium WebDriver

Is there a way to programmatically retrieve all chat divs in WhatsApp Web in the order they are displayed? Currently, using

driver.find_elements_by_class_name('_210SC')

seems to only fetch the first 20 or so chats in no particular sequence. It appears that the chats are generated dynamically.

When attempting to select specific chats by index, such as chats[0].click() for the 1st chat and chat[1].click() for the 46th chat, the results are inconsistent as the order changes with scrolling and re-executing the query.

Is there a method to retrieve the chats exactly as they appear on the screen, ensuring that chats[0] corresponds to Mike and chats[1] to George, for instance? What is the underlying reason for this behavior?

python selenium selenium-webdriver webdriver

Answer 1

Answer №1

Whatsapp Web uses a lazyloaded react app structure. It currently displays 21 elements on the screen, however, this number may vary based on the screen size. The order in which the elements are displayed is from top to bottom - starting with the most recent entry at the top, followed by 20 entries in reverse order, meaning

chat[0] > chat[20] > chat[19] ... chat[1]

To efficiently navigate through these elements, I would recommend fetching the first 21 elements, scrolling down to the last element (which should be at chats[1]), fetching again, and repeating this process until no new divs remain. It would also be beneficial to keep track of the chatters you have already fetched, possibly by evaluating their XPath using

//*[@id="pane-side"]//div[@class='_210SC']//div[@class='_3dtfX']//span[@class='_3ko75 _5h6Y_ _3Whw5']

to retrieve their names.

Answer 2

Whatsapp Web uses a lazyloaded react app structure. It currently displays 21 elements on the screen, however, this number may vary based on the screen size. The order in which the elements are displayed is from top to bottom - starting with the most recent entry at the top, followed by 20 entries in reverse order, meaning

chat[0] > chat[20] > chat[19] ... chat[1]

To efficiently navigate through these elements, I would recommend fetching the first 21 elements, scrolling down to the last element (which should be at chats[1]), fetching again, and repeating this process until no new divs remain. It would also be beneficial to keep track of the chatters you have already fetched, possibly by evaluating their XPath using

//*[@id="pane-side"]//div[@class='_210SC']//div[@class='_3dtfX']//span[@class='_3ko75 _5h6Y_ _3Whw5']

to retrieve their names.

Answer 3

Answer №2

I managed to figure out a technique for saving the entire contact list. While I acknowledge that there may be more efficient methods available, this approach seems to get the job done:

from selenium import webdriver
from webdriver_manager.chrome import ChromeDriverManager
import time 
from selenium.webdriver.common.keys import Keys 

#navigate to WhatsApp Web and scan QR code
driver = webdriver.Chrome(ChromeDriverManager().install())
driver.get('https://web.whatsapp.com/')
time.sleep(15)

#click on search bar
search_field = driver.find_element_by_xpath('//div[contains(@class,"copyable-text selectable-text")]')
search_field.click()
time.sleep(3)

#scroll down to access contact list
search_field.send_keys(Keys.ARROW_DOWN)
time.sleep(3)

#retrieve elements by class + continue scrolling down 
while True:
    contacts = []
    contact_title = driver.find_elements_by_class_name('_3Dr46')
    selected_contact = driver.find_element_by_xpath('//div[@aria-selected="true" and @role="row"]')
    for i in contact_title:
        contacts.append(i.text)
    selected_contact.send_keys(Keys.ARROW_DOWN)
    time.sleep(1)
    selected_contact.send_keys(Keys.ARROW_DOWN)
    time.sleep(1)
    selected_contact.send_keys(Keys.ARROW_DOWN)

I utilized send_keys 20 times due to difficulties with using ActionChains

(from selenium.webdriver.common.action_chains import ActionChains)

, as it was operating too quickly without allowing sufficient loading time for the desired data.

Subsequently, I employed print(len(contacts)) and print(contacts), yielding the following results:

16
['num1', 'num2, 'num3','num4'...]
16
['num1', 'num2, 'num4','num5'...]
16
['num2', 'num3, 'num4','num5'...]

This pattern continues until reaching the end of the scroll bar. I will share further updates, as my next objective is to compile this information into a list containing approximately 200 unique contacts.

I trust that this information proves beneficial and welcome any suggestions for optimizing this process.

Answer 4

I managed to figure out a technique for saving the entire contact list. While I acknowledge that there may be more efficient methods available, this approach seems to get the job done:

from selenium import webdriver
from webdriver_manager.chrome import ChromeDriverManager
import time 
from selenium.webdriver.common.keys import Keys 

#navigate to WhatsApp Web and scan QR code
driver = webdriver.Chrome(ChromeDriverManager().install())
driver.get('https://web.whatsapp.com/')
time.sleep(15)

#click on search bar
search_field = driver.find_element_by_xpath('//div[contains(@class,"copyable-text selectable-text")]')
search_field.click()
time.sleep(3)

#scroll down to access contact list
search_field.send_keys(Keys.ARROW_DOWN)
time.sleep(3)

#retrieve elements by class + continue scrolling down 
while True:
    contacts = []
    contact_title = driver.find_elements_by_class_name('_3Dr46')
    selected_contact = driver.find_element_by_xpath('//div[@aria-selected="true" and @role="row"]')
    for i in contact_title:
        contacts.append(i.text)
    selected_contact.send_keys(Keys.ARROW_DOWN)
    time.sleep(1)
    selected_contact.send_keys(Keys.ARROW_DOWN)
    time.sleep(1)
    selected_contact.send_keys(Keys.ARROW_DOWN)

I utilized send_keys 20 times due to difficulties with using ActionChains

(from selenium.webdriver.common.action_chains import ActionChains)

, as it was operating too quickly without allowing sufficient loading time for the desired data.

Subsequently, I employed print(len(contacts)) and print(contacts), yielding the following results:

16
['num1', 'num2, 'num3','num4'...]
16
['num1', 'num2, 'num4','num5'...]
16
['num2', 'num3, 'num4','num5'...]

This pattern continues until reaching the end of the scroll bar. I will share further updates, as my next objective is to compile this information into a list containing approximately 200 unique contacts.

I trust that this information proves beneficial and welcome any suggestions for optimizing this process.

Scanning through the correct sequence of WhatsApp web chat list using the Selenium WebDriver

Answer №1

Answer №2

Similar questions

Unexpected closure occurred with status 1 using Firefox webdrivers in the context of Watir automation (Ruby on Rails)

Developing a Python serverless function on Vercel using Next.js is a streamlined

User timeout led to connection failure in the scraping process

Utilize Beautiful Soup, Selenium, and Pandas to extract price information by scraping the web for values stored within specified div class

Python's function file.truncate() does not behave as expected and does not actually truncate the file

building an administrator profile with django

Tips for implementing Bitwise Exclusive OR on a nested list:

Encountering WebDriver Firefox and Selenium issues - requires the use of Gecko driver

Begin the execution of JMeter using a JUnit test

The Rise and Fall of Python: A Study of Ascendance and

Employing Python with Selenium to programmatically click on a button with a ng-click attribute and automatically upload

Guide to installing torch through python

Discovering the xpath with a certain condition based on text length using Python Selenium

Can I choose multiple rows at once in a treeview widget?

Issue with mismatched dynamic values in Selenium IDE

What is the best way to align text to the center below an image?

Attempting to scan through each Reddit headline in order to make a decision on which one to click

Discover the XPATH for selenium in VBA programming

Transferring live video feed from NodeJS to Python instantaneously

Looking for assistance with parsing out four numerical values from an HTML scrape using Python