J'ai quelques données html ci-dessous, je veux extraire tous les temps de la page Web, puis stocker toutes les données dans une variable de liste. Comment puis-je faire cela… Aide S'il vous plaît ..
<div class=panchang-box-secondary-header> <div class="list-wrapper pl-2"> <div class="list-style-thumbnail list-layout-horizontal"> <div class="list-item-outer py-2"> <div class="d-flex w-100 align-items-center"> <span class="icon-sprite icon-sprite-sunrise"></span> <div class=flex-grow-1> <span class="d-block t-sm">सà¥à¤°à¥à¤¯à¥à¤¦à¤¯</span> <span class="d-block b">5:31 AM</span> </div> </div> </div> <div class="list-item-outer py-2"> <div class="d-flex w-100 align-items-center"> <span class="icon-sprite icon-sprite-sunset"></span> <div class=flex-grow-1> <span class="d-block t-sm">सà¥à¤°à¥à¤¯à¤¾à¤¸à¥à¤¤</span> <span class="d-block b">7:24 PM</span> </div> </div> </div> <div class="list-item-outer py-2"> <div class="d-flex w-100 align-items-center"> <span class="icon-sprite icon-sprite-moonrise"></span> <div class=flex-grow-1> <span class="d-block t-sm">à¤à¤¨à¥à¤¦à¥à¤°à¥à¤¦à¤¯</span> <span class="d-block b">10:05 PM</span> </div> </div> </div> <div class="list-item-outer py-2"> <div class="d-flex w-100 align-items-center"> <span class="icon-sprite icon-sprite-moonset"></span> <div class=flex-grow-1> <span class="d-block t-sm">à¤à¤¨à¥à¤¦à¥à¤°à¤¾à¤¸à¥à¤¤</span> <span class="d-block b">9:12 AM</span> </div> </div> </div>
3 Réponses :
Extrayez simplement "d-block b" et insérez-le où vous voulez.
Pouvez-vous s'il vous plaît me répondre avec le code complet s'il vous plaît ... Je suis débutant dans le scraping Web ...
time = soup.find_all(class_ = "d-block b").text This will make a list that gets all the time in the webpage source and store it in the variable time
Essayez d'utiliser ceci:
['5:31 AM', '7:24 PM', '10:05 PM', '9:12 AM']
Résultat:
from bs4 import BeautifulSoup a = '''<div class=panchang-box-secondary-header> <div class="list-wrapper pl-2"> <div class="list-style-thumbnail list-layout-horizontal"> <div class="list-item-outer py-2"> <div class="d-flex w-100 align-items-center"> <span class="icon-sprite icon-sprite-sunrise"></span> <div class=flex-grow-1> <span class="d-block t-sm">सà¥à¤°à¥à¤¯à¥à¤¦à¤¯</span> <span class="d-block b">5:31 AM</span> </div> </div> </div> <div class="list-item-outer py-2"> <div class="d-flex w-100 align-items-center"> <span class="icon-sprite icon-sprite-sunset"></span> <div class=flex-grow-1> <span class="d-block t-sm">सà¥à¤°à¥à¤¯à¤¾à¤¸à¥à¤¤</span> <span class="d-block b">7:24 PM</span> </div> </div> </div> <div class="list-item-outer py-2"> <div class="d-flex w-100 align-items-center"> <span class="icon-sprite icon-sprite-moonrise"></span> <div class=flex-grow-1> <span class="d-block t-sm">à¤à¤¨à¥à¤¦à¥à¤°à¥à¤¦à¤¯</span> <span class="d-block b">10:05 PM</span> </div> </div> </div> <div class="list-item-outer py-2"> <div class="d-flex w-100 align-items-center"> <span class="icon-sprite icon-sprite-moonset"></span> <div class=flex-grow-1> <span class="d-block t-sm">à¤à¤¨à¥à¤¦à¥à¤°à¤¾à¤¸à¥à¤¤</span> <span class="d-block b">9:12 AM</span> </div> </div> </div>''' soup = BeautifulSoup(a,'html.parser') time = soup.select('.d-block.b') times = [times.text for times in time] print(times)