There is a list with lists inside [['/world/'], ['/latest/'], ['/?updated=top'], ['/politics/36188461-s-marta-zhizn-rossiyan-suschestvenno-izmenitsya-iz-za-novyh-zakonov/'] ['/world/36007585-tramp-pridumal-kak-reshit-ukrainskiy-vopros/'], ['/science/36157853-nasa-sobiraet-ekstrennuyu-press-konferentsiyu-na-temu-vnezemnoy-zhizni/'], ['/video/36001498-poyavilis-pervye-podrobnosti-gibeli-natali-melamed/'], ['/world/36007585-tramp-pridumal-kak-reshit-ukrainskiy-vopros/?smi2=1'] ['/science/'], ['/sport/'], ['/middleeast/36131117-divizion-s-400-ne-zametil-ataki-f-35-pod-damaskom/'], ['/economics/36065674-rossiyane-vozmutilis-minimalnymi-zarplatami-v-stranah-es/']] 1) Modify the list to the Pandas dataframe 2) Filter out and leave only the url's with the news sctructure (containing 8 digits and heading) in it, using the str.contains method
import pandas as pd list = [['/world/'], ['/latest/'], ['/?updated=top'], ['/politics/36188461-s-marta-zhizn-rossiyan-suschestvenno-izmenitsya-iz-za-novyh-zakonov/'] ['/world/36007585-tramp-pridumal-kak-reshit-ukrainskiy-vopros/'], ['/science/36157853-nasa-sobiraet-ekstrennuyu-press-konferentsiyu-na-temu-vnezemnoy-zhizni/'], ['/video/36001498-poyavilis-pervye-podrobnosti-gibeli-natali-melamed/'], ['/world/36007585-tramp-pridumal-kak-reshit-ukrainskiy-vopros/?smi2=1'] ['/science/