-
What is a web crawler primarily used for?
- A: Playing games
- B: Downloading music
- C: Indexing website content for search engines
- D: Editing videos
-
Which Python library is commonly used for making HTTP requests in web crawling?
- A: NumPy
- B: Pandas
- C: Requests
- D: Matplotlib
-
What is ‘rate limiting’ in the context of web crawling?
- A: Limiting the number of websites a crawler can access
- B: Limiting the speed at which a crawler sends requests
- C: Limiting the size of the data a crawler can download
- D: Limiting the types of data a crawler can request
-
Which Python library is widely used for web crawling and scraping?
- A: Flask
- B: Django
- C: Scrapy
- D: TensorFlow
-
What function in the Requests library is used to send a GET request?
- A: requests.post()
- B: requests.get()
- C: requests.send()
- D: requests.receive()
-
How can you parse a URL and extract its components in Python?
- A: Using the urllib.parse module
- B: Using the os.path module
- C: Using the json module
- D: Using the csv module
-
In Scrapy, what is the purpose of the parse method in a spider class?
- A: To start the crawler
- B: To process the response and extract data
- C: To handle exceptions
- D: To store the crawled data
-
Which of the following is not a part of the Scrapy architecture?
- A: Scheduler
- B: Downloader
- C: Executor
- D: Item Pipeline
-
Which function is used to create an array in NumPy?
- A: array()
- B: list()
- C: dict()
- D: set()
-
Which method is used to compute the mean of elements in a NumPy array?
- A: .sum()
- B: .avg()
- C: .mean()
- D: .total()
-
How do you change the shape of an existing NumPy array?
- A: reshape()
- B: resize()
- C: reshape_array()
- D: array_shape()
-
Which of the following is not a valid attribute of a NumPy array?
- A: shape
- B: dtype
- C: size
- D: typecode
-
What is a core element of web crawlers?
- A: Iteration
- B: Recursion
- C: Conditional statements
- D: Variables
-
What Python package is used to parse HTML and XML documents?
- A: requests
- B: BeautifulSoup
- C: lxml
- D: pandas
-
Which of the following is not a type of object discussed in BeautifulSoup?
- A: Tag
- B: NavigableString
- C: BeautifulSoup
- D: LinkObject
Crawler选填判断题
于 2024-06-16 23:04:40 首次发布