2026-05-19 10:48:15.013 [main] INFO com.example.datacollect.Main - Starting CLI Crawler application 2026-05-19 10:48:15.022 [main] INFO c.e.d.strategy.StrategyFactory - StrategyFactory initialized with 3 strategies 2026-05-19 10:48:15.025 [main] DEBUG c.e.d.controller.CrawlerController - Registered command: help 2026-05-19 10:48:15.027 [main] DEBUG c.e.d.controller.CrawlerController - Registered command: list 2026-05-19 10:48:15.027 [main] DEBUG c.e.d.controller.CrawlerController - Registered command: crawl 2026-05-19 10:48:15.028 [main] DEBUG c.e.d.controller.CrawlerController - Registered command: exit 2026-05-19 10:48:15.029 [main] INFO c.e.d.controller.CrawlerController - CrawlerController initialized with 4 commands 2026-05-19 10:48:15.031 [main] INFO c.e.datacollect.view.ConsoleView - Success: Welcome to CLI Crawler (w10_3)! Type help for commands. 2026-05-19 10:48:15.032 [main] INFO com.example.datacollect.Main - Application ready, waiting for input 2026-05-19 10:48:15.033 [main] DEBUG c.e.datacollect.view.ConsoleView - Reading input from console 2026-05-19 10:49:49.548 [main] WARN c.e.d.controller.CrawlerController - Unknown command: crawl 2026-05-19 10:49:49.548 [main] ERROR c.e.datacollect.view.ConsoleView - Error: Unknown command: crawl 2026-05-19 10:49:49.559 [main] DEBUG c.e.datacollect.view.ConsoleView - Reading input from console 2026-05-19 11:04:21.863 [main] INFO com.example.datacollect.Main - Starting CLI Crawler application 2026-05-19 11:04:21.878 [main] INFO c.e.d.strategy.StrategyFactory - StrategyFactory initialized with 6 strategies 2026-05-19 11:04:21.883 [main] DEBUG c.e.d.controller.CrawlerController - Registered command: help 2026-05-19 11:04:21.884 [main] DEBUG c.e.d.controller.CrawlerController - Registered command: list 2026-05-19 11:04:21.885 [main] DEBUG c.e.d.controller.CrawlerController - Registered command: crawl 2026-05-19 11:04:21.886 [main] DEBUG c.e.d.controller.CrawlerController - Registered command: exit 2026-05-19 11:04:21.886 [main] INFO c.e.d.controller.CrawlerController - CrawlerController initialized with 4 commands 2026-05-19 11:04:21.886 [main] INFO c.e.datacollect.view.ConsoleView - Success: Welcome to CLI Crawler (w10_3)! Type help for commands. 2026-05-19 11:04:21.887 [main] INFO com.example.datacollect.Main - Application ready, waiting for input 2026-05-19 11:04:21.887 [main] DEBUG c.e.datacollect.view.ConsoleView - Reading input from console 2026-05-19 11:05:05.592 [main] DEBUG c.e.d.controller.CrawlerController - Executing command: crawl 2026-05-19 11:05:05.594 [main] INFO c.e.datacollect.command.CrawlCommand - Crawl started for: 2026-05-19 11:05:05.596 [main] DEBUG c.e.d.strategy.HnuNewsStrategy - HnuNewsStrategy supports URL : false 2026-05-19 11:05:05.596 [main] DEBUG c.e.d.strategy.BlogStrategy - BlogStrategy supports URL : false 2026-05-19 11:05:05.597 [main] DEBUG c.e.d.strategy.NewsStrategy - NewsStrategy supports URL : false 2026-05-19 11:05:05.603 [main] DEBUG c.e.d.strategy.WeiboHotStrategy - WeiboHotStrategy supports URL : true 2026-05-19 11:05:05.604 [main] DEBUG c.e.d.strategy.StrategyFactory - Found strategy WeiboHotStrategy for URL: 2026-05-19 11:05:05.606 [main] INFO c.e.datacollect.command.CrawlCommand - Starting crawl for URL: 2026-05-19 11:05:05.621 [main] DEBUG c.e.datacollect.view.ConsoleView - Info: Crawling: 2026-05-19 11:05:05.626 [main] DEBUG c.e.datacollect.command.CrawlCommand - Attempt 1 to fetch URL: 2026-05-19 11:05:05.667 [main] WARN c.e.datacollect.command.CrawlCommand - Attempt 1 failed for URL : The supplied URL, '', is malformed. Make sure it is an absolute URL, and starts with 'http://' or 'https://'. See https://jsoup.org/cookbook/extracting-data/working-with-urls 2026-05-19 11:05:05.669 [main] INFO c.e.datacollect.command.CrawlCommand - Retrying in 1000ms... 2026-05-19 11:05:06.672 [main] DEBUG c.e.datacollect.command.CrawlCommand - Attempt 2 to fetch URL: 2026-05-19 11:05:06.675 [main] WARN c.e.datacollect.command.CrawlCommand - Attempt 2 failed for URL : The supplied URL, '', is malformed. Make sure it is an absolute URL, and starts with 'http://' or 'https://'. See https://jsoup.org/cookbook/extracting-data/working-with-urls 2026-05-19 11:05:06.677 [main] INFO c.e.datacollect.command.CrawlCommand - Retrying in 1000ms... 2026-05-19 11:05:07.689 [main] DEBUG c.e.datacollect.command.CrawlCommand - Attempt 3 to fetch URL: 2026-05-19 11:05:07.690 [main] WARN c.e.datacollect.command.CrawlCommand - Attempt 3 failed for URL : The supplied URL, '', is malformed. Make sure it is an absolute URL, and starts with 'http://' or 'https://'. See https://jsoup.org/cookbook/extracting-data/working-with-urls 2026-05-19 11:05:07.692 [main] ERROR c.e.datacollect.command.CrawlCommand - Failed to fetch URL after 3 attempts: 2026-05-19 11:05:07.692 [main] ERROR c.e.datacollect.view.ConsoleView - Error: Failed to fetch URL after 3 attempts: 2026-05-19 11:05:07.694 [main] DEBUG c.e.datacollect.view.ConsoleView - Reading input from console 2026-05-19 11:06:22.788 [main] WARN c.e.d.controller.CrawlerController - Unknown command: ceawl 2026-05-19 11:06:22.788 [main] ERROR c.e.datacollect.view.ConsoleView - Error: Unknown command: ceawl 2026-05-19 11:06:22.791 [main] DEBUG c.e.datacollect.view.ConsoleView - Reading input from console 2026-05-19 11:06:50.556 [main] DEBUG c.e.d.controller.CrawlerController - Executing command: crawl 2026-05-19 11:06:50.557 [main] INFO c.e.datacollect.command.CrawlCommand - Crawl started for: < 2026-05-19 11:06:50.557 [main] DEBUG c.e.d.strategy.HnuNewsStrategy - HnuNewsStrategy supports URL <: false 2026-05-19 11:06:50.558 [main] DEBUG c.e.d.strategy.BlogStrategy - BlogStrategy supports URL <: false 2026-05-19 11:06:50.558 [main] DEBUG c.e.d.strategy.NewsStrategy - NewsStrategy supports URL <: false 2026-05-19 11:06:50.558 [main] DEBUG c.e.d.strategy.WeiboHotStrategy - WeiboHotStrategy supports URL <: false 2026-05-19 11:06:50.562 [main] DEBUG c.e.d.strategy.CctvNewsStrategy - CctvNewsStrategy supports URL <: false 2026-05-19 11:06:50.563 [main] DEBUG c.e.d.strategy.WeatherStrategy - WeatherStrategy supports URL <: false 2026-05-19 11:06:50.567 [main] WARN c.e.d.strategy.StrategyFactory - No strategy found for URL: < 2026-05-19 11:06:50.574 [main] WARN c.e.datacollect.command.CrawlCommand - No strategy found for: < 2026-05-19 11:06:50.576 [main] ERROR c.e.datacollect.view.ConsoleView - Error: No strategy found for: < 2026-05-19 11:06:50.580 [main] DEBUG c.e.datacollect.view.ConsoleView - Reading input from console 2026-05-19 11:07:24.657 [main] DEBUG c.e.d.controller.CrawlerController - Executing command: crawl 2026-05-19 11:07:24.659 [main] INFO c.e.datacollect.command.CrawlCommand - Crawl started for: 2026-05-19 11:07:24.659 [main] DEBUG c.e.d.strategy.HnuNewsStrategy - HnuNewsStrategy supports URL : false 2026-05-19 11:07:24.659 [main] DEBUG c.e.d.strategy.BlogStrategy - BlogStrategy supports URL : false 2026-05-19 11:07:24.661 [main] DEBUG c.e.d.strategy.NewsStrategy - NewsStrategy supports URL : false 2026-05-19 11:07:24.663 [main] DEBUG c.e.d.strategy.WeiboHotStrategy - WeiboHotStrategy supports URL : false 2026-05-19 11:07:24.666 [main] DEBUG c.e.d.strategy.CctvNewsStrategy - CctvNewsStrategy supports URL : true 2026-05-19 11:07:24.667 [main] DEBUG c.e.d.strategy.StrategyFactory - Found strategy CctvNewsStrategy for URL: 2026-05-19 11:07:24.668 [main] INFO c.e.datacollect.command.CrawlCommand - Starting crawl for URL: 2026-05-19 11:07:24.669 [main] DEBUG c.e.datacollect.view.ConsoleView - Info: Crawling: 2026-05-19 11:07:24.671 [main] DEBUG c.e.datacollect.command.CrawlCommand - Attempt 1 to fetch URL: 2026-05-19 11:07:24.675 [main] WARN c.e.datacollect.command.CrawlCommand - Attempt 1 failed for URL : The supplied URL, '', is malformed. Make sure it is an absolute URL, and starts with 'http://' or 'https://'. See https://jsoup.org/cookbook/extracting-data/working-with-urls 2026-05-19 11:07:24.676 [main] INFO c.e.datacollect.command.CrawlCommand - Retrying in 1000ms... 2026-05-19 11:07:25.678 [main] DEBUG c.e.datacollect.command.CrawlCommand - Attempt 2 to fetch URL: 2026-05-19 11:07:25.681 [main] WARN c.e.datacollect.command.CrawlCommand - Attempt 2 failed for URL : The supplied URL, '', is malformed. Make sure it is an absolute URL, and starts with 'http://' or 'https://'. See https://jsoup.org/cookbook/extracting-data/working-with-urls 2026-05-19 11:07:25.682 [main] INFO c.e.datacollect.command.CrawlCommand - Retrying in 1000ms... 2026-05-19 11:07:26.696 [main] DEBUG c.e.datacollect.command.CrawlCommand - Attempt 3 to fetch URL: 2026-05-19 11:07:26.698 [main] WARN c.e.datacollect.command.CrawlCommand - Attempt 3 failed for URL : The supplied URL, '', is malformed. Make sure it is an absolute URL, and starts with 'http://' or 'https://'. See https://jsoup.org/cookbook/extracting-data/working-with-urls 2026-05-19 11:07:26.700 [main] ERROR c.e.datacollect.command.CrawlCommand - Failed to fetch URL after 3 attempts: 2026-05-19 11:07:26.701 [main] ERROR c.e.datacollect.view.ConsoleView - Error: Failed to fetch URL after 3 attempts: 2026-05-19 11:07:26.702 [main] DEBUG c.e.datacollect.view.ConsoleView - Reading input from console 2026-05-19 11:14:42.973 [main] DEBUG c.e.d.controller.CrawlerController - Executing command: crawl 2026-05-19 11:14:42.975 [main] INFO c.e.datacollect.command.CrawlCommand - Crawl started for: 2026-05-19 11:14:42.975 [main] DEBUG c.e.d.strategy.HnuNewsStrategy - HnuNewsStrategy supports URL : false 2026-05-19 11:14:42.988 [main] DEBUG c.e.d.strategy.BlogStrategy - BlogStrategy supports URL : false 2026-05-19 11:14:43.005 [main] DEBUG c.e.d.strategy.NewsStrategy - NewsStrategy supports URL : false 2026-05-19 11:14:43.011 [main] DEBUG c.e.d.strategy.WeiboHotStrategy - WeiboHotStrategy supports URL : false 2026-05-19 11:14:43.015 [main] DEBUG c.e.d.strategy.CctvNewsStrategy - CctvNewsStrategy supports URL : false 2026-05-19 11:14:43.016 [main] DEBUG c.e.d.strategy.WeatherStrategy - WeatherStrategy supports URL : true 2026-05-19 11:14:43.017 [main] DEBUG c.e.d.strategy.StrategyFactory - Found strategy WeatherStrategy for URL: 2026-05-19 11:14:43.019 [main] INFO c.e.datacollect.command.CrawlCommand - Starting crawl for URL: 2026-05-19 11:14:43.030 [main] DEBUG c.e.datacollect.view.ConsoleView - Info: Crawling: 2026-05-19 11:14:43.038 [main] DEBUG c.e.datacollect.command.CrawlCommand - Attempt 1 to fetch URL: 2026-05-19 11:14:43.039 [main] WARN c.e.datacollect.command.CrawlCommand - Attempt 1 failed for URL : The supplied URL, '', is malformed. Make sure it is an absolute URL, and starts with 'http://' or 'https://'. See https://jsoup.org/cookbook/extracting-data/working-with-urls 2026-05-19 11:14:43.048 [main] INFO c.e.datacollect.command.CrawlCommand - Retrying in 1000ms... 2026-05-19 11:14:44.062 [main] DEBUG c.e.datacollect.command.CrawlCommand - Attempt 2 to fetch URL: 2026-05-19 11:14:44.066 [main] WARN c.e.datacollect.command.CrawlCommand - Attempt 2 failed for URL : The supplied URL, '', is malformed. Make sure it is an absolute URL, and starts with 'http://' or 'https://'. See https://jsoup.org/cookbook/extracting-data/working-with-urls 2026-05-19 11:14:44.067 [main] INFO c.e.datacollect.command.CrawlCommand - Retrying in 1000ms... 2026-05-19 11:14:45.078 [main] DEBUG c.e.datacollect.command.CrawlCommand - Attempt 3 to fetch URL: 2026-05-19 11:14:45.079 [main] WARN c.e.datacollect.command.CrawlCommand - Attempt 3 failed for URL : The supplied URL, '', is malformed. Make sure it is an absolute URL, and starts with 'http://' or 'https://'. See https://jsoup.org/cookbook/extracting-data/working-with-urls 2026-05-19 11:14:45.082 [main] ERROR c.e.datacollect.command.CrawlCommand - Failed to fetch URL after 3 attempts: 2026-05-19 11:14:45.083 [main] ERROR c.e.datacollect.view.ConsoleView - Error: Failed to fetch URL after 3 attempts: 2026-05-19 11:14:45.090 [main] DEBUG c.e.datacollect.view.ConsoleView - Reading input from console 2026-05-19 11:17:17.250 [main] DEBUG c.e.d.controller.CrawlerController - Executing command: crawl 2026-05-19 11:17:17.251 [main] INFO c.e.datacollect.command.CrawlCommand - Crawl started for: https://www.tianqi.com/changsha 2026-05-19 11:17:17.263 [main] DEBUG c.e.d.strategy.HnuNewsStrategy - HnuNewsStrategy supports URL https://www.tianqi.com/changsha: false 2026-05-19 11:17:17.266 [main] DEBUG c.e.d.strategy.BlogStrategy - BlogStrategy supports URL https://www.tianqi.com/changsha: false 2026-05-19 11:17:17.267 [main] DEBUG c.e.d.strategy.NewsStrategy - NewsStrategy supports URL https://www.tianqi.com/changsha: false 2026-05-19 11:17:17.267 [main] DEBUG c.e.d.strategy.WeiboHotStrategy - WeiboHotStrategy supports URL https://www.tianqi.com/changsha: false 2026-05-19 11:17:17.267 [main] DEBUG c.e.d.strategy.CctvNewsStrategy - CctvNewsStrategy supports URL https://www.tianqi.com/changsha: false 2026-05-19 11:17:17.267 [main] DEBUG c.e.d.strategy.WeatherStrategy - WeatherStrategy supports URL https://www.tianqi.com/changsha: true 2026-05-19 11:17:17.267 [main] DEBUG c.e.d.strategy.StrategyFactory - Found strategy WeatherStrategy for URL: https://www.tianqi.com/changsha 2026-05-19 11:17:17.269 [main] INFO c.e.datacollect.command.CrawlCommand - Starting crawl for URL: https://www.tianqi.com/changsha 2026-05-19 11:17:17.269 [main] DEBUG c.e.datacollect.view.ConsoleView - Info: Crawling: https://www.tianqi.com/changsha 2026-05-19 11:17:17.269 [main] DEBUG c.e.datacollect.command.CrawlCommand - Attempt 1 to fetch URL: https://www.tianqi.com/changsha 2026-05-19 11:17:18.361 [main] DEBUG c.e.d.strategy.WeatherStrategy - Parsing weather page: https://www.tianqi.com/changsha 2026-05-19 11:17:18.388 [main] INFO c.e.d.strategy.WeatherStrategy - Parsed 1 weather items 2026-05-19 11:17:18.391 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 2026年05月19日 长沙天气 2026-05-19 11:17:18.395 [main] INFO c.e.datacollect.command.CrawlCommand - Successfully crawled 1 articles from https://www.tianqi.com/changsha 2026-05-19 11:17:18.395 [main] INFO c.e.datacollect.view.ConsoleView - Success: Crawled 1 articles. 2026-05-19 11:17:18.395 [main] DEBUG c.e.datacollect.view.ConsoleView - Reading input from console 2026-05-19 11:17:25.633 [main] DEBUG c.e.d.controller.CrawlerController - Executing command: list 2026-05-19 11:17:25.634 [main] DEBUG c.e.datacollect.command.ListCommand - Listing articles 2026-05-19 11:17:25.635 [main] DEBUG c.e.d.repository.ArticleRepository - Returning 1 articles (unmodifiable) 2026-05-19 11:17:25.636 [main] DEBUG c.e.datacollect.view.ConsoleView - Displaying 1 articles 2026-05-19 11:17:25.660 [main] DEBUG c.e.datacollect.view.ConsoleView - Reading input from console 2026-05-19 11:18:00.938 [main] DEBUG c.e.d.controller.CrawlerController - Executing command: crawl 2026-05-19 11:18:00.939 [main] INFO c.e.datacollect.command.CrawlCommand - Crawl started for: https://tv.cctv.com 2026-05-19 11:18:00.939 [main] DEBUG c.e.d.strategy.HnuNewsStrategy - HnuNewsStrategy supports URL https://tv.cctv.com: false 2026-05-19 11:18:00.942 [main] DEBUG c.e.d.strategy.BlogStrategy - BlogStrategy supports URL https://tv.cctv.com: false 2026-05-19 11:18:00.942 [main] DEBUG c.e.d.strategy.NewsStrategy - NewsStrategy supports URL https://tv.cctv.com: false 2026-05-19 11:18:00.948 [main] DEBUG c.e.d.strategy.WeiboHotStrategy - WeiboHotStrategy supports URL https://tv.cctv.com: false 2026-05-19 11:18:00.950 [main] DEBUG c.e.d.strategy.CctvNewsStrategy - CctvNewsStrategy supports URL https://tv.cctv.com: true 2026-05-19 11:18:00.950 [main] DEBUG c.e.d.strategy.StrategyFactory - Found strategy CctvNewsStrategy for URL: https://tv.cctv.com 2026-05-19 11:18:00.951 [main] INFO c.e.datacollect.command.CrawlCommand - Starting crawl for URL: https://tv.cctv.com 2026-05-19 11:18:00.951 [main] DEBUG c.e.datacollect.view.ConsoleView - Info: Crawling: https://tv.cctv.com 2026-05-19 11:18:00.952 [main] DEBUG c.e.datacollect.command.CrawlCommand - Attempt 1 to fetch URL: https://tv.cctv.com 2026-05-19 11:18:01.315 [main] DEBUG c.e.d.strategy.CctvNewsStrategy - Parsing CCTV news page: https://tv.cctv.com 2026-05-19 11:18:01.318 [main] DEBUG c.e.d.strategy.CctvNewsStrategy - Found 0 news items 2026-05-19 11:18:01.402 [main] INFO c.e.d.strategy.CctvNewsStrategy - Parsed 189 news from CCTV 2026-05-19 11:18:01.403 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 直 播 2026-05-19 11:18:01.403 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 节目单 2026-05-19 11:18:01.404 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 频道大全 2026-05-19 11:18:01.404 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 栏目大全 2026-05-19 11:18:01.404 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 主 持 人 2026-05-19 11:18:01.405 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 听音 2026-05-19 11:18:01.406 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 消费主张 2026-05-19 11:18:01.406 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 文化十分 2026-05-19 11:18:01.407 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 军事科技 2026-05-19 11:18:01.407 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 今日关注 2026-05-19 11:18:01.407 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 天下足球 2026-05-19 11:18:01.408 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 综合 2026-05-19 11:18:01.408 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 新闻 2026-05-19 11:18:01.409 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 财经 2026-05-19 11:18:01.409 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 综艺 2026-05-19 11:18:01.409 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 亚洲 2026-05-19 11:18:01.409 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 体育 2026-05-19 11:18:01.411 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 电影 2026-05-19 11:18:01.411 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 国防军事 2026-05-19 11:18:01.411 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 电视剧 2026-05-19 11:18:01.412 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 纪录 2026-05-19 11:18:01.412 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 科教 2026-05-19 11:18:01.412 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 戏曲 2026-05-19 11:18:01.413 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 社会与法 2026-05-19 11:18:01.413 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 少儿 2026-05-19 11:18:01.413 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 音乐 2026-05-19 11:18:01.414 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 奥林匹克 2026-05-19 11:18:01.420 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 农业农村 2026-05-19 11:18:01.437 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 欧洲 2026-05-19 11:18:01.437 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 美洲 2026-05-19 11:18:01.439 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 体育赛事 2026-05-19 11:18:01.439 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 微视频 2026-05-19 11:18:01.440 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 搜片库 2026-05-19 11:18:01.440 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 找栏目 2026-05-19 11:18:01.440 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 中国经济大讲堂 2026-05-19 11:18:01.441 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 健康中国 2026-05-19 11:18:01.441 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 听音 2026-05-19 11:18:01.441 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 科幻地带 2026-05-19 11:18:01.441 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 如何解读中美关系新定位? 2026-05-19 11:18:01.442 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 垃圾围村 曝多地违法倾倒乱象 2026-05-19 11:18:01.442 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 四川“大摆荡”坠亡事故调查 2026-05-19 11:18:01.443 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 孤寡老人留百万遗产 由谁继承 2026-05-19 11:18:01.443 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 带你破解肠道健康的隐秘真相 2026-05-19 11:18:01.444 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 坐着高铁看中国 2026-05-19 11:18:01.445 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: AI基建新潮涌 2026-05-19 11:18:01.451 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 一帘光影三代人 2026-05-19 11:18:01.451 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 如何把阳光“存进”大海里 2026-05-19 11:18:01.452 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 驾驭“蓝鲸” 潜航深海 2026-05-19 11:18:01.452 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 广告 2026-05-19 11:18:01.454 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 广告 2026-05-19 11:18:01.454 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 广告 2026-05-19 11:18:01.456 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 唐都生活指南(第二部) 2026-05-19 11:18:01.456 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 金石探文明 2026-05-19 11:18:01.457 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 赤壁之战 2026-05-19 11:18:01.457 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 先秦智慧 2026-05-19 11:18:01.458 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 国史通鉴·两晋南北朝篇 2026-05-19 11:18:01.458 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 《叶问》郑嘉颖乱世之中寻求武学真谛 2026-05-19 11:18:01.459 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 《曾少年》张一山关晓彤爱情事业两不误 2026-05-19 11:18:01.461 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 《炊事班的故事Ⅱ》密集承包你的笑点 2026-05-19 11:18:01.461 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 《火蓝刀锋》海军陆战队亮刀锋展军魂 2026-05-19 11:18:01.461 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 《问天》致敬中国航天数十年的峥嵘岁月 2026-05-19 11:18:01.461 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 直播 2026-05-19 11:18:01.461 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 节目单 2026-05-19 11:18:01.462 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 频道大全 2026-05-19 11:18:01.462 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 栏目大全 2026-05-19 11:18:01.462 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 综合 2026-05-19 11:18:01.463 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 财经 2026-05-19 11:18:01.463 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 综艺 2026-05-19 11:18:01.463 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 中文国际 2026-05-19 11:18:01.463 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 体育 2026-05-19 11:18:01.464 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 电影 2026-05-19 11:18:01.464 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 国防军事 2026-05-19 11:18:01.464 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 电视剧 2026-05-19 11:18:01.464 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 纪录 2026-05-19 11:18:01.464 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 科教 2026-05-19 11:18:01.464 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 戏曲 2026-05-19 11:18:01.466 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 社会与法 2026-05-19 11:18:01.466 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 新闻 2026-05-19 11:18:01.466 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 少儿 2026-05-19 11:18:01.467 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 音乐 2026-05-19 11:18:01.467 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 体育赛事 2026-05-19 11:18:01.467 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 农业农村 2026-05-19 11:18:01.467 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 新闻联播 2026-05-19 11:18:01.468 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 挑战不可能 2026-05-19 11:18:01.468 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 开讲啦 2026-05-19 11:18:01.470 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 人与自然 2026-05-19 11:18:01.471 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 生活提示 2026-05-19 11:18:01.471 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 中国诗词大会 2026-05-19 11:18:01.472 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 对话 2026-05-19 11:18:01.472 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 第一时间 2026-05-19 11:18:01.473 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 一槌定音 2026-05-19 11:18:01.474 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 回家吃饭 2026-05-19 11:18:01.474 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 生活家 2026-05-19 11:18:01.475 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 金牌喜剧班 2026-05-19 11:18:01.475 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 舞蹈世界 2026-05-19 11:18:01.475 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 星光大道 2026-05-19 11:18:01.476 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 向幸福出发 2026-05-19 11:18:01.476 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 回声嘹亮 2026-05-19 11:18:01.476 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 健康中国 2026-05-19 11:18:01.477 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 深度国际 2026-05-19 11:18:01.478 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 中国文艺 2026-05-19 11:18:01.478 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 国家记忆 2026-05-19 11:18:01.478 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 华人故事 2026-05-19 11:18:01.479 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 运动一起赢 2026-05-19 11:18:01.479 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 体坛快讯 2026-05-19 11:18:01.479 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 足球之夜 2026-05-19 11:18:01.479 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 今日影评 2026-05-19 11:18:01.479 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 时光军史馆 2026-05-19 11:18:01.481 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 军迷行天下 2026-05-19 11:18:01.482 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 军事报道 2026-05-19 11:18:01.483 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 军事纪实 2026-05-19 11:18:01.483 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 军事纪录 2026-05-19 11:18:01.484 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 国防故事 2026-05-19 11:18:01.484 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 剧说很好看 2026-05-19 11:18:01.484 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 寰宇视野 2026-05-19 11:18:01.485 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 特别呈现 2026-05-19 11:18:01.485 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 9视频 2026-05-19 11:18:01.485 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 百家讲坛 2026-05-19 11:18:01.486 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 健康之路 2026-05-19 11:18:01.486 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 科幻地带 2026-05-19 11:18:01.486 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 考古公开课 2026-05-19 11:18:01.486 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 解码科技史 2026-05-19 11:18:01.487 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 戏曲青年说 2026-05-19 11:18:01.487 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 中国京剧音配像精粹 2026-05-19 11:18:01.487 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 过把瘾 2026-05-19 11:18:01.488 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 宝贝亮相吧 2026-05-19 11:18:01.488 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 剧懂法 2026-05-19 11:18:01.489 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 一线 2026-05-19 11:18:01.489 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 普法剧场 2026-05-19 11:18:01.489 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 天网 2026-05-19 11:18:01.489 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 心理访谈 2026-05-19 11:18:01.490 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 律师来了 2026-05-19 11:18:01.490 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 焦点访谈 2026-05-19 11:18:01.490 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 每周质量报告 2026-05-19 11:18:01.491 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 午夜新闻 2026-05-19 11:18:01.491 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 新闻调查 2026-05-19 11:18:01.491 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 新闻周刊 2026-05-19 11:18:01.491 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 周末动画片 2026-05-19 11:18:01.491 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 音乐快递 2026-05-19 11:18:01.493 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 新闻袋袋裤 2026-05-19 11:18:01.493 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 智慧树 2026-05-19 11:18:01.494 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 快乐童行 2026-05-19 11:18:01.494 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 乐享汇 2026-05-19 11:18:01.494 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: CCTV音乐厅 2026-05-19 11:18:01.494 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 中国音乐电视 2026-05-19 11:18:01.494 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 聆听时刻 2026-05-19 11:18:01.496 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 童声唱 2026-05-19 11:18:01.496 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 冰球冰球 2026-05-19 11:18:01.496 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 田野欢歌 2026-05-19 11:18:01.497 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 三农长短说 2026-05-19 11:18:01.498 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 三农群英汇 2026-05-19 11:18:01.498 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 大地讲堂 2026-05-19 11:18:01.498 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 片库 2026-05-19 11:18:01.499 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 热榜 2026-05-19 11:18:01.499 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 看点 2026-05-19 11:18:01.499 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 微视频 2026-05-19 11:18:01.499 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: ai美食 2026-05-19 11:18:01.500 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 主持人 2026-05-19 11:18:01.500 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 全部 2026-05-19 11:18:01.500 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 直播 2026-05-19 11:18:01.500 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 节目单 2026-05-19 11:18:01.501 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 热榜 2026-05-19 11:18:01.502 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 主持人 2026-05-19 11:18:01.502 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 听音 2026-05-19 11:18:01.502 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 新闻联播 2026-05-19 11:18:01.502 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 焦点访谈 2026-05-19 11:18:01.503 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 等着我 2026-05-19 11:18:01.503 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 今日说法 2026-05-19 11:18:01.503 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 海峡两岸 2026-05-19 11:18:01.504 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 今日关注 2026-05-19 11:18:01.504 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 今日亚洲 2026-05-19 11:18:01.504 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 更多 2026-05-19 11:18:01.504 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 电视剧 2026-05-19 11:18:01.504 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 动画片 2026-05-19 11:18:01.506 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 纪录片 2026-05-19 11:18:01.506 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 特别节目 2026-05-19 11:18:01.506 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 更多 2026-05-19 11:18:01.506 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 微视频 2026-05-19 11:18:01.507 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 视频百科 2026-05-19 11:18:01.508 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 微故事 2026-05-19 11:18:01.510 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: ai美食 2026-05-19 11:18:01.510 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 今日热门 2026-05-19 11:18:01.512 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 直播导视 2026-05-19 11:18:01.512 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 精品 2026-05-19 11:18:01.514 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 片库 2026-05-19 11:18:01.515 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 看点 2026-05-19 11:18:01.516 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 央视大全 2026-05-19 11:18:01.518 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 手机访问 扫描下载央 视影音客户端 扫一扫 手机继续看 2026-05-19 11:18:01.521 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 返回顶部 2026-05-19 11:18:01.521 [main] INFO c.e.datacollect.command.CrawlCommand - Successfully crawled 189 articles from https://tv.cctv.com 2026-05-19 11:18:01.521 [main] INFO c.e.datacollect.view.ConsoleView - Success: Crawled 189 articles. 2026-05-19 11:18:01.522 [main] DEBUG c.e.datacollect.view.ConsoleView - Reading input from console 2026-05-19 11:18:12.244 [main] DEBUG c.e.d.controller.CrawlerController - Executing command: list 2026-05-19 11:18:12.244 [main] DEBUG c.e.datacollect.command.ListCommand - Listing articles 2026-05-19 11:18:12.245 [main] DEBUG c.e.d.repository.ArticleRepository - Returning 190 articles (unmodifiable) 2026-05-19 11:18:12.246 [main] DEBUG c.e.datacollect.view.ConsoleView - Displaying 190 articles 2026-05-19 11:18:12.317 [main] DEBUG c.e.datacollect.view.ConsoleView - Reading input from console 2026-05-19 11:18:49.649 [main] DEBUG c.e.d.controller.CrawlerController - Executing command: crawl 2026-05-19 11:18:49.650 [main] INFO c.e.datacollect.command.CrawlCommand - Crawl started for: https://www.tianqi.com/changsha 2026-05-19 11:18:49.651 [main] DEBUG c.e.d.strategy.HnuNewsStrategy - HnuNewsStrategy supports URL https://www.tianqi.com/changsha: false 2026-05-19 11:18:49.651 [main] DEBUG c.e.d.strategy.BlogStrategy - BlogStrategy supports URL https://www.tianqi.com/changsha: false 2026-05-19 11:18:49.651 [main] DEBUG c.e.d.strategy.NewsStrategy - NewsStrategy supports URL https://www.tianqi.com/changsha: false 2026-05-19 11:18:49.652 [main] DEBUG c.e.d.strategy.WeiboHotStrategy - WeiboHotStrategy supports URL https://www.tianqi.com/changsha: false 2026-05-19 11:18:49.652 [main] DEBUG c.e.d.strategy.CctvNewsStrategy - CctvNewsStrategy supports URL https://www.tianqi.com/changsha: false 2026-05-19 11:18:49.652 [main] DEBUG c.e.d.strategy.WeatherStrategy - WeatherStrategy supports URL https://www.tianqi.com/changsha: true 2026-05-19 11:18:49.663 [main] DEBUG c.e.d.strategy.StrategyFactory - Found strategy WeatherStrategy for URL: https://www.tianqi.com/changsha 2026-05-19 11:18:49.666 [main] INFO c.e.datacollect.command.CrawlCommand - Starting crawl for URL: https://www.tianqi.com/changsha 2026-05-19 11:18:49.668 [main] DEBUG c.e.datacollect.view.ConsoleView - Info: Crawling: https://www.tianqi.com/changsha 2026-05-19 11:18:49.669 [main] DEBUG c.e.datacollect.command.CrawlCommand - Attempt 1 to fetch URL: https://www.tianqi.com/changsha 2026-05-19 11:18:49.912 [main] DEBUG c.e.d.strategy.WeatherStrategy - Parsing weather page: https://www.tianqi.com/changsha 2026-05-19 11:18:49.921 [main] INFO c.e.d.strategy.WeatherStrategy - Parsed 1 weather items 2026-05-19 11:18:49.923 [main] DEBUG c.e.d.repository.ArticleRepository - Added article: 2026年05月19日 长沙天气 2026-05-19 11:18:49.941 [main] INFO c.e.datacollect.command.CrawlCommand - Successfully crawled 1 articles from https://www.tianqi.com/changsha 2026-05-19 11:18:49.945 [main] INFO c.e.datacollect.view.ConsoleView - Success: Crawled 1 articles. 2026-05-19 11:18:49.948 [main] DEBUG c.e.datacollect.view.ConsoleView - Reading input from console 2026-05-19 11:18:54.406 [main] DEBUG c.e.d.controller.CrawlerController - Executing command: list 2026-05-19 11:18:54.406 [main] DEBUG c.e.datacollect.command.ListCommand - Listing articles 2026-05-19 11:18:54.407 [main] DEBUG c.e.d.repository.ArticleRepository - Returning 191 articles (unmodifiable) 2026-05-19 11:18:54.407 [main] DEBUG c.e.datacollect.view.ConsoleView - Displaying 191 articles 2026-05-19 11:18:54.473 [main] DEBUG c.e.datacollect.view.ConsoleView - Reading input from console