wuxifeng116 2022-01-09 21:32 采纳率: 75%
浏览 86
已结题

请问怎样用VB.net或C#获取以下网页截图中红框中的标题文字及其链接?

  • 写回答

2条回答 默认 最新

  • CSDN专家-showbo 2022-01-09 23:33
    关注

    需要POST请求:https://b2b.10086.cn/b2b/main/listVendorNoticeResult.html?noticeBean.noticeType=2
    参数:page.currentPage=1&page.perPageSize=20&noticeBean.sourceCH=&noticeBean.source=&noticeBean.title=&noticeBean.startDate=&noticeBean.endDate=&_qt=mY3E2NU2N5MmM0UWNygDM4ATY0EDZ2MjM3YjMjZDN3Q

    示例代码如下

    img

    using System;
    using System.Text;
    using System.Net;
    using System.Text.RegularExpressions;
    using System.IO;
    namespace ConsoleApp2
    {
        class Program
        {
            static void Main(string[] args)
            {
    
                var url = "https://b2b.10086.cn/b2b/main/listVendorNoticeResult.html?noticeBean.noticeType=2";
                
                var req = (HttpWebRequest)HttpWebRequest.Create(url);
               
                req.Method = "post";
                req.ContentType = "application/x-www-form-urlencoded; charset=UTF-8";
                //下面3个头必须
                req.Referer = "https://b2b.10086.cn/b2b/main/listVendorNotice.html?noticeType=2";
                req.UserAgent = "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/97.0.4692.71 Safari/537.36 Edg/97.0.1072.55";
                req.Headers.Add("cookie", "JSESSIONID=B3155698F7F1CF723C2F9E03CB29CDB7");
    
                var s = "page.currentPage=1&page.perPageSize=20&noticeBean.sourceCH=&noticeBean.source=&noticeBean.title=&noticeBean.startDate=&noticeBean.endDate=&_qt=mY3E2NU2N5MmM0UWNygDM4ATY0EDZ2MjM3YjMjZDN3Q";
    
                var data = Encoding.UTF8.GetBytes(s);
                req.ContentLength = data.Length;
    
                var io = req.GetRequestStream();
                io.Write(data,0,data.Length);
                io.Flush();
    
                var resp = (HttpWebResponse)req.GetResponse();
                io = resp.GetResponseStream();
                var sr = new StreamReader(io);
                s = sr.ReadToEnd();
    
                io.Close();
    
                var re = new Regex("selectResult\\('\\d+'\\)\">"
                    + "\\s+<td[^>]+>([\\s\\S]+?)</td>"
                    + "\\s+<td[^>]+>([\\s\\S]+?)</td>"
                    + "\\s+<td[^>]+>([\\s\\S]+?)</td>"
                    + "\\s+<td[^>]+>([\\s\\S]+?)</td>\\s*</tr>", RegexOptions.Compiled | RegexOptions.IgnoreCase);
    
                var reTitle = new Regex("title=\"([^\"]+)\"", RegexOptions.Compiled | RegexOptions.IgnoreCase);
                var reConent = new Regex("<a[^>]+>([\\s\\S]+?)</a>", RegexOptions.Compiled | RegexOptions.IgnoreCase);
    
                MatchCollection mc = re.Matches(s);
                foreach (Match m in mc)
                {
                    string title = m.Groups[3].Value;
                    title = (title.Contains("title=\"")?reTitle:reConent).Match(title).Groups[1].Value;
                    Console.WriteLine(m.Groups[1].Value.Trim() + "\t" + m.Groups[2].Value.Trim() + "\t" + title.Trim() + "\t" + m.Groups[4].Value.Trim());
                    Console.WriteLine();
                }
    
    
    
                Console.Read();
            }
        }
    }
    
    
    
    

    img


    有其他问题可以继续交流~

    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论
查看更多回答(1条)

报告相同问题?

问题事件

  • 已结题 (查看结题原因) 1月16日
  • 已采纳回答 1月10日
  • 修改了问题 1月9日
  • 创建了问题 1月9日

悬赏问题

  • ¥15 用lstm来预测股票价格
  • ¥15 请问paddlehub能支持移动端开发吗?在Android studio上该如何部署?
  • ¥170 如图所示配置eNSP
  • ¥20 docker里部署springboot项目,访问不到扬声器
  • ¥15 netty整合springboot之后自动重连失效
  • ¥15 悬赏!微信开发者工具报错,求帮改
  • ¥20 wireshark抓不到vlan
  • ¥20 关于#stm32#的问题:需要指导自动酸碱滴定仪的原理图程序代码及仿真
  • ¥20 设计一款异域新娘的视频相亲软件需要哪些技术支持
  • ¥15 stata安慰剂检验作图但是真实值不出现在图上