Skip to content Skip to sidebar Skip to footer

How Can I Parse Html String

I have a html string and i want to parse this;

Solution 1:

HtmlAgilityPack.HtmlDocumentdoc=newHtmlAgilityPack.HtmlDocument();
doc.LoadHtml(html);

varteams= doc.DocumentNode.SelectNodes("//td[@width='313']")
                .Select(td => newTeamClass
                {
                    TeamName = td.Element("a").InnerText,
                    TeamId = HttpUtility.ParseQueryString(td.Element("a").Attributes["href"].Value)["ItemTypeID"]
                })
                .ToList();

Solution 2:

Have a look at this lib http:HTML Agility Pack. It helps you with HTML parsing.

Solution 3:

You can use Regular expression

String html; //your html stringStringpattern= @"action=ViewItemDetails&ItemType[I|i]D=(\d*)"">(.*)</a>";
MatchCollectionmatches= Regex.Matches(html, pattern);
varlist=newList<TeamClass>();
foreach (Match match in matches)
{
    TeamClassteam=newTeamClass();
    team.TeamName = match.Groups[2].Value;
    team.TeamId = Int32.Parse(match.Groups[1].Value);
    list.Add(team);
}

Solution 4:

Try Html Agility:

try something like (Untested Code):

var TeamList = from lnks in document.DocumentNode.Descendants()
               where lnks.Name == "a" && 
                    lnks.Attributes["href"] != null && 
                    lnks.InnerText.Trim().Length > 0selectnew
               {

                  TeamId= (lnks.Attributes["href"].Value).
                          Substring((lnks.Attributes["href"].Value).Length-1, 1),
                  TeamName= lnks.InnerText
               };

Post a Comment for "How Can I Parse Html String"