文字列内のURLをリンクするC＃コード

Question

文字列を解析し、文字列に含まれる可能性のあるURLを「リンク」する優れたc＃コード（および正規表現）を持っている人はいますか？

Konstantin Tarkus · Accepted Answer

これは非常に単純なタスクであり、 Regex と、次の正規表現を使用してすぐに実行できます。

http://regexlib.com/

何かのようなもの：

var html = Regex.Replace(html, @"^(http|https|ftp)\://[a-zA-Z0-9\-\.]+" + "\.[a-zA-Z]{2,3}(:[a-zA-Z0-9]*)?/?" + "([a-zA-Z0-9\-\._\?\,\'/\\+&amp;%\$#\=~])*$", "<a href=\"$1\">$1</a>");

リンクの作成だけでなく、URLの短縮にも興味があるかもしれません。これはこの主題に関する良い記事です：

C＃でURLを解決して短縮する

関連項目：

Vance Smith · Answer

protected string Linkify( string SearchText ) { // this will find links like: // http://www.mysite.com // as well as any links with other characters directly in front of it like: // href="http://www.mysite.com" // you can then use your own logic to determine which links to linkify Regex regx = new Regex( @"\b(((\S+)?)(@|mailto\:|(news|(ht|f)tp(s?))\://)\S+)\b", RegexOptions.IgnoreCase ); SearchText = SearchText.Replace( "&nbsp;", " " ); MatchCollection matches = regx.Matches( SearchText ); foreach ( Match match in matches ) { if ( match.Value.StartsWith( "http" ) ) { // if it starts with anything else then dont linkify -- may already be linked! SearchText = SearchText.Replace( match.Value, "<a href='" + match.Value + "'>" + match.Value + "</a>" ); } } return SearchText; }

M4N · Answer

これで読むことができるほど簡単ではありません Jeff Atwoodによるブログ投稿。 URLがどこで終わるかを検出するのは特に困難です。

たとえば、URLの末尾の括弧部分は次のとおりです。

http://en.wikipedia.org/wiki/PCTools(CentralPointSoftware）
括弧内のURL（http://en.wikipedia.org）その他のテキスト

最初のケースでは、括弧はURLの一部です。 2番目のケースではそうではありません！

Berezh · Answer

クラスがあります：

public class TextLink { #region Properties public const string BeginPattern = "((http|https)://)?(www.)?"; public const string MiddlePattern = @"([a-z0-9\-]*\.)+[a-z]+(:[0-9]+)?"; public const string EndPattern = @"(/\S*)?"; public static string Pattern { get { return BeginPattern + MiddlePattern + EndPattern; } } public static string ExactPattern { get { return string.Format("^{0}$", Pattern); } } public string OriginalInput { get; private set; } public bool Valid { get; private set; } private bool _isHttps; private string _readyLink; #endregion #region Constructor public TextLink(string input) { this.OriginalInput = input; var text = Regex.Replace(input, @"(^\s)|(\s$)", "", RegexOptions.IgnoreCase); Valid = Regex.IsMatch(text, ExactPattern); if (Valid) { _isHttps = Regex.IsMatch(text, "^https:", RegexOptions.IgnoreCase); // clear begin: _readyLink = Regex.Replace(text, BeginPattern, "", RegexOptions.IgnoreCase); // HTTPS if (_isHttps) { _readyLink = "https://www." + _readyLink; } // Default else { _readyLink = "http://www." + _readyLink; } } } #endregion #region Methods public override string ToString() { return _readyLink; } #endregion }

この方法で使用します。

public static string ReplaceUrls(string input) { var result = Regex.Replace(input.ToSafeString(), TextLink.Pattern, match => { var textLink = new TextLink(match.Value); return textLink.Valid ? string.Format("<a href=\"{0}\" target=\"_blank\">{1}</a>", textLink, textLink.OriginalInput) : textLink.OriginalInput; }); return result; }

テストケース：

[TestMethod] public void RegexUtil_TextLink_Parsing() { Assert.IsTrue(new TextLink("smthing.com").Valid); Assert.IsTrue(new TextLink("www.smthing.com/").Valid); Assert.IsTrue(new TextLink("http://smthing.com").Valid); Assert.IsTrue(new TextLink("http://www.smthing.com").Valid); Assert.IsTrue(new TextLink("http://www.smthing.com/").Valid); Assert.IsTrue(new TextLink("http://www.smthing.com/publisher").Valid); // port Assert.IsTrue(new TextLink("http://www.smthing.com:80").Valid); Assert.IsTrue(new TextLink("http://www.smthing.com:80/").Valid); // https Assert.IsTrue(new TextLink("https://smthing.com").Valid); Assert.IsFalse(new TextLink("").Valid); Assert.IsFalse(new TextLink("smthing.com.").Valid); Assert.IsFalse(new TextLink("smthing.com-").Valid); } [TestMethod] public void RegexUtil_TextLink_ToString() { // default Assert.AreEqual("http://www.smthing.com", new TextLink("smthing.com").ToString()); Assert.AreEqual("http://www.smthing.com", new TextLink("http://www.smthing.com").ToString()); Assert.AreEqual("http://www.smthing.com/", new TextLink("smthing.com/").ToString()); Assert.AreEqual("https://www.smthing.com", new TextLink("https://www.smthing.com").ToString()); }

Yauhen.F · Answer

次の正規表現が見つかりました http://daringfireball.net/2010/07/improved_regex_for_matching_urls

私にとってはとてもよさそうだ。 Jeff Atwoodソリューションは、多くのケースを処理しません。 josefresno 私にはすべてのケースを処理しているようです。しかし、私がそれを理解しようとしたとき（サポート要求の場合）、私の脳は沸騰しました。

Muhammad Awais · Answer

これは私のために働きます：

str = Regex.Replace(str, @"((http|ftp|https):\/\/[\w\-_]+(\.[\w\-_]+)+([\w\-\.,@?^=%&amp;:/~\+#]*[\w\-\@?^=%&amp;/~\+#])?)", "<a target='_blank' href='$1'>$1</a>");