我正在寻找一个正则表达式,它允许我验证字符串是否是对网站地址或该网站中特定页面的引用.
所以它会匹配:
http://google.com ftp://google.com http://google.com/ http://lots.of.subdomains.google.com
但不是:
http://google.com/search.whatever ftp://google.com/search.whatever http://lots.of.subdomains.google.com/search.whatever
有任何想法吗?我无法弄清楚如何处理允许/
在URL的末尾.
试试这个:
(http|ftp|https)://([a-zA-Z0-9\-\.]+)/?
这是我的完整URI验证模式的缩短版本,基于规范.我写这个是因为规范允许我在网上找到的任何验证模式中都没有包含许多字符.您将看到用户/传递(以及第二种模式,路径和查询字符串)比您想象的要宽松得多.
/^(https?|ftp):\/\/(?# protocol )(([a-z0-9$_\.\+!\*\'\(\),;\?&=-]|%[0-9a-f]{2})+(?# username )(:([a-z0-9$_\.\+!\*\'\(\),;\?&=-]|%[0-9a-f]{2})+)?(?# password )@)?(?# auth requires @ )((([a-z0-9][a-z0-9-]*[a-z0-9]\.)*(?# domain segments AND )[a-z]{2}[a-z0-9-]*[a-z0-9](?# top level domain OR )|(\d|[1-9]\d|1\d{2}|2[0-4][0-9]|25[0-5]\.){3}(?# )(\d|[1-9]\d|1\d{2}|2[0-4][0-9]|25[0-5])(?# IP address ))(:\d+)?(?# port ))\/?$/i
而且由于我花时间将其分解为更具可读性,所以这里是完整的模式:
/^(https?|ftp):\/\/(?# protocol )(([a-z0-9$_\.\+!\*\'\(\),;\?&=-]|%[0-9a-f]{2})+(?# username )(:([a-z0-9$_\.\+!\*\'\(\),;\?&=-]|%[0-9a-f]{2})+)?(?# password )@)?(?# auth requires @ )((([a-z0-9][a-z0-9-]*[a-z0-9]\.)*(?# domain segments AND )[a-z]{2}[a-z0-9-]*[a-z0-9](?# top level domain OR )|(\d|[1-9]\d|1\d{2}|2[0-4][0-9]|25[0-5]\.){3}(?# )(\d|[1-9]\d|1\d{2}|2[0-4][0-9]|25[0-5])(?# IP address ))(:\d+)?(?# port ))(((\/+([a-z0-9$_\.\+!\*\'\(\),;:@&=-]|%[0-9a-f]{2})*)*(?# path )(\?([a-z0-9$_\.\+!\*\'\(\),;:@&=-]|%[0-9a-f]{2})*)(?# query string )?)?)?(?# path and query string optional )(#([a-z0-9$_\.\+!\*\'\(\),;:@&=-]|%[0-9a-f]{2})*)?(?# fragment )$/i
请注意,某些(所有?)javascript实现不支持正则表达式中的注释.