1.html相关的标签
匹配 <video> 标签:<video.*?>.+?</video>
pdfFile = pdfFile.replace("<br.?+>", "<br/>");
String repContent = pdfFile.replaceAll("<img(.?+)>", "<img$1/>");
方法一(有缺陷)
String contents = repContent.replaceAll("<img src="/cds_filestorage/download-s", "<img src="**/cds_filestorage/download-s");
方法二(完美)
String contents = ss.replaceAll("src="/**/download-s", "src="**/download-s");
2.去掉特殊符号:
public static String FilterStringName(String str){ |