python解析html中的json字符串
python解析html中的json字符串
格式如下图:
解析代码如下:
import re
import json
JSON = re.compile('publish_page = ({.*?});', re.DOTALL)
matches = JSON.search(content)
data = matches.group(1)
json.loads(data)
结果如图所示:
参考
[How to extract a JSON object that was defined in a HTML page javascript block using Python? – Stack Overflow](https://stackoverflow.com/questions/13323976/how-to-extract-a-json-object-that-was-defined-in-a-html-page-javascript-block-us)