I would like to ask how can we do better in the use of rules for this kind of topic?

topic description

paragraph 1 and paragraph 1, respectively, take out the following operating income, net profit attributable to the owner of the parent company, and net profit attributable to the owner of the parent company after deducting non-recurring profits and losses, in the format of tuple, (date, name, value, unit), formatted as 2017-9-30, and stored in list

II. Take out the registered capital of paragraph 2, that is, (2015, registered capital, 7900.00 US dollars), and deposit it into one list

topic sources and their own ideas

text content:
1. The operating income of the company from January to September 2017, the net profit attributable to the owner of the parent company and the net profit attributable to the owner of the parent company after deducting non-recurring profits and losses were 987.6899 million yuan, 140.7998 million yuan and 136.49 million yuan respectively, up 10.27%, 35.63% and 20.30% from January to September 2016, respectively, reflecting the good operation of the company.
2. The main reasons for the company"s higher management expenses in 2015 than in 2014 and lower management expenses in 2016 compared with 2015 are as follows: in March 2015, the company"s registered capital increased from US $79 million to US $91.61256 million. Ningbo Lingqi, Ningbo Lingxi, Ningbo Lingqi, Ningbo Lingxin and Ningbo Lingxi are the employee shareholding platforms of the company, according to the company"s December 2014

.

related codes

/ / Please paste the code text below (do not replace the code with pictures)

-sharp-*-coding:utf-8-*-
import re
import json

data = ["2017-9-30"]
list = []
file = open(u".txt")
key = file.read().decode("gbk")
print key
pattern = re.compile(u"(?<=1-9).*(?=)")
matcher = re.search(pattern,key)
pass
pattern1 = re.compile(u"")
matcher1 = re.search(pattern1,matcher.group())
data.append(matcher1.group())
pass
pattern2 = re.compile(u"[0-9].*?(?=)")
matcher2 = re.search(pattern2,matcher.group())
data.append(matcher2.group())
data.append("")
data = tuple(data)
list.append(data)
pass
data2 = ["2017-9-30"]
pattern3 = re.compile(u"")
matcher3 = re.search(pattern3,matcher.group())
data2.append(matcher3.group())
pass
pattern4 = re.compile(u"(?<=).*?(?=)")
matcher4 = re.search(pattern4,matcher.group())
data2.append(matcher4.group())
data2.append("")
data2 = tuple(data2)
list.append(data2)
pass
data3 = ["2017-9-30"]
pattern5 = re.compile(u"")
matcher5 = re.search(pattern5,matcher.group())
data3.append(matcher5.group())
pass
pattern6 = re.compile(u"(?<=).*?(?=)")
matcher6 = re.search(pattern6,matcher.group())
data3.append(matcher6.group())
data3.append("")
data3 = tuple(data3)
list.append(data3)
pass
pass
dataTwo = []
patternTwo = re.compile(u"(?<=).*")
matcherTwo = re.search(patternTwo,key)
pass
patternTwo1 = re.compile(u"2015")
matcherTwo1 = re.search(patternTwo1,matcherTwo.group())
dataTwo.append(matcherTwo1.group())
pass
patternTwo2 = re.compile(u"")
matcherTwo2 = re.search(patternTwo2,matcherTwo.group())
dataTwo.append(matcherTwo2.group())
pass
patternTwo3 = re.compile(u"(?<=).*?(?=)")
matcherTwo3 = re.search(patternTwo3,matcherTwo.group())
dataTwo.append(matcherTwo3.group())
dataTwo.append("")
dataTwo = tuple(dataTwo)
list.append(dataTwo)
for i in range(0,4):
    print "\n"
    for j in list[i]:
        print j,

what result do you expect? What is the error message actually seen?

although the data can be extracted according to the requirements now, I feel that this extraction method is too stupid. I hope all the bosses can give us some advice on how to improve it.

Apr.03,2021

I don't quite understand. A matching amount is written below for reference. In the second paragraph, matching capital is also available. You only need to use the first item, and the year is compared by analogy

.
var str1 = '20171-998,768.9914,079.9813,649.0020161-910.27%35.63%20.30%,';
str1.match(/([\d,.])+/g);
Menu