336 文字壓縮

中文english

給你乙個只含有可見字元（ascii 碼範圍 3232 至 126126）文字檔案，檔案中可能出現一些重複的單詞，你需要對它們進行壓縮。

壓縮規則如下：

如果原檔案中的字元不是英文本元，那麼保留這些字元。

我們將連續的英文本元視為乙個單詞，單詞的前後不應該還有其它的英文本元。

輸入： please, please do it--it would please mary very, very much.

thanks

輸出： please, please do it--4 would 2 mary very, 7 much.

thanks

樣例中，please是第 22 個出現的不同的單詞，it是第 44 個出現的不同的單詞，very是第 77 個出現的不同的單詞。

壓縮中應該大小寫敏感，"abc" 和 "abc" 不是同乙個單詞。

文字中總的字元數為 nn，1 \le n \le 10^41≤n≤104。

乙個單詞不會跨行顯示。

字串切割 + 是否字元判斷

class
solution:
"""@param lines: the text to compress.
@return: 
return
the text after compression.
"""def textcompression(self, lines):
# write your code here.
if not lines: return
''ss = '|'
.join(lines) 
s_dict, res = {}, ''
count, index = 0, 0
length =len(ss)
while index temp_s = ''
ifss[index].isalpha():
right =index
while right temp_s +=ss[right] 
right += 1
#判斷是否之前已存dict
if temp_s not in
s_dict.keys():
count += 1
s_dict[temp_s] =str(count)
else
: temp_s =s_dict[temp_s]
res +=temp_s 
#更新index =right
else
: res +=ss[index]
index += 1
#格式矯正
results = res.split('|'
) 
return results

5文字溢位

文字溢位 overflow visible 預設值 hidden 超出隱藏 scroll 顯示滾動條 auto 自適應檢視 inherit 繼承父元素overflow值文字換行 white space normal 預設值 pre 原格式輸出，空白會被瀏覽器保留 pre wrap 文字不會換行，文字...

The HTML Language 二文字標誌

後面的標誌對未經特別說明，均用在之間1.標誌對是用來建立乙個段落段與段之間空一行在此標誌對之間加入的文字將按照段落的格式顯示在瀏覽器上標誌還可以使用 align 屬性，它用來說明對齊方式格式 left 左對齊center 居中right 右對齊2.是乙個很簡單的標誌，它沒有結束標誌，它...

6 1 文字表示（one hot）

one hot原理 import jieba import numpy as np from collections import counter stopwords line.strip for line in open data stopwords.data r encoding utf 8 r...

336 文字壓縮

5文字溢位

The HTML Language 二 文字標誌

6 1 文字表示（one hot）

相關推薦

The HTML Language 二文字標誌