• No results found

MEGABLAST and BLASTn searches of ORFs

Sequence 1

Total ORFs Hits No hits

MEGABLAST 79 36 43

BLASTn 79 79 0

Hits on MEGABLAST

ORF name

Length (bp)

Organism Accession Pairwise identity

EU140752.1 100 100 WP_072756907:

hypothetical protein

771 Planctomycetes bacterium

990 Planctomycetes bacterium

CP036525 82.0 93.03 0 TWU40200: putative nucleotidyltransferase S1 -

ORF0010

414 Roseimaritima ulvae

609 Rhodopirellula baltica

S1 - ORF0021

354 Planctomycetes bacterium

384 Planctomycetes bacterium

1116 Planctomycetes sp.

CP011270 69.6 87.63 3.88

× 10 -101

WP_068259366:

slipin family protein

S1 - ORF0038

1038 Planctomycetes bacterium

477 Methylocystis sp.

396 Planctomycetes bacterium

396 Rhodopirellula baltica

270 Rhodopirellula baltica

108 Rhodopirellula baltica

357 Rhodopirellula baltica

726 Planctomycetes bacterium

S1 - ORF0090

636 Rhodopirellula baltica

BX294153 77.3 97.33 2.99

× 10 -131

WP_068264266: HD domain-containing protein

S1 - ORF0094

279 Rhodopirellula baltica

792 Planctomycetes bacterium

321 Planctomycetes bacterium

552 Planctomycetes bacterium

CP036262 100 100 0 WP_146459133:

CIA30 family protein S1 -

ORF00114

123 Planctomycetes bacterium

CP036262 100 100 0 WP_007332630:

CoA-binding protein S1 -

ORF00122

390 Planctomycetes bacterium

CP036262 100 100 0 WP_068264250:

CoA-binding protein S1 -

ORF00123

267 Planctomycetes bacterium

735 Rhodopirellula baltica

867 Rhodopirellula baltica

S1 - ORF00147

513 pCC2FOS fosmid vector

EU140752.1 100 100 WP_171831280:

tyrosine-type

EU140752.1 100 100 WP_001302176:

hypothetical protein

EU140752.1 100 100 EEU3264372:

replication initiation

EU140752.1 100 100 WP_001365321:

hypothetical protein S1 -

ORF00155

1176 pCC2FOS fosmid vector

EU140752.1 100 100 WP_001304218:

plasmid-partitioning

EU140752.1 100 100 WP_059242500:

ParB/RepB/Spo0J family plasmid partition protein

BLASTn search of «No hits» on MEGABLAST

ORF name Length (bp) Identical proteins Grade (%) Number of

results (out of 10)

S1 - ORF0013 99 WP_162273174:

hypothetical protein [Rubripirellula obstinata]

73.5 10

S1 - ORF0014 495 WP_146516087:

rhodanese-like domain-containing protein [Rubripirellula amarantea]

92.7 10

S1 - ORF0016 102 -

S1 - ORF0017 96 -

S1 - ORF0018 99 -

S1 - ORF0023 951 QDU26629: Group II

intron-encoded protein LtrA [Planctomycetes bacterium ETA_A8]

86.7 10

S1 - ORF0024 471 REJ87213: group II intron

reverse

transcriptase/maturase [Planctomycetes bacterium]

88.5 10

S1 - ORF0025 105 -

S1 - ORF0026 96 -

S1 - ORF0034 651 WP_068258815: RNA

2'-phosphotransferase [Rubripirellula obstinata]

87.5 10

S1 - ORF0037 660 WP_146405649:

tyrosine-protein phosphatase [Planctomycetes bacterium Poly21]

95.4 10

S1 - ORF0041 264 WP_146462253: M28

S1 - ORF0046 225 TWT97173: hypothetical

protein Pla100_23220

S1 - ORF0060 1548 MBA3583771: transposase

[Gemmatimonadetes RelE/ParE family toxin [Planctomycetes bacterium EC9]

74.7 10

S1 - ORF0066 141 TWU39980: hypothetical

protein Q31b_32960 [Planctomycetes bacterium Q31b]

46.3 2

S1 - ORF0068 222 -

S1 - ORF0096 162 TWT51002: hypothetical

protein Pla22_37780

[Planctomycetes bacterium Pla52n]

S1 - ORF00129 474 WP_068264232:

hypothetical protein [Rubripirellula obstinata]

83.8 10

S1 - ORF00130 105 RLS74515: ISAs1 family

transposase, partial [Planctomycetes bacterium]

57.4 6

S1 - ORF00133 666 OYV96015: hypothetical

protein B7Z68_06230 [Acidobacteria bacterium 21-70-11]

45.0 10

S1 - ORF00135 117 -

S1 - ORF00136 456 AJY36415: putative lipo domain protein

[Burkholderia mallei]

26.6 4

S1 - ORF00138 1413 WP_008674207: HAMP

domain-containing histidine kinase

[Rhodopirellula sallentina]

88.9 10

S1 - ORF00142 429 WP_009095090:

aminoacyl-tRNA hydrolase [Rhodopirellula sp. SWK7]

92.3 10

No hits on MEGABLAST or BLASTn: 0

Sequence 2

Total ORFs Hits No hits

MEGABLAST 41 34 7

BLAStn 41 41 0

Hits on MEGABLAST

ORF name Length (bp)

Organism Accession Pairwise identity

100 100 EEV9030702hypothetical

protein:

100 100 ADE34479: hypothetical

protein

CP018760 86.9 100 0 WP_073245624:

tRNA-

S2 - ORF0010

2068 Maribacte r sp.

CP011318 84.4 100 0 WP_133689069.1:

polyphosphate kinase 1 S2 -

bifunctional pyr operon transcriptional DNA-binding LytR/AlgR family response regulator

LT629754 99.60 80.9 0 WP_133689060.1

galactose

LT629754 93.6 100 0 WP_073245651.1:30S

ribosomal protein S1 S2 -

S2 - ORF0027

2457 Maribacte r sp.

LT629754 84.7 100 0 TDT37164.1:

ATP-dependent Lon protease S2 - factor, ECF subfamily S2 -

family MFS transporter S2 -

glycoside hydrolase family 31 protein

S2 -

BLASTn search of «No hits» on MEGABLAST

ORF name Length (bp) Identical proteins Grade (%) Number of

results (out of 10)

S2 - ORF007 852 WP_073245622: EboA

domain-containing protein [Maribacter aquivivus]

94.7 10

S2 - ORF0011 246 WP_133689068: histidine

phosphatase family protein [Maribacter spongiicola]

96.9 10

S2 - ORF0014 333 WP_133689066:

ribonuclease Z [Maribacter spongiicola]

98.6 10

S2 - ORF0019 342 WP_133689061: DUF4907

domain-containing protein [Maribacter spongiicola]

93.4 10

S2 - ORF0022 1251 TDT37168: long-subunit

fatty acid transport protein [Maribacter spongiicola]

97.8 10

S2 - ORF0028 120 -

S2 - ORF0033 735 WP_036158526: DUF2807

domain-containing protein [Maribacter forsetii]

93.6 10

No hits on Megablast or Blastn: 0

Sequence 3

Total ORFs Hits No hits

MEGABLAST 56 19 37

BLASTn 56 45 11

Hits on MEGABLAST

ORF name Length (bp)

Organism Accession Pairwise identity (%)

Query coverage (%)

E-value Protein

S3 -

S3 -

elongation factor 4 S3 -

ORF0037

894 Planctomycet es bacterium

CP036432 68.8 55.82****

*

1270 Crateriforma conspicua

acyl carrier protein S3 -

S3 -

BLASTn search of «No hits» on MEGABLAST

ORF name Length (bp) Identical proteins Grade

(%)

S3 - ORF0018 615 WP_052031530: class I SAM-dependent methyltransferase [Rhodopirellula maiorica]

79.8 10

S3 - ORF0020 618 EMI42599: putative membrane protein [Rhodopirellula sp. SWK7]

77.1 10

S3 - ORF0024 1086 WP_145098080: ROK family protein [Planctomycetes bacterium Poly24]

81.9 10

S3 - ORF0026 345 MAI72440: hypothetical protein [Rhodopirellula sp.]

65.3 10

S3 - ORF0027 102 -

S3 - ORF0029 2808 WP_154900441: circularly permuted type 2 ATP-grasp protein [Planctomycetes bacterium CA11]

70.8 10

S3 - ORF0030 885 WP_145262651: transglutaminase family protein [Planctomycetes bacterium Pan216]

76.6 10

S3 - ORF0031 792 WP_145419616: GIY-YIG nuclease family protein [Planctomycetes bacterium K23_9]

74.8 10

S3 - ORF0032 1101 QDV58998: hypothetical protein Mal33_50230 [Planctomycetes bacterium Mal33]

77.8 10

S3 - ORF0034 984 WP_164102765: D-2-hydroxyacid dehydrogenase [Roseimaritima sp. JC640]

85.0 10

S3 - ORF0035 1155 WP_164102793: protein kinase [Roseimaritima sp. JC640]

89.5 10

S3 - ORF0038 1227 WP_075084119:

endonuclease/exonuclease/phosphatase family protein [Mariniblastus fucicola]

74.6 10

S3 - ORF0039 456 WP_164101729: response regulator [Roseimaritima sp. JC640]

90.6 10

S3 - ORF0041 102 -

S3 - ORF0043 141 -

S3 - ORF0045 1428 WP_164104070: MFS transporter [Roseimaritima sp. JC640]

78.5 10

S3 - ORF0046 114 - 10

S3 - ORF0047 216 PHR99503: IS5/IS1182 family transposase [Blastopirellula sp.]

52.9 10

S3 - ORF0048 129 -

S3 - ORF0051 840 WP_164104021: purine-nucleoside phosphorylase [Roseimaritima sp. JC640]

89.1 10

S3 - ORF0052 126 -

S3 - ORF0053 1458 WP_146406530: sulfatase-like

hydrolase/transferase [Planctomycetes bacterium Poly21]

83.6 10

S3 - ORF0060 921 WP_164103642: ACP S-malonyltransferase [Roseimaritima sp. JC640]

84.9 10

S3 - ORF0062 96 GDX91350: 50S ribosomal protein L32 [Planctomycetia bacterium]

44.2 2

S3 - ORF0063 108 WP_153556032: 50S ribosomal protein L32 [Roseimaritima sp. JC651]

52.8 10

S3 - ORF0065 3069 WP_165225646: protein kinase [Aquisphaera sp.

JC669]

71.5 10

S3 - ORF0066 96 -

S3 - ORF0068 1203 WP_008662881: hypothetical protein [Rhodopirellula europaea]

74.1 10

S3 - ORF0069 1233 WP_145282472: PQQ-binding-like beta-propeller repeat protein [Planctomycetes bacterium Mal33]

75.9 10

S3 - ORF0072 102 -

S3 - ORF0073 1635 WP_145350034: von Willebrand factor type A domain-containing protein [Planctomycetes bacterium FF011L]

76.5 10

S3 - ORF0075 138 -

S3 - ORF0076 462 WP_145342310: low molecular weight phosphotyrosine protein phosphatase [Planctomycetes bacterium EC9]

78.5 10

S3 - ORF0078 276 WP_146592785: hypothetical protein [Planctomycetes bacterium Pla52o]

39.1 7

S3 - ORF0083 1173 WP_008693703: type II and III secretion system protein [Rhodopirellula maiorica]

74.2 10

No hits on MEGABLAST or BLASTn

-S3 - ORF0015 - S3 - ORF0016 - S3 - ORF0027 - S3 - ORF0041 - S3 - ORF0043 - S3 - ORF0046 - S3 - ORF0048

- S3 - ORF0066

Hits on MEGABLAST

ORF name

Length (bp)

Organism Accession Pairwise

identity

972 pCC2FOS fosmid vector

EU140752.1 100 100

S4 - ORF002

1167 pCC2FOS fosmid vector

EU140752.1 100 100

S4 - ORF005

756 pCC2FOS fosmid vector

EU140752.1 100 100

S4 - ORF007

126 pCC2FOS fosmid vector

EU140752.1 100 100

S4 - ORF009

120 pCC2FOS fosmid vector

EU140752.1 100 100

S4 - ORF0010

513 pCC2FOS fosmid vector

EU140752.1 100 100

S4 - ORF0012

660 pCC2FOS fosmid vector

EU140752.1 100 100

S4 - ORF0032

1824 Thermoleptolyngbya sp.

699 Geitlerinema sp. CP003591 71.6 81.26

5.98e-71

681 Oscillatoria acuminata

1920 Thermoleptolyngbya sp.

807 Thermoleptolyngbya sp.

732 Leptolyngbya sp. AP017367 77.3 31.28*** 1.58e-40

258 pCC2FOS fosmid vector

EU140752.1 100 100

S4 - ORF0080

159 pCC2FOS fosmid vector

EU140752.1 100 100

BLASTn search of «No hits» on MEGABLAST

ORF name Length (bp) Identical proteins Grade (%) Number of

results (out of 10)

S4 - ORF0015 1080 NJN29057: aromatic ring-hydroxylating dioxygenase subunit alpha [Synechococcales cyanobacterium RM1_1_8]

92.0 10

S4 - ORF0016 1959 NJM57392: FAD-dependent

oxidoreductase [Synechococcales cyanobacterium RU_4_20]

88.0 10

S4 - ORF0018 114 -

S4 - ORF0019 1827 NJN32773: alpha/beta hydrolase [Synechococcales

cyanobacterium RM1_1_8]

75.4 10

S4 - ORF0020 99 -

S4 - ORF0021 168 -

S4 - ORF0022 1347 BAZ30658: putative nicotinate phosphoribosyltransferase [Cylindrospermum sp. NIES-4074]

83.3 10

S4 - ORF0026 645 TVQ16582:

nicotinate-nucleotide adenylyltransferase [Leptolyngbya sp.

DLM2.Bin15]

73.0 10

S4 - ORF0030 756 NJM57395: NUDIX hydrolase

[Synechococcales cyanobacterium RU_4_20]

88.0 10

S4 - ORF0033 99 -

S4 - ORF0038 285 NJN32796: DUF427

domain-containing protein [Synechococcales

cyanobacterium RM1_1_8]

88.8 10

S4 - ORF0039 825 NJN32797: phytoene/squalene synthase family protein [Synechococcales

cyanobacterium RM1_1_8]

94.9 10

S4 - ORF0040 168 -

S4 - ORF0041 102 -

S4 - ORF0042 942 WP_009768167: M48 family

metalloprotease [Oscillatoriales

S4 - ORF0044 426 WP_058030484: hypothetical protein [Pseudoalteromonas

S4 - ORF0048 399 WP_168570017: DUF2605

domain-containing protein [Oxynema sp. AP17]

70.9 10

S4 - ORF0049 333 NJN31526: DUF2973

domain-containing protein [Synechococcales

cyanobacterium RM1_1_8]

65.1 10

S4 - ORF0051 147 -

S4 - ORF0052 1302 NJN30652: response regulator [Synechococcales

cyanobacterium RM1_1_8]

72.6 10

S4 - ORF0053 414 -

S4 - ORF0055 1233 WP_068514664: FIST

C-terminal domain-containing protein [Leptolyngbya sp. O-77]

80.0 10

S4 - ORF0056 228 WP_066349091: Calvin cycle protein CP12 [Geminocystis sp.

NIES-3708]

84.9 10

S4 - ORF0058 603 WP_068789457: DUF3177

family protein [Phormidium willei]

78.9 10

S4 - ORF0061 237 NJM48050: hypothetical protein [Alkalinema sp. RU_4_3]

80.1 10

S4 - ORF0064 822 WP_068510336:

FAD-dependent oxidoreductase [Leptolyngbya sp. O-77]

81.1 10

S4 - ORF0067 108 -

S4 - ORF0069 426 WP_162398825: hypothetical protein [Nostoc sp. B(2019)]

72.5 10

S4 - ORF0072 156 -

S4 - ORF0073 537 NJL85273: porin family protein [Leptolyngbyaceae

cyanobacterium SM1_1_3]

68.0 10

S4 - ORF0074 1218 WP_146133632:

tetratricopeptide repeat protein, partial [filamentous

cyanobacterium Phorm 46]

60.2 10

S4 - ORF0075 99 -

S4 - ORF0076 1101 NJR68231: signal peptidase I [Synechococcales

cyanobacterium CRU_2_2]

71.3 10

S4 - ORF0078 975 NJN32371: nuclear transport factor 2 family protein [Synechococcales

cyanobacterium RM1_1_8]

70.4 10

No hits on Megablast or Blastn

- S4 - ORF0018 - S4 - ORF0020 - S4 - ORF0021 - S4 - ORF0033 - S4 - ORF0040 - S4 - ORF0041 - S4 - ORF0051 - S4 - ORF0053 - S4 - ORF0067 - S4 - ORF0072 - S4 - ORF0075

Sequence 5

Total ORFs Hits No hits

MEGABLAST 62 59 3

BLASTn 62 56 6

Hits on MEGABLAST

ORF name

Length (bp)

Organism Accession Pairwise identity

S5 -

recognition particle

S5 -

S5 -

repair protein RecN

S5 - ORF0032

141 Klebsiella sp. CP056483 100 100

3.79e-64

WP_160886650:

hypothetical protein

S5 - family protein

S5 - ORF0037

477 Klebsiella sp. CP056483 100 100 0 WP_112217068: type

II toxin-antitoxin system RatA family toxin

S5 -

1821 Serratia marcescens

AP021873 92.0 100 0 WP_163525672:

DUF4365 domain-containing protein

S5 - ORF0047

1446 Cronobacter malonaticus

CP013940 78.2 96.89 0 WP_163525673:

SIR2 family protein

S5 - ORF0048

501 Cronobacter malonaticus

1479 Salmonella enterica

CP053332 82.4 100 0 WP_163525675:

relaxase/mobilization nuclease domain-containing protein S5 -

ORF0050

372 Klebsiella grimontii

270 Raoultella ornithinolytica

CP038281 93.7 100

1.73e-110

WP_163525677:

hypothetical protein

S5 - ORF0052

153 Raoultella ornithinolytica

CP038281 96.7 100

1.47e-63

WP_169050475:

hypothetical protein

S5 - ORF0053

303 Raoultella ornithinolytica

CP038281 97.4 100

7.85e-141

WP_163525678:

hypothetical protein

S5 - ORF0056

276 Raoultella ornithinolytica

CP038281 100 100

4.45e-137

WP_063407905:

hypothetical protein S5 -

hypothetical protein S5 -

hypothetical protein S5 -

AlpA family phage regulatory protein S5 -

S5 - transposase A S5 -

ORF0087

513 Klebsiella sp. CP056483 100 100 0 QLO35724: IS3

family transposase S5 -

ORF0088

249 Klebsiella sp. CP056483 100 100

1.79e-122

VGA88629: integrase catalytic subunit S5 -

171 Escherichia coli

CP056263 100 72.51

8.25e-55

BLASTn search of «No hits» on MEGABLAST

ORF NAME LENGTH (BP) IDENTICAL

PROTEINS

GRADE (%) NUMBER OF

RESULTS (OUT OF 10)

S5 - ORF0046 99 -

S5 - ORF0062 105 -

S5 - ORF0069 102 EAA4525289:

hypothetical protein

47.7 2

No hits on Megablast or Blastn

- S5 - ORF0021 - S5 - ORF0039 - S5 - ORF0046 - S5 - ORF0062 - S5 - ORF0067 - S5 - ORF0068

Sequence 6

Total ORFs Hits No hits

MEGABLAST 57 11 46

BLASTn 57 40 17

Hits on MEGABLAST

ORF name Length (bp)

Organism Accession Pairwise identity

855 pCC2FOS fosmid vector

EU140752.1 100 100

S6 - ORF002

1167 pCC2FOS fosmid vector

EU140752.1 100 100

S6 - ORF005

756 pCC2FOS fosmid vector

EU140752.1 100 100

S6 - ORF006

126 pCC2FOS fosmid vector

EU140752.1 100 100

S6 - ORF007

183 pCC2FOS fosmid vector

EU140752.1 100 100

S6 - ORF008

513 pCC2FOS fosmid vector

EU140752.1 100 100

S6 - ORF009

660 pCC2FOS fosmid vector

EU140752.1 100 100

S6 - ORF0016

1491 Halomicronema hongdechloris

585 Geobacter anodireducens

CP014963 67.10 74.36

6.90e-25

WP_153293456:

zeta toxin family protein

102 pCC2FOS fosmid vector

EU140752.1 100 100

BLASTn search of «No hits» on MEGABLAST

ORF name Length (bp) Identical proteins Grade (%) Number of

results (out of 10)

S6 - ORF0012 528 -

S6 - ORF0013 174 AFY93165: hypothetical

protein Cha6605_2070 [Chamaesiphon minutus PCC 6605]

44.0 7

S6 - ORF0014 3393 NEQ48243: hypothetical

protein [Leptolyngbya sp.

SIOISBB]

68.6 10

S6 - ORF0018 612 -

S6 - ORF0019 120 NJR70312: hypothetical

protein [Synechococcales cyanobacterium CRU_2_2]

60.3 1

S6 - ORF0020 540 NJN32002: hypothetical

protein [Synechococcales cyanobacterium RM1_1_8]

55.2 10

S6 - ORF0021 261 NJR67456: hypothetical

protein [Synechococcales cyanobacterium CRU_2_2]

83.1 1

S6 - ORF0023 399 NJN31714: hypothetical

protein [Synechococcales cyanobacterium RM1_1_8]

80.7 1

S6 - ORF0024 105 -

S6 - ORF0025 690 NJN31713: hypothetical

protein [Synechococcales cyanobacterium RM1_1_8]

70.6 2

S6 - ORF0027 216 WP_008317590: hypothetical protein [Leptolyngbya sp. PCC 6406]

49.7 10

S6 - ORF0030 1125 NJN31710: DGQHR [Synechococcus sp. PCC 7336]

49.6 10

S6 - ORF0036 534 NJN32748: hypothetical

protein [Synechococcales cyanobacterium RM1_1_8]

69.2 10

S6 - ORF0037 762 NEQ28405: ISKra4 family

transposase [Microcoleus sp.

SIO2G3]

85.6 10

S6 - ORF0038 210 NJR71155: hypothetical

protein [Synechococcales cyanobacterium CRU_2_2]

51.5 10

S6 - ORF0040 405 NJN32748: hypothetical

protein [Synechococcales cyanobacterium RM1_1_8]

81.7 3

S6 - ORF0042 864 NJN32749: hypothetical

protein [Synechococcales cyanobacterium RM1_1_8]

75.4 10

S6 - ORF0043 552 NJN32750: hypothetical

protein [Synechococcales

S6 - ORF0049 213 RPI65896:

protein-L-isoaspartate

O-33.1 3

methyltransferase, partial [Ignavibacteriae bacterium]

S6 - ORF0052 375 NJM58486: hypothetical

protein [Synechococcales cyanobacterium RU_4_20]

34.0 1

S6 - ORF0054 1089 NJM58485:

relaxase/mobilization nuclease domain-containing protein [Synechococcales cyanobacterium RU_4_20]

73.2 10

S6 - ORF0058 1533 NJN30180: hypothetical protein [Synechococcales cyanobacterium RM1_1_8]

87.7 10

S6 - ORF0060 258 -

S6 - ORF0062 390 NJR70879: hypothetical

protein [Synechococcales cyanobacterium CRU_2_2]

50.8 2

S6 - ORF0063 603 NJR70880: hypothetical

protein [Synechococcales cyanobacterium CRU_2_2]

87.1 10

S6 - ORF0064 96 -

S6 - ORF0065 96 -

S6 - ORF0066 186 NJN30184: hypothetical

protein [Synechococcales cyanobacterium RM1_1_8]

63.4 2

S6 - ORF0068 255 NJN30185: hypothetical

protein [Synechococcales cyanobacterium RM1_1_8]

83.3 2

S6 - ORF0069 171 -

S6 - ORF0070 228 -

S6 - ORF0071 234 -

S6 - ORF0072 321 NJN30946: hypothetical protein [Synechococcales cyanobacterium RM1_1_8]

83.5 3

S6 - ORF0073 126 -

S6 - ORF0074 129 -

S6 - ORF0075 1803 NJM58305: hypothetical

protein [Synechococcales cyanobacterium RU_4_20]

87.3 10

S6 - ORF0077 204 -

S6 - ORF0079 99 -

S6 - ORF0081 192 WP_144969979:

4a-hydroxytetrahydrobiopterin dehydratase [Bremerella volcania]

86.5 10

S6 - ORF0082 168 -

S6 - ORF0083 273 WP_166276346:

acetyltransferase [Aphanocapsa montana]

70.7 10

No hits on MEGABLAST or BLASTn

- S6 - ORF0012 - S6 - ORF0018 - S6 - ORF0024 - S6 - ORF0045 - S6 - ORF0047 - S6 - ORF0048 - S6 - ORF0060

- S6 - ORF0064 - S6 - ORF0065 - S6 - ORF0069 - S6 - ORF0070 - S6 - ORF0071 - S6 - ORF0073 - S6 - ORF0074 - S6 - ORF0077 - S6 - ORF0079 - S6 - ORF0082

Sequence 7 – contigs 1, 2, and 3

Total ORFs Hits No hits

MEGABLAST 44 23 21

BLASTn 44 39 5

Hits on MEGABLAST

ORF name

Length (bp)

Organism Accession Pairwise

identity

1023 Planctomycetes bacterium

1758 Rhodopirellula baltica

1089 Planctomycetes bacterium

261 pCC2FOS fosmid vector

EU140752.1 100 100

CONTIG 2 S7 - ORF009

120 pCC2FOS fosmid vector

EU140752.1 100 100

CONTIG 2 S7 - ORF0010

99 pCC2FOS fosmid vector

EU140752.1 100 100

CONTIG 2 S7 - ORF0011

102 pCC2FOS fosmid vector

EU140752.1 100 100

CONTIG 2 S7 - ORF0012

975 pCC2FOS fosmid vector

EU140752.1 100 100

CONTIG 2 S7 - ORF0014

1161 pCC2FOS fosmid vector

EU140752.1 100 100

CONTIG 2 S7 - ORF0015

96 pCC2FOS fosmid vector

EU140752.1 100 100

CONTIG 2 S7 - ORF0016

93 pCC2FOS fosmid vector

EU140752.1 100 100

CONTIG 2 S7 - ORF0017

126 pCC2FOS fosmid vector

EU140752.1 100 100

CONTIG 2 S7 - ORF0019

756 pCC2FOS fosmid vector

EU140752.1 100 100

CONTIG 2 S7 - ORF0020

147 pCC2FOS fosmid vector

EU140752.1 100 100

CONTIG 2 S7 - ORF0021

117 pCC2FOS fosmid vector

EU140752.1 100 100

CONTIG 2 S7 - ORF0022

330 pCC2FOS fosmid vector

EU140752.1 100 100

CONTIG 2 S7 - ORF0023

660 pCC2FOS fosmid vector

EU140752.1 100 100

CONTIG 2 S7 - ORF0028

1704 Rhodopirellula baltica

1323 Planctomycetes bacterium

WP_167546599 47.1 88.89 1.61e-110

558 Rhodopirellula europaea

WP_037251127 84.9 99.46 4.94e-112

666 Planctomycetaceae bacterium

5388 Verrucomicrobia bacterium

PYI87545 43.5 81.85 0 hypothetical protein DME26_05830, partial

BLASTn search of «No hits» on MEGABLAST

ORF name Length (bp) Identical proteins Grade (%) Number of

results (out of 10)

S7 - ORF001 660 WP_150074029: mucoidy

inhibitor MuiA family protein [Rhodopirellula sp.

JC645]

73.2 10

S7 - ORF002 102 -

S7 - ORF003 975 EMI22299: secreted protein

[Rhodopirellula maiorica SM1]

68.8 10

S7 - ORF005 2433 TWU45271: Periplasmic

beta-glucosidase precursor [Planctomycetes bacterium Q31b]

91.7 10

S7 - ORF007 1497 WP_009100429: TRAP

transporter large permease [Rhodopirellula sp. SWK7]

90.2 10

S7 - ORF009 462 WP_008673152: TRAP

transporter small permease [Rhodopirellula sallentina]

83.3 10

S7 - ORF0011 1071 WP_009100433: TRAP

transporter substrate-binding protein [Rhodopirellula sp.

SWK7]

[Planctomycetes bacterium Poly21]

S7 - ORF0023 1047 WP_146407289: biotin

synthase BioB

S7 - ORF0030 537 TWT74243: bifunctional

nicotinamide

S7 - ORF0032 1152 EMI42754: hypothetical

protein RRSWK_04943 [Rhodopirellula sp. SWK7]

79.4 10

1716 WP_081796966: tandem-95

repeat protein [Bacillus ndiopicus]

78.7 10

CONTIG 2 S7 - ORF0031

930 WP_146392717:

hypothetical protein [Rhodopirellula solitaria]

79.2 10

CONTIG 2 S7 - ORF0034

402 WP_146579500:

hypothetical protein

No hits on MEGABLAST or BLASTn

- S7 - ORF002 - S7 - ORF0015 - S7 - ORF0018 - S7 - ORF0033 - C3 S7- orf003

Sequence 8

Total ORFs Hits No hits

MEGABLAST 41 36 5

BLASTn 41 41 0

Hits on MEGABLAST

ORF name Length (bp)

Organism Accession Pairwise identity

2193 Sorangium cellulosum

2565 Rhodothermaceae bacterium

1419 Candidatus Snodgrassella

159 pCC2FOS fosmid vector

EU140752.1 100 100

S8 - ORF0011

99 pCC2FOS fosmid vector

EU140752.1 100 100

S8 - ORF0013

114 pCC2FOS fosmid vector

EU140752.1 100 100

S8 - ORF0014

114 pCC2FOS fosmid vector

EU140752.1 100 100

S8 - ORF0017

105 pCC2FOS fosmid vector

EU140752.1 100 100

S8 - ORF0019

126 pCC2FOS fosmid vector

EU140752.1 100 100

S8 - ORF0023

972 pCC2FOS fosmid vector

EU140752.1 100 100

S8 - ORF0028

1167 pCC2FOS fosmid vector

EU140752.1 100 100

S8 - ORF0029

102 pCC2FOS fosmid vector

EU140752.1 100 100

S8 - ORF0031

99 pCC2FOS fosmid vector

EU140752.1 100 100

S8 - ORF0032

129 pCC2FOS fosmid vector

EU140752.1 100 100

S8 - ORF0034

105 pCC2FOS fosmid vector

EU140752.1 100 100

S8 - ORF0035

93 pCC2FOS fosmid vector

EU140752.1 100 100

S8 - ORF0040

756 pCC2FOS fosmid vector

EU140752.1 100 100

S8 - ORF0041

96 pCC2FOS fosmid vector

EU140752.1 100 100

S8 - ORF0042

96 pCC2FOS fosmid vector

EU140752.1 100 100

S8 - ORF0043

108 pCC2FOS fosmid vector

EU140752.1 100 100

S8 - ORF0044

108 pCC2FOS fosmid vector

EU140752.1 100 100

S8 - ORF0047

120 pCC2FOS fosmid vector

EU140752.1 100 100

S8 - ORF0048

144 pCC2FOS fosmid vector

EU140752.1 100 100

S8 - ORF0052

513 pCC2FOS fosmid vector

EU140752.1 100 100

S8 - ORF0053

105 pCC2FOS fosmid vector

EU140752.1 100 100

S8 - ORF0054

105 pCC2FOS fosmid vector

EU140752.1 100 100

S8 - ORF0055

135 pCC2FOS fosmid vector

EU140752.1 100 100

S8 - ORF0056

195 pCC2FOS fosmid vector

EU140752.1 100 100

S8 - ORF0060

153 pCC2FOS fosmid vector beta-propeller fold lactonase family protein

S8 - ORF0068

639 Azotobacter chroococcum

1506 Pseudomonas oryzae

2895 Bradymonadales bacterium

1248 Gemmatirosa kalamazoonesis

2127 Gemmatirosa kalamazoonesis

CP007128 71.80 21.72 1.55e-52

HIG75890.1: sigma-70 family RNA polymerase sigma factor***

S8 - ORF0080

1623 Luteitalea pratensis

BLASTn search of «No hits» on MEGABLAST

ORF NAME LENGTH (BP) IDENTICAL

PROTEINS

GRADE (%) NUMBER OF

RESULTS (OUT OF 10)

S8 - ORF0064 1923 WP_143097359: S41

family peptidase [Myxococcus fulvus]

58.2 10

S8 - ORF0066 2973 WP_094548663:

TonB-dependent receptor [Rubricoccus marinus]

69.9 10

S8 - ORF0067 1455 NOS85174: RNA

polymerase sigma factor [Ignavibacteria

bacterium]

39.7 10

S8 - ORF0074 1587 WP_095514951:

serine/threonine protein kinase [Rubrivirga sp.

SAORIC476]

70.1 10

S8 - ORF0077 2310 MYF63479:

sulfatase-like

hydrolase/transferase [Rhodothermaceae bacterium]

43.3 10

No hits on MEGABLAST or BLASTn: 0

Sequence 9

Total ORFs Hits No hits

MEGABLAST 40 24 16

BLASTn 40 39 1

Hits on MEGABLAST

ORF name

Length (bp)

Organism Accession Pairwise identity

102 pCC2FOS fosmid vector

972 pCC2FOS fosmid vector

1176 pCC2FOS fosmid vector

756 pCC2FOS fosmid vector

96 pCC2FOS fosmid vector

126 pCC2FOS fosmid vector

120 pCC2FOS fosmid vector

513 pCC2FOS fosmid vector

660 pCC2FOS fosmid vector

EU140752.

1

100 100

S9 - ORF0017

2334 Granulosicoccus antarcticus

CP018632 68.30 45.59**

*

216 Roseobacter litoralis CP002623 80.60 99.54 6.4 6e-45

MPT24899.1:

acetamidase/formamida se family protein S9 - se family protein S9 - ATP-binding protein UrtD S9 -

ORF0031

1227 Marinobacter hydrocarbonoclastic permease subunit UrtC S9 -

ORF0032

927 Bradyrhizobium guangdongense permease subunit UrtB S9 -

ORF0035

1206 Confluentimicrobium sp.

CP010869 77.70 94.69 0 WP_153772203.1: urea ABC transporter substrate-binding protein

S9 - ORF0038

3435 Labrenzia sp. CP045380 67.20 37.73**

*

1803 Granulosicoccus antarcticus

CP018632 75.30 99.67 0 ASJ74785.1: putative oxidoreductase CzcO S9 -

ORF0049

549 Granulosicoccus antarcticus

1854 Granulosicoccus antarcticus

CP018632 74.80 97.09 0 NND91485.1:

excinuclease ABC subunit UvrC

S9 - ORF0052

972 Granulosicoccus antarcticus

1038 Granulosicoccus antarcticus

201 Granulosicoccus antarcticus

CP018632 75.80 96.52 1.1 4e-28

NND91481.1: 50S ribosomal protein L32

S9 -

terminase small subunit

BLASTn search of «No hits» on MEGABLAST

ORF name Length (bp) Identical proteins Grade (%) Number of

results (out of 10)

S9 - ORF0019 765 WP_088920696: sulfite

exporter TauE/SafE family protein [Granulosicoccus antarcticus]

80.4 10

S9 - ORF0020 597 WP_125926863:

MULTISPECIES: LEA type 2 family protein

[Pseudomonas]

65.5 10

S9 - ORF0023 186 WP_155846134: hypothetical

protein [Celeribacter ethanolicus]

46.8 10

S9 - ORF0024 1395 RZO33341: ammonium

transporter [SAR116 cluster bacterium]

86.1 10

S9 - ORF0025 1818 HIC47449: transporter substrate-binding

domain-72.4 10

containing protein [Methylophaga sp.]

S9 - ORF0026 225 WP_136385948: zinc ribbon

domain-containing protein [Azoarcus sp. CC-YHH848]

49.3 10

S9 - ORF0044 498 OED37980: hypothetical

protein AB833_21610 [Chromatiales bacterium (ex Bugula neritina AB1)]

78.3 10

S9 - ORF0045 153 -

S9 - ORF0046 144 WP_146366491: ABC

transporter permease [Litoreibacter sp. LN3S51]

47.7 10

S9 - ORF0047 282 WP_142905356:

helix-turn-helix domain-containing protein [Exilibacterium tricleocarpae]

87.1 10

S9 - ORF0048 1260 RLA50911: type II toxin-antitoxin system HipA family toxin [Gammaproteobacteria

transcription factor [Stenotrophobium rhamnosiphilum]

S9 - ORF0056 582 WP_157736169: DUF177

domain-containing protein [Granulosicoccus antarcticus]

76.6 10

No hits on MEGABLAST or BLASTn:

- S9 - ORF0045

Sequence 10

Total ORFs Hits No hits

MEGABLAST 56 24 32

BLASTn 56 46 10

Hits on MEGABLAST

ORF name Length (bp)

Organism Accession Pairwise identity (%)

Query coverage (%)

E-value

Protein

S10 - ORF001

972 pCC2FOS

fosmid vector

EU140752.1 100 100

S10 - ORF002

1158 pCC2FOS fosmid vector

EU140752.1 100 100

S10 - ORF003

102 pCC2FOS

fosmid vector

EU140752.1 100 100

S10 -

624 Uncultured bacterium

MG458673 93.7 17.63*** 3.58e-35

WP_007329889:

flagellar basal body P-ring formation protein FlgA S10 -

ORF0013

804 Rhodopirellula baltica

BX294154 77.5 99.75

1.42e-174

MAP08669:

flagellar basal-body rod protein FlgG [Rhodopirellula sp.]

S10 - ORF0026

2106 Rhodopirellula baltica

381 Rhodopirellula baltica

BX294151 81.0 86.88

5.25e-81

WP_009099718:

DUF3467 domain-containing protein S10 -

ORF0036

2238 Rhodopirellula baltica

BX294151 77.4 99.91 0 TWT56412:

putative peptide zinc metalloprotease protein YydH [Rhodopirellula solitaria]

S10 - ORF0040

2052 Rhodopirellula baltica

BX294151 73.8 98.98 0 WP_008679296:

HlyD family efflux transporter periplasmic adaptor subunit

S10 - ORF0047

1452 Rhodopirellula baltica

BX294144 74.1 99.04 0 WP_083904879:

RNA polymerase factor sigma-54 S10 -

ORF0054

1881 Rhodopirellula baltica

BX294144 77.1 58.37** 0 WP_146389752:

DNA polymerase III subunit gamma/tau S10 -

ORF0061

1032 Rhodopirellula baltica

3492 Rhodopirellula baltica

BX294143 74.5 72.37 0 EMI45813:

transcription-repair coupling factor S10 -

ORF0069

1041 Rhodopirellula baltica

1005 Rhodopirellula baltica

579 Rhodopirellula baltica

BX294141 75.3 71.50

5.59e-64

WP_008681921:

Flp family type IVb pilin

BLASTn search of «No hits» on MEGABLAST

ORF name Length (bp) Identical proteins Grade (%) Number of results (out of 10)

S10 - ORF0014 753 WP_009102573:

flagellar hook basal-body protein [Rhodopirellula sp.

SWK7]

92.6 10

S10 - ORF0017 465 WP_044256526:

hypothetical protein [Rhodopirellula sp.

SWK7]

92.5 10

S10 - ORF0019 96 -

S10 - ORF0020 159 -

S10 - ORF0021 120 -

S10 - ORF0023 522 TWT87892:

Inosine-5'-monophosphate dehydrogenase [Planctomycetes bacterium Pla100]

90.2 10

S10 - ORF0025 1701 WP_146407640:

hypothetical protein [Planctomycetes bacterium Poly21]

78.0 10

S10 - ORF0027 723 WP_044302358:

ATP-binding cassette domain-containing protein [Rhodopirellula sallentina]

93.1 10

S10 - ORF0028 93 -

S10 - ORF0031 93 -

S10 - ORF0032 729 EMI55677: protein

containing DUF556 [Rhodopirellula sallentina SM41]

72.0 10

S10 - ORF0038 945 WP_085981221: efflux

[Planctomycetes

pilus assembly protein CpaB [Rhodopirellula sp. SWK7]

86.3 10

S10 - ORF0094 1785 WP_009100158: pilus

assembly protein

S10 - ORF0097 561 WP_009100154:

MinD/ParA family protein [Rhodopirellula sp. SWK7]

90.2 10

No hits on MEGABLAST or BLASTn

- S10 - ORF0019 - S10 - ORF0020 - S10 - ORF0021 - S10 - ORF0028 - S10 - ORF0031 - S10 - ORF0041 - S10 - ORF0048 - S10 - ORF0075 - S10 - ORF0089 - S10 - ORF0096