; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy2G044970 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy2G044970
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionReplication protein A 32 kDa subunit B-like
Genome locationchrH02:26367183..26377909
RNA-Seq ExpressionChy2G044970
SyntenyChy2G044970
Gene Ontology termsGO:0000724 - double-strand break repair via homologous recombination (biological process)
GO:0006260 - DNA replication (biological process)
GO:0006289 - nucleotide-excision repair (biological process)
GO:0000781 - chromosome, telomeric region (cellular component)
GO:0005662 - DNA replication factor A complex (cellular component)
GO:0035861 - site of double-strand break (cellular component)
GO:0003697 - single-stranded DNA binding (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsIPR004365 - OB-fold nucleic acid binding domain, AA-tRNA synthetase-type
IPR012340 - Nucleic acid-binding, OB-fold
IPR014892 - Replication protein A, C-terminal
IPR036388 - Winged helix-like DNA-binding domain superfamily
IPR036390 - Winged helix DNA-binding domain superfamily
IPR040260 - Replication factor A protein-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589705.1 Replication protein A 32 kDa subunit B, partial [Cucurbita argyrosperma subsp. sororia]2.18e-24879.66Show/hide
Query:  LRLSEKPFSLLQANQLFVHKILSRNSSFGRSARLPACGLPGQVPFNWEAQPGLPKNQPSDNPPPAELPPPSSSALGLSKTPHVVPKQAPV-KIWFWNKQR
        L LSE PFSLLQAN+LFVHKILSRNSS GRSARLPAC LPGQ+PF WEAQPGLPK++PS+N    + PP  +S  G SKTPH VP+Q    KIWFW K R
Subjt:  LRLSEKPFSLLQANQLFVHKILSRNSSFGRSARLPACGLPGQVPFNWEAQPGLPKNQPSDNPPPAELPPPSSSALGLSKTPHVVPKQAPV-KIWFWNKQR

Query:  RKSRRAVKKSGALGSSSP-RRHVDRHQSEFCRKKSNDESSSSLSCISYSLSSNSSSASSSDRYNNRRRSKLAIRPYSMEIYGSHILHSYRYTGRTMYASQ
        +K R  VKK+GA GSSS  RRHV++ +SEF RK SNDESSSSLSCIS+SLSS   S+SSS+ Y  RRRSKL+             L      GRTMY SQ
Subjt:  RKSRRAVKKSGALGSSSP-RRHVDRHQSEFCRKKSNDESSSSLSCISYSLSSNSSSASSSDRYNNRRRSKLAIRPYSMEIYGSHILHSYRYTGRTMYASQ

Query:  FDGNAAFSGGGFMPSQTTQAPDHSFSPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRAERITDVTFALDDGTGRIDCSKWV
        FDGNAAFSGGGFMPSQTT APDHS SPAKNRDVQALLPLTVKQINDAFLSSDDKSNFV+DGVDVNNVKLVGMVRNRA RITDVTFALDDGTGRIDCSKWV
Subjt:  FDGNAAFSGGGFMPSQTTQAPDHSFSPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRAERITDVTFALDDGTGRIDCSKWV

Query:  NEAADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLRKQQSSSVTTQPQMTNLSNTPMKVYQAPIANQYTGQ
        NEAADSNEVEGILDG+YVRVHGHLKSFQGKRTLNVFSIRPVTDYNEIT+HFIESIYVHFYNTRLRKQQSSS+T QPQM NLSNTP++ YQ PI+NQYTGQ
Subjt:  NEAADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLRKQQSSSVTTQPQMTNLSNTPMKVYQAPIANQYTGQ

Query:  AGGESWKSLEQMVLDFLQLPSCDSERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFKSTANG
        AGG+ WK+LEQMVLDFLQLPSCD+ERGAHRDVIAQ L VPLEKL+PAMKNLEEEGLIYSTTDD+HFKSTANG
Subjt:  AGGESWKSLEQMVLDFLQLPSCDSERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFKSTANG

XP_008453602.1 PREDICTED: replication protein A 32 kDa subunit B-like [Cucumis melo]9.51e-19197.47Show/hide
Query:  MYASQFDGNAAFSGGGFMPSQTTQAPDHSFSPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRAERITDVTFALDDGTGRID
        MYASQFDGNAAFSGGGFMPSQTTQAPDHSFSPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRA RITDVTFALDDGTGRID
Subjt:  MYASQFDGNAAFSGGGFMPSQTTQAPDHSFSPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRAERITDVTFALDDGTGRID

Query:  CSKWVNEAADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLRKQQSSSVTTQPQMTNLSNTPMKVYQAPIAN
        CSKWVNEAADSNEVEGILDG+YVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLRKQQSSS+TTQPQMTNLSNTP+K YQAPIAN
Subjt:  CSKWVNEAADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLRKQQSSSVTTQPQMTNLSNTPMKVYQAPIAN

Query:  QYTGQAGGESWKSLEQMVLDFLQLPSCDSERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFKSTANG
        QYTGQAGG+SWKSLEQMVLDFLQLPSCDSERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDD+HFKSTANG
Subjt:  QYTGQAGGESWKSLEQMVLDFLQLPSCDSERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFKSTANG

XP_011656845.1 replication protein A 32 kDa subunit B [Cucumis sativus]7.08e-19398.56Show/hide
Query:  MYASQFDGNAAFSGGGFMPSQTTQAPDHSFSPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRAERITDVTFALDDGTGRID
        MYASQFDGNAAFSGGGFMPSQTTQAPDHSFSPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRA RITDVTFALDDGTGRID
Subjt:  MYASQFDGNAAFSGGGFMPSQTTQAPDHSFSPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRAERITDVTFALDDGTGRID

Query:  CSKWVNEAADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLRKQQSSSVTTQPQMTNLSNTPMKVYQAPIAN
        CSKWVNEAADSNEVEGILDG+YVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLRKQQSSS+TTQPQMTNLSNTPMKVYQAPIAN
Subjt:  CSKWVNEAADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLRKQQSSSVTTQPQMTNLSNTPMKVYQAPIAN

Query:  QYTGQAGGESWKSLEQMVLDFLQLPSCDSERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFKSTANG
        QYTGQAGG+SWKSLEQMVLDFLQLPSCDSERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFKSTANG
Subjt:  QYTGQAGGESWKSLEQMVLDFLQLPSCDSERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFKSTANG

XP_022988280.1 replication protein A 32 kDa subunit B [Cucurbita maxima]3.58e-18191.7Show/hide
Query:  MYASQFDGNAAFSGGGFMPSQTTQAPDHSFSPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRAERITDVTFALDDGTGRID
        MY SQFDGNAAFSGGGFMPSQTT APDHS SPAKNRDVQALLPLTVKQINDAFLSSDDKSNFV+DGVDVNNVKLVGMVRNRA RITDVTFALDDGTGRID
Subjt:  MYASQFDGNAAFSGGGFMPSQTTQAPDHSFSPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRAERITDVTFALDDGTGRID

Query:  CSKWVNEAADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLRKQQSSSVTTQPQMTNLSNTPMKVYQAPIAN
        CSKWVNEAADSNEVEGILDG+YVRVHGHLKSFQGKRTLNVFSIRPVTDYNEIT+HFIESIYVHFYNTRLRKQQSSS+T QPQM NLSNTP++ YQ P++N
Subjt:  CSKWVNEAADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLRKQQSSSVTTQPQMTNLSNTPMKVYQAPIAN

Query:  QYTGQAGGESWKSLEQMVLDFLQLPSCDSERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFKSTANG
        QYTGQAGG+ WK+LEQMVLDFLQLPSCD+ERGAHRDVIAQ LKVPLEKL+PAMKNLEEEGLIYSTTDD+HFKSTANG
Subjt:  QYTGQAGGESWKSLEQMVLDFLQLPSCDSERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFKSTANG

XP_038880298.1 replication protein A 32 kDa subunit B [Benincasa hispida]1.28e-18394.58Show/hide
Query:  MYASQFDGNAAFSGGGFMPSQTTQAPDHSFSPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRAERITDVTFALDDGTGRID
        MY SQFDGNAAFSGGGFMPSQTTQAPDHSFSPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRA RITDVTFALDDGTG ID
Subjt:  MYASQFDGNAAFSGGGFMPSQTTQAPDHSFSPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRAERITDVTFALDDGTGRID

Query:  CSKWVNEAADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLRKQQSSSVTTQPQMTNLSNTPMKVYQAPIAN
        CSKWVNEAADSNEVEGILDG+YVRVHGHLKSFQGKRTLNVFSIRPVTDYNEIT+HFIESIYVHFYNTRLRKQQ SS+T QPQM NLSNTPMK YQAP++N
Subjt:  CSKWVNEAADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLRKQQSSSVTTQPQMTNLSNTPMKVYQAPIAN

Query:  QYTGQAGGESWKSLEQMVLDFLQLPSCDSERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFKSTANG
        QYTGQAGG+ WKSLEQMVLDFLQLPSCDSERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDD+HFKSTANG
Subjt:  QYTGQAGGESWKSLEQMVLDFLQLPSCDSERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFKSTANG

TrEMBL top hitse value%identityAlignment
A0A1S4DZE3 replication protein A 32 kDa subunit B-like1.9e-15197.47Show/hide
Query:  MYASQFDGNAAFSGGGFMPSQTTQAPDHSFSPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRAERITDVTFALDDGTGRID
        MYASQFDGNAAFSGGGFMPSQTTQAPDHSFSPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRA RITDVTFALDDGTGRID
Subjt:  MYASQFDGNAAFSGGGFMPSQTTQAPDHSFSPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRAERITDVTFALDDGTGRID

Query:  CSKWVNEAADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLRKQQSSSVTTQPQMTNLSNTPMKVYQAPIAN
        CSKWVNEAADSNEVEGILDG+YVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLRKQQSSS+TTQPQMTNLSNTP+K YQAPIAN
Subjt:  CSKWVNEAADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLRKQQSSSVTTQPQMTNLSNTPMKVYQAPIAN

Query:  QYTGQAGGESWKSLEQMVLDFLQLPSCDSERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFKSTANG
        QYTGQAGG+SWKSLEQMVLDFLQLPSCDSERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDD+HFKSTANG
Subjt:  QYTGQAGGESWKSLEQMVLDFLQLPSCDSERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFKSTANG

A0A5D3CIS3 Replication protein A 32 kDa subunit B-like1.9e-15197.47Show/hide
Query:  MYASQFDGNAAFSGGGFMPSQTTQAPDHSFSPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRAERITDVTFALDDGTGRID
        MYASQFDGNAAFSGGGFMPSQTTQAPDHSFSPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRA RITDVTFALDDGTGRID
Subjt:  MYASQFDGNAAFSGGGFMPSQTTQAPDHSFSPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRAERITDVTFALDDGTGRID

Query:  CSKWVNEAADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLRKQQSSSVTTQPQMTNLSNTPMKVYQAPIAN
        CSKWVNEAADSNEVEGILDG+YVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLRKQQSSS+TTQPQMTNLSNTP+K YQAPIAN
Subjt:  CSKWVNEAADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLRKQQSSSVTTQPQMTNLSNTPMKVYQAPIAN

Query:  QYTGQAGGESWKSLEQMVLDFLQLPSCDSERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFKSTANG
        QYTGQAGG+SWKSLEQMVLDFLQLPSCDSERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDD+HFKSTANG
Subjt:  QYTGQAGGESWKSLEQMVLDFLQLPSCDSERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFKSTANG

A0A6J1E235 replication protein A 32 kDa subunit B-like isoform X21.8e-14190.97Show/hide
Query:  MYASQFDGNAAFSGGGFMPSQTTQAPDHSFSPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRAERITDVTFALDDGTGRID
        MY SQFDGNAAFSGGGFMPSQTT APDHS SPAKNRDVQALLPLTVKQINDAFLSSDDKSNFV+DGVDVNNVKLVGMVRNRA RITDVTFALDDGTG+ID
Subjt:  MYASQFDGNAAFSGGGFMPSQTTQAPDHSFSPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRAERITDVTFALDDGTGRID

Query:  CSKWVNEAADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLRKQQSSSVTTQPQMTNLSNTPMKVYQAPIAN
        CSKWVNEAADSNEVEGILDG+YVRVHGHLKSFQGKRTLNVFSIRPVTDYNEIT+HFIESIYVHFYNTRLR QQSSS+T QPQM NLSNTP++ YQ PI+N
Subjt:  CSKWVNEAADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLRKQQSSSVTTQPQMTNLSNTPMKVYQAPIAN

Query:  QYTGQAGGESWKSLEQMVLDFLQLPSCDSERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFKSTANG
        QYTGQAGG+ WK+LEQMVLDFLQLPSCD+ERGAHRDVIAQ L VPLEKL+PAMKNLEEEGLIYSTTDD+HFKSTANG
Subjt:  QYTGQAGGESWKSLEQMVLDFLQLPSCDSERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFKSTANG

A0A6J1E347 replication protein A 32 kDa subunit B-like isoform X11.9e-14391.34Show/hide
Query:  MYASQFDGNAAFSGGGFMPSQTTQAPDHSFSPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRAERITDVTFALDDGTGRID
        MY SQFDGNAAFSGGGFMPSQTT APDHS SPAKNRDVQALLPLTVKQINDAFLSSDDKSNFV+DGVDVNNVKLVGMVRNRA RITDVTFALDDGTG+ID
Subjt:  MYASQFDGNAAFSGGGFMPSQTTQAPDHSFSPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRAERITDVTFALDDGTGRID

Query:  CSKWVNEAADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLRKQQSSSVTTQPQMTNLSNTPMKVYQAPIAN
        CSKWVNEAADSNEVEGILDG+YVRVHGHLKSFQGKRTLNVFSIRPVTDYNEIT+HFIESIYVHFYNTRLRKQQSSS+T QPQM NLSNTP++ YQ PI+N
Subjt:  CSKWVNEAADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLRKQQSSSVTTQPQMTNLSNTPMKVYQAPIAN

Query:  QYTGQAGGESWKSLEQMVLDFLQLPSCDSERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFKSTANG
        QYTGQAGG+ WK+LEQMVLDFLQLPSCD+ERGAHRDVIAQ L VPLEKL+PAMKNLEEEGLIYSTTDD+HFKSTANG
Subjt:  QYTGQAGGESWKSLEQMVLDFLQLPSCDSERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFKSTANG

A0A6J1JJ51 replication protein A 32 kDa subunit B3.8e-14491.7Show/hide
Query:  MYASQFDGNAAFSGGGFMPSQTTQAPDHSFSPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRAERITDVTFALDDGTGRID
        MY SQFDGNAAFSGGGFMPSQTT APDHS SPAKNRDVQALLPLTVKQINDAFLSSDDKSNFV+DGVDVNNVKLVGMVRNRA RITDVTFALDDGTGRID
Subjt:  MYASQFDGNAAFSGGGFMPSQTTQAPDHSFSPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRAERITDVTFALDDGTGRID

Query:  CSKWVNEAADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLRKQQSSSVTTQPQMTNLSNTPMKVYQAPIAN
        CSKWVNEAADSNEVEGILDG+YVRVHGHLKSFQGKRTLNVFSIRPVTDYNEIT+HFIESIYVHFYNTRLRKQQSSS+T QPQM NLSNTP++ YQ P++N
Subjt:  CSKWVNEAADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLRKQQSSSVTTQPQMTNLSNTPMKVYQAPIAN

Query:  QYTGQAGGESWKSLEQMVLDFLQLPSCDSERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFKSTANG
        QYTGQAGG+ WK+LEQMVLDFLQLPSCD+ERGAHRDVIAQ LKVPLEKL+PAMKNLEEEGLIYSTTDD+HFKSTANG
Subjt:  QYTGQAGGESWKSLEQMVLDFLQLPSCDSERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFKSTANG

SwissProt top hitse value%identityAlignment
Q5RC43 Replication protein A 32 kDa subunit1.9e-2330.14Show/hide
Query:  MYASQFDGNAAFSGGGFMPSQTTQAPDHSFSPAKNR-------DVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRAERITDVTFALD
        M+ S F+   + S GG      TQ+P    SPA ++         Q ++P T+ Q+  A L  +    F I  V+++ V +VG++R+  +  T++ + +D
Subjt:  MYASQFDGNAAFSGGGFMPSQTTQAPDHSFSPAKNR-------DVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRAERITDVTFALD

Query:  DGTGR-IDCSKWVNEAADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLRKQQSSSVTTQPQMTNLSNTPMK
        D T   +D  +WV+    S+E   +    YV+V GHL+SFQ K++L  F I P+ D NE T H +E I  H   ++   Q S+                 
Subjt:  DGTGR-IDCSKWVNEAADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLRKQQSSSVTTQPQMTNLSNTPMK

Query:  VYQAPIANQYTGQAGGESWKSL---------EQMVLDFLQLPSCDSERGAHRDVIAQQLK-VPLEKLIPAMKNLEEEGLIYSTTDDFHFKST
          +API+N    +AG     S          +  VL+ ++  +C    G +   +  QLK + +  +  AM  L  EG IYST DD HFKST
Subjt:  VYQAPIANQYTGQAGGESWKSL---------EQMVLDFLQLPSCDSERGAHRDVIAQQLK-VPLEKLIPAMKNLEEEGLIYSTTDDFHFKST

Q6H7J5 Replication protein A 32 kDa subunit B2.8e-5138.57Show/hide
Query:  GNAA--FSGGGFMPSQTTQAPDHSFSPA----KNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRAERITDVTFALDDGTGRIDC
        GNA+  F GGGFMPSQ T A + +        K+R+ QALLPLTVKQI DA  ++DDKSNF ++G++V+ V+LVG + N+ +R+TDV+F LDDGTGR+  
Subjt:  GNAA--FSGGGFMPSQTTQAPDHSFSPA----KNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRAERITDVTFALDDGTGRIDC

Query:  SKWVNEAADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLRKQQSSSVTTQPQMTNLSNTPMKVYQAPIANQ
        ++W N++ D+ E+  I +G YV V+G LK FQGKR +  +S+R +T++N++T+HF+  ++VH   TR + Q +++  T      +    M   Q+P+ NQ
Subjt:  SKWVNEAADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLRKQQSSSVTTQPQMTNLSNTPMKVYQAPIANQ

Query:  ---YTGQAGGESWKSLEQMVLDFLQLPS-CDSERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFKSTANG
           ++      +  ++  +VL+    P+  + + G   D ++++L +P E +   + +  + G +Y+T DD H+KST NG
Subjt:  ---YTGQAGGESWKSLEQMVLDFLQLPS-CDSERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFKSTANG

Q6K9U2 Replication protein A 32 kDa subunit A9.6e-4439.43Show/hide
Query:  AFSGGGFMPSQTTQAPDHSFSPAKNRDVQALLPLTVKQINDA---FLSSDDKSNFVIDGVDVNNVKLVGMVRNRAERITDVTFALDDGTGRIDCSKWVNE
        AFS   F  SQ   A   S +P+K+R   + +PLTVKQI++A    ++ +  + FV+DGV+  NV+LVG+V  + ER TDV+F +DDGTGR+D  +WVN+
Subjt:  AFSGGGFMPSQTTQAPDHSFSPAKNRDVQALLPLTVKQINDA---FLSSDDKSNFVIDGVDVNNVKLVGMVRNRAERITDVTFALDDGTGRIDCSKWVNE

Query:  AADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLRKQQSSSVTTQPQMTNLSN---------TPMKVYQAPI
         ADS E   + +G+YV V G LK  Q ++    F+IRPVTDYNE+T HFI+ + +H  NT  + Q  S   T   M + S+         T +K   AP+
Subjt:  AADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLRKQQSSSVTTQPQMTNLSN---------TPMKVYQAPI

Query:  ANQYTGQAGGESWKSLEQMVLDFLQLP-SCDSERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFKSTAN
         +   G     S   L   VL+  + P + +SE G H D I ++ ++P  K+  A+  L + G IYST D+ H+KS  N
Subjt:  ANQYTGQAGGESWKSLEQMVLDFLQLP-SCDSERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFKSTAN

Q8LFJ8 Replication protein A 32 kDa subunit B5.1e-6949.64Show/hide
Query:  MYASQFDGNAAFSGGGFMPSQTTQAPDHSFSPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRAERITDVTFALDDGTGRID
        MY   FDGNAAF+GGGFMPSQ T     S S  KNRDV+ LLPLT+KQ++ A  S+  +SNF IDGVD+  V +VG +     RIT V F +DDGTG +D
Subjt:  MYASQFDGNAAFSGGGFMPSQTTQAPDHSFSPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRAERITDVTFALDDGTGRID

Query:  CSKWVNEAADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLR--KQQSSSVTTQPQMT-NLSNTPMKVYQAP
        C +W +   ++ E+E +  G+YVR+HGHLK FQGKR++NVFS+RPVTD+NEI +HF E +YVH YNT+LR       + T +PQM  +   TP K YQ  
Subjt:  CSKWVNEAADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLR--KQQSSSVTTQPQMT-NLSNTPMKVYQAP

Query:  IANQYTGQAGGESWKSLEQMVLDFLQLP-SCDSERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFKSTAN
         +NQ+  Q   +S   ++Q VL++L  P    SE G H D+IA++L++PL ++  A++ L  +G IYST D+  FKSTAN
Subjt:  IANQYTGQAGGESWKSLEQMVLDFLQLP-SCDSERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFKSTAN

Q9ZQ19 Replication protein A 32 kDa subunit A2.5e-6848.91Show/hide
Query:  ASQFDGNAAFSGGGFMPSQTTQAPDHSFSPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRAE-RITDVTFALDDGTGRIDC
        +SQF+ N+ FSGGGFM SQ +QA + S S AKNRD Q L+P+TVKQI + F SS +KS  VI+G+ + NV LVG+V ++ E ++T+V F LDDGTGRIDC
Subjt:  ASQFDGNAAFSGGGFMPSQTTQAPDHSFSPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRAE-RITDVTFALDDGTGRIDC

Query:  SKWVNEAADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLRKQQSSSVTTQPQMT--NLSNTPMKVYQAPIA
         +WV+E  D+ E+E + DG YVR+ GHLK+FQGK  L VFS+RP+ D+NE+T H+IE I+ +  N+  ++QQ   VT     T    SNT       P+ 
Subjt:  SKWVNEAADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLRKQQSSSVTTQPQMT--NLSNTPMKVYQAPIA

Query:  NQYTGQAGGESWKSLEQMVLDFLQLPSCDS-ERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFK
        +       G   K+L+ M+LD+L+ P+C + ++G H D IAQQLK+P  KL   +++LE +GLIYST D++HFK
Subjt:  NQYTGQAGGESWKSLEQMVLDFLQLPSCDS-ERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFK

Arabidopsis top hitse value%identityAlignment
AT2G24490.1 replicon protein A21.8e-6948.91Show/hide
Query:  ASQFDGNAAFSGGGFMPSQTTQAPDHSFSPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRAE-RITDVTFALDDGTGRIDC
        +SQF+ N+ FSGGGFM SQ +QA + S S AKNRD Q L+P+TVKQI + F SS +KS  VI+G+ + NV LVG+V ++ E ++T+V F LDDGTGRIDC
Subjt:  ASQFDGNAAFSGGGFMPSQTTQAPDHSFSPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRAE-RITDVTFALDDGTGRIDC

Query:  SKWVNEAADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLRKQQSSSVTTQPQMT--NLSNTPMKVYQAPIA
         +WV+E  D+ E+E + DG YVR+ GHLK+FQGK  L VFS+RP+ D+NE+T H+IE I+ +  N+  ++QQ   VT     T    SNT       P+ 
Subjt:  SKWVNEAADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLRKQQSSSVTTQPQMT--NLSNTPMKVYQAPIA

Query:  NQYTGQAGGESWKSLEQMVLDFLQLPSCDS-ERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFK
        +       G   K+L+ M+LD+L+ P+C + ++G H D IAQQLK+P  KL   +++LE +GLIYST D++HFK
Subjt:  NQYTGQAGGESWKSLEQMVLDFLQLPSCDS-ERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFK

AT2G24490.2 replicon protein A21.8e-6948.91Show/hide
Query:  ASQFDGNAAFSGGGFMPSQTTQAPDHSFSPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRAE-RITDVTFALDDGTGRIDC
        +SQF+ N+ FSGGGFM SQ +QA + S S AKNRD Q L+P+TVKQI + F SS +KS  VI+G+ + NV LVG+V ++ E ++T+V F LDDGTGRIDC
Subjt:  ASQFDGNAAFSGGGFMPSQTTQAPDHSFSPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRAE-RITDVTFALDDGTGRIDC

Query:  SKWVNEAADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLRKQQSSSVTTQPQMT--NLSNTPMKVYQAPIA
         +WV+E  D+ E+E + DG YVR+ GHLK+FQGK  L VFS+RP+ D+NE+T H+IE I+ +  N+  ++QQ   VT     T    SNT       P+ 
Subjt:  SKWVNEAADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLRKQQSSSVTTQPQMT--NLSNTPMKVYQAPIA

Query:  NQYTGQAGGESWKSLEQMVLDFLQLPSCDS-ERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFK
        +       G   K+L+ M+LD+L+ P+C + ++G H D IAQQLK+P  KL   +++LE +GLIYST D++HFK
Subjt:  NQYTGQAGGESWKSLEQMVLDFLQLPSCDS-ERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFK

AT3G02920.1 Replication protein A, subunit RPA323.6e-7049.64Show/hide
Query:  MYASQFDGNAAFSGGGFMPSQTTQAPDHSFSPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRAERITDVTFALDDGTGRID
        MY   FDGNAAF+GGGFMPSQ T     S S  KNRDV+ LLPLT+KQ++ A  S+  +SNF IDGVD+  V +VG +     RIT V F +DDGTG +D
Subjt:  MYASQFDGNAAFSGGGFMPSQTTQAPDHSFSPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRAERITDVTFALDDGTGRID

Query:  CSKWVNEAADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLR--KQQSSSVTTQPQMT-NLSNTPMKVYQAP
        C +W +   ++ E+E +  G+YVR+HGHLK FQGKR++NVFS+RPVTD+NEI +HF E +YVH YNT+LR       + T +PQM  +   TP K YQ  
Subjt:  CSKWVNEAADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLR--KQQSSSVTTQPQMT-NLSNTPMKVYQAP

Query:  IANQYTGQAGGESWKSLEQMVLDFLQLP-SCDSERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFKSTAN
         +NQ+  Q   +S   ++Q VL++L  P    SE G H D+IA++L++PL ++  A++ L  +G IYST D+  FKSTAN
Subjt:  IANQYTGQAGGESWKSLEQMVLDFLQLP-SCDSERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFKSTAN

AT3G02920.2 Replication protein A, subunit RPA326.3e-6747Show/hide
Query:  MYASQFDGNAAFSGGGFMPSQ-TTQAPDHSF-------------------SPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRN
        MY   FDGNAAF+GGGFMPSQ TTQA + S                    S  +NRDV+ LLPLT+KQ++ A  S+  +SNF IDGVD+  V +VG +  
Subjt:  MYASQFDGNAAFSGGGFMPSQ-TTQAPDHSF-------------------SPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRN

Query:  RAERITDVTFALDDGTGRIDCSKWVNEAADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLR--KQQSSSVT
           RIT V F +DDGTG +DC +W +   ++ E+E +  G+YVR+HGHLK FQGKR++NVFS+RPVTD+NEI +HF E +YVH YNT+LR       + T
Subjt:  RAERITDVTFALDDGTGRIDCSKWVNEAADSNEVEGILDGIYVRVHGHLKSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLR--KQQSSSVT

Query:  TQPQMT-NLSNTPMKVYQAPIANQYTGQAGGESWKSLEQMVLDFLQLP-SCDSERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFKSTAN
         +PQM  +   TP K YQ   +NQ+  Q   +S   ++Q VL++L  P    SE G H D+IA++L++PL ++  A++ L  +G IYST D+  FKSTAN
Subjt:  TQPQMT-NLSNTPMKVYQAPIANQYTGQAGGESWKSLEQMVLDFLQLP-SCDSERGAHRDVIAQQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFKSTAN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGCAAAAGATCACACTCTTCTTCGTCTCTCTGAAAAACCCTTCTCATTACTCCAAGCAAATCAATTATTTGTCCACAAAATCCTTTCCAGAAACTCCTCATTTGG
CCGCTCTGCCCGTCTCCCTGCCTGTGGCCTCCCCGGTCAAGTCCCTTTCAATTGGGAAGCCCAGCCCGGCCTCCCTAAAAACCAACCTTCGGATAATCCCCCGCCAGCCG
AACTTCCTCCGCCCTCATCCTCCGCCTTGGGCCTCTCCAAAACTCCTCATGTAGTCCCTAAGCAAGCGCCGGTTAAGATTTGGTTTTGGAATAAGCAGAGGAGAAAATCG
AGGCGTGCAGTTAAGAAAAGCGGTGCGCTTGGATCGTCCTCACCGAGACGGCATGTCGATCGTCATCAGAGTGAGTTTTGTAGAAAAAAATCAAACGACGAGTCGTCGTC
GTCTTTGTCATGTATTTCTTATTCCTTGTCTTCGAATTCTTCTTCGGCTTCGTCATCAGATCGTTACAACAACCGACGGCGCTCCAAACTCGCTATACGTCCATATTCCA
TGGAAATATATGGTTCCCATATTCTTCACTCCTACCGTTACACAGGAAGAACGATGTATGCAAGTCAATTCGACGGCAACGCTGCCTTCTCCGGTGGTGGATTTATGCCA
TCTCAGACCACCCAGGCCCCTGATCATTCATTTTCTCCGGCCAAAAATCGGGATGTCCAGGCATTGCTTCCATTAACAGTGAAACAGATAAACGACGCATTTCTGTCTAG
TGATGATAAATCAAATTTTGTTATTGATGGTGTTGATGTTAACAATGTTAAGCTTGTAGGGATGGTGCGGAATAGAGCAGAAAGAATTACTGATGTGACTTTTGCACTGG
ACGATGGCACTGGACGGATTGACTGTAGCAAATGGGTCAATGAAGCTGCAGATTCAAACGAAGTTGAGGGAATATTGGATGGCATCTATGTCCGGGTCCATGGCCACTTA
AAGAGCTTCCAGGGTAAAAGAACTTTAAATGTCTTTTCTATCAGGCCAGTGACGGACTATAATGAGATCACAAATCACTTCATCGAATCCATATATGTTCATTTTTACAA
TACCAGATTGCGGAAACAACAAAGTAGTAGTGTGACTACTCAGCCACAGATGACAAATCTATCTAATACGCCCATGAAAGTATATCAAGCTCCCATTGCAAATCAATACA
CTGGTCAAGCAGGCGGTGAGAGTTGGAAGAGCCTTGAGCAGATGGTCTTAGATTTTCTACAACTTCCGTCATGCGATTCTGAAAGGGGCGCACACCGTGATGTCATTGCT
CAACAGCTCAAAGTTCCACTAGAGAAGCTCATTCCGGCGATGAAAAATCTAGAAGAGGAAGGCCTAATTTACTCAACAACTGATGATTTCCACTTCAAATCAACCGCCAA
CGGGTGCCAGCAGCCTGGAGAGGTGGATTTGGCTGACTCCAGGACTTCTGTCCAATCCGCAAACAATAACAATTCAAGCATGAACCCATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGCAAAAGATCACACTCTTCTTCGTCTCTCTGAAAAACCCTTCTCATTACTCCAAGCAAATCAATTATTTGTCCACAAAATCCTTTCCAGAAACTCCTCATTTGG
CCGCTCTGCCCGTCTCCCTGCCTGTGGCCTCCCCGGTCAAGTCCCTTTCAATTGGGAAGCCCAGCCCGGCCTCCCTAAAAACCAACCTTCGGATAATCCCCCGCCAGCCG
AACTTCCTCCGCCCTCATCCTCCGCCTTGGGCCTCTCCAAAACTCCTCATGTAGTCCCTAAGCAAGCGCCGGTTAAGATTTGGTTTTGGAATAAGCAGAGGAGAAAATCG
AGGCGTGCAGTTAAGAAAAGCGGTGCGCTTGGATCGTCCTCACCGAGACGGCATGTCGATCGTCATCAGAGTGAGTTTTGTAGAAAAAAATCAAACGACGAGTCGTCGTC
GTCTTTGTCATGTATTTCTTATTCCTTGTCTTCGAATTCTTCTTCGGCTTCGTCATCAGATCGTTACAACAACCGACGGCGCTCCAAACTCGCTATACGTCCATATTCCA
TGGAAATATATGGTTCCCATATTCTTCACTCCTACCGTTACACAGGAAGAACGATGTATGCAAGTCAATTCGACGGCAACGCTGCCTTCTCCGGTGGTGGATTTATGCCA
TCTCAGACCACCCAGGCCCCTGATCATTCATTTTCTCCGGCCAAAAATCGGGATGTCCAGGCATTGCTTCCATTAACAGTGAAACAGATAAACGACGCATTTCTGTCTAG
TGATGATAAATCAAATTTTGTTATTGATGGTGTTGATGTTAACAATGTTAAGCTTGTAGGGATGGTGCGGAATAGAGCAGAAAGAATTACTGATGTGACTTTTGCACTGG
ACGATGGCACTGGACGGATTGACTGTAGCAAATGGGTCAATGAAGCTGCAGATTCAAACGAAGTTGAGGGAATATTGGATGGCATCTATGTCCGGGTCCATGGCCACTTA
AAGAGCTTCCAGGGTAAAAGAACTTTAAATGTCTTTTCTATCAGGCCAGTGACGGACTATAATGAGATCACAAATCACTTCATCGAATCCATATATGTTCATTTTTACAA
TACCAGATTGCGGAAACAACAAAGTAGTAGTGTGACTACTCAGCCACAGATGACAAATCTATCTAATACGCCCATGAAAGTATATCAAGCTCCCATTGCAAATCAATACA
CTGGTCAAGCAGGCGGTGAGAGTTGGAAGAGCCTTGAGCAGATGGTCTTAGATTTTCTACAACTTCCGTCATGCGATTCTGAAAGGGGCGCACACCGTGATGTCATTGCT
CAACAGCTCAAAGTTCCACTAGAGAAGCTCATTCCGGCGATGAAAAATCTAGAAGAGGAAGGCCTAATTTACTCAACAACTGATGATTTCCACTTCAAATCAACCGCCAA
CGGGTGCCAGCAGCCTGGAGAGGTGGATTTGGCTGACTCCAGGACTTCTGTCCAATCCGCAAACAATAACAATTCAAGCATGAACCCATAA
Protein sequenceShow/hide protein sequence
MAAKDHTLLRLSEKPFSLLQANQLFVHKILSRNSSFGRSARLPACGLPGQVPFNWEAQPGLPKNQPSDNPPPAELPPPSSSALGLSKTPHVVPKQAPVKIWFWNKQRRKS
RRAVKKSGALGSSSPRRHVDRHQSEFCRKKSNDESSSSLSCISYSLSSNSSSASSSDRYNNRRRSKLAIRPYSMEIYGSHILHSYRYTGRTMYASQFDGNAAFSGGGFMP
SQTTQAPDHSFSPAKNRDVQALLPLTVKQINDAFLSSDDKSNFVIDGVDVNNVKLVGMVRNRAERITDVTFALDDGTGRIDCSKWVNEAADSNEVEGILDGIYVRVHGHL
KSFQGKRTLNVFSIRPVTDYNEITNHFIESIYVHFYNTRLRKQQSSSVTTQPQMTNLSNTPMKVYQAPIANQYTGQAGGESWKSLEQMVLDFLQLPSCDSERGAHRDVIA
QQLKVPLEKLIPAMKNLEEEGLIYSTTDDFHFKSTANGCQQPGEVDLADSRTSVQSANNNNSSMNP