; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017613 (gene) of Snake gourd v1 genome

Gene IDTan0017613
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein translocase subunit SecA
Genome locationLG09:48580650..48586306
RNA-Seq ExpressionTan0017613
SyntenyTan0017613
Gene Ontology termsNA
InterPro domainsIPR004027 - SEC-C motif


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588322.1 hypothetical protein SDJN03_16887, partial [Cucurbita argyrosperma subsp. sororia]3.9e-11492.7Show/hide
Query:  MRSSSRPLINLSKKNSSNPNGFFLFSSSDHSPRNCYSTLLQSRSIFSTTQLHDSWMDKIKGVFTGKKNPPEGTEISSESFTLLRFADELKNARRVGAFKQ
        MRSSSR LINLS+ N+SN N FF FSSSD SPR+C STLLQSRSIFSTTQL+DSWMDKIKG FTGKK+P EGTEISSESFTLLRFA+ELKNARRVGAFKQ
Subjt:  MRSSSRPLINLSKKNSSNPNGFFLFSSSDHSPRNCYSTLLQSRSIFSTTQLHDSWMDKIKGVFTGKKNPPEGTEISSESFTLLRFADELKNARRVGAFKQ

Query:  YIVGRSSEATFADAFEKQEAIIRYLGGFDPTGENIQTSQKQEAAKNCNCTIADVENTLAKFIWAKEAQKKIEKLKEEGKPMPKSIAEVQKLVGSNPLDLA
        YIVGRSSEATFADAFEKQEAIIRYLGGFDPTGENIQTSQKQEAAK CNCTIADVENTLAKFIWAKEAQKKIEKLKEEGKPMPKS+AEVQKLVGSNPLDLA
Subjt:  YIVGRSSEATFADAFEKQEAIIRYLGGFDPTGENIQTSQKQEAAKNCNCTIADVENTLAKFIWAKEAQKKIEKLKEEGKPMPKSIAEVQKLVGSNPLDLA

Query:  RSNLAKSGQISRNALCPCGSKKRYKRCCGKDQT
        RSNLAKSGQISRNALCPCGSKKRYKRCCGKDQT
Subjt:  RSNLAKSGQISRNALCPCGSKKRYKRCCGKDQT

XP_008450751.1 PREDICTED: protein translocase subunit SecA [Cucumis melo]1.4e-11190.17Show/hide
Query:  MRSSSRPLINLSKKNSSNPNGFFLFSSSDHSPRNCYSTLLQSRSIFSTTQLHDSWMDKIKGVFTGKKNPPEGTEISSESFTLLRFADELKNARRVGAFKQ
        MRSSSR LI LSKK + NPN FF  SSSDH+P NCY TLLQSRSIFSTTQLH SWMDKIKGV TGKK   EGT+ISSESFTLLRFADELKNARRVGAFKQ
Subjt:  MRSSSRPLINLSKKNSSNPNGFFLFSSSDHSPRNCYSTLLQSRSIFSTTQLHDSWMDKIKGVFTGKKNPPEGTEISSESFTLLRFADELKNARRVGAFKQ

Query:  YIVGRSSEATFADAFEKQEAIIRYLGGFDPTGENIQTSQKQEAAKNCNCTIADVENTLAKFIWAKEAQKKIEKLKEEGKPMPKSIAEVQKLVGSNPLDLA
        YIVGRSSEATFADAFEKQEAIIRYLGGFD TGENIQTSQKQEAAKNCNCTIADVEN L+KF+WAKEAQKKIEKLKEEGKP+P SIAEVQKLVGSNPLDLA
Subjt:  YIVGRSSEATFADAFEKQEAIIRYLGGFDPTGENIQTSQKQEAAKNCNCTIADVENTLAKFIWAKEAQKKIEKLKEEGKPMPKSIAEVQKLVGSNPLDLA

Query:  RSNLAKSGQISRNALCPCGSKKRYKRCCGKDQTV
        RSNLAKSGQISRNALCPCGSKKRYKRCCGKDQTV
Subjt:  RSNLAKSGQISRNALCPCGSKKRYKRCCGKDQTV

XP_022932559.1 uncharacterized protein LOC111439055 [Cucurbita moschata]2.1e-11593.56Show/hide
Query:  MRSSSRPLINLSKKNSSNPNGFFLFSSSDHSPRNCYSTLLQSRSIFSTTQLHDSWMDKIKGVFTGKKNPPEGTEISSESFTLLRFADELKNARRVGAFKQ
        MRSSSR LINLS+ N+SN N FF FSSSD SPRNC STLLQSRSIFSTTQL+DSWMDKIKG FTGKK+P EGTEISSESFTLLRFADELKNARRVGAFKQ
Subjt:  MRSSSRPLINLSKKNSSNPNGFFLFSSSDHSPRNCYSTLLQSRSIFSTTQLHDSWMDKIKGVFTGKKNPPEGTEISSESFTLLRFADELKNARRVGAFKQ

Query:  YIVGRSSEATFADAFEKQEAIIRYLGGFDPTGENIQTSQKQEAAKNCNCTIADVENTLAKFIWAKEAQKKIEKLKEEGKPMPKSIAEVQKLVGSNPLDLA
        YIVGRSSEATFADAFEKQEAIIRYLGGFDPTGENIQTSQKQEAAK CNCTIADVENTLAKFIWAKEAQKKIEKLKEEGKPMPKS+AEVQKLVGSNPLDLA
Subjt:  YIVGRSSEATFADAFEKQEAIIRYLGGFDPTGENIQTSQKQEAAKNCNCTIADVENTLAKFIWAKEAQKKIEKLKEEGKPMPKSIAEVQKLVGSNPLDLA

Query:  RSNLAKSGQISRNALCPCGSKKRYKRCCGKDQT
        RSNLAKSGQISRNALCPCGSKKRYKRCCGKDQT
Subjt:  RSNLAKSGQISRNALCPCGSKKRYKRCCGKDQT

XP_022965948.1 uncharacterized protein LOC111465670 [Cucurbita maxima]1.4e-11191.85Show/hide
Query:  MRSSSRPLINLSKKNSSNPNGFFLFSSSDHSPRNCYSTLLQSRSIFSTTQLHDSWMDKIKGVFTGKKNPPEGTEISSESFTLLRFADELKNARRVGAFKQ
        MRSSSR LINLS+ N+SN N  F FSS D SPRNC S LLQSRSIFSTTQL+DSWMDKIKG FTGKK+P EGTEISSESFTLLRFA+ELKNARRVGAFKQ
Subjt:  MRSSSRPLINLSKKNSSNPNGFFLFSSSDHSPRNCYSTLLQSRSIFSTTQLHDSWMDKIKGVFTGKKNPPEGTEISSESFTLLRFADELKNARRVGAFKQ

Query:  YIVGRSSEATFADAFEKQEAIIRYLGGFDPTGENIQTSQKQEAAKNCNCTIADVENTLAKFIWAKEAQKKIEKLKEEGKPMPKSIAEVQKLVGSNPLDLA
        YIVGRSSEATFADAFEKQEAIIRYLGGFDPTGENIQTSQKQEAAK CNCTIADVENTLAKFIWAKEAQKKIEKLKEEGKPMPKS+AEVQKLVGSNPLDLA
Subjt:  YIVGRSSEATFADAFEKQEAIIRYLGGFDPTGENIQTSQKQEAAKNCNCTIADVENTLAKFIWAKEAQKKIEKLKEEGKPMPKSIAEVQKLVGSNPLDLA

Query:  RSNLAKSGQISRNALCPCGSKKRYKRCCGKDQT
        RSNLAKSGQISRNALCPCGSKKRYKRCCGKDQT
Subjt:  RSNLAKSGQISRNALCPCGSKKRYKRCCGKDQT

XP_023530731.1 uncharacterized protein LOC111793188 [Cucurbita pepo subsp. pepo]3.5e-11593.99Show/hide
Query:  MRSSSRPLINLSKKNSSNPNGFFLFSSSDHSPRNCYSTLLQSRSIFSTTQLHDSWMDKIKGVFTGKKNPPEGTEISSESFTLLRFADELKNARRVGAFKQ
        MRSSSR LINLS+ N+SN N FF FSSSD SPRNC STLLQSRSIFSTTQLHDSWMDKIKG FTGKK+  EGTEISSESFTLLRFADELKNARRVGAFKQ
Subjt:  MRSSSRPLINLSKKNSSNPNGFFLFSSSDHSPRNCYSTLLQSRSIFSTTQLHDSWMDKIKGVFTGKKNPPEGTEISSESFTLLRFADELKNARRVGAFKQ

Query:  YIVGRSSEATFADAFEKQEAIIRYLGGFDPTGENIQTSQKQEAAKNCNCTIADVENTLAKFIWAKEAQKKIEKLKEEGKPMPKSIAEVQKLVGSNPLDLA
        YIVGRSSEATFADAFEKQEAIIRYLGGFDPTGENIQTSQKQEAAK CNCTIADVENTLAKFIWAKEAQKKIEKLKEEGKPMPKSIAEVQKLVGSNPLDLA
Subjt:  YIVGRSSEATFADAFEKQEAIIRYLGGFDPTGENIQTSQKQEAAKNCNCTIADVENTLAKFIWAKEAQKKIEKLKEEGKPMPKSIAEVQKLVGSNPLDLA

Query:  RSNLAKSGQISRNALCPCGSKKRYKRCCGKDQT
        RSNLAKSGQISRNALCPCGSKKRYKRCCGKDQT
Subjt:  RSNLAKSGQISRNALCPCGSKKRYKRCCGKDQT

TrEMBL top hitse value%identityAlignment
A0A0A0LWL6 Uncharacterized protein2.6e-11188.89Show/hide
Query:  MRSSSRPLINLSKKNSSNPNGFFLFSSSDHSPRNCYSTLLQSRSIFSTTQLHDSWMDKIKGVFTGKKNPPEGTEISSESFTLLRFADELKNARRVGAFKQ
        MRSSSR LIN SK+ + NPN FF FSSSDH+P NCY TLLQSRSIFSTTQLH SWMDKIKGV +GKKN  EGT+ISSESFTLLRFADELKNARRVGA KQ
Subjt:  MRSSSRPLINLSKKNSSNPNGFFLFSSSDHSPRNCYSTLLQSRSIFSTTQLHDSWMDKIKGVFTGKKNPPEGTEISSESFTLLRFADELKNARRVGAFKQ

Query:  YIVGRSSEATFADAFEKQEAIIRYLGGFDPTGENIQTSQKQEAAKNCNCTIADVENTLAKFIWAKEAQKKIEKLKEEGKPMPKSIAEVQKLVGSNPLDLA
        YIVGRSSEATFADAFEKQEAIIRYLGGFD TGENIQTSQKQEAAKNCNCTIA+VEN L+KF+WAKEAQKKIEKLKEEGKP+P +IAEVQKLVGSNPLDLA
Subjt:  YIVGRSSEATFADAFEKQEAIIRYLGGFDPTGENIQTSQKQEAAKNCNCTIADVENTLAKFIWAKEAQKKIEKLKEEGKPMPKSIAEVQKLVGSNPLDLA

Query:  RSNLAKSGQISRNALCPCGSKKRYKRCCGKDQTV
        RSNLAKSGQISRNALCPCGSKKRYKRCCGKDQTV
Subjt:  RSNLAKSGQISRNALCPCGSKKRYKRCCGKDQTV

A0A1S3BPC0 protein translocase subunit SecA6.7e-11290.17Show/hide
Query:  MRSSSRPLINLSKKNSSNPNGFFLFSSSDHSPRNCYSTLLQSRSIFSTTQLHDSWMDKIKGVFTGKKNPPEGTEISSESFTLLRFADELKNARRVGAFKQ
        MRSSSR LI LSKK + NPN FF  SSSDH+P NCY TLLQSRSIFSTTQLH SWMDKIKGV TGKK   EGT+ISSESFTLLRFADELKNARRVGAFKQ
Subjt:  MRSSSRPLINLSKKNSSNPNGFFLFSSSDHSPRNCYSTLLQSRSIFSTTQLHDSWMDKIKGVFTGKKNPPEGTEISSESFTLLRFADELKNARRVGAFKQ

Query:  YIVGRSSEATFADAFEKQEAIIRYLGGFDPTGENIQTSQKQEAAKNCNCTIADVENTLAKFIWAKEAQKKIEKLKEEGKPMPKSIAEVQKLVGSNPLDLA
        YIVGRSSEATFADAFEKQEAIIRYLGGFD TGENIQTSQKQEAAKNCNCTIADVEN L+KF+WAKEAQKKIEKLKEEGKP+P SIAEVQKLVGSNPLDLA
Subjt:  YIVGRSSEATFADAFEKQEAIIRYLGGFDPTGENIQTSQKQEAAKNCNCTIADVENTLAKFIWAKEAQKKIEKLKEEGKPMPKSIAEVQKLVGSNPLDLA

Query:  RSNLAKSGQISRNALCPCGSKKRYKRCCGKDQTV
        RSNLAKSGQISRNALCPCGSKKRYKRCCGKDQTV
Subjt:  RSNLAKSGQISRNALCPCGSKKRYKRCCGKDQTV

A0A5D3CG47 Protein translocase subunit SecA3.2e-10689.82Show/hide
Query:  MRSSSRPLINLSKKNSSNPNGFFLFSSSDHSPRNCYSTLLQSRSIFSTTQLHDSWMDKIKGVFTGKKNPPEGTEISSESFTLLRFADELKNARRVGAFKQ
        MRSSSR LI LSKK + NPN FF  SSSDH+P NCY TLLQSRSIFSTTQLH SWMDKIKGV TGKK   EGT+ISSESFTLLRFADELKNARRVGAFKQ
Subjt:  MRSSSRPLINLSKKNSSNPNGFFLFSSSDHSPRNCYSTLLQSRSIFSTTQLHDSWMDKIKGVFTGKKNPPEGTEISSESFTLLRFADELKNARRVGAFKQ

Query:  YIVGRSSEATFADAFEKQEAIIRYLGGFDPTGENIQTSQKQEAAKNCNCTIADVENTLAKFIWAKEAQKKIEKLKEEGKPMPKSIAEVQKLVGSNPLDLA
        YIVGRSSEATFADAFEKQEAIIRYLGGFD TGENIQTSQKQEAAKNCNCTIADVEN L+KF+WAKEAQKKIEKLKEEGKP+P SIAEVQKLVGSNPLDLA
Subjt:  YIVGRSSEATFADAFEKQEAIIRYLGGFDPTGENIQTSQKQEAAKNCNCTIADVENTLAKFIWAKEAQKKIEKLKEEGKPMPKSIAEVQKLVGSNPLDLA

Query:  RSNLAKSGQISRNALCPCGSKKRYKR
        RSNLAKSGQISRNALCPCGSKKRYKR
Subjt:  RSNLAKSGQISRNALCPCGSKKRYKR

A0A6J1EXB8 uncharacterized protein LOC1114390551.0e-11593.56Show/hide
Query:  MRSSSRPLINLSKKNSSNPNGFFLFSSSDHSPRNCYSTLLQSRSIFSTTQLHDSWMDKIKGVFTGKKNPPEGTEISSESFTLLRFADELKNARRVGAFKQ
        MRSSSR LINLS+ N+SN N FF FSSSD SPRNC STLLQSRSIFSTTQL+DSWMDKIKG FTGKK+P EGTEISSESFTLLRFADELKNARRVGAFKQ
Subjt:  MRSSSRPLINLSKKNSSNPNGFFLFSSSDHSPRNCYSTLLQSRSIFSTTQLHDSWMDKIKGVFTGKKNPPEGTEISSESFTLLRFADELKNARRVGAFKQ

Query:  YIVGRSSEATFADAFEKQEAIIRYLGGFDPTGENIQTSQKQEAAKNCNCTIADVENTLAKFIWAKEAQKKIEKLKEEGKPMPKSIAEVQKLVGSNPLDLA
        YIVGRSSEATFADAFEKQEAIIRYLGGFDPTGENIQTSQKQEAAK CNCTIADVENTLAKFIWAKEAQKKIEKLKEEGKPMPKS+AEVQKLVGSNPLDLA
Subjt:  YIVGRSSEATFADAFEKQEAIIRYLGGFDPTGENIQTSQKQEAAKNCNCTIADVENTLAKFIWAKEAQKKIEKLKEEGKPMPKSIAEVQKLVGSNPLDLA

Query:  RSNLAKSGQISRNALCPCGSKKRYKRCCGKDQT
        RSNLAKSGQISRNALCPCGSKKRYKRCCGKDQT
Subjt:  RSNLAKSGQISRNALCPCGSKKRYKRCCGKDQT

A0A6J1HQG1 uncharacterized protein LOC1114656706.7e-11291.85Show/hide
Query:  MRSSSRPLINLSKKNSSNPNGFFLFSSSDHSPRNCYSTLLQSRSIFSTTQLHDSWMDKIKGVFTGKKNPPEGTEISSESFTLLRFADELKNARRVGAFKQ
        MRSSSR LINLS+ N+SN N  F FSS D SPRNC S LLQSRSIFSTTQL+DSWMDKIKG FTGKK+P EGTEISSESFTLLRFA+ELKNARRVGAFKQ
Subjt:  MRSSSRPLINLSKKNSSNPNGFFLFSSSDHSPRNCYSTLLQSRSIFSTTQLHDSWMDKIKGVFTGKKNPPEGTEISSESFTLLRFADELKNARRVGAFKQ

Query:  YIVGRSSEATFADAFEKQEAIIRYLGGFDPTGENIQTSQKQEAAKNCNCTIADVENTLAKFIWAKEAQKKIEKLKEEGKPMPKSIAEVQKLVGSNPLDLA
        YIVGRSSEATFADAFEKQEAIIRYLGGFDPTGENIQTSQKQEAAK CNCTIADVENTLAKFIWAKEAQKKIEKLKEEGKPMPKS+AEVQKLVGSNPLDLA
Subjt:  YIVGRSSEATFADAFEKQEAIIRYLGGFDPTGENIQTSQKQEAAKNCNCTIADVENTLAKFIWAKEAQKKIEKLKEEGKPMPKSIAEVQKLVGSNPLDLA

Query:  RSNLAKSGQISRNALCPCGSKKRYKRCCGKDQT
        RSNLAKSGQISRNALCPCGSKKRYKRCCGKDQT
Subjt:  RSNLAKSGQISRNALCPCGSKKRYKRCCGKDQT

SwissProt top hitse value%identityAlignment
A4J927 Protein translocase subunit SecA4.5e-0424.54Show/hide
Query:  EISSESFTLLRFADELKNARRVGAFKQYIVGRSSEATFADAFEKQEAI-----IRYLGGFDPTGENIQTSQKQEAAKNCNCTIADVENTLAKFIWAKEAQ
        E+   + +L +  +E   A  +   ++ ++ R  +  + D  +  + +     +R  G  DP  E      K EA +  N  IA++++ + ++I+     
Subjt:  EISSESFTLLRFADELKNARRVGAFKQYIVGRSSEATFADAFEKQEAI-----IRYLGGFDPTGENIQTSQKQEAAKNCNCTIADVENTLAKFIWAKEAQ

Query:  KKIEKLKEEGKPMPKSIAEVQKLVGSNPLDLARSNLAKSGQISRNALCPCGSKKRYKRCCGKD
           E+ + +   +    AE +   G  P       + K  QI RN  CPCGS K+YK+CCGK+
Subjt:  KKIEKLKEEGKPMPKSIAEVQKLVGSNPLDLARSNLAKSGQISRNALCPCGSKKRYKRCCGKD

A4XJ42 Protein translocase subunit SecA5.9e-0442Show/hide
Query:  MPKSIAEVQKLVGSNPLDL-ARSNLAKSGQISRNALCPCGSKKRYKRCCG
        MPK    V+++  ++P D   R ++ K+ ++ RN  CPCGS K+YK+CCG
Subjt:  MPKSIAEVQKLVGSNPLDL-ARSNLAKSGQISRNALCPCGSKKRYKRCCG

A6TLE7 Protein translocase subunit SecA 19.0e-0524.07Show/hide
Query:  EISSESFTLLRFADELKNARRVGAFKQYIVGRSSEATFADAFEKQEAI-----IRYLGGFDPTGENIQTSQKQEAAKNCNCTIADVENTLAKFIWAKEAQ
        +I   S  L    +E   A R+   ++ IV +  +  + D  +  + +     +R +G  DP       + + E     N  I  ++    K+++  E Q
Subjt:  EISSESFTLLRFADELKNARRVGAFKQYIVGRSSEATFADAFEKQEAI-----IRYLGGFDPTGENIQTSQKQEAAKNCNCTIADVENTLAKFIWAKEAQ

Query:  KKIEKLKEEGKPMPKSIAEVQKLVGSNPLDLARSNLAKSGQISRNALCPCGSKKRYKRCCGK
         K+E+ K+  KP+  S  +  +          ++ + K  +  RN  CPCGS K+YK+CCG+
Subjt:  KKIEKLKEEGKPMPKSIAEVQKLVGSNPLDLARSNLAKSGQISRNALCPCGSKKRYKRCCGK

Arabidopsis top hitse value%identityAlignment
AT3G04950.1 CONTAINS InterPro DOMAIN/s: SEC-C motif (InterPro:IPR004027); Has 583 Blast hits to 583 proteins in 248 species: Archae - 0; Bacteria - 488; Metazoa - 0; Fungi - 0; Plants - 34; Viruses - 0; Other Eukaryotes - 61 (source: NCBI BLink).2.8e-7874.48Show/hide
Query:  QSRSIFSTTQLHDSWMDKIKGVFTGKKNPP-EGTEISSESFTLLRFADELKNARRVGAFKQYIVGRSSEATFADAFEKQEAIIRYLGGFDPTGENIQTSQ
        Q RSI ST  L + WMD IKGVFTG K+ P E + +  E+FTLLRFADELKNARR+G FKQYIVGRSSEATFADAFEKQEA+IRYLG  D TGEN+Q SQ
Subjt:  QSRSIFSTTQLHDSWMDKIKGVFTGKKNPP-EGTEISSESFTLLRFADELKNARRVGAFKQYIVGRSSEATFADAFEKQEAIIRYLGGFDPTGENIQTSQ

Query:  KQEAAKNCNCTIADVENTLAKFIWAKEAQKKIEKLKEEGKPMPKSIAEVQKLVGSNPLDLARSNLAKSGQISRNALCPCGSKKRYKRCCGKD
        KQ+AAK+C CTI DVENTL+KF WA++A KK+ +LKE GKP+PK++ E+QK++GS P+DLARSNLAKSGQISRNALCPCGSKKRYKRCCGKD
Subjt:  KQEAAKNCNCTIADVENTLAKFIWAKEAQKKIEKLKEEGKPMPKSIAEVQKLVGSNPLDLARSNLAKSGQISRNALCPCGSKKRYKRCCGKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTTCGTCCTCTCGACCCTTAATCAATCTTTCCAAGAAAAACAGCTCGAACCCCAATGGGTTCTTCCTCTTCTCTTCTTCCGATCACAGTCCCCGTAACTGTTATTC
CACTCTGTTACAGTCCCGATCCATTTTCTCCACGACGCAGCTTCACGATTCATGGATGGACAAGATCAAGGGAGTCTTCACTGGAAAGAAGAACCCTCCCGAAGGAACCG
AAATCAGCTCCGAGTCCTTTACTCTCCTCCGGTTTGCTGATGAGTTGAAAAATGCCCGTAGAGTAGGGGCATTCAAGCAATATATAGTGGGAAGAAGTAGTGAAGCTACT
TTTGCGGATGCTTTTGAGAAGCAAGAAGCCATTATTCGCTATTTGGGAGGTTTTGATCCCACTGGAGAGAACATCCAAACCAGTCAAAAGCAAGAAGCAGCAAAAAATTG
TAACTGCACAATCGCTGATGTTGAAAATACGCTGGCAAAGTTTATCTGGGCTAAAGAAGCACAAAAGAAGATCGAGAAGTTAAAGGAAGAAGGAAAACCAATGCCAAAGA
GCATTGCCGAGGTCCAGAAACTGGTGGGTTCAAATCCATTGGATCTTGCTAGGTCAAATTTGGCTAAGAGTGGTCAGATCAGCCGGAATGCTCTCTGTCCTTGCGGTTCC
AAGAAGAGATATAAACGGTGCTGCGGGAAGGATCAAACGGTATAG
mRNA sequenceShow/hide mRNA sequence
GTATTTTTGACACAACCCGATGGTTTTGGGCACTTTCTATAATTTAGCCAATGTTTTTTTAACCGATAACGCTGCATAGAGGTCGCCGATCCCTTCCGCCTGAGTATTCG
TTGTGAAAAACCTCCGGACCGACTAAAACCCTAATTCCAGCCTGAGGAAGCCGATTTAGAACTCTTTCATTTTCCCGAAAATGCGTTCGTCCTCTCGACCCTTAATCAAT
CTTTCCAAGAAAAACAGCTCGAACCCCAATGGGTTCTTCCTCTTCTCTTCTTCCGATCACAGTCCCCGTAACTGTTATTCCACTCTGTTACAGTCCCGATCCATTTTCTC
CACGACGCAGCTTCACGATTCATGGATGGACAAGATCAAGGGAGTCTTCACTGGAAAGAAGAACCCTCCCGAAGGAACCGAAATCAGCTCCGAGTCCTTTACTCTCCTCC
GGTTTGCTGATGAGTTGAAAAATGCCCGTAGAGTAGGGGCATTCAAGCAATATATAGTGGGAAGAAGTAGTGAAGCTACTTTTGCGGATGCTTTTGAGAAGCAAGAAGCC
ATTATTCGCTATTTGGGAGGTTTTGATCCCACTGGAGAGAACATCCAAACCAGTCAAAAGCAAGAAGCAGCAAAAAATTGTAACTGCACAATCGCTGATGTTGAAAATAC
GCTGGCAAAGTTTATCTGGGCTAAAGAAGCACAAAAGAAGATCGAGAAGTTAAAGGAAGAAGGAAAACCAATGCCAAAGAGCATTGCCGAGGTCCAGAAACTGGTGGGTT
CAAATCCATTGGATCTTGCTAGGTCAAATTTGGCTAAGAGTGGTCAGATCAGCCGGAATGCTCTCTGTCCTTGCGGTTCCAAGAAGAGATATAAACGGTGCTGCGGGAAG
GATCAAACGGTATAGGAAGAGAACTTGTATGTGCCCTTTTTTTGCCTGAATAATTGACGATAGTTGGACTGGTCATTCTATTGTATTTGAACTGGTTGGTTAAGATATTG
AGATTCTTCTTTCAGTGACTCTGCATGGTTTTTAATGCATAAAATCCTCATCCTTCATTAGAAAAGTGATCTTCCAAAATTTCTTCGAAGGTTTTGTGATTTTCTAGCTT
ATGGTTTTACTGTTCTAGTCTAGATGTTTTGTAAATGGG
Protein sequenceShow/hide protein sequence
MRSSSRPLINLSKKNSSNPNGFFLFSSSDHSPRNCYSTLLQSRSIFSTTQLHDSWMDKIKGVFTGKKNPPEGTEISSESFTLLRFADELKNARRVGAFKQYIVGRSSEAT
FADAFEKQEAIIRYLGGFDPTGENIQTSQKQEAAKNCNCTIADVENTLAKFIWAKEAQKKIEKLKEEGKPMPKSIAEVQKLVGSNPLDLARSNLAKSGQISRNALCPCGS
KKRYKRCCGKDQTV