; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0022745 (gene) of Snake gourd v1 genome

Gene IDTan0022745
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionFilamentous hemagglutinin transporter
Genome locationLG02:710478..712006
RNA-Seq ExpressionTan0022745
SyntenyTan0022745
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139833.1 uncharacterized protein LOC101214550 [Cucumis sativus]1.1e-11694.69Show/hide
Query:  MAAEVSSLIRVLAGYKEEDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMINQTETK
        MAAEVSSLIRVLAGYK++DNRTALGNGQ+STALVTRDLLGQSSNL DSQELDLDLQVP+GWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQ QMINQTE+K
Subjt:  MAAEVSSLIRVLAGYKEEDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMINQTETK

Query:  FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSSSSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSSSAA--KEIQEEEAAEIRNSAA
        FQDLNFPPSPSKRTLNLFNETSLDLKLTSSP  SSTNYASVCTLDKVKSALERADKELVKKRSSLWKS+SSPSYSSSSSSAA  KEIQEEEAAEIRNSAA
Subjt:  FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSSSSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSSSAA--KEIQEEEAAEIRNSAA

Query:  PMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPIIKKPRIDLNMSI
        PMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLP IKKPRIDLNMSI
Subjt:  PMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPIIKKPRIDLNMSI

XP_008447115.1 PREDICTED: uncharacterized protein LOC103489640 [Cucumis melo]3.9e-11795.49Show/hide
Query:  MAAEVSSLIRVLAGYKEEDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMINQTETK
        MAAEVSSLIRVLAGYKE+DNRTALGNGQ+STALVTRDLLGQSSNL DSQELDLDLQVP+GWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQ QMINQTE+K
Subjt:  MAAEVSSLIRVLAGYKEEDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMINQTETK

Query:  FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSSSSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSSS-AAKEIQEEEAAEIRNSAAP
        FQDLNFPPSPSKRTLNLFNETSLDLKLTSSP S STNYASVCTLDKVKSALERADKELVKKRSSLWKS+SSPSYSSSSSS AAKEIQEEEAAEIRNSAAP
Subjt:  FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSSSSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSSS-AAKEIQEEEAAEIRNSAAP

Query:  MAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPIIKKPRIDLNMSI
        MAVGCPGCLSYVLVMKNNPRCPRCNSVVPLP IKKPRIDLNMSI
Subjt:  MAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPIIKKPRIDLNMSI

XP_022952111.1 uncharacterized protein LOC111454876 [Cucurbita moschata]9.1e-10686.33Show/hide
Query:  MAAEVSSLIRVLA-----GYKEEDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMIN
        MAA+VS+LIRVLA     GY +EDNRT LGNGQE T LVTRDLLGQSSNLA SQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSK HQM N
Subjt:  MAAEVSSLIRVLA-----GYKEEDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMIN

Query:  QTETKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPS---SSSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSSSAAKEIQ-EEEAA
        QTE+KFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPS   S STNYASVCTLDKVKSALERADKELVKKRSSLWKS+SSPSYSSSSS+AAKEIQ EEEA 
Subjt:  QTETKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPS---SSSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSSSAAKEIQ-EEEAA

Query:  EIRNS----AAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPIIKKPRIDLNMSI
        E RN     AAPMAVGC GCLSYVLV KNNPRCPRCNSVVPL  +KKPRIDLNMSI
Subjt:  EIRNS----AAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPIIKKPRIDLNMSI

XP_022969452.1 uncharacterized protein LOC111468450 [Cucurbita maxima]1.4e-10688.8Show/hide
Query:  MAAEVSSLIRVL--AGYKEEDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMINQTE
        MAA+VSSLIRVL  AGY +EDNRT LGNGQE T LVTRDLLGQSSNLA SQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQM NQTE
Subjt:  MAAEVSSLIRVL--AGYKEEDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMINQTE

Query:  TKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSSS---STNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSSSAAKEIQ-EEEAAEIR
        +KFQDLNFPPSPSKRTLNLFNETSLDLKLTSS  SS   STNYASVCTLDKVKSALERADKELVKKRSSLWKS+SSPSYSSSSS+AAKEIQ EEEA E R
Subjt:  TKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSSS---STNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSSSAAKEIQ-EEEAAEIR

Query:  N-SAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPIIKKPRIDLNMSI
        N +AAPMAVGC GCLSYVLV KNNPRCPRCNSVVPL  +KKPRIDLNMSI
Subjt:  N-SAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPIIKKPRIDLNMSI

XP_038887388.1 LOW QUALITY PROTEIN: uncharacterized protein LOC120077543 [Benincasa hispida]2.6e-10588.07Show/hide
Query:  MAAEVSSLIRVLAGYKEEDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMINQTETK
        MAAEVSSLIRVLAGYK+EDNRTALGNGQESTALVTRDLLGQSSNL DSQELDLDLQVP+GWEKRLDLKSGKVYIQRSQTPDSPLNS+SKQ +MINQTE+K
Subjt:  MAAEVSSLIRVLAGYKEEDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMINQTETK

Query:  FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSSSSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSSSAAKEIQEEEAAEIRNSAAPM
        FQDLNFPPSPSKRTLNL NETSLDLKLTSSPS++ T   SVCTLDKVKSALERADKELVKKR             SS SSAAKEIQEEEAAEIRNSAAPM
Subjt:  FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSSSSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSSSAAKEIQEEEAAEIRNSAAPM

Query:  AVGCPGCLSYVLVMKNNPRCPRCNSVVPLPIIKKPRIDLNMSI
        AVGCPGCLSYVLVMKNNPRCPRCNSVVPLP IKKPRIDLNMSI
Subjt:  AVGCPGCLSYVLVMKNNPRCPRCNSVVPLPIIKKPRIDLNMSI

TrEMBL top hitse value%identityAlignment
A0A0A0K3J9 Uncharacterized protein5.5e-11794.69Show/hide
Query:  MAAEVSSLIRVLAGYKEEDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMINQTETK
        MAAEVSSLIRVLAGYK++DNRTALGNGQ+STALVTRDLLGQSSNL DSQELDLDLQVP+GWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQ QMINQTE+K
Subjt:  MAAEVSSLIRVLAGYKEEDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMINQTETK

Query:  FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSSSSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSSSAA--KEIQEEEAAEIRNSAA
        FQDLNFPPSPSKRTLNLFNETSLDLKLTSSP  SSTNYASVCTLDKVKSALERADKELVKKRSSLWKS+SSPSYSSSSSSAA  KEIQEEEAAEIRNSAA
Subjt:  FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSSSSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSSSAA--KEIQEEEAAEIRNSAA

Query:  PMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPIIKKPRIDLNMSI
        PMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLP IKKPRIDLNMSI
Subjt:  PMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPIIKKPRIDLNMSI

A0A1S3BGM6 uncharacterized protein LOC1034896401.9e-11795.49Show/hide
Query:  MAAEVSSLIRVLAGYKEEDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMINQTETK
        MAAEVSSLIRVLAGYKE+DNRTALGNGQ+STALVTRDLLGQSSNL DSQELDLDLQVP+GWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQ QMINQTE+K
Subjt:  MAAEVSSLIRVLAGYKEEDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMINQTETK

Query:  FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSSSSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSSS-AAKEIQEEEAAEIRNSAAP
        FQDLNFPPSPSKRTLNLFNETSLDLKLTSSP S STNYASVCTLDKVKSALERADKELVKKRSSLWKS+SSPSYSSSSSS AAKEIQEEEAAEIRNSAAP
Subjt:  FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSSSSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSSS-AAKEIQEEEAAEIRNSAAP

Query:  MAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPIIKKPRIDLNMSI
        MAVGCPGCLSYVLVMKNNPRCPRCNSVVPLP IKKPRIDLNMSI
Subjt:  MAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPIIKKPRIDLNMSI

A0A6J1EXD9 uncharacterized protein LOC1114370422.9e-10286.01Show/hide
Query:  MAAEVSSLIRVLAGYKEEDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMINQTETK
        MAA+VSSLIRVLAGYK++DNR AL   +ESTAL TRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQ QM NQTE+K
Subjt:  MAAEVSSLIRVLAGYKEEDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMINQTETK

Query:  FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSSSSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSSSAAKEIQEEEAAEIRNSAAPM
         QDLNFPPSPSKRTLNLFNETSLDL LT     SSTNYASVCTLDKVKSALERADKEL+KKRS+LWKS SSP       SAAKEIQEEEAAE R SAAPM
Subjt:  FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSSSSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSSSAAKEIQEEEAAEIRNSAAPM

Query:  AVGCPGCLSYVLVMKNNPRCPRCNSVVPLPIIKKPRIDLNMSI
        AVGCPGCLSYVLV KNNPRCPRCNSVVPLP IKKPRIDLN+SI
Subjt:  AVGCPGCLSYVLVMKNNPRCPRCNSVVPLPIIKKPRIDLNMSI

A0A6J1GJC7 uncharacterized protein LOC1114548764.4e-10686.33Show/hide
Query:  MAAEVSSLIRVLA-----GYKEEDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMIN
        MAA+VS+LIRVLA     GY +EDNRT LGNGQE T LVTRDLLGQSSNLA SQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSK HQM N
Subjt:  MAAEVSSLIRVLA-----GYKEEDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMIN

Query:  QTETKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPS---SSSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSSSAAKEIQ-EEEAA
        QTE+KFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPS   S STNYASVCTLDKVKSALERADKELVKKRSSLWKS+SSPSYSSSSS+AAKEIQ EEEA 
Subjt:  QTETKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPS---SSSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSSSAAKEIQ-EEEAA

Query:  EIRNS----AAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPIIKKPRIDLNMSI
        E RN     AAPMAVGC GCLSYVLV KNNPRCPRCNSVVPL  +KKPRIDLNMSI
Subjt:  EIRNS----AAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPIIKKPRIDLNMSI

A0A6J1HWE1 uncharacterized protein LOC1114684506.8e-10788.8Show/hide
Query:  MAAEVSSLIRVL--AGYKEEDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMINQTE
        MAA+VSSLIRVL  AGY +EDNRT LGNGQE T LVTRDLLGQSSNLA SQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQM NQTE
Subjt:  MAAEVSSLIRVL--AGYKEEDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMINQTE

Query:  TKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSSS---STNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSSSAAKEIQ-EEEAAEIR
        +KFQDLNFPPSPSKRTLNLFNETSLDLKLTSS  SS   STNYASVCTLDKVKSALERADKELVKKRSSLWKS+SSPSYSSSSS+AAKEIQ EEEA E R
Subjt:  TKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSSS---STNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSSSAAKEIQ-EEEAAEIR

Query:  N-SAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPIIKKPRIDLNMSI
        N +AAPMAVGC GCLSYVLV KNNPRCPRCNSVVPL  +KKPRIDLNMSI
Subjt:  N-SAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPIIKKPRIDLNMSI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G16500.1 unknown protein5.0e-4644.93Show/hide
Query:  MAAEVSSLIRVLAGYKEEDNRTALGNGQEST-ALVTRDLLGQSSNLA-------DSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQH-
        MAA+VSSL+R+L+ +K++        G  ST AL+TRDLLG    +         S ELDLD+QVP+GWEKRLDLKSGKVY+Q+     S  +S    H 
Subjt:  MAAEVSSLIRVLAGYKEEDNRTALGNGQEST-ALVTRDLLGQSSNLA-------DSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQH-

Query:  ---QMINQTETKFQDLNFPP----SPSKRTLNLF---NETSLDLKL--------------TSSPSSSSTNYASVCTLDKVKSALERADKELVKKRSSLWK
              NQT  +FQDLN PP     P+K  L+LF   ++TSL+LKL              + SP+ S +  +SVCTLDKVK ALERA+K+  K++S    
Subjt:  ---QMINQTETKFQDLNFPP----SPSKRTLNLF---NETSLDLKL--------------TSSPSSSSTNYASVCTLDKVKSALERADKELVKKRSSLWK

Query:  SSSSPSYSSSSSSAAKEIQEEEAAEIRNSAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPIIKKPRIDLNMSI
              Y  ++S+               +A+ +A GCPGCLSYV V KNNP+CPRC+S VPLP +KKP+IDLN+S+
Subjt:  SSSSPSYSSSSSSAAKEIQEEEAAEIRNSAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPIIKKPRIDLNMSI

AT1G79160.1 unknown protein4.1e-4849.61Show/hide
Query:  MAAEVSSLIRVLAGYKEEDNRTAL---GNGQESTALVTRDLLGQSSNLAD--SQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMIN
        MAA+VSSL+R+L+GYK  D+R  +      + S AL+TRDLLG         S ELDLDLQVP+G+EKRLDLKSGKVY+QR  +  S   +++ Q    N
Subjt:  MAAEVSSLIRVLAGYKEEDNRTAL---GNGQESTALVTRDLLGQSSNLAD--SQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMIN

Query:  QTETKFQDLNFPPSPSKRT--LNLFNETSLDLKLTSSPSSS---STNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSSSAAKEIQEEEA
        QT   FQDLNFPP     +  LNLF++T+ +LKL  S  SS   ++N  SVCTLDKVKSALERA+++      +++K   SP           +    EA
Subjt:  QTETKFQDLNFPPSPSKRT--LNLFNETSLDLKLTSSPSSS---STNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSSSAAKEIQEEEA

Query:  AEIRNSAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPL---PIIKKPRIDLNMSI
              A+P+  GCPGCLSYVLVM NNP+CPRC+++VPL   P+ KKP+IDLN+SI
Subjt:  AEIRNSAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPL---PIIKKPRIDLNMSI

AT3G11600.1 unknown protein8.2e-0435.21Show/hide
Query:  SLWKSSSSPSYSSSSSSAAKEIQEEEAAEIRNSAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPIIKK
        SL +S +  + +S SS  + E  +EE   I    + + VGCP CL YV++  ++P+CP+C S V L  +++
Subjt:  SLWKSSSSPSYSSSSSSAAKEIQEEEAAEIRNSAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPIIKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCCGAAGTAAGTAGTCTGATTCGAGTTCTCGCCGGCTACAAGGAAGAAGATAATCGGACGGCTCTCGGTAATGGCCAGGAATCAACGGCTCTCGTCACTCGCGA
TTTGCTCGGTCAATCTTCTAACCTCGCTGACTCCCAGGAATTGGACCTCGATTTGCAAGTTCCTTCCGGCTGGGAAAAACGCCTCGACTTGAAGTCGGGAAAAGTTTACA
TACAGAGAAGTCAAACTCCAGATTCTCCTCTGAATTCAGATTCCAAACAACACCAAATGATCAATCAAACAGAAACCAAATTCCAGGACTTGAATTTCCCCCCATCTCCT
TCAAAACGAACATTAAATCTCTTCAATGAAACCAGCTTGGATTTGAAATTGACGTCGTCGCCGTCGTCGTCGTCGACCAATTACGCCAGCGTTTGCACTCTGGATAAGGT
CAAATCTGCTCTAGAAAGGGCCGACAAGGAGCTGGTTAAAAAACGGTCTTCGCTATGGAAATCGTCGTCGTCGCCGTCGTACTCCTCCTCCTCCTCCTCGGCGGCGAAGG
AAATTCAAGAAGAAGAAGCCGCGGAAATTAGAAACTCGGCGGCGCCGATGGCGGTGGGTTGTCCTGGATGTTTATCTTATGTATTGGTAATGAAAAACAACCCAAGATGT
CCTCGTTGCAATTCTGTTGTTCCATTGCCGATCATCAAGAAACCTCGGATTGATCTAAACATGTCCATATAA
mRNA sequenceShow/hide mRNA sequence
AATTTAGTTCATAACGTCGTTGCAATAATATTTACTTGATAACTATCCTCCTCTATTTCCCATAGCGCTCTCCATACAATTACAAACACCAAACCCCCACCCACTCCTTT
CTTCTCCCTCTCTCTCTCCCAATCCTCTTTTTCCCTCTGCCTCTTATTTTTATCAGAAAGAAAAAATATATATATATAAAAAAAAAACAAAACCCCTTCAAAAAAGGGGT
TTCGTTTCTCTATAAAGCTCCAAAAATGGCTGCCGAAGTAAGTAGTCTGATTCGAGTTCTCGCCGGCTACAAGGAAGAAGATAATCGGACGGCTCTCGGTAATGGCCAGG
AATCAACGGCTCTCGTCACTCGCGATTTGCTCGGTCAATCTTCTAACCTCGCTGACTCCCAGGAATTGGACCTCGATTTGCAAGTTCCTTCCGGCTGGGAAAAACGCCTC
GACTTGAAGTCGGGAAAAGTTTACATACAGAGAAGTCAAACTCCAGATTCTCCTCTGAATTCAGATTCCAAACAACACCAAATGATCAATCAAACAGAAACCAAATTCCA
GGACTTGAATTTCCCCCCATCTCCTTCAAAACGAACATTAAATCTCTTCAATGAAACCAGCTTGGATTTGAAATTGACGTCGTCGCCGTCGTCGTCGTCGACCAATTACG
CCAGCGTTTGCACTCTGGATAAGGTCAAATCTGCTCTAGAAAGGGCCGACAAGGAGCTGGTTAAAAAACGGTCTTCGCTATGGAAATCGTCGTCGTCGCCGTCGTACTCC
TCCTCCTCCTCCTCGGCGGCGAAGGAAATTCAAGAAGAAGAAGCCGCGGAAATTAGAAACTCGGCGGCGCCGATGGCGGTGGGTTGTCCTGGATGTTTATCTTATGTATT
GGTAATGAAAAACAACCCAAGATGTCCTCGTTGCAATTCTGTTGTTCCATTGCCGATCATCAAGAAACCTCGGATTGATCTAAACATGTCCATATAAAAATGTTTGGGAA
TCATCCATCAATATCATATATAAAATATCCAAAGGGAGATTAAGAAATCTTCGTGGATCTTTTGTTTTGGGTATATGTAAAATGGATATATTTACACAAAATTATTAT
Protein sequenceShow/hide protein sequence
MAAEVSSLIRVLAGYKEEDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMINQTETKFQDLNFPPSP
SKRTLNLFNETSLDLKLTSSPSSSSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSSSAAKEIQEEEAAEIRNSAAPMAVGCPGCLSYVLVMKNNPRC
PRCNSVVPLPIIKKPRIDLNMSI