; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G24290 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G24290
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionFilamentous hemagglutinin transporter
Genome locationClcChr09:37629690..37631179
RNA-Seq ExpressionClc09G24290
SyntenyClc09G24290
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139833.1 uncharacterized protein LOC101214550 [Cucumis sativus]1.6e-11292.28Show/hide
Query:  MAAEVSSLIRVLAGYKDEDNR---------TALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQLQMINQTESK
        MAAEVSSLIRVLAGYKD+DNR         TALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQ+QMINQTESK
Subjt:  MAAEVSSLIRVLAGYKDEDNR---------TALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQLQMINQTESK

Query:  FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSSAA---KEIQEEEAAEIRNSA
        FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPS   STNYASVCTLDKVKSALERADKELVKKRSSLWKS+SSPSYSSSSS+A   KEIQEEEAAEIRNSA
Subjt:  FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSSAA---KEIQEEEAAEIRNSA

Query:  APMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI
        APMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI
Subjt:  APMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI

XP_008447115.1 PREDICTED: uncharacterized protein LOC103489640 [Cucumis melo]1.9e-11393.06Show/hide
Query:  MAAEVSSLIRVLAGYKDEDNR---------TALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQLQMINQTESK
        MAAEVSSLIRVLAGYK++DNR         TALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQ+QMINQTESK
Subjt:  MAAEVSSLIRVLAGYKDEDNR---------TALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQLQMINQTESK

Query:  FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSS--AAKEIQEEEAAEIRNSAA
        FQDLNFPPSPSKRTLNLFNETSLDLKLTS  SPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKS+SSPSYSSSSS  AAKEIQEEEAAEIRNSAA
Subjt:  FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSS--AAKEIQEEEAAEIRNSAA

Query:  PMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI
        PMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI
Subjt:  PMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI

XP_022930640.1 uncharacterized protein LOC111437042 [Cucurbita moschata]5.5e-10085.42Show/hide
Query:  MAAEVSSLIRVLAGYKDEDNR------TALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQLQMINQTESKFQD
        MAA+VSSLIRVLAGYKD+DNR      TAL TRDLLGQSSNL DSQELDLDLQVP+GWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQ QM NQTESK QD
Subjt:  MAAEVSSLIRVLAGYKDEDNR------TALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQLQMINQTESKFQD

Query:  LNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSSAAKEIQEEEAAEIRNSAAPMAVG
        LNFPPSPSKRTLNLFNETSLDL LTS      STNYASVCTLDKVKSALERADKEL+KKRS+LWKS SSP      SAAKEIQEEEAAE R SAAPMAVG
Subjt:  LNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSSAAKEIQEEEAAEIRNSAAPMAVG

Query:  CPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI
        CPGCLSYVLV KNNPRCPRCNSVVPLP+IKKPRIDLN+SI
Subjt:  CPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI

XP_022969452.1 uncharacterized protein LOC111468450 [Cucurbita maxima]1.4e-10086.4Show/hide
Query:  MAAEVSSLIRVL--AGYKDEDNR---------TALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQLQMINQTE
        MAA+VSSLIRVL  AGY DEDNR         T LVTRDLLGQSSNL  SQELDLDLQVP+GWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQ QM NQTE
Subjt:  MAAEVSSLIRVL--AGYKDEDNR---------TALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQLQMINQTE

Query:  SKFQDLNFPPSPSKRTLNLFNETSLDLKLTSS--PSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSS-AAKEIQ-EEEAAEIR
        SKFQDLNFPPSPSKRTLNLFNETSLDLKLTSS   SPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKS+SSPSYSSSSS AAKEIQ EEEA E R
Subjt:  SKFQDLNFPPSPSKRTLNLFNETSLDLKLTSS--PSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSS-AAKEIQ-EEEAAEIR

Query:  N-SAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI
        N +AAPMAVGC GCLSYVLV KNNPRCPRCNSVVPL ++KKPRIDLNMSI
Subjt:  N-SAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI

XP_038887388.1 LOW QUALITY PROTEIN: uncharacterized protein LOC120077543 [Benincasa hispida]3.2e-10086.01Show/hide
Query:  MAAEVSSLIRVLAGYKDEDNR---------TALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQLQMINQTESK
        MAAEVSSLIRVLAGYKDEDNR         TALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNS+SKQ +MINQTESK
Subjt:  MAAEVSSLIRVLAGYKDEDNR---------TALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQLQMINQTESK

Query:  FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSSAAKEIQEEEAAEIRNSAAPM
        FQDLNFPPSPSKRTLNL NETSLDLKLTSSPS + +    SVCTLDKVKSALERADKELVKKR            SS SSAAKEIQEEEAAEIRNSAAPM
Subjt:  FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSSAAKEIQEEEAAEIRNSAAPM

Query:  AVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI
        AVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI
Subjt:  AVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI

TrEMBL top hitse value%identityAlignment
A0A0A0K3J9 Uncharacterized protein8.0e-11392.28Show/hide
Query:  MAAEVSSLIRVLAGYKDEDNR---------TALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQLQMINQTESK
        MAAEVSSLIRVLAGYKD+DNR         TALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQ+QMINQTESK
Subjt:  MAAEVSSLIRVLAGYKDEDNR---------TALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQLQMINQTESK

Query:  FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSSAA---KEIQEEEAAEIRNSA
        FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPS   STNYASVCTLDKVKSALERADKELVKKRSSLWKS+SSPSYSSSSS+A   KEIQEEEAAEIRNSA
Subjt:  FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSSAA---KEIQEEEAAEIRNSA

Query:  APMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI
        APMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI
Subjt:  APMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI

A0A1S3BGM6 uncharacterized protein LOC1034896409.4e-11493.06Show/hide
Query:  MAAEVSSLIRVLAGYKDEDNR---------TALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQLQMINQTESK
        MAAEVSSLIRVLAGYK++DNR         TALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQ+QMINQTESK
Subjt:  MAAEVSSLIRVLAGYKDEDNR---------TALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQLQMINQTESK

Query:  FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSS--AAKEIQEEEAAEIRNSAA
        FQDLNFPPSPSKRTLNLFNETSLDLKLTS  SPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKS+SSPSYSSSSS  AAKEIQEEEAAEIRNSAA
Subjt:  FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSS--AAKEIQEEEAAEIRNSAA

Query:  PMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI
        PMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI
Subjt:  PMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI

A0A6J1EXD9 uncharacterized protein LOC1114370422.7e-10085.42Show/hide
Query:  MAAEVSSLIRVLAGYKDEDNR------TALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQLQMINQTESKFQD
        MAA+VSSLIRVLAGYKD+DNR      TAL TRDLLGQSSNL DSQELDLDLQVP+GWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQ QM NQTESK QD
Subjt:  MAAEVSSLIRVLAGYKDEDNR------TALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQLQMINQTESKFQD

Query:  LNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSSAAKEIQEEEAAEIRNSAAPMAVG
        LNFPPSPSKRTLNLFNETSLDL LTS      STNYASVCTLDKVKSALERADKEL+KKRS+LWKS SSP      SAAKEIQEEEAAE R SAAPMAVG
Subjt:  LNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSSAAKEIQEEEAAEIRNSAAPMAVG

Query:  CPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI
        CPGCLSYVLV KNNPRCPRCNSVVPLP+IKKPRIDLN+SI
Subjt:  CPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI

A0A6J1GJC7 uncharacterized protein LOC1114548764.5e-10083.98Show/hide
Query:  MAAEVSSLIRVLA-----GYKDEDNR---------TALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQLQMIN
        MAA+VS+LIRVLA     GY DEDNR         T LVTRDLLGQSSNL  SQELDLDLQVP+GWEKRLDLKSGKVYIQRSQTPDSPLNSDSK  QM N
Subjt:  MAAEVSSLIRVLA-----GYKDEDNR---------TALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQLQMIN

Query:  QTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTS--SPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSS-AAKEIQ-EEEAA
        QTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTS  SPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKS+SSPSYSSSSS AAKEIQ EEEA 
Subjt:  QTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTS--SPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSS-AAKEIQ-EEEAA

Query:  EIRNS----AAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI
        E RN     AAPMAVGC GCLSYVLV KNNPRCPRCNSVVPL ++KKPRIDLNMSI
Subjt:  EIRNS----AAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI

A0A6J1HWE1 uncharacterized protein LOC1114684507.0e-10186.4Show/hide
Query:  MAAEVSSLIRVL--AGYKDEDNR---------TALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQLQMINQTE
        MAA+VSSLIRVL  AGY DEDNR         T LVTRDLLGQSSNL  SQELDLDLQVP+GWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQ QM NQTE
Subjt:  MAAEVSSLIRVL--AGYKDEDNR---------TALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQLQMINQTE

Query:  SKFQDLNFPPSPSKRTLNLFNETSLDLKLTSS--PSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSS-AAKEIQ-EEEAAEIR
        SKFQDLNFPPSPSKRTLNLFNETSLDLKLTSS   SPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKS+SSPSYSSSSS AAKEIQ EEEA E R
Subjt:  SKFQDLNFPPSPSKRTLNLFNETSLDLKLTSS--PSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSS-AAKEIQ-EEEAAEIR

Query:  N-SAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI
        N +AAPMAVGC GCLSYVLV KNNPRCPRCNSVVPL ++KKPRIDLNMSI
Subjt:  N-SAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G16500.1 unknown protein5.4e-4545.45Show/hide
Query:  MAAEVSSLIRVLAGYKDE----------DNRTALVTRDLLGQSSNL-------TDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSD----S
        MAA+VSSL+R+L+ +KD+           +  AL+TRDLLG    +         S ELDLD+QVP GWEKRLDLKSGKVY+Q+     S  +S      
Subjt:  MAAEVSSLIRVLAGYKDE----------DNRTALVTRDLLGQSSNL-------TDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSD----S

Query:  KQLQMINQTESKFQDLNFPP----SPSKRTLNLF---NETSLDLKLT------------SSPSPSPSTNY-ASVCTLDKVKSALERADKELVKKRSSLWK
              NQT  +FQDLN PP     P+K  L+LF   ++TSL+LKL             SS SP+ S +Y +SVCTLDKVK ALERA+K+  K++S    
Subjt:  KQLQMINQTESKFQDLNFPP----SPSKRTLNLF---NETSLDLKLT------------SSPSPSPSTNY-ASVCTLDKVKSALERADKELVKKRSSLWK

Query:  SSSSPSYSSSSSAAKEIQEEEAAEIRNSAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI
              Y  ++SA              +A+ +A GCPGCLSYV V KNNP+CPRC+S VPLP +KKP+IDLN+S+
Subjt:  SSSSPSYSSSSSAAKEIQEEEAAEIRNSAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI

AT1G79160.1 unknown protein3.6e-4950.99Show/hide
Query:  MAAEVSSLIRVLAGYKDE----------DNRTALVTRDLL--GQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQLQMINQT
        MAA+VSSL+R+L+GYKD+           +  AL+TRDLL  G+      S ELDLDLQVPTG+EKRLDLKSGKVY+QR  +  S   +++ Q    NQT
Subjt:  MAAEVSSLIRVLAGYKDE----------DNRTALVTRDLL--GQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQLQMINQT

Query:  ESKFQDLNFPPSPSKRT--LNLFNETSLDLK-LTSSPSPSPST-NYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSSAAKEIQEEEAAEI
           FQDLNFPP     +  LNLF++T+ +LK L SS S  P+T N  SVCTLDKVKSALERA+++      +++K   SP          +    EA   
Subjt:  ESKFQDLNFPPSPSKRT--LNLFNETSLDLK-LTSSPSPSPST-NYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSSAAKEIQEEEAAEI

Query:  RNSAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPT---IKKPRIDLNMSI
           A+P+  GCPGCLSYVLVM NNP+CPRC+++VPLPT    KKP+IDLN+SI
Subjt:  RNSAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPT---IKKPRIDLNMSI

AT5G06270.1 unknown protein9.3e-0532.53Show/hide
Query:  RSSLWKSSSSPSYSSSSSAAKEIQEEEAAEIRNSAAP-----MAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLN
        RS    +++SP+   SS  + E+ ++E + +R S +P     + VGCP CL YV++ +++P+CP+C S V L  + +   + N
Subjt:  RSSLWKSSSSPSYSSSSSAAKEIQEEEAAEIRNSAAP-----MAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLN

AT5G22270.1 unknown protein2.7e-0444.26Show/hide
Query:  SSSPSYSSSSSAAKEIQEEEAAEIRNSAAPMAVGCPGCLSYVLV-MKNNPRCPRCNSVVPL
        SS  + SSSSS   E  E    E ++    + VGCP C+ Y++  ++N+PRCPRCNS V L
Subjt:  SSSPSYSSSSSAAKEIQEEEAAEIRNSAAPMAVGCPGCLSYVLV-MKNNPRCPRCNSVVPL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCCGAAGTAAGCAGTCTGATTCGAGTACTCGCCGGTTACAAGGACGAAGACAATCGAACGGCTCTCGTTACTCGCGATTTGCTCGGACAATCTTCCAACCTTAC
TGACTCTCAAGAATTGGACCTCGATTTGCAGGTTCCCACCGGCTGGGAGAAAAGGCTCGACTTGAAGTCGGGAAAAGTTTACATACAGAGAAGTCAAACACCGGATTCTC
CTCTGAATTCAGATTCGAAACAACTCCAAATGATCAATCAAACAGAATCGAAATTTCAGGACTTGAATTTCCCCCCATCTCCTTCAAAACGAACATTAAATCTCTTCAAC
GAAACCAGTTTGGATTTGAAATTGACATCGTCGCCGTCGCCGTCGCCGTCGACCAATTACGCCAGCGTTTGTACTCTGGATAAGGTGAAATCTGCTCTGGAAAGGGCCGA
CAAGGAGCTCGTAAAGAAACGCTCTTCGCTATGGAAATCGTCTTCATCGCCGTCTTACTCCTCCTCCTCGTCGGCGGCGAAGGAAATTCAAGAAGAAGAAGCGGCGGAAA
TTAGAAACTCGGCGGCGCCGATGGCGGTGGGTTGCCCAGGATGTTTATCATATGTATTAGTAATGAAAAACAACCCCCGATGTCCTCGTTGCAACTCTGTTGTTCCATTG
CCGACCATCAAGAAACCTCGGATTGATCTAAACATGTCCATATAA
mRNA sequenceShow/hide mRNA sequence
GCCCAATTAATTGTCTTCCTCCACCCTCCCACTTTCCCATCATTAATCAAATTCGAACAATTAATTTCCATTTTCACTCCCACCATTAATATTGTGAAAATGAATGAATT
AATTAATTTACTTCATAACGTGGGTTCAATCTTCAATATTTAGAAGATAAAATTTTGGTCTTATATTTCCCTATCTCGCTCGCGCTCTCCATACAATTACCAACACCCAA
CCAATCCAAACCCACTCATTCTTCTCTTTCTCTTTCTCTCTCTCTCTCTCTTTTTTCCCTCTTCCTCTTATTTTTATCAAAATAATAATAATAATAACAACAACAAATTA
AATTAAATTAAATTATTAAACCACCTTCAAAAGGGGTTTCCTTTAGTTTCCGCTTTGTTTCTCTATAAAGCTCCCAGAAAATGGCTGCCGAAGTAAGCAGTCTGATTCGA
GTACTCGCCGGTTACAAGGACGAAGACAATCGAACGGCTCTCGTTACTCGCGATTTGCTCGGACAATCTTCCAACCTTACTGACTCTCAAGAATTGGACCTCGATTTGCA
GGTTCCCACCGGCTGGGAGAAAAGGCTCGACTTGAAGTCGGGAAAAGTTTACATACAGAGAAGTCAAACACCGGATTCTCCTCTGAATTCAGATTCGAAACAACTCCAAA
TGATCAATCAAACAGAATCGAAATTTCAGGACTTGAATTTCCCCCCATCTCCTTCAAAACGAACATTAAATCTCTTCAACGAAACCAGTTTGGATTTGAAATTGACATCG
TCGCCGTCGCCGTCGCCGTCGACCAATTACGCCAGCGTTTGTACTCTGGATAAGGTGAAATCTGCTCTGGAAAGGGCCGACAAGGAGCTCGTAAAGAAACGCTCTTCGCT
ATGGAAATCGTCTTCATCGCCGTCTTACTCCTCCTCCTCGTCGGCGGCGAAGGAAATTCAAGAAGAAGAAGCGGCGGAAATTAGAAACTCGGCGGCGCCGATGGCGGTGG
GTTGCCCAGGATGTTTATCATATGTATTAGTAATGAAAAACAACCCCCGATGTCCTCGTTGCAACTCTGTTGTTCCATTGCCGACCATCAAGAAACCTCGGATTGATCTA
AACATGTCCATATAA
Protein sequenceShow/hide protein sequence
MAAEVSSLIRVLAGYKDEDNRTALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQLQMINQTESKFQDLNFPPSPSKRTLNLFN
ETSLDLKLTSSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSSSSPSYSSSSSAAKEIQEEEAAEIRNSAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPL
PTIKKPRIDLNMSI