; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh19G011290 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh19G011290
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionFilamentous hemagglutinin transporter
Genome locationCmo_Chr19:9563755..9565123
RNA-Seq ExpressionCmoCh19G011290
SyntenyCmoCh19G011290
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572471.1 hypothetical protein SDJN03_29199, partial [Cucurbita argyrosperma subsp. sororia]1.1e-11296.98Show/hide
Query:  MAAQVSTLIRVLA---GGGGGGYNDEDNRTPLGNGQEPTPLVTRDLLGQSSNLAASQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKHHQ
        MAAQVSTLIRVLA   GGGGGGYNDEDNRTPLGNGQEPTPLVTRDLLGQSSNLAASQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKHHQ
Subjt:  MAAQVSTLIRVLA---GGGGGGYNDEDNRTPLGNGQEPTPLVTRDLLGQSSNLAASQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKHHQ

Query:  MTNQTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSTSSPSYSSSSSAAAKEIQEEE
        MTNQTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSTSSPSYSSSSSAAAKEIQEEE
Subjt:  MTNQTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSTSSPSYSSSSSAAAKEIQEEE

Query:  EAVENRNEAAV---LAAPMAVGCRGCLSYVLV
        EAVENRNEAA    LAAPMAVGCRGCLSYVLV
Subjt:  EAVENRNEAAV---LAAPMAVGCRGCLSYVLV

KAG7012065.1 hypothetical protein SDJN02_26973, partial [Cucurbita argyrosperma subsp. argyrosperma]3.0e-13197.33Show/hide
Query:  MAAQVSTLIRVLA---GGGGGGYNDEDNRTPLGNGQEPTPLVTRDLLGQSSNLAASQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKHHQ
        MAAQVSTLIRVLA   GGGGGGYNDEDNRTPLGNGQEPTPLVTRDLLGQSSNLAASQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKHHQ
Subjt:  MAAQVSTLIRVLA---GGGGGGYNDEDNRTPLGNGQEPTPLVTRDLLGQSSNLAASQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKHHQ

Query:  MTNQTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSTSSPSYSSSSSAAAKEIQEEE
        MTNQTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSTSSPSYSSSSSAAAKEIQEEE
Subjt:  MTNQTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSTSSPSYSSSSSAAAKEIQEEE

Query:  EAVENRNEAAV---LAAPMAVGCRGCLSYVLVTKNNPRCPRCNSVVPLASMKKPRIDLNMSI
        EAVENRNEAA    LAAPMAVGCRGCLSYVLVTKNNPRCPRCNSVVPLASMKKPRIDLNMSI
Subjt:  EAVENRNEAAV---LAAPMAVGCRGCLSYVLVTKNNPRCPRCNSVVPLASMKKPRIDLNMSI

XP_022952111.1 uncharacterized protein LOC111454876 [Cucurbita moschata]5.7e-135100Show/hide
Query:  MAAQVSTLIRVLAGGGGGGYNDEDNRTPLGNGQEPTPLVTRDLLGQSSNLAASQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKHHQMTN
        MAAQVSTLIRVLAGGGGGGYNDEDNRTPLGNGQEPTPLVTRDLLGQSSNLAASQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKHHQMTN
Subjt:  MAAQVSTLIRVLAGGGGGGYNDEDNRTPLGNGQEPTPLVTRDLLGQSSNLAASQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKHHQMTN

Query:  QTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSTSSPSYSSSSSAAAKEIQEEEEAV
        QTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSTSSPSYSSSSSAAAKEIQEEEEAV
Subjt:  QTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSTSSPSYSSSSSAAAKEIQEEEEAV

Query:  ENRNEAAVLAAPMAVGCRGCLSYVLVTKNNPRCPRCNSVVPLASMKKPRIDLNMSI
        ENRNEAAVLAAPMAVGCRGCLSYVLVTKNNPRCPRCNSVVPLASMKKPRIDLNMSI
Subjt:  ENRNEAAVLAAPMAVGCRGCLSYVLVTKNNPRCPRCNSVVPLASMKKPRIDLNMSI

XP_022969452.1 uncharacterized protein LOC111468450 [Cucurbita maxima]3.5e-12495.31Show/hide
Query:  MAAQVSTLIRVLAGGGGGGYNDEDNRTPLGNGQEPTPLVTRDLLGQSSNLAASQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKHHQMTN
        MAAQVS+LIRVLA   G GYNDEDNRTPLGNGQEPTPLVTRDLLGQSSNLAASQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSK HQMTN
Subjt:  MAAQVSTLIRVLAGGGGGGYNDEDNRTPLGNGQEPTPLVTRDLLGQSSNLAASQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKHHQMTN

Query:  QTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSTSSPSYSSSSSAAAKEIQEEEEAV
        QTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTSS   SPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSTSSPSYSSSSSAAAKEIQEEEEAV
Subjt:  QTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSTSSPSYSSSSSAAAKEIQEEEEAV

Query:  ENRNEAAVLAAPMAVGCRGCLSYVLVTKNNPRCPRCNSVVPLASMKKPRIDLNMSI
        ENRNEA   AAPMAVGCRGCLSYVLVTKNNPRCPRCNSVVPLASMKKPRIDLNMSI
Subjt:  ENRNEAAVLAAPMAVGCRGCLSYVLVTKNNPRCPRCNSVVPLASMKKPRIDLNMSI

XP_023553772.1 uncharacterized protein DDB_G0280205 [Cucurbita pepo subsp. pepo]1.0e-12897.67Show/hide
Query:  MAAQVSTLIRVLA-GGGGGGYNDEDNRTPLGNGQEPTPLVTRDLLGQSSNLAASQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKHHQMT
        MAAQVSTLIRVLA GGGGGGYNDEDNRTPLGNGQEPTPLVTRDLLGQSSNLAASQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKHHQMT
Subjt:  MAAQVSTLIRVLA-GGGGGGYNDEDNRTPLGNGQEPTPLVTRDLLGQSSNLAASQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKHHQMT

Query:  NQTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSTSSP-SYSSSSSAAAKEIQEEEE
        NQTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSTSSP SYSSSSSAAAKEIQEEEE
Subjt:  NQTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSTSSP-SYSSSSSAAAKEIQEEEE

Query:  AVENRNEAAVLAAPMAVGCRGCLSYVLVTKNNPRCPRCNSVVPLASMKKPRIDLNMSI
        AVENRNE    AAPMAVGCRGCLSYVLVTKNNPRCPRCNSVVPLASMKKPRIDLNMSI
Subjt:  AVENRNEAAVLAAPMAVGCRGCLSYVLVTKNNPRCPRCNSVVPLASMKKPRIDLNMSI

TrEMBL top hitse value%identityAlignment
A0A0A0K3J9 Uncharacterized protein1.6e-10384.11Show/hide
Query:  MAAQVSTLIRVLAGGGGGGYNDEDNRTPLGNGQEPTPLVTRDLLGQSSNLAASQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKHHQMTN
        MAA+VS+LIRVLA     GY D+DNRT LGNGQ+ T LVTRDLLGQSSNL  SQELDLDLQVP+GWEKRLDLKSGKVYIQRSQTPDSPLNSDSK  QM N
Subjt:  MAAQVSTLIRVLAGGGGGGYNDEDNRTPLGNGQEPTPLVTRDLLGQSSNLAASQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKHHQMTN

Query:  QTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSTSSPSYSSSSSAAA--KEIQEEEE
        QTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPS     STNYASVCTLDKVKSALERADKELVKKRSSLWKS SSPSYSSSSS+AA  KEIQ EEE
Subjt:  QTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSTSSPSYSSSSSAAA--KEIQEEEE

Query:  AVENRNEAAVLAAPMAVGCRGCLSYVLVTKNNPRCPRCNSVVPLASMKKPRIDLNMSI
        A E RN     AAPMAVGC GCLSYVLV KNNPRCPRCNSVVPL ++KKPRIDLNMSI
Subjt:  AVENRNEAAVLAAPMAVGCRGCLSYVLVTKNNPRCPRCNSVVPLASMKKPRIDLNMSI

A0A1S3BGM6 uncharacterized protein LOC1034896401.9e-10484.82Show/hide
Query:  MAAQVSTLIRVLAGGGGGGYNDEDNRTPLGNGQEPTPLVTRDLLGQSSNLAASQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKHHQMTN
        MAA+VS+LIRVLA     GY ++DNRT LGNGQ+ T LVTRDLLGQSSNL  SQELDLDLQVP+GWEKRLDLKSGKVYIQRSQTPDSPLNSDSK  QM N
Subjt:  MAAQVSTLIRVLAGGGGGGYNDEDNRTPLGNGQEPTPLVTRDLLGQSSNLAASQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKHHQMTN

Query:  QTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSTSSPSY-SSSSSAAAKEIQEEEEA
        QTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTS    SPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKS SSPSY SSSSSAAAKEIQ EEEA
Subjt:  QTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSTSSPSY-SSSSSAAAKEIQEEEEA

Query:  VENRNEAAVLAAPMAVGCRGCLSYVLVTKNNPRCPRCNSVVPLASMKKPRIDLNMSI
         E RN     AAPMAVGC GCLSYVLV KNNPRCPRCNSVVPL ++KKPRIDLNMSI
Subjt:  VENRNEAAVLAAPMAVGCRGCLSYVLVTKNNPRCPRCNSVVPLASMKKPRIDLNMSI

A0A6J1EXD9 uncharacterized protein LOC1114370421.5e-9378.52Show/hide
Query:  MAAQVSTLIRVLAGGGGGGYNDEDNRTPLGNGQEPTPLVTRDLLGQSSNLAASQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKHHQMTN
        MAAQVS+LIRVLA     GY D+DNR  L   +E T L TRDLLGQSSNLA SQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSK  QM N
Subjt:  MAAQVSTLIRVLAGGGGGGYNDEDNRTPLGNGQEPTPLVTRDLLGQSSNLAASQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKHHQMTN

Query:  QTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSTSSPSYSSSSSAAAKEIQEEEEAV
        QTESK QDLNFPPSPSKRTLNLFNETSLDL LTS        STNYASVCTLDKVKSALERADKEL+KKRS+LWKS SSPS       AAKEIQEEE A 
Subjt:  QTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSTSSPSYSSSSSAAAKEIQEEEEAV

Query:  ENRNEAAVLAAPMAVGCRGCLSYVLVTKNNPRCPRCNSVVPLASMKKPRIDLNMSI
            E+   AAPMAVGC GCLSYVLVTKNNPRCPRCNSVVPL S+KKPRIDLN+SI
Subjt:  ENRNEAAVLAAPMAVGCRGCLSYVLVTKNNPRCPRCNSVVPLASMKKPRIDLNMSI

A0A6J1GJC7 uncharacterized protein LOC1114548762.8e-135100Show/hide
Query:  MAAQVSTLIRVLAGGGGGGYNDEDNRTPLGNGQEPTPLVTRDLLGQSSNLAASQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKHHQMTN
        MAAQVSTLIRVLAGGGGGGYNDEDNRTPLGNGQEPTPLVTRDLLGQSSNLAASQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKHHQMTN
Subjt:  MAAQVSTLIRVLAGGGGGGYNDEDNRTPLGNGQEPTPLVTRDLLGQSSNLAASQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKHHQMTN

Query:  QTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSTSSPSYSSSSSAAAKEIQEEEEAV
        QTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSTSSPSYSSSSSAAAKEIQEEEEAV
Subjt:  QTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSTSSPSYSSSSSAAAKEIQEEEEAV

Query:  ENRNEAAVLAAPMAVGCRGCLSYVLVTKNNPRCPRCNSVVPLASMKKPRIDLNMSI
        ENRNEAAVLAAPMAVGCRGCLSYVLVTKNNPRCPRCNSVVPLASMKKPRIDLNMSI
Subjt:  ENRNEAAVLAAPMAVGCRGCLSYVLVTKNNPRCPRCNSVVPLASMKKPRIDLNMSI

A0A6J1HWE1 uncharacterized protein LOC1114684501.7e-12495.31Show/hide
Query:  MAAQVSTLIRVLAGGGGGGYNDEDNRTPLGNGQEPTPLVTRDLLGQSSNLAASQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKHHQMTN
        MAAQVS+LIRVLA   G GYNDEDNRTPLGNGQEPTPLVTRDLLGQSSNLAASQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSK HQMTN
Subjt:  MAAQVSTLIRVLAGGGGGGYNDEDNRTPLGNGQEPTPLVTRDLLGQSSNLAASQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKHHQMTN

Query:  QTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSTSSPSYSSSSSAAAKEIQEEEEAV
        QTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTSS   SPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSTSSPSYSSSSSAAAKEIQEEEEAV
Subjt:  QTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSTSSPSYSSSSSAAAKEIQEEEEAV

Query:  ENRNEAAVLAAPMAVGCRGCLSYVLVTKNNPRCPRCNSVVPLASMKKPRIDLNMSI
        ENRNEA   AAPMAVGCRGCLSYVLVTKNNPRCPRCNSVVPLASMKKPRIDLNMSI
Subjt:  ENRNEAAVLAAPMAVGCRGCLSYVLVTKNNPRCPRCNSVVPLASMKKPRIDLNMSI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G16500.1 unknown protein8.2e-4744.44Show/hide
Query:  MAAQVSTLIRVLAGGGGGGYNDEDNRTPLGNGQEP---TPLVTRDLLGQSSNLAA-------SQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLN
        MAA VS+L+R+L+         +D+RT + +   P     L+TRDLLG    +         S ELDLD+QVP+GWEKRLDLKSGKVY+Q+     S  +
Subjt:  MAAQVSTLIRVLAGGGGGGYNDEDNRTPLGNGQEP---TPLVTRDLLGQSSNLAA-------SQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLN

Query:  SDSKHH----QMTNQTESKFQDLNFPP----SPSKRTLNLF---NETSLDLKLTSS------PSP----SPSPSTNY-ASVCTLDKVKSALERADKELVK
        S   HH      TNQT  +FQDLN PP     P+K  L+LF   ++TSL+LKL  S      P P    SP+ S +Y +SVCTLDKVK ALERA+K+  K
Subjt:  SDSKHH----QMTNQTESKFQDLNFPP----SPSKRTLNLF---NETSLDLKLTSS------PSP----SPSPSTNY-ASVCTLDKVKSALERADKELVK

Query:  KRSSLWKSTSSPSYSSSSSAAAKEIQEEEEAVENRNEAAVLAAPMAVGCRGCLSYVLVTKNNPRCPRCNSVVPLASMKKPRIDLNMSI
        ++S                       E++   +    A V A+ +A GC GCLSYV V KNNP+CPRC+S VPL +MKKP+IDLN+S+
Subjt:  KRSSLWKSTSSPSYSSSSSAAAKEIQEEEEAVENRNEAAVLAAPMAVGCRGCLSYVLVTKNNPRCPRCNSVVPLASMKKPRIDLNMSI

AT1G79160.1 unknown protein7.7e-4545.08Show/hide
Query:  MAAQVSTLIRVLAGGGGGGYNDEDNRTPLGNG-QEPTPLVTRDLLGQSSNLAA--SQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKHHQ
        MAA VS+L+R+L+     GY D+        G +    L+TRDLLG         S ELDLDLQVP+G+EKRLDLKSGKVY+QR  +  S   S   +  
Subjt:  MAAQVSTLIRVLAGGGGGGYNDEDNRTPLGNG-QEPTPLVTRDLLGQSSNLAA--SQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKHHQ

Query:  MTNQTESKFQDLNFPPSPSKRT--LNLFNETSLDLKLTSSPSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSTSSPSYSSSSSAAAKEIQE
         TNQT   FQDLNFPP     +  LNLF++T+ +LKL  S   S   ++N  SVCTLDKVKSALERA+++                      A  K+ Q 
Subjt:  MTNQTESKFQDLNFPPSPSKRT--LNLFNETSLDLKLTSSPSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSTSSPSYSSSSSAAAKEIQE

Query:  EEEAVENRNEAAVLAAPMAVGCRGCLSYVLVTKNNPRCPRCNSVVPLAS---MKKPRIDLNMSI
         ++ V +      +A+P+  GC GCLSYVLV  NNP+CPRC+++VPL +    KKP+IDLN+SI
Subjt:  EEEAVENRNEAAVLAAPMAVGCRGCLSYVLVTKNNPRCPRCNSVVPLAS---MKKPRIDLNMSI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCCCAAGTCAGCACTCTGATTCGGGTACTCGCCGGCGGCGGAGGCGGAGGCTACAACGACGAAGATAACCGGACGCCCCTCGGTAATGGCCAGGAACCAACGCC
TCTCGTTACTCGCGATTTGCTCGGCCAATCTTCCAACCTCGCCGCCTCTCAGGAATTGGACCTCGATTTGCAGGTTCCCTCCGGCTGGGAAAAAAGACTCGACTTGAAGT
CGGGGAAAGTTTACATACAGAGAAGTCAAACACCGGATTCTCCTCTGAATTCAGATTCGAAACACCACCAAATGACAAACCAAACAGAGTCCAAATTCCAGGACTTGAAT
TTCCCCCCATCTCCTTCAAAACGAACATTAAATCTCTTCAATGAAACCAGCTTGGATTTGAAACTGACGTCGTCGCCGTCTCCGTCACCGTCTCCGTCGACCAATTACGC
CAGTGTTTGCACTCTGGATAAGGTGAAATCCGCTCTGGAAAGGGCCGACAAGGAGCTCGTTAAAAAAAGGTCTTCGCTTTGGAAATCGACGTCGTCGCCGTCGTACTCGT
CATCGTCGTCTGCGGCGGCGAAGGAAATTCAAGAAGAAGAAGAAGCGGTGGAAAACAGAAACGAGGCGGCAGTTTTGGCGGCGCCGATGGCGGTGGGTTGCCGTGGATGT
TTATCTTATGTATTGGTAACGAAAAACAACCCGAGATGTCCTCGTTGCAATTCTGTTGTTCCGTTGGCGAGCATGAAGAAACCTCGGATTGATCTAAACATGTCTATATA
A
mRNA sequenceShow/hide mRNA sequence
CCGCACAAGTTACAACCCCCCAAACCACTCGCCCACCCACCCACCTCTCTGCTTCCTCTCTCTCTCTCTCTATCCTTCTCGTTTTTATCAGAACAAAAAACAAAACCCCT
TCACAGGCCCTTCCGTTTCGTTTCCCTATAAAGCTCCAGAAAATGGCTGCCCAAGTCAGCACTCTGATTCGGGTACTCGCCGGCGGCGGAGGCGGAGGCTACAACGACGA
AGATAACCGGACGCCCCTCGGTAATGGCCAGGAACCAACGCCTCTCGTTACTCGCGATTTGCTCGGCCAATCTTCCAACCTCGCCGCCTCTCAGGAATTGGACCTCGATT
TGCAGGTTCCCTCCGGCTGGGAAAAAAGACTCGACTTGAAGTCGGGGAAAGTTTACATACAGAGAAGTCAAACACCGGATTCTCCTCTGAATTCAGATTCGAAACACCAC
CAAATGACAAACCAAACAGAGTCCAAATTCCAGGACTTGAATTTCCCCCCATCTCCTTCAAAACGAACATTAAATCTCTTCAATGAAACCAGCTTGGATTTGAAACTGAC
GTCGTCGCCGTCTCCGTCACCGTCTCCGTCGACCAATTACGCCAGTGTTTGCACTCTGGATAAGGTGAAATCCGCTCTGGAAAGGGCCGACAAGGAGCTCGTTAAAAAAA
GGTCTTCGCTTTGGAAATCGACGTCGTCGCCGTCGTACTCGTCATCGTCGTCTGCGGCGGCGAAGGAAATTCAAGAAGAAGAAGAAGCGGTGGAAAACAGAAACGAGGCG
GCAGTTTTGGCGGCGCCGATGGCGGTGGGTTGCCGTGGATGTTTATCTTATGTATTGGTAACGAAAAACAACCCGAGATGTCCTCGTTGCAATTCTGTTGTTCCGTTGGC
GAGCATGAAGAAACCTCGGATTGATCTAAACATGTCTATATAAAATATCCAAAGGGAAAATTAAGAAATTTCCCCCATGGCCCTCTCCCTGGATCTTGTGTTTTGGGTAT
ATGTAAAATGGACATATTCACACAAAATTATGGTTTCCATTATTCCATTCTCAATTCCTCCTCTGATGTCAAATTATTCAAA
Protein sequenceShow/hide protein sequence
MAAQVSTLIRVLAGGGGGGYNDEDNRTPLGNGQEPTPLVTRDLLGQSSNLAASQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKHHQMTNQTESKFQDLN
FPPSPSKRTLNLFNETSLDLKLTSSPSPSPSPSTNYASVCTLDKVKSALERADKELVKKRSSLWKSTSSPSYSSSSSAAAKEIQEEEEAVENRNEAAVLAAPMAVGCRGC
LSYVLVTKNNPRCPRCNSVVPLASMKKPRIDLNMSI