; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0020590 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0020590
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
Descriptionflocculation protein FLO11-like
Genome locationchr01:17786789..17792553
RNA-Seq ExpressionPI0020590
SyntenyPI0020590
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036856.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.4e-4138.79Show/hide
Query:  DENFTDPPPDSVSDPA---PQSTPTKVKSKTKR-------VSTQSGRRQVPLHVNSVPIDGISFHHESHVHKWKYVIQRRIAAESDVSDDHHSCLAVMTL
        D N   P  D+V   A   P S PT++K   K+       ++T++GR+++PL++ SVPIDGISFH E ++H+WK+V+QRRIA E ++SD HHSC+++M L
Subjt:  DENFTDPPPDSVSDPA---PQSTPTKVKSKTKR-------VSTQSGRRQVPLHVNSVPIDGISFHHESHVHKWKYVIQRRIAAESDVSDDHHSCLAVMTL

Query:  ISQAKLLTTVSNVGPYYPKLIREMFVNIPTSFDNSSSPDYHTIHIRRLPFSFSPHSLNTFLG-ITLPSDSPSLPTSDDLASE------------------
        I +A+L  T+S+V P+YP+LIRE  VN+PT+ ++ SSPDY T+HIR   F+ SP  +N F+G +   + SPS  ++D LASE                  
Subjt:  ISQAKLLTTVSNVGPYYPKLIREMFVNIPTSFDNSSSPDYHTIHIRRLPFSFSPHSLNTFLG-ITLPSDSPSLPTSDDLASE------------------

Query:  ---------------------------------------------ITGDEAHGPSPTKIQLSYRLFQGAHVPDIDHSFHPS
                                                     +T  +A GP P  + LSYRLFQG+HVPDIDH  HPS
Subjt:  ---------------------------------------------ITGDEAHGPSPTKIQLSYRLFQGAHVPDIDHSFHPS

KAA0056211.1 uncharacterized protein E6C27_scaffold85G00030 [Cucumis melo var. makuwa]1.5e-4033.17Show/hide
Query:  SSDHHQSSASTPTLDENFTDPPPDSVSDP-----APQS--------TPTKVKSKT----KRVSTQSGRRQVPLHVNSVPIDGISFHHESHVHKWKYVIQR
        SS  H S +   T   + T+ PP+ +  P     AP S        +P   K KT    + V+T++ R+++P +V SVPIDGISFHHE  V  WK+V+Q+
Subjt:  SSDHHQSSASTPTLDENFTDPPPDSVSDP-----APQS--------TPTKVKSKT----KRVSTQSGRRQVPLHVNSVPIDGISFHHESHVHKWKYVIQR

Query:  RIAAESDVSDDHHSCLAVMTLISQAKLLTTVSNVGPYYPKLIREMFVNIPTSFDNSSSPDYHTIHIRRLPFSFSPHSLNTFLGITLPSD-SPSLPTSDDL
        RI  E ++SD H SC+++M LI +A L  T+S+VGP+YP+LIRE  VN+P  F+N SS DY T+HI    F  S   ++ FLG T+  D SPS  T++ L
Subjt:  RIAAESDVSDDHHSCLAVMTLISQAKLLTTVSNVGPYYPKLIREMFVNIPTSFDNSSSPDYHTIHIRRLPFSFSPHSLNTFLGITLPSD-SPSLPTSDDL

Query:  ASEITG--------------------------------DEAHG--------------------------------------------------PSPTKIQ
        A+ ++G                                  +H                                                   P P  I 
Subjt:  ASEITG--------------------------------DEAHG--------------------------------------------------PSPTKIQ

Query:  LSYRLFQGAHVPDIDHSFHPSSSIRP-NYDNLRSPIDGLLFPPSLANQVLHVLTSESRSISTLIHDLTDRRNEIDSVITSVQLLISSNPDAPAPQPPS
        LSYRLFQG+HVPDIDH  HP+   R  +  +     +G      LA ++++ LT+ESR+++  I  L++RR E+D++I  ++   SS P     QPPS
Subjt:  LSYRLFQGAHVPDIDHSFHPSSSIRP-NYDNLRSPIDGLLFPPSLANQVLHVLTSESRSISTLIHDLTDRRNEIDSVITSVQLLISSNPDAPAPQPPS

KAA0066044.1 uncharacterized protein E6C27_scaffold21G00170 [Cucumis melo var. makuwa]1.1e-4636.6Show/hide
Query:  DENFTDPPPDSVSDPAPQST---PTKVKSKTKR-------VSTQSGRRQVPLHVNSVPIDGISFHHESHVHKWKYVIQRRIAAESDVSDDHHSCLAVMTL
        D N   P  D+V   APQ T   PT+ K   K+       ++T+ GR+++PL++ SVPI+GISFH E +V +WK+V+QRRIA E ++SD HHSC+++M L
Subjt:  DENFTDPPPDSVSDPAPQST---PTKVKSKTKR-------VSTQSGRRQVPLHVNSVPIDGISFHHESHVHKWKYVIQRRIAAESDVSDDHHSCLAVMTL

Query:  ISQAKLLTTVSNVGPYYPKLIREMFVNIPTSFDNSSSPDYHTIHIRRLPFSFSPHSLNTFLGITLPSD-SPSLPTSDDLASEITG---------------
        I +A L  T+S+VGP+YP+LIRE  VN+P  F++ SSP+Y T+HI+   F+ SP  +N FLG  +  + SPS P +D LA  ++G               
Subjt:  ISQAKLLTTVSNVGPYYPKLIREMFVNIPTSFDNSSSPDYHTIHIRRLPFSFSPHSLNTFLGITLPSD-SPSLPTSDDLASEITG---------------

Query:  ---------------------------------------------------------DEAHGPSPTKIQLSYRLFQGAHVPDIDHSFHPSSSIRP-NYDN
                                                                  +A G  P  + LSYRLFQ +HVPDIDH  HPS   R  +  +
Subjt:  ---------------------------------------------------------DEAHGPSPTKIQLSYRLFQGAHVPDIDHSFHPSSSIRP-NYDN

Query:  LRSPIDGLLFPPSLANQVLHVLTSESRSISTLIHDLTDRRNEIDSVI
             DG      LA+++L+ LT++SRS+ST I  +++RR EIDS+I
Subjt:  LRSPIDGLLFPPSLANQVLHVLTSESRSISTLIHDLTDRRNEIDSVI

XP_008454855.1 PREDICTED: uncharacterized protein LOC103495162 [Cucumis melo]1.8e-4133.92Show/hide
Query:  RMSKSVRMLNSSTKDLNHILSSDHHQSSAST-----PT-----LDENFTDPPPDSVSDPAPQSTP---TKVKSKTKR-------VSTQSGRRQVPLHVNS
        R+ K V   N+  K++N  + SD  Q S+S+     PT     +D N      DSV   A Q TP   T+ K   K+       ++T+ GR+++PL++ S
Subjt:  RMSKSVRMLNSSTKDLNHILSSDHHQSSAST-----PT-----LDENFTDPPPDSVSDPAPQSTP---TKVKSKTKR-------VSTQSGRRQVPLHVNS

Query:  VPIDGISFHHESHVHKWKYVIQRRIAAESDVSDDHHSCLAVMTLISQAKLLTTVSNVGPYYPKLIREMFVNIPTSFDNSSSPDYHTIHIRRLPFSFSPHS
        VPIDGI FH E +V  WK+++QRRIA E ++SD HHSC++++ LI++A L  T+S+VGP+YP+LIRE  VN+PT F++SSSPDY T+HIR   FS +   
Subjt:  VPIDGISFHHESHVHKWKYVIQRRIAAESDVSDDHHSCLAVMTLISQAKLLTTVSNVGPYYPKLIREMFVNIPTSFDNSSSPDYHTIHIRRLPFSFSPHS

Query:  LNTFLGITLPSD-SPSLPTSDDLASEITG----------------------------------DEAHGPSPTKIQLSYRLFQG-----------------
        +N FLG  +  + +PS P++D LASE++G                                    A   S       YR+                    
Subjt:  LNTFLGITLPSD-SPSLPTSDDLASEITG----------------------------------DEAHGPSPTKIQLSYRLFQG-----------------

Query:  ------------AHVPDIDHSFHPSSSIR----PNYDNLRSPIDGLLFPPSLANQVLHVLTSESRSISTLIHDLTDRRNEIDSVITSVQLLISSN
                    +HVPDIDH  HP+   R     ++D +    DG      LA+++L+ L +ESRS++T I  +++RR +IDS+I  ++    S+
Subjt:  ------------AHVPDIDHSFHPSSSIR----PNYDNLRSPIDGLLFPPSLANQVLHVLTSESRSISTLIHDLTDRRNEIDSVITSVQLLISSN

XP_008465030.1 PREDICTED: uncharacterized protein LOC103502746 [Cucumis melo]5.7e-4335.01Show/hide
Query:  SSDHHQSSASTPTLDENFTDPPPDSVSDP-----APQS--------TPTKVKSKT----KRVSTQSGRRQVPLHVNSVPIDGISFHHESHVHKWKYVIQR
        SS  H S +   T   + T+ PP+ +  P     AP S        +P   K KT    + V+T++ R+++P +V SVPIDGISFHHE  V  WK+V+Q+
Subjt:  SSDHHQSSASTPTLDENFTDPPPDSVSDP-----APQS--------TPTKVKSKT----KRVSTQSGRRQVPLHVNSVPIDGISFHHESHVHKWKYVIQR

Query:  RIAAESDVSDDHHSCLAVMTLISQAKLLTTVSNVGPYYPKLIREMFVNIPTSFDNSSSPDYHTIHIRRLPFSFSPHSLNTFLGITLPSD-SPSLPTSDDL
        RI  E ++SD H SC+++M LI +A L  T+S+VGP+YP+LIRE  VN+P  F+N SS DY T+HI    F  S   ++ FLG T+  D SPS  T++ L
Subjt:  RIAAESDVSDDHHSCLAVMTLISQAKLLTTVSNVGPYYPKLIREMFVNIPTSFDNSSSPDYHTIHIRRLPFSFSPHSLNTFLGITLPSD-SPSLPTSDDL

Query:  ASEITG--------------------------------DEAHG-----------------------------PSPTKIQLSYRLFQGAHVPDIDHSFHPS
        A+ ++G                                  +H                              P P  I LSYRLFQG+HVPDIDH  HP+
Subjt:  ASEITG--------------------------------DEAHG-----------------------------PSPTKIQLSYRLFQGAHVPDIDHSFHPS

Query:  SSIRP-NYDNLRSPIDGLLFPPSLANQVLHVLTSESRSISTLIHDLTDRRNEIDSVITSVQLLISSNPDAPAPQPPS
           R  +  +     +G      LA ++++ LT+ESR+++  I  L++RR E+D++I  ++   SS P     QPPS
Subjt:  SSIRP-NYDNLRSPIDGLLFPPSLANQVLHVLTSESRSISTLIHDLTDRRNEIDSVITSVQLLISSNPDAPAPQPPS

TrEMBL top hitse value%identityAlignment
A0A1S3BZ31 uncharacterized protein LOC1034951628.8e-4233.92Show/hide
Query:  RMSKSVRMLNSSTKDLNHILSSDHHQSSAST-----PT-----LDENFTDPPPDSVSDPAPQSTP---TKVKSKTKR-------VSTQSGRRQVPLHVNS
        R+ K V   N+  K++N  + SD  Q S+S+     PT     +D N      DSV   A Q TP   T+ K   K+       ++T+ GR+++PL++ S
Subjt:  RMSKSVRMLNSSTKDLNHILSSDHHQSSAST-----PT-----LDENFTDPPPDSVSDPAPQSTP---TKVKSKTKR-------VSTQSGRRQVPLHVNS

Query:  VPIDGISFHHESHVHKWKYVIQRRIAAESDVSDDHHSCLAVMTLISQAKLLTTVSNVGPYYPKLIREMFVNIPTSFDNSSSPDYHTIHIRRLPFSFSPHS
        VPIDGI FH E +V  WK+++QRRIA E ++SD HHSC++++ LI++A L  T+S+VGP+YP+LIRE  VN+PT F++SSSPDY T+HIR   FS +   
Subjt:  VPIDGISFHHESHVHKWKYVIQRRIAAESDVSDDHHSCLAVMTLISQAKLLTTVSNVGPYYPKLIREMFVNIPTSFDNSSSPDYHTIHIRRLPFSFSPHS

Query:  LNTFLGITLPSD-SPSLPTSDDLASEITG----------------------------------DEAHGPSPTKIQLSYRLFQG-----------------
        +N FLG  +  + +PS P++D LASE++G                                    A   S       YR+                    
Subjt:  LNTFLGITLPSD-SPSLPTSDDLASEITG----------------------------------DEAHGPSPTKIQLSYRLFQG-----------------

Query:  ------------AHVPDIDHSFHPSSSIR----PNYDNLRSPIDGLLFPPSLANQVLHVLTSESRSISTLIHDLTDRRNEIDSVITSVQLLISSN
                    +HVPDIDH  HP+   R     ++D +    DG      LA+++L+ L +ESRS++T I  +++RR +IDS+I  ++    S+
Subjt:  ------------AHVPDIDHSFHPSSSIR----PNYDNLRSPIDGLLFPPSLANQVLHVLTSESRSISTLIHDLTDRRNEIDSVITSVQLLISSN

A0A1S3CMY0 uncharacterized protein LOC1035027462.7e-4335.01Show/hide
Query:  SSDHHQSSASTPTLDENFTDPPPDSVSDP-----APQS--------TPTKVKSKT----KRVSTQSGRRQVPLHVNSVPIDGISFHHESHVHKWKYVIQR
        SS  H S +   T   + T+ PP+ +  P     AP S        +P   K KT    + V+T++ R+++P +V SVPIDGISFHHE  V  WK+V+Q+
Subjt:  SSDHHQSSASTPTLDENFTDPPPDSVSDP-----APQS--------TPTKVKSKT----KRVSTQSGRRQVPLHVNSVPIDGISFHHESHVHKWKYVIQR

Query:  RIAAESDVSDDHHSCLAVMTLISQAKLLTTVSNVGPYYPKLIREMFVNIPTSFDNSSSPDYHTIHIRRLPFSFSPHSLNTFLGITLPSD-SPSLPTSDDL
        RI  E ++SD H SC+++M LI +A L  T+S+VGP+YP+LIRE  VN+P  F+N SS DY T+HI    F  S   ++ FLG T+  D SPS  T++ L
Subjt:  RIAAESDVSDDHHSCLAVMTLISQAKLLTTVSNVGPYYPKLIREMFVNIPTSFDNSSSPDYHTIHIRRLPFSFSPHSLNTFLGITLPSD-SPSLPTSDDL

Query:  ASEITG--------------------------------DEAHG-----------------------------PSPTKIQLSYRLFQGAHVPDIDHSFHPS
        A+ ++G                                  +H                              P P  I LSYRLFQG+HVPDIDH  HP+
Subjt:  ASEITG--------------------------------DEAHG-----------------------------PSPTKIQLSYRLFQGAHVPDIDHSFHPS

Query:  SSIRP-NYDNLRSPIDGLLFPPSLANQVLHVLTSESRSISTLIHDLTDRRNEIDSVITSVQLLISSNPDAPAPQPPS
           R  +  +     +G      LA ++++ LT+ESR+++  I  L++RR E+D++I  ++   SS P     QPPS
Subjt:  SSIRP-NYDNLRSPIDGLLFPPSLANQVLHVLTSESRSISTLIHDLTDRRNEIDSVITSVQLLISSNPDAPAPQPPS

A0A5A7UQ31 Uncharacterized protein8.8e-4233.92Show/hide
Query:  RMSKSVRMLNSSTKDLNHILSSDHHQSSAST-----PT-----LDENFTDPPPDSVSDPAPQSTP---TKVKSKTKR-------VSTQSGRRQVPLHVNS
        R+ K V   N+  K++N  + SD  Q S+S+     PT     +D N      DSV   A Q TP   T+ K   K+       ++T+ GR+++PL++ S
Subjt:  RMSKSVRMLNSSTKDLNHILSSDHHQSSAST-----PT-----LDENFTDPPPDSVSDPAPQSTP---TKVKSKTKR-------VSTQSGRRQVPLHVNS

Query:  VPIDGISFHHESHVHKWKYVIQRRIAAESDVSDDHHSCLAVMTLISQAKLLTTVSNVGPYYPKLIREMFVNIPTSFDNSSSPDYHTIHIRRLPFSFSPHS
        VPIDGI FH E +V  WK+++QRRIA E ++SD HHSC++++ LI++A L  T+S+VGP+YP+LIRE  VN+PT F++SSSPDY T+HIR   FS +   
Subjt:  VPIDGISFHHESHVHKWKYVIQRRIAAESDVSDDHHSCLAVMTLISQAKLLTTVSNVGPYYPKLIREMFVNIPTSFDNSSSPDYHTIHIRRLPFSFSPHS

Query:  LNTFLGITLPSD-SPSLPTSDDLASEITG----------------------------------DEAHGPSPTKIQLSYRLFQG-----------------
        +N FLG  +  + +PS P++D LASE++G                                    A   S       YR+                    
Subjt:  LNTFLGITLPSD-SPSLPTSDDLASEITG----------------------------------DEAHGPSPTKIQLSYRLFQG-----------------

Query:  ------------AHVPDIDHSFHPSSSIR----PNYDNLRSPIDGLLFPPSLANQVLHVLTSESRSISTLIHDLTDRRNEIDSVITSVQLLISSN
                    +HVPDIDH  HP+   R     ++D +    DG      LA+++L+ L +ESRS++T I  +++RR +IDS+I  ++    S+
Subjt:  ------------AHVPDIDHSFHPSSSIR----PNYDNLRSPIDGLLFPPSLANQVLHVLTSESRSISTLIHDLTDRRNEIDSVITSVQLLISSN

A0A5A7VFF7 Uncharacterized protein5.3e-4736.6Show/hide
Query:  DENFTDPPPDSVSDPAPQST---PTKVKSKTKR-------VSTQSGRRQVPLHVNSVPIDGISFHHESHVHKWKYVIQRRIAAESDVSDDHHSCLAVMTL
        D N   P  D+V   APQ T   PT+ K   K+       ++T+ GR+++PL++ SVPI+GISFH E +V +WK+V+QRRIA E ++SD HHSC+++M L
Subjt:  DENFTDPPPDSVSDPAPQST---PTKVKSKTKR-------VSTQSGRRQVPLHVNSVPIDGISFHHESHVHKWKYVIQRRIAAESDVSDDHHSCLAVMTL

Query:  ISQAKLLTTVSNVGPYYPKLIREMFVNIPTSFDNSSSPDYHTIHIRRLPFSFSPHSLNTFLGITLPSD-SPSLPTSDDLASEITG---------------
        I +A L  T+S+VGP+YP+LIRE  VN+P  F++ SSP+Y T+HI+   F+ SP  +N FLG  +  + SPS P +D LA  ++G               
Subjt:  ISQAKLLTTVSNVGPYYPKLIREMFVNIPTSFDNSSSPDYHTIHIRRLPFSFSPHSLNTFLGITLPSD-SPSLPTSDDLASEITG---------------

Query:  ---------------------------------------------------------DEAHGPSPTKIQLSYRLFQGAHVPDIDHSFHPSSSIRP-NYDN
                                                                  +A G  P  + LSYRLFQ +HVPDIDH  HPS   R  +  +
Subjt:  ---------------------------------------------------------DEAHGPSPTKIQLSYRLFQGAHVPDIDHSFHPSSSIRP-NYDN

Query:  LRSPIDGLLFPPSLANQVLHVLTSESRSISTLIHDLTDRRNEIDSVI
             DG      LA+++L+ LT++SRS+ST I  +++RR EIDS+I
Subjt:  LRSPIDGLLFPPSLANQVLHVLTSESRSISTLIHDLTDRRNEIDSVI

A0A5D3B7L1 Gag-pol polyprotein6.7e-4238.79Show/hide
Query:  DENFTDPPPDSVSDPA---PQSTPTKVKSKTKR-------VSTQSGRRQVPLHVNSVPIDGISFHHESHVHKWKYVIQRRIAAESDVSDDHHSCLAVMTL
        D N   P  D+V   A   P S PT++K   K+       ++T++GR+++PL++ SVPIDGISFH E ++H+WK+V+QRRIA E ++SD HHSC+++M L
Subjt:  DENFTDPPPDSVSDPA---PQSTPTKVKSKTKR-------VSTQSGRRQVPLHVNSVPIDGISFHHESHVHKWKYVIQRRIAAESDVSDDHHSCLAVMTL

Query:  ISQAKLLTTVSNVGPYYPKLIREMFVNIPTSFDNSSSPDYHTIHIRRLPFSFSPHSLNTFLG-ITLPSDSPSLPTSDDLASE------------------
        I +A+L  T+S+V P+YP+LIRE  VN+PT+ ++ SSPDY T+HIR   F+ SP  +N F+G +   + SPS  ++D LASE                  
Subjt:  ISQAKLLTTVSNVGPYYPKLIREMFVNIPTSFDNSSSPDYHTIHIRRLPFSFSPHSLNTFLG-ITLPSDSPSLPTSDDLASE------------------

Query:  ---------------------------------------------ITGDEAHGPSPTKIQLSYRLFQGAHVPDIDHSFHPS
                                                     +T  +A GP P  + LSYRLFQG+HVPDIDH  HPS
Subjt:  ---------------------------------------------ITGDEAHGPSPTKIQLSYRLFQGAHVPDIDHSFHPS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACTATGGATCCACTTCAAATTGGGATAATCAACATAAATTTAGAAATGGTAACTCTTTTCCGAAAAGAAAAGATGACAGAAATTATGCTTTTAAAAAGGATGAAGA
TTTCATATGCAGAGAATGTGGTGGTAAAGGTCATTACCAATCAGAATGTGCTACCTATTTGAGAAGACAGCGCAAAGGATTCTCAGCAACTCTGTCTGATGAATCAGAAT
CAAGTCATGATTCTGAGGATGAGGTACGTGCACTAGTTGCTTATTCTAATTTACAAACTAATAATTTTTCGACAGGAGATGATCAATCCTCTAGTCTTCCTGGAGGAGAT
CTTGAAGATATTATGGAGAAATGGCAGGAAGATCTCAAAGTTATTGAACAGCAGAAAGAAAGAATTTTAGAACTAGTTGAAGACAACCATAGACTGTTGCAAACTATCTC
AGATATCAAGAAGGAGTTAAGAATTGCAAAAAATGAAAATGATAGAATGTCAAAATCTGTTCGCATGCTGAACTCAAGTACCAAAGATTTAAATCACATTTTGAGTTCAG
ACCATCACCAGTCTTCTGCTTCCACTCCCACTCTTGATGAAAATTTCACTGATCCTCCTCCTGATTCGGTTTCGGATCCTGCTCCTCAATCTACTCCAACCAAAGTTAAA
TCCAAAACTAAAAGAGTGTCTACCCAGTCTGGTCGAAGACAGGTTCCTTTGCATGTGAATTCTGTTCCCATAGATGGCATCTCCTTTCATCACGAGTCTCATGTCCATAA
ATGGAAATACGTGATTCAGAGACGTATTGCTGCAGAGTCAGATGTTTCTGATGATCATCATTCCTGCTTGGCTGTTATGACTCTAATTTCTCAAGCCAAACTTCTGACCA
CGGTCTCTAATGTTGGCCCTTACTATCCCAAGCTGATTAGAGAAATGTTTGTCAACATTCCAACGTCTTTTGACAATTCTAGCAGTCCTGATTATCATACCATCCATATC
AGAAGATTGCCCTTCTCCTTTTCCCCTCATTCCTTGAATACCTTTCTTGGTATTACTCTTCCTTCTGATTCTCCCTCTCTTCCTACCTCTGATGATCTTGCCTCTGAGAT
AACAGGAGATGAAGCTCATGGCCCTTCTCCTACAAAAATTCAATTAAGTTATCGCCTCTTTCAAGGAGCTCATGTTCCTGACATTGACCATTCTTTTCATCCTTCATCTT
CCATTCGCCCAAACTATGACAATTTGAGATCTCCTATTGATGGTTTGTTATTTCCTCCCTCTTTAGCCAATCAAGTCCTTCATGTTTTGACCTCTGAGTCTCGCAGCATC
AGCACTCTCATCCATGACCTCACAGACAGAAGAAACGAGATTGACTCTGTTATCACTTCTGTTCAGTTATTGATTTCTTCAAATCCTGATGCTCCTGCTCCTCAACCTCC
ATCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAACTATGGATCCACTTCAAATTGGGATAATCAACATAAATTTAGAAATGGTAACTCTTTTCCGAAAAGAAAAGATGACAGAAATTATGCTTTTAAAAAGGATGAAGA
TTTCATATGCAGAGAATGTGGTGGTAAAGGTCATTACCAATCAGAATGTGCTACCTATTTGAGAAGACAGCGCAAAGGATTCTCAGCAACTCTGTCTGATGAATCAGAAT
CAAGTCATGATTCTGAGGATGAGGTACGTGCACTAGTTGCTTATTCTAATTTACAAACTAATAATTTTTCGACAGGAGATGATCAATCCTCTAGTCTTCCTGGAGGAGAT
CTTGAAGATATTATGGAGAAATGGCAGGAAGATCTCAAAGTTATTGAACAGCAGAAAGAAAGAATTTTAGAACTAGTTGAAGACAACCATAGACTGTTGCAAACTATCTC
AGATATCAAGAAGGAGTTAAGAATTGCAAAAAATGAAAATGATAGAATGTCAAAATCTGTTCGCATGCTGAACTCAAGTACCAAAGATTTAAATCACATTTTGAGTTCAG
ACCATCACCAGTCTTCTGCTTCCACTCCCACTCTTGATGAAAATTTCACTGATCCTCCTCCTGATTCGGTTTCGGATCCTGCTCCTCAATCTACTCCAACCAAAGTTAAA
TCCAAAACTAAAAGAGTGTCTACCCAGTCTGGTCGAAGACAGGTTCCTTTGCATGTGAATTCTGTTCCCATAGATGGCATCTCCTTTCATCACGAGTCTCATGTCCATAA
ATGGAAATACGTGATTCAGAGACGTATTGCTGCAGAGTCAGATGTTTCTGATGATCATCATTCCTGCTTGGCTGTTATGACTCTAATTTCTCAAGCCAAACTTCTGACCA
CGGTCTCTAATGTTGGCCCTTACTATCCCAAGCTGATTAGAGAAATGTTTGTCAACATTCCAACGTCTTTTGACAATTCTAGCAGTCCTGATTATCATACCATCCATATC
AGAAGATTGCCCTTCTCCTTTTCCCCTCATTCCTTGAATACCTTTCTTGGTATTACTCTTCCTTCTGATTCTCCCTCTCTTCCTACCTCTGATGATCTTGCCTCTGAGAT
AACAGGAGATGAAGCTCATGGCCCTTCTCCTACAAAAATTCAATTAAGTTATCGCCTCTTTCAAGGAGCTCATGTTCCTGACATTGACCATTCTTTTCATCCTTCATCTT
CCATTCGCCCAAACTATGACAATTTGAGATCTCCTATTGATGGTTTGTTATTTCCTCCCTCTTTAGCCAATCAAGTCCTTCATGTTTTGACCTCTGAGTCTCGCAGCATC
AGCACTCTCATCCATGACCTCACAGACAGAAGAAACGAGATTGACTCTGTTATCACTTCTGTTCAGTTATTGATTTCTTCAAATCCTGATGCTCCTGCTCCTCAACCTCC
ATCTTAA
Protein sequenceShow/hide protein sequence
MNYGSTSNWDNQHKFRNGNSFPKRKDDRNYAFKKDEDFICRECGGKGHYQSECATYLRRQRKGFSATLSDESESSHDSEDEVRALVAYSNLQTNNFSTGDDQSSSLPGGD
LEDIMEKWQEDLKVIEQQKERILELVEDNHRLLQTISDIKKELRIAKNENDRMSKSVRMLNSSTKDLNHILSSDHHQSSASTPTLDENFTDPPPDSVSDPAPQSTPTKVK
SKTKRVSTQSGRRQVPLHVNSVPIDGISFHHESHVHKWKYVIQRRIAAESDVSDDHHSCLAVMTLISQAKLLTTVSNVGPYYPKLIREMFVNIPTSFDNSSSPDYHTIHI
RRLPFSFSPHSLNTFLGITLPSDSPSLPTSDDLASEITGDEAHGPSPTKIQLSYRLFQGAHVPDIDHSFHPSSSIRPNYDNLRSPIDGLLFPPSLANQVLHVLTSESRSI
STLIHDLTDRRNEIDSVITSVQLLISSNPDAPAPQPPS