; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g32100 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g32100
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr6:24166444..24171663
RNA-Seq ExpressionMoc06g32100
SyntenyMoc06g32100
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022145827.1 uncharacterized protein LOC111015187 [Momordica charantia]5.3e-4741.72Show/hide
Query:  DKAEVWREPKGLRQKSKEDPPKVDPAKERRGRDSSKR---SPRSHQRYDHEPRREQDQRRGRPERYDRFTPLNVSVIEILANIQNSKLD-LLTEPSLMRR
        D  E+ R   G R K + D  K+  ++E+R  DS  +   S  S  R ++  R E    R RP  Y+R+TP  + + EIL NI+ S ++ LL  P  +R 
Subjt:  DKAEVWREPKGLRQKSKEDPPKVDPAKERRGRDSSKR---SPRSHQRYDHEPRREQDQRRGRPERYDRFTPLNVSVIEILANIQNSKLD-LLTEPSLMRR

Query:  DPEKRNRSKFCRFHKDHSHDTSNCYELKRQIEGFIQKGYFKKHVGQAHNRGRKEAANSGEGRAKRERTQSPTKRTDRPTVINMIFGGPSGRQPRKKHKAL
        DPEKRN+ K+CRFH+DHSH+T+NC+ELKRQIE  IQ GYFKK V        K   +S E   +R+ +++P +R DRP VIN IF GPSG Q   K K L
Subjt:  DPEKRNRSKFCRFHKDHSHDTSNCYELKRQIEGFIQKGYFKKHVGQAHNRGRKEAANSGEGRAKRERTQSPTKRTDRPTVINMIFGGPSGRQPRKKHKAL

Query:  KRSHKRVLETQEDSKEDSVVVFVGKSLKKRSSKFLYLPCSGIDARPAQVKPHPTGRICRGAGQTWGCISLPITLEEGSTQISRMTEFVVVEARSAYNAIL
            +R +    + K    ++F    L+      ++LP +  DA    + P     + R      GCI LP+T+ + +TQ+++M EFVV+++RSAYNAI 
Subjt:  KRSHKRVLETQEDSKEDSVVVFVGKSLKKRSSKFLYLPCSGIDARPAQVKPHPTGRICRGAGQTWGCISLPITLEEGSTQISRMTEFVVVEARSAYNAIL

Query:  GRPVIHDLEVVPSTLHQMMKYPTPKG
        GRP+IH    VPSTLHQ++KY TP G
Subjt:  GRPVIHDLEVVPSTLHQMMKYPTPKG

XP_022154405.1 uncharacterized protein LOC111021682 [Momordica charantia]2.4e-4737.17Show/hide
Query:  LRQKSKEDPPKVD---PAKERRGRDSSKRSPRSHQRYDHEPRREQDQRRGRPERYDRFTPLNVSVIEILANIQNSKLD-LLTEPSLMRRDPEKRNRSKFC
        +R K+     ++D   P++E+R  DS  +             R  D    R   Y+ +TP  + + EIL NI+ S ++ LL  P  +RRD EKRN+ K+C
Subjt:  LRQKSKEDPPKVD---PAKERRGRDSSKRSPRSHQRYDHEPRREQDQRRGRPERYDRFTPLNVSVIEILANIQNSKLD-LLTEPSLMRRDPEKRNRSKFC

Query:  RFHKDHSHDTSNCYELKRQIEGFIQKGYFKKHVGQAHNRGRKEAANSGEGRAKRERTQSPTKRTDRPTVINMIFGGPSGRQPRKKHKALKRSHKRVLETQ
         FH+DH H+TSN +ELKRQIE  IQ GYFKK V        K   NS E + +++R+++  +R DRPT+IN+IFGGP+G Q R K K L R  KR +   
Subjt:  RFHKDHSHDTSNCYELKRQIEGFIQKGYFKKHVGQAHNRGRKEAANSGEGRAKRERTQSPTKRTDRPTVINMIFGGPSGRQPRKKHKALKRSHKRVLETQ

Query:  EDSKEDSVVVFVGKSLKKRSSKFLYLPCSGIDARPAQVKPHPTGRICRGAGQTWGCISLPITLEEGSTQISRMTEFVVVEARSAYNAILGRPVIHDLEVV
         + K    + F    L+      ++LP +  DA    + P     + R      GCI LP+T+ +  TQ+++M EFVV++ RSAYNAI GRP+IH   VV
Subjt:  EDSKEDSVVVFVGKSLKKRSSKFLYLPCSGIDARPAQVKPHPTGRICRGAGQTWGCISLPITLEEGSTQISRMTEFVVVEARSAYNAILGRPVIHDLEVV

Query:  PSTLHQMMKYPTPKGGDPELQPDKSLNE-----------------------------PNRGE-----PVEDLELVPLLGPEK
        PST+HQ++KY TP G        K+L E                             P  G+     P E+LELV LL PE+
Subjt:  PSTLHQMMKYPTPKGGDPELQPDKSLNE-----------------------------PNRGE-----PVEDLELVPLLGPEK

XP_022154876.1 uncharacterized protein LOC111022030 [Momordica charantia]1.0e-4538.14Show/hide
Query:  KVDPAKERRGRDSSKRSPRSHQRYDHEPRREQDQRRGRPERYDRFTPLNVSVIEILANIQNSKLD-LLTEPSLMRRDPEKRNRSKFCRFHKDHSHDTSNC
        K DP  + +G  SS R+           R E    + RP  Y+RFTP  + + EIL NI+ S ++ LL  P  +R  PE+R++ K+CRFH++H H+TS+C
Subjt:  KVDPAKERRGRDSSKRSPRSHQRYDHEPRREQDQRRGRPERYDRFTPLNVSVIEILANIQNSKLD-LLTEPSLMRRDPEKRNRSKFCRFHKDHSHDTSNC

Query:  YELKRQIEGFIQKGYFKKHVGQAHNRGRKEAANSGEGRAKRERTQSPTKRTDRPTVINMIFGGPSGRQPRKKHKALKRSHKRVLETQEDSKEDSVVVFVG
        +ELKRQ+E   Q GYFKK VG       K   NS E + +R+R+++P +R DRP VIN IFGGPSG Q  +K K L R+ +  +    + +    + F  
Subjt:  YELKRQIEGFIQKGYFKKHVGQAHNRGRKEAANSGEGRAKRERTQSPTKRTDRPTVINMIFGGPSGRQPRKKHKALKRSHKRVLETQEDSKEDSVVVFVG

Query:  KSLKK----------------------------RSSKFLYLPC-SGIDARPAQVKPHPTGRI-CRGAGQT-WGCISLPITLEEGSTQISRMTEFVVVEAR
          L++                             S+  L LP    +    +Q+K  PT  +   G   T  GCI L +T  +  TQ+++M EFVV++ R
Subjt:  KSLKK----------------------------RSSKFLYLPC-SGIDARPAQVKPHPTGRI-CRGAGQT-WGCISLPITLEEGSTQISRMTEFVVVEAR

Query:  SAYNAILGRPVIHDLEVVPSTLHQMMKYPTPKG
        SAYNAI GRP+IH    VPSTLHQ++KY TP G
Subjt:  SAYNAILGRPVIHDLEVVPSTLHQMMKYPTPKG

XP_022157474.1 uncharacterized protein LOC111024166 [Momordica charantia]2.6e-4638.53Show/hide
Query:  KAEVWREPKGLRQKSKEDPPKVDPAKERRGRDSSKRSPRSHQRYDHEPRREQDQRRGRPERYDRFTPLNVSVIEILANIQNSKLD-LLTEPSLMRRDPEK
        + E W + K L Q++K   P V   K +    SS  S   ++R D  P       R RP  Y+R+TP  + +  IL NI+ + ++ LL  P  +R D EK
Subjt:  KAEVWREPKGLRQKSKEDPPKVDPAKERRGRDSSKRSPRSHQRYDHEPRREQDQRRGRPERYDRFTPLNVSVIEILANIQNSKLD-LLTEPSLMRRDPEK

Query:  RNRSKFCRFHKDHSHDTSNCYELKRQIEGFIQKGYFKKHVGQAHNRGRKEAANSGEGRAKRERTQSPTKRTDRPTVINMIFGGPSGRQPRKKHKALKRSH
         N+ K+CRFH+DH H+TS+C+ELKRQIE  IQ  YFKK VG       K  +N  E + +R+R+++P +  DRP VIN IFGGPSG Q   K K L R  
Subjt:  RNRSKFCRFHKDHSHDTSNCYELKRQIEGFIQKGYFKKHVGQAHNRGRKEAANSGEGRAKRERTQSPTKRTDRPTVINMIFGGPSGRQPRKKHKALKRSH

Query:  KRVLETQEDSKEDSVVVFVGKSLK----------------------------KRSSKFLYLPC-SGIDARPAQVKPHPTGRI--CRGAGQTWGCISLPIT
        +R +    + K    + F    L+                              S+  L LP    +    +Q+K  PT  +   R +    GCI LPIT
Subjt:  KRVLETQEDSKEDSVVVFVGKSLK----------------------------KRSSKFLYLPC-SGIDARPAQVKPHPTGRI--CRGAGQTWGCISLPIT

Query:  LEEGSTQISRMTEFVVVEARSAYNAILGRPVIHDLEVVPSTLHQMMKYPTPKG
        + + STQ+++M EFVV++ RSAYNAI GRP+IH    VPSTLHQ++KY TP G
Subjt:  LEEGSTQISRMTEFVVVEARSAYNAILGRPVIHDLEVVPSTLHQMMKYPTPKG

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]4.8e-4836.81Show/hide
Query:  LRQKSKEDPPKVDPAK---ERRGRDSSKRSPRSHQRYDHEPRREQDQRRGRPERYDRFTPLNVSVIEILANIQNSKLD-LLTEPSLMRRDPEKRNRSKFC
        LR K+     ++D  K   E+R  DS  R   S         R  +    R   Y+R+T   + + EIL NI+ S ++ LL  P  +R D EKRN+ K+C
Subjt:  LRQKSKEDPPKVDPAK---ERRGRDSSKRSPRSHQRYDHEPRREQDQRRGRPERYDRFTPLNVSVIEILANIQNSKLD-LLTEPSLMRRDPEKRNRSKFC

Query:  RFHKDHSHDTSNCYELKRQIEGFIQKGYFKKHVGQAHNRGRKEAANSGEGRAKRERTQSPTKRTDRPTVINMIFGGPSGRQPRKKHKALKRSHKRVLETQ
        RFH+DH H+T++C+ELKRQIE  IQ GYFKK VG       K  +NS E + +R+R+++P +R DRP VIN IFGGP+G Q   K K L R  +R +   
Subjt:  RFHKDHSHDTSNCYELKRQIEGFIQKGYFKKHVGQAHNRGRKEAANSGEGRAKRERTQSPTKRTDRPTVINMIFGGPSGRQPRKKHKALKRSHKRVLETQ

Query:  EDSKEDSVVVFVGKSLKKRSSKFLYLPCSGIDARPAQVKPHPTGRICRGAGQTWGCISLPITLEEGSTQISRMTEFVVVEARSAYNAILGRPVIHDLEVV
         + K    + F    L+      ++LP +      + +      R+        GCI LP+T+ + +TQ+++M EFVV++ RSAYNAI GRP+IH    V
Subjt:  EDSKEDSVVVFVGKSLKKRSSKFLYLPCSGIDARPAQVKPHPTGRICRGAGQTWGCISLPITLEEGSTQISRMTEFVVVEARSAYNAILGRPVIHDLEVV

Query:  PSTLHQMMKYPTPKGGDPELQPDKSLNE-----------------PNRGE------------------PVEDLELVPLLGPEK
        PSTLHQ++KY TP          K+  E                  NRG+                  P E+LELVPLL PE+
Subjt:  PSTLHQMMKYPTPKGGDPELQPDKSLNE-----------------PNRGE------------------PVEDLELVPLLGPEK

TrEMBL top hitse value%identityAlignment
A0A6J1CXU1 uncharacterized protein LOC1110151872.6e-4741.72Show/hide
Query:  DKAEVWREPKGLRQKSKEDPPKVDPAKERRGRDSSKR---SPRSHQRYDHEPRREQDQRRGRPERYDRFTPLNVSVIEILANIQNSKLD-LLTEPSLMRR
        D  E+ R   G R K + D  K+  ++E+R  DS  +   S  S  R ++  R E    R RP  Y+R+TP  + + EIL NI+ S ++ LL  P  +R 
Subjt:  DKAEVWREPKGLRQKSKEDPPKVDPAKERRGRDSSKR---SPRSHQRYDHEPRREQDQRRGRPERYDRFTPLNVSVIEILANIQNSKLD-LLTEPSLMRR

Query:  DPEKRNRSKFCRFHKDHSHDTSNCYELKRQIEGFIQKGYFKKHVGQAHNRGRKEAANSGEGRAKRERTQSPTKRTDRPTVINMIFGGPSGRQPRKKHKAL
        DPEKRN+ K+CRFH+DHSH+T+NC+ELKRQIE  IQ GYFKK V        K   +S E   +R+ +++P +R DRP VIN IF GPSG Q   K K L
Subjt:  DPEKRNRSKFCRFHKDHSHDTSNCYELKRQIEGFIQKGYFKKHVGQAHNRGRKEAANSGEGRAKRERTQSPTKRTDRPTVINMIFGGPSGRQPRKKHKAL

Query:  KRSHKRVLETQEDSKEDSVVVFVGKSLKKRSSKFLYLPCSGIDARPAQVKPHPTGRICRGAGQTWGCISLPITLEEGSTQISRMTEFVVVEARSAYNAIL
            +R +    + K    ++F    L+      ++LP +  DA    + P     + R      GCI LP+T+ + +TQ+++M EFVV+++RSAYNAI 
Subjt:  KRSHKRVLETQEDSKEDSVVVFVGKSLKKRSSKFLYLPCSGIDARPAQVKPHPTGRICRGAGQTWGCISLPITLEEGSTQISRMTEFVVVEARSAYNAIL

Query:  GRPVIHDLEVVPSTLHQMMKYPTPKG
        GRP+IH    VPSTLHQ++KY TP G
Subjt:  GRPVIHDLEVVPSTLHQMMKYPTPKG

A0A6J1DJI4 uncharacterized protein LOC1110216821.2e-4737.17Show/hide
Query:  LRQKSKEDPPKVD---PAKERRGRDSSKRSPRSHQRYDHEPRREQDQRRGRPERYDRFTPLNVSVIEILANIQNSKLD-LLTEPSLMRRDPEKRNRSKFC
        +R K+     ++D   P++E+R  DS  +             R  D    R   Y+ +TP  + + EIL NI+ S ++ LL  P  +RRD EKRN+ K+C
Subjt:  LRQKSKEDPPKVD---PAKERRGRDSSKRSPRSHQRYDHEPRREQDQRRGRPERYDRFTPLNVSVIEILANIQNSKLD-LLTEPSLMRRDPEKRNRSKFC

Query:  RFHKDHSHDTSNCYELKRQIEGFIQKGYFKKHVGQAHNRGRKEAANSGEGRAKRERTQSPTKRTDRPTVINMIFGGPSGRQPRKKHKALKRSHKRVLETQ
         FH+DH H+TSN +ELKRQIE  IQ GYFKK V        K   NS E + +++R+++  +R DRPT+IN+IFGGP+G Q R K K L R  KR +   
Subjt:  RFHKDHSHDTSNCYELKRQIEGFIQKGYFKKHVGQAHNRGRKEAANSGEGRAKRERTQSPTKRTDRPTVINMIFGGPSGRQPRKKHKALKRSHKRVLETQ

Query:  EDSKEDSVVVFVGKSLKKRSSKFLYLPCSGIDARPAQVKPHPTGRICRGAGQTWGCISLPITLEEGSTQISRMTEFVVVEARSAYNAILGRPVIHDLEVV
         + K    + F    L+      ++LP +  DA    + P     + R      GCI LP+T+ +  TQ+++M EFVV++ RSAYNAI GRP+IH   VV
Subjt:  EDSKEDSVVVFVGKSLKKRSSKFLYLPCSGIDARPAQVKPHPTGRICRGAGQTWGCISLPITLEEGSTQISRMTEFVVVEARSAYNAILGRPVIHDLEVV

Query:  PSTLHQMMKYPTPKGGDPELQPDKSLNE-----------------------------PNRGE-----PVEDLELVPLLGPEK
        PST+HQ++KY TP G        K+L E                             P  G+     P E+LELV LL PE+
Subjt:  PSTLHQMMKYPTPKGGDPELQPDKSLNE-----------------------------PNRGE-----PVEDLELVPLLGPEK

A0A6J1DQ11 uncharacterized protein LOC1110220304.8e-4638.14Show/hide
Query:  KVDPAKERRGRDSSKRSPRSHQRYDHEPRREQDQRRGRPERYDRFTPLNVSVIEILANIQNSKLD-LLTEPSLMRRDPEKRNRSKFCRFHKDHSHDTSNC
        K DP  + +G  SS R+           R E    + RP  Y+RFTP  + + EIL NI+ S ++ LL  P  +R  PE+R++ K+CRFH++H H+TS+C
Subjt:  KVDPAKERRGRDSSKRSPRSHQRYDHEPRREQDQRRGRPERYDRFTPLNVSVIEILANIQNSKLD-LLTEPSLMRRDPEKRNRSKFCRFHKDHSHDTSNC

Query:  YELKRQIEGFIQKGYFKKHVGQAHNRGRKEAANSGEGRAKRERTQSPTKRTDRPTVINMIFGGPSGRQPRKKHKALKRSHKRVLETQEDSKEDSVVVFVG
        +ELKRQ+E   Q GYFKK VG       K   NS E + +R+R+++P +R DRP VIN IFGGPSG Q  +K K L R+ +  +    + +    + F  
Subjt:  YELKRQIEGFIQKGYFKKHVGQAHNRGRKEAANSGEGRAKRERTQSPTKRTDRPTVINMIFGGPSGRQPRKKHKALKRSHKRVLETQEDSKEDSVVVFVG

Query:  KSLKK----------------------------RSSKFLYLPC-SGIDARPAQVKPHPTGRI-CRGAGQT-WGCISLPITLEEGSTQISRMTEFVVVEAR
          L++                             S+  L LP    +    +Q+K  PT  +   G   T  GCI L +T  +  TQ+++M EFVV++ R
Subjt:  KSLKK----------------------------RSSKFLYLPC-SGIDARPAQVKPHPTGRI-CRGAGQT-WGCISLPITLEEGSTQISRMTEFVVVEAR

Query:  SAYNAILGRPVIHDLEVVPSTLHQMMKYPTPKG
        SAYNAI GRP+IH    VPSTLHQ++KY TP G
Subjt:  SAYNAILGRPVIHDLEVVPSTLHQMMKYPTPKG

A0A6J1DWK7 uncharacterized protein LOC1110241661.3e-4638.53Show/hide
Query:  KAEVWREPKGLRQKSKEDPPKVDPAKERRGRDSSKRSPRSHQRYDHEPRREQDQRRGRPERYDRFTPLNVSVIEILANIQNSKLD-LLTEPSLMRRDPEK
        + E W + K L Q++K   P V   K +    SS  S   ++R D  P       R RP  Y+R+TP  + +  IL NI+ + ++ LL  P  +R D EK
Subjt:  KAEVWREPKGLRQKSKEDPPKVDPAKERRGRDSSKRSPRSHQRYDHEPRREQDQRRGRPERYDRFTPLNVSVIEILANIQNSKLD-LLTEPSLMRRDPEK

Query:  RNRSKFCRFHKDHSHDTSNCYELKRQIEGFIQKGYFKKHVGQAHNRGRKEAANSGEGRAKRERTQSPTKRTDRPTVINMIFGGPSGRQPRKKHKALKRSH
         N+ K+CRFH+DH H+TS+C+ELKRQIE  IQ  YFKK VG       K  +N  E + +R+R+++P +  DRP VIN IFGGPSG Q   K K L R  
Subjt:  RNRSKFCRFHKDHSHDTSNCYELKRQIEGFIQKGYFKKHVGQAHNRGRKEAANSGEGRAKRERTQSPTKRTDRPTVINMIFGGPSGRQPRKKHKALKRSH

Query:  KRVLETQEDSKEDSVVVFVGKSLK----------------------------KRSSKFLYLPC-SGIDARPAQVKPHPTGRI--CRGAGQTWGCISLPIT
        +R +    + K    + F    L+                              S+  L LP    +    +Q+K  PT  +   R +    GCI LPIT
Subjt:  KRVLETQEDSKEDSVVVFVGKSLK----------------------------KRSSKFLYLPC-SGIDARPAQVKPHPTGRI--CRGAGQTWGCISLPIT

Query:  LEEGSTQISRMTEFVVVEARSAYNAILGRPVIHDLEVVPSTLHQMMKYPTPKG
        + + STQ+++M EFVV++ RSAYNAI GRP+IH    VPSTLHQ++KY TP G
Subjt:  LEEGSTQISRMTEFVVVEARSAYNAILGRPVIHDLEVVPSTLHQMMKYPTPKG

A0A6J1DZB9 uncharacterized protein LOC1110249042.3e-4836.81Show/hide
Query:  LRQKSKEDPPKVDPAK---ERRGRDSSKRSPRSHQRYDHEPRREQDQRRGRPERYDRFTPLNVSVIEILANIQNSKLD-LLTEPSLMRRDPEKRNRSKFC
        LR K+     ++D  K   E+R  DS  R   S         R  +    R   Y+R+T   + + EIL NI+ S ++ LL  P  +R D EKRN+ K+C
Subjt:  LRQKSKEDPPKVDPAK---ERRGRDSSKRSPRSHQRYDHEPRREQDQRRGRPERYDRFTPLNVSVIEILANIQNSKLD-LLTEPSLMRRDPEKRNRSKFC

Query:  RFHKDHSHDTSNCYELKRQIEGFIQKGYFKKHVGQAHNRGRKEAANSGEGRAKRERTQSPTKRTDRPTVINMIFGGPSGRQPRKKHKALKRSHKRVLETQ
        RFH+DH H+T++C+ELKRQIE  IQ GYFKK VG       K  +NS E + +R+R+++P +R DRP VIN IFGGP+G Q   K K L R  +R +   
Subjt:  RFHKDHSHDTSNCYELKRQIEGFIQKGYFKKHVGQAHNRGRKEAANSGEGRAKRERTQSPTKRTDRPTVINMIFGGPSGRQPRKKHKALKRSHKRVLETQ

Query:  EDSKEDSVVVFVGKSLKKRSSKFLYLPCSGIDARPAQVKPHPTGRICRGAGQTWGCISLPITLEEGSTQISRMTEFVVVEARSAYNAILGRPVIHDLEVV
         + K    + F    L+      ++LP +      + +      R+        GCI LP+T+ + +TQ+++M EFVV++ RSAYNAI GRP+IH    V
Subjt:  EDSKEDSVVVFVGKSLKKRSSKFLYLPCSGIDARPAQVKPHPTGRICRGAGQTWGCISLPITLEEGSTQISRMTEFVVVEARSAYNAILGRPVIHDLEVV

Query:  PSTLHQMMKYPTPKGGDPELQPDKSLNE-----------------PNRGE------------------PVEDLELVPLLGPEK
        PSTLHQ++KY TP          K+  E                  NRG+                  P E+LELVPLL PE+
Subjt:  PSTLHQMMKYPTPKGGDPELQPDKSLNE-----------------PNRGE------------------PVEDLELVPLLGPEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAGAAGGCTCCCAAACGTCAAGCAATCATCACTCTCGAGATCCAACCAAGGATCCCAAAGTTCGGTCGGGCGCTTCCAAACCACCTGTAGGTCGGAAGCCGAAGAC
CCGGACCGACTCACAGAGGTCGAAGTCCCGCCTCCAGGTCAAGGTGACGTTCGTTCGACAGGAAGAAATAGACGCCATGAGGATTCAAGTGGCAGCTCTGACTGAGGCGC
TGCAAGAGACAGGTGTACCGCTCCCTGAGCCCCCCGAGCCCGAGGTTCTAGCGACCTCCGAAGAGGAAAAGCCAAAGCTATCGGGATCGACCAGGAAAGCGGGACCTTCG
CAACAAGCTGAACCATCAACGCTCCAAGAGAAACACCCCCTCAACAAGGTCGGAGTCAGTCAACAACGGAGACTTGTTGCTGCAATTCGACAGGTCCAGAAAAAGCTTAG
AAACGACCTTCGAAAGGTCCAGGGGGAAAACTCTCAAGCTTCGACAAATAAGATGGGTCTGCTGATCGGTGGACCACGTGGAGGAAATGGTACCGGCAGCTCGCGCCCCA
CTCATTTCCTTTTGGAAACAGCTCCGCAAGACCTTCATGGCCCAATTTGCGGCCCAGCAAGAAACTCAACACCCGACTCAGTTCTTGCTCACGATAAGGCAGAAGTCTGG
AGAGAGCCTAAGGGATTACGTCAGAAGTCAAAGGAGGATCCTCCCAAAGTCGACCCTGCGAAGGAAAGAAGAGGTCGGGACAGCTCTAAAAGGTCGCCACGTAGCCACCA
ACGATATGATCACGAGCCTCGACGTGAGCAAGACCAGAGACGCGGGAGACCCGAAAGGTACGACCGCTTCACGCCTCTGAACGTCTCGGTAATCGAAATTCTCGCAAACA
TACAAAACTCCAAGCTCGACCTACTCACCGAGCCTAGTCTCATGCGTCGAGACCCTGAGAAGCGCAATAGGTCGAAGTTCTGCCGCTTCCACAAGGATCACAGTCACGAT
ACCTCCAATTGCTATGAGCTGAAGAGACAGATCGAGGGCTTCATCCAAAAAGGGTACTTCAAAAAGCACGTGGGACAAGCTCACAACAGAGGAAGGAAAGAGGCAGCCAA
TTCAGGAGAAGGAAGGGCAAAGAGGGAGAGGACGCAATCTCCCACTAAGCGCACAGATCGACCTACCGTGATCAACATGATCTTCGGAGGTCCCAGCGGGAGGCAGCCAA
GAAAGAAGCACAAGGCGCTGAAACGCTCCCACAAGCGTGTTCTCGAAACCCAAGAAGATAGCAAGGAAGATTCGGTGGTGGTGTTCGTCGGGAAATCGTTGAAGAAACGT
TCTTCAAAGTTTCTCTACTTACCTTGCTCTGGGATTGACGCAAGACCAGCTCAGGTCAAGCCCCACCCCACTGGTCGAATTTGTAGGGGAGCCGGTCAGACCTGGGGATG
CATCAGCCTTCCAATAACCCTTGAGGAAGGAAGCACTCAAATCTCAAGGATGACGGAGTTTGTAGTGGTTGAGGCCAGGTCGGCGTACAATGCTATACTTGGCCGACCTG
TCATCCATGACTTGGAAGTCGTGCCATCAACTCTGCACCAAATGATGAAGTATCCAACCCCCAAGGGGGGGGACCCCGAACTCCAGCCAGACAAATCACTTAACGAGCCC
AACCGTGGAGAGCCCGTGGAGGACTTAGAGCTCGTCCCATTGCTCGGCCCGGAGAAATGCAGGAGCATCGACACAAAGCTGGAAGCTCCCTCTGTTCAGGGGCCCGAAGC
AATGAATATCGACGTAGCGGCCCCGATCTGGATGGACCACATAAAAGCTTTCCTCCGTGGATGGGAGCTACCAGAACAAGTTGACCTTCACAAGGTGCGGTGCAAAGCCG
CAGGGTACCTACTCCGAGAAACCGTGATACTTGTTGAGATTGGTCTACCAACTGCTCGGACCGAAGCCTTTGATGCGTCTAAAAACAACGAGGAGCTTCACCTGAACCTT
GACCTCCTTGAGGAGAGAAGAGGGACCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGACAGAAGGCTCCCAAACGTCAAGCAATCATCACTCTCGAGATCCAACCAAGGATCCCAAAGTTCGGTCGGGCGCTTCCAAACCACCTGTAGGTCGGAAGCCGAAGAC
CCGGACCGACTCACAGAGGTCGAAGTCCCGCCTCCAGGTCAAGGTGACGTTCGTTCGACAGGAAGAAATAGACGCCATGAGGATTCAAGTGGCAGCTCTGACTGAGGCGC
TGCAAGAGACAGGTGTACCGCTCCCTGAGCCCCCCGAGCCCGAGGTTCTAGCGACCTCCGAAGAGGAAAAGCCAAAGCTATCGGGATCGACCAGGAAAGCGGGACCTTCG
CAACAAGCTGAACCATCAACGCTCCAAGAGAAACACCCCCTCAACAAGGTCGGAGTCAGTCAACAACGGAGACTTGTTGCTGCAATTCGACAGGTCCAGAAAAAGCTTAG
AAACGACCTTCGAAAGGTCCAGGGGGAAAACTCTCAAGCTTCGACAAATAAGATGGGTCTGCTGATCGGTGGACCACGTGGAGGAAATGGTACCGGCAGCTCGCGCCCCA
CTCATTTCCTTTTGGAAACAGCTCCGCAAGACCTTCATGGCCCAATTTGCGGCCCAGCAAGAAACTCAACACCCGACTCAGTTCTTGCTCACGATAAGGCAGAAGTCTGG
AGAGAGCCTAAGGGATTACGTCAGAAGTCAAAGGAGGATCCTCCCAAAGTCGACCCTGCGAAGGAAAGAAGAGGTCGGGACAGCTCTAAAAGGTCGCCACGTAGCCACCA
ACGATATGATCACGAGCCTCGACGTGAGCAAGACCAGAGACGCGGGAGACCCGAAAGGTACGACCGCTTCACGCCTCTGAACGTCTCGGTAATCGAAATTCTCGCAAACA
TACAAAACTCCAAGCTCGACCTACTCACCGAGCCTAGTCTCATGCGTCGAGACCCTGAGAAGCGCAATAGGTCGAAGTTCTGCCGCTTCCACAAGGATCACAGTCACGAT
ACCTCCAATTGCTATGAGCTGAAGAGACAGATCGAGGGCTTCATCCAAAAAGGGTACTTCAAAAAGCACGTGGGACAAGCTCACAACAGAGGAAGGAAAGAGGCAGCCAA
TTCAGGAGAAGGAAGGGCAAAGAGGGAGAGGACGCAATCTCCCACTAAGCGCACAGATCGACCTACCGTGATCAACATGATCTTCGGAGGTCCCAGCGGGAGGCAGCCAA
GAAAGAAGCACAAGGCGCTGAAACGCTCCCACAAGCGTGTTCTCGAAACCCAAGAAGATAGCAAGGAAGATTCGGTGGTGGTGTTCGTCGGGAAATCGTTGAAGAAACGT
TCTTCAAAGTTTCTCTACTTACCTTGCTCTGGGATTGACGCAAGACCAGCTCAGGTCAAGCCCCACCCCACTGGTCGAATTTGTAGGGGAGCCGGTCAGACCTGGGGATG
CATCAGCCTTCCAATAACCCTTGAGGAAGGAAGCACTCAAATCTCAAGGATGACGGAGTTTGTAGTGGTTGAGGCCAGGTCGGCGTACAATGCTATACTTGGCCGACCTG
TCATCCATGACTTGGAAGTCGTGCCATCAACTCTGCACCAAATGATGAAGTATCCAACCCCCAAGGGGGGGGACCCCGAACTCCAGCCAGACAAATCACTTAACGAGCCC
AACCGTGGAGAGCCCGTGGAGGACTTAGAGCTCGTCCCATTGCTCGGCCCGGAGAAATGCAGGAGCATCGACACAAAGCTGGAAGCTCCCTCTGTTCAGGGGCCCGAAGC
AATGAATATCGACGTAGCGGCCCCGATCTGGATGGACCACATAAAAGCTTTCCTCCGTGGATGGGAGCTACCAGAACAAGTTGACCTTCACAAGGTGCGGTGCAAAGCCG
CAGGGTACCTACTCCGAGAAACCGTGATACTTGTTGAGATTGGTCTACCAACTGCTCGGACCGAAGCCTTTGATGCGTCTAAAAACAACGAGGAGCTTCACCTGAACCTT
GACCTCCTTGAGGAGAGAAGAGGGACCTCTTAG
Protein sequenceShow/hide protein sequence
MTEGSQTSSNHHSRDPTKDPKVRSGASKPPVGRKPKTRTDSQRSKSRLQVKVTFVRQEEIDAMRIQVAALTEALQETGVPLPEPPEPEVLATSEEEKPKLSGSTRKAGPS
QQAEPSTLQEKHPLNKVGVSQQRRLVAAIRQVQKKLRNDLRKVQGENSQASTNKMGLLIGGPRGGNGTGSSRPTHFLLETAPQDLHGPICGPARNSTPDSVLAHDKAEVW
REPKGLRQKSKEDPPKVDPAKERRGRDSSKRSPRSHQRYDHEPRREQDQRRGRPERYDRFTPLNVSVIEILANIQNSKLDLLTEPSLMRRDPEKRNRSKFCRFHKDHSHD
TSNCYELKRQIEGFIQKGYFKKHVGQAHNRGRKEAANSGEGRAKRERTQSPTKRTDRPTVINMIFGGPSGRQPRKKHKALKRSHKRVLETQEDSKEDSVVVFVGKSLKKR
SSKFLYLPCSGIDARPAQVKPHPTGRICRGAGQTWGCISLPITLEEGSTQISRMTEFVVVEARSAYNAILGRPVIHDLEVVPSTLHQMMKYPTPKGGDPELQPDKSLNEP
NRGEPVEDLELVPLLGPEKCRSIDTKLEAPSVQGPEAMNIDVAAPIWMDHIKAFLRGWELPEQVDLHKVRCKAAGYLLRETVILVEIGLPTARTEAFDASKNNEELHLNL
DLLEERRGTS