; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS024690 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS024690
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionDUF4283 domain-containing protein
Genome locationscaffold92:2103223..2104554
RNA-Seq ExpressionMS024690
SyntenyMS024690
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034062.1 uncharacterized protein E6C27_scaffold65G00480 [Cucumis melo var. makuwa]9.2e-6445.82Show/hide
Query:  SHPPMLISSNPVTQKEKCGFCVQTLPNLSKVDVPQSHPSMSISSNPVTQKEKGGSSVKTLPNLFQSMALRDDPKSIPSYPEAA--STMSTKTLELQQNSL
        S  P++  S+ V+  E   F ++  P L          S++  + P ++  K G+SV+       S  +    K+IP   E    S     +L  QQ+S 
Subjt:  SHPPMLISSNPVTQKEKCGFCVQTLPNLSKVDVPQSHPSMSISSNPVTQKEKGGSSVKTLPNLFQSMALRDDPKSIPSYPEAA--STMSTKTLELQQNSL

Query:  IQSMALHDPHVSISSGPKAAPELKRNKLIQSMALSPSVVEDQFRAAKTSTPTILAVHSNQPLLSSPSSIPSLQPSLVSEVSLKFYSAGIQCSTKKKYIIN
        I                  APELK         + PSVVEDQ + AKT   T++A H++QP  S  +SIP LQPS  SE +LKF S  I C T+K+ I N
Subjt:  IQSMALHDPHVSISSGPKAAPELKRNKLIQSMALSPSVVEDQFRAAKTSTPTILAVHSNQPLLSSPSSIPSLQPSLVSEVSLKFYSAGIQCSTKKKYIIN

Query:  TPFKETSVDNRPIVYAIDP-QIKSLAIALSEV-PTMVSNHSQYVISFVPTLRGGNERGAGSESVSMSASCPEKMLCWNFRGTATVNLIRALKDLIQVHEP
        +P KET+  + P VY IDP +I SL I+LSEV  T +SN +QY I  VPT++GG++ G G E  S S  C +KML W F       L+RALKDLIQ+HEP
Subjt:  TPFKETSVDNRPIVYAIDP-QIKSLAIALSEV-PTMVSNHSQYVISFVPTLRGGNERGAGSESVSMSASCPEKMLCWNFRGTATVNLIRALKDLIQVHEP

Query:  SIVLIFGSNISGAAADQVVQELTFCGSYCRKPNGYNGGVWLLLSRQDVQTEVNSYNSQQVSASVHFHSETN
        SIVLIFG+ I+G  A +V+QEL FCGSY  +P+GYNGGVWLLLS+QDVQT+VNSY+ QQVSASV FHSETN
Subjt:  SIVLIFGSNISGAAADQVVQELTFCGSYCRKPNGYNGGVWLLLSRQDVQTEVNSYNSQQVSASVHFHSETN

KAG6600113.1 hypothetical protein SDJN03_05346, partial [Cucurbita argyrosperma subsp. sororia]9.8e-6649.14Show/hide
Query:  SMSISSNPVTQKEKGGSSVKTL-----PNLFQSMALRDDPKS--IPSYPEAASTMSTK--TLELQQNSLIQSMALHDPHVSISSGPKAA--PELKRNKLI
        S+S ++NP +      +   +L     PNL  S     + K   IPS P  AS   ++   LEL  N    S+ + +    +   P     P LK+  LI
Subjt:  SMSISSNPVTQKEKGGSSVKTL-----PNLFQSMALRDDPKS--IPSYPEAASTMSTK--TLELQQNSLIQSMALHDPHVSISSGPKAA--PELKRNKLI

Query:  QSMALSPSVVED-QFRAAKTSTPTILAVHSNQPLLSSPS--SIPSLQPSLVSEVSLKFYSAGIQCSTKKKYIINTPFKETSVDNRPIVYAIDPQIKSLAI
        QS+ L+P V+ED QFR  KTS+PT LAV +N+P  SS +  SI  LQPS   E  LKFYS  IQ ST +K I NTP +  SVD+ P +Y IDP I +LAI
Subjt:  QSMALSPSVVED-QFRAAKTSTPTILAVHSNQPLLSSPS--SIPSLQPSLVSEVSLKFYSAGIQCSTKKKYIINTPFKETSVDNRPIVYAIDPQIKSLAI

Query:  ALSEV--PTMVSNHSQYVISFVPTLRGGNERGAGSESVSMSAS-CPEKMLCWNFRGTATVNLIRALKDLIQVHEPSIVLIFGSNISGAAADQVVQELTFC
         L E+   T  SN +++ I  VPT          SE+VSMSAS C +KMLCWNFR T    L+RALKDLIQ+H+PSIVLIFG+ ISGA AD V +EL F 
Subjt:  ALSEV--PTMVSNHSQYVISFVPTLRGGNERGAGSESVSMSAS-CPEKMLCWNFRGTATVNLIRALKDLIQVHEPSIVLIFGSNISGAAADQVVQELTFC

Query:  GSYCRKPNGYNGGVWLLLSRQDVQTEVNSYNSQQVSASVHFHSETNKPTL
        GSYCRKP+GY GG WLLLS+QDVQ EV+SY+ QQVSASV  HS+ NK  +
Subjt:  GSYCRKPNGYNGGVWLLLSRQDVQTEVNSYNSQQVSASVHFHSETNKPTL

KAG7030784.1 hypothetical protein SDJN02_04821, partial [Cucurbita argyrosperma subsp. argyrosperma]7.0e-6449.85Show/hide
Query:  SSNPVTQKEKGGSSVKTLPNLFQSMALRDDP----KSIPSYPEAASTMSTK--TLELQQNSLIQSMALHDPHVSISSGPK---AAPELKRNKLIQSMALS
        SS+ + Q +   SS+++ PN   S +   +     + IPS P  AS   ++   LEL  N    S+ + +    +   P     AP LK+  LIQS+ L+
Subjt:  SSNPVTQKEKGGSSVKTLPNLFQSMALRDDP----KSIPSYPEAASTMSTK--TLELQQNSLIQSMALHDPHVSISSGPK---AAPELKRNKLIQSMALS

Query:  PSVVED-QFRAAKTSTPTILAVHSNQPLLSSPS--SIPSLQPSLVSEVSLKFYSAGIQCSTKKKYIINTPFKETSVDNRPIVYAIDPQIKSLAIALSEV-
        P V+ED QFR  KTS+PT LAV +N+P   S +  SI  LQPS   E  LKFYS  IQ ST +K I NTP +  SVD+ P +Y IDP I SLAI L E+ 
Subjt:  PSVVED-QFRAAKTSTPTILAVHSNQPLLSSPS--SIPSLQPSLVSEVSLKFYSAGIQCSTKKKYIINTPFKETSVDNRPIVYAIDPQIKSLAIALSEV-

Query:  -PTMVSNHSQYVISFVPTLRGGNERGAGSESVSMSAS-CPEKMLCWNFRGTATVNLIRALKDLIQVHEPSIVLIFGSNISGAAADQVVQELTFCGSYCRK
          T  SN +++ I  VPT          SE+VSMSAS C +KMLCWNFR T    L+RALKDLIQ+H+PSIVLIFG+ I G  AD VV+EL F GSYCRK
Subjt:  -PTMVSNHSQYVISFVPTLRGGNERGAGSESVSMSAS-CPEKMLCWNFRGTATVNLIRALKDLIQVHEPSIVLIFGSNISGAAADQVVQELTFCGSYCRK

Query:  PNGYNGGVWLLLSRQDVQTEVNSYNSQQVSASV
        P+GY GG WLLLS+QDVQ EV+SY+ QQVSASV
Subjt:  PNGYNGGVWLLLSRQDVQTEVNSYNSQQVSASV

XP_022941630.1 uncharacterized protein LOC111446932 isoform X1 [Cucurbita moschata]1.2e-6650.72Show/hide
Query:  SMSISSNPVTQKEKGGSSVKTLPNLFQSMALRDDPKS--IPSYPEAASTMSTK--TLELQQNSLIQSMALHDPHVSISSGPKAA--PELKRNKLIQSMAL
        S+S + NP        SS    PNL  S     + K   IPS P  AS   ++   LEL  N    S+ + +    +   P     P LK+  LIQS+ L
Subjt:  SMSISSNPVTQKEKGGSSVKTLPNLFQSMALRDDPKS--IPSYPEAASTMSTK--TLELQQNSLIQSMALHDPHVSISSGPKAA--PELKRNKLIQSMAL

Query:  SPSVVED-QFRAAKTSTPTILAVHSNQPLLSSPS--SIPSLQPSLVSEVSLKFYSAGIQCSTKKKYIINTPFKETSVDNRPIVYAIDPQIKSLAIALSEV
        +P V+ED QFR  KTS+PT LAV +N+P  SS +  SI  LQPS   E  LKFYS  IQ ST +K I NTP +  SVD+ P +Y IDP I SLAI L E+
Subjt:  SPSVVED-QFRAAKTSTPTILAVHSNQPLLSSPS--SIPSLQPSLVSEVSLKFYSAGIQCSTKKKYIINTPFKETSVDNRPIVYAIDPQIKSLAIALSEV

Query:  --PTMVSNHSQYVISFVPTLRGGNERGAGSESVSMSAS-CPEKMLCWNFRGTATVNLIRALKDLIQVHEPSIVLIFGSNISGAAADQVVQELTFCGSYCR
           T  SN +++ I  VPT          SE+VSMSAS C +KMLCWNFR T    L+RALKDLIQ+H+PSIVLIFG+ ISGA AD VV+EL F GSYCR
Subjt:  --PTMVSNHSQYVISFVPTLRGGNERGAGSESVSMSAS-CPEKMLCWNFRGTATVNLIRALKDLIQVHEPSIVLIFGSNISGAAADQVVQELTFCGSYCR

Query:  KPNGYNGGVWLLLSRQDVQTEVNSYNSQQVSASVHFHSETNKPTL
        KP+GY GG WLLLS+QDVQ EV+SY+ QQVSASV  HS+ NK  +
Subjt:  KPNGYNGGVWLLLSRQDVQTEVNSYNSQQVSASVHFHSETNKPTL

XP_022941632.1 uncharacterized protein LOC111446932 isoform X2 [Cucurbita moschata]4.0e-6750.89Show/hide
Query:  GSSVKTLPNLFQSMALRDDPKS----------IPSYPEAASTMSTK--TLELQQNSLIQSMALHDPHVSISSGPKAA--PELKRNKLIQSMALSPSVVED
        GSS+ +  N   S  L  +P S          IPS P  AS   ++   LEL  N    S+ + +    +   P     P LK+  LIQS+ L+P V+ED
Subjt:  GSSVKTLPNLFQSMALRDDPKS----------IPSYPEAASTMSTK--TLELQQNSLIQSMALHDPHVSISSGPKAA--PELKRNKLIQSMALSPSVVED

Query:  -QFRAAKTSTPTILAVHSNQPLLSSPS--SIPSLQPSLVSEVSLKFYSAGIQCSTKKKYIINTPFKETSVDNRPIVYAIDPQIKSLAIALSEV--PTMVS
         QFR  KTS+PT LAV +N+P  SS +  SI  LQPS   E  LKFYS  IQ ST +K I NTP +  SVD+ P +Y IDP I SLAI L E+   T  S
Subjt:  -QFRAAKTSTPTILAVHSNQPLLSSPS--SIPSLQPSLVSEVSLKFYSAGIQCSTKKKYIINTPFKETSVDNRPIVYAIDPQIKSLAIALSEV--PTMVS

Query:  NHSQYVISFVPTLRGGNERGAGSESVSMSAS-CPEKMLCWNFRGTATVNLIRALKDLIQVHEPSIVLIFGSNISGAAADQVVQELTFCGSYCRKPNGYNG
        N +++ I  VPT          SE+VSMSAS C +KMLCWNFR T    L+RALKDLIQ+H+PSIVLIFG+ ISGA AD VV+EL F GSYCRKP+GY G
Subjt:  NHSQYVISFVPTLRGGNERGAGSESVSMSAS-CPEKMLCWNFRGTATVNLIRALKDLIQVHEPSIVLIFGSNISGAAADQVVQELTFCGSYCRKPNGYNG

Query:  GVWLLLSRQDVQTEVNSYNSQQVSASVHFHSETNKPTL
        G WLLLS+QDVQ EV+SY+ QQVSASV  HS+ NK  +
Subjt:  GVWLLLSRQDVQTEVNSYNSQQVSASVHFHSETNKPTL

TrEMBL top hitse value%identityAlignment
A0A0A0KLB0 DUF4283 domain-containing protein7.8e-6143.9Show/hide
Query:  SHPPMLISSNPVTQKEKCGFCVQT---------LPNLSKVDVPQSHPSMSISSNPV--------TQKEKGGSSVKTLPNLFQSMALRDDPKSIPSYPEAA
        S  P++  S+PV+  E   F  +           PNL K +  ++   + ISS  V         +KEK   SV+ LPNL                P+  
Subjt:  SHPPMLISSNPVTQKEKCGFCVQT---------LPNLSKVDVPQSHPSMSISSNPV--------TQKEKGGSSVKTLPNLFQSMALRDDPKSIPSYPEAA

Query:  STMSTKTLELQQNSLIQSMALHDPHVSISSGPKAAPELKRNKLIQSMALSPSVVEDQFRAAKTSTPTILAVHSNQPLLSSPSSIPSLQPSLVSEVSLKFY
        ST++ K                            APELKR        + PSVVED+ +  KT   T++A H++QP  S  +SIP LQPS  SE +LKF 
Subjt:  STMSTKTLELQQNSLIQSMALHDPHVSISSGPKAAPELKRNKLIQSMALSPSVVEDQFRAAKTSTPTILAVHSNQPLLSSPSSIPSLQPSLVSEVSLKFY

Query:  SAGIQCSTKKKYIINTPFKETSVDNRPIVYAIDP-QIKSLAIALSEVPTMVSNHSQYVISFVPTLRGGNERGAGSESVSMSASCPEKMLCWNFRGTATVN
        S  I C T+K+ I N+P K  +  + P VY IDP +I SL IALSEV T         I  VPT++GG+E G GSE  S S  C +K+L W F       
Subjt:  SAGIQCSTKKKYIINTPFKETSVDNRPIVYAIDP-QIKSLAIALSEVPTMVSNHSQYVISFVPTLRGGNERGAGSESVSMSASCPEKMLCWNFRGTATVN

Query:  LIRALKDLIQVHEPSIVLIFGSNISGAAADQVVQELTFCGSYCRKPNGYNGGVWLLLSRQDVQTEVNSYNSQQVSASVHFHSETN
        L+RALKDLIQ+HEPSIVLIFG+ ISG   D+V++EL FCGSY  KP+GYNGGVWLLLS+QDVQT+VNS++SQQVSASV FHSETN
Subjt:  LIRALKDLIQVHEPSIVLIFGSNISGAAADQVVQELTFCGSYCRKPNGYNGGVWLLLSRQDVQTEVNSYNSQQVSASVHFHSETN

A0A5A7SSJ3 DUF4283 domain-containing protein4.4e-6445.82Show/hide
Query:  SHPPMLISSNPVTQKEKCGFCVQTLPNLSKVDVPQSHPSMSISSNPVTQKEKGGSSVKTLPNLFQSMALRDDPKSIPSYPEAA--STMSTKTLELQQNSL
        S  P++  S+ V+  E   F ++  P L          S++  + P ++  K G+SV+       S  +    K+IP   E    S     +L  QQ+S 
Subjt:  SHPPMLISSNPVTQKEKCGFCVQTLPNLSKVDVPQSHPSMSISSNPVTQKEKGGSSVKTLPNLFQSMALRDDPKSIPSYPEAA--STMSTKTLELQQNSL

Query:  IQSMALHDPHVSISSGPKAAPELKRNKLIQSMALSPSVVEDQFRAAKTSTPTILAVHSNQPLLSSPSSIPSLQPSLVSEVSLKFYSAGIQCSTKKKYIIN
        I                  APELK         + PSVVEDQ + AKT   T++A H++QP  S  +SIP LQPS  SE +LKF S  I C T+K+ I N
Subjt:  IQSMALHDPHVSISSGPKAAPELKRNKLIQSMALSPSVVEDQFRAAKTSTPTILAVHSNQPLLSSPSSIPSLQPSLVSEVSLKFYSAGIQCSTKKKYIIN

Query:  TPFKETSVDNRPIVYAIDP-QIKSLAIALSEV-PTMVSNHSQYVISFVPTLRGGNERGAGSESVSMSASCPEKMLCWNFRGTATVNLIRALKDLIQVHEP
        +P KET+  + P VY IDP +I SL I+LSEV  T +SN +QY I  VPT++GG++ G G E  S S  C +KML W F       L+RALKDLIQ+HEP
Subjt:  TPFKETSVDNRPIVYAIDP-QIKSLAIALSEV-PTMVSNHSQYVISFVPTLRGGNERGAGSESVSMSASCPEKMLCWNFRGTATVNLIRALKDLIQVHEP

Query:  SIVLIFGSNISGAAADQVVQELTFCGSYCRKPNGYNGGVWLLLSRQDVQTEVNSYNSQQVSASVHFHSETN
        SIVLIFG+ I+G  A +V+QEL FCGSY  +P+GYNGGVWLLLS+QDVQT+VNSY+ QQVSASV FHSETN
Subjt:  SIVLIFGSNISGAAADQVVQELTFCGSYCRKPNGYNGGVWLLLSRQDVQTEVNSYNSQQVSASVHFHSETN

A0A5A7SSW6 DUF4283 domain-containing protein4.5e-5644.65Show/hide
Query:  SVKTLPNLFQSMALRDDPKSIPSYPEAASTMSTKTLELQQNSLIQSMALHDPHVSISSGPKAAPELKRNKLIQSMALSPSVVEDQFRAAKTSTPTILAVH
        ++K  P L     + ++ K++P++P  +ST + KT E                                    S+ L+ S+VEDQFRAAK S PT LA+H
Subjt:  SVKTLPNLFQSMALRDDPKSIPSYPEAASTMSTKTLELQQNSLIQSMALHDPHVSISSGPKAAPELKRNKLIQSMALSPSVVEDQFRAAKTSTPTILAVH

Query:  SNQPLLSSPSSIPSLQPSLVSEVSLKFYSAGIQCSTKKKYIINTPFKETS-VDNRPIVYAIDPQIKSLAIALSEVPTMV-SNHSQYVISFVPTLRGGNER
        +N    SS             E  L  +S   Q  T  K +INTPF   + VD+ P VY IDP   SL I  SEVPT   SN +QY I+FV      NE 
Subjt:  SNQPLLSSPSSIPSLQPSLVSEVSLKFYSAGIQCSTKKKYIINTPFKETS-VDNRPIVYAIDPQIKSLAIALSEVPTMV-SNHSQYVISFVPTLRGGNER

Query:  GAGSESVSMSASCPEKMLCWNFRGTATVNLIRALKDLIQVHEPSIVLIFGSNISGAAADQVVQELTFCGSYCRKPNGYNGGVWLLLSRQDVQTEVNSYNS
           S++ SM + C +KMLCWNF G     L +ALKDLI +HEPSIVLIFGS IS + AD+V++EL F G Y RKP+GYNGGVW++LSRQDVQTEVNS +S
Subjt:  GAGSESVSMSASCPEKMLCWNFRGTATVNLIRALKDLIQVHEPSIVLIFGSNISGAAADQVVQELTFCGSYCRKPNGYNGGVWLLLSRQDVQTEVNSYNS

Query:  QQVSASVHFHSETNKPTL
        Q+V ASVHFH + N+P L
Subjt:  QQVSASVHFHSETNKPTL

A0A6J1FN13 uncharacterized protein LOC111446932 isoform X21.9e-6750.89Show/hide
Query:  GSSVKTLPNLFQSMALRDDPKS----------IPSYPEAASTMSTK--TLELQQNSLIQSMALHDPHVSISSGPKAA--PELKRNKLIQSMALSPSVVED
        GSS+ +  N   S  L  +P S          IPS P  AS   ++   LEL  N    S+ + +    +   P     P LK+  LIQS+ L+P V+ED
Subjt:  GSSVKTLPNLFQSMALRDDPKS----------IPSYPEAASTMSTK--TLELQQNSLIQSMALHDPHVSISSGPKAA--PELKRNKLIQSMALSPSVVED

Query:  -QFRAAKTSTPTILAVHSNQPLLSSPS--SIPSLQPSLVSEVSLKFYSAGIQCSTKKKYIINTPFKETSVDNRPIVYAIDPQIKSLAIALSEV--PTMVS
         QFR  KTS+PT LAV +N+P  SS +  SI  LQPS   E  LKFYS  IQ ST +K I NTP +  SVD+ P +Y IDP I SLAI L E+   T  S
Subjt:  -QFRAAKTSTPTILAVHSNQPLLSSPS--SIPSLQPSLVSEVSLKFYSAGIQCSTKKKYIINTPFKETSVDNRPIVYAIDPQIKSLAIALSEV--PTMVS

Query:  NHSQYVISFVPTLRGGNERGAGSESVSMSAS-CPEKMLCWNFRGTATVNLIRALKDLIQVHEPSIVLIFGSNISGAAADQVVQELTFCGSYCRKPNGYNG
        N +++ I  VPT          SE+VSMSAS C +KMLCWNFR T    L+RALKDLIQ+H+PSIVLIFG+ ISGA AD VV+EL F GSYCRKP+GY G
Subjt:  NHSQYVISFVPTLRGGNERGAGSESVSMSAS-CPEKMLCWNFRGTATVNLIRALKDLIQVHEPSIVLIFGSNISGAAADQVVQELTFCGSYCRKPNGYNG

Query:  GVWLLLSRQDVQTEVNSYNSQQVSASVHFHSETNKPTL
        G WLLLS+QDVQ EV+SY+ QQVSASV  HS+ NK  +
Subjt:  GVWLLLSRQDVQTEVNSYNSQQVSASVHFHSETNKPTL

A0A6J1FU80 uncharacterized protein LOC111446932 isoform X15.6e-6750.72Show/hide
Query:  SMSISSNPVTQKEKGGSSVKTLPNLFQSMALRDDPKS--IPSYPEAASTMSTK--TLELQQNSLIQSMALHDPHVSISSGPKAA--PELKRNKLIQSMAL
        S+S + NP        SS    PNL  S     + K   IPS P  AS   ++   LEL  N    S+ + +    +   P     P LK+  LIQS+ L
Subjt:  SMSISSNPVTQKEKGGSSVKTLPNLFQSMALRDDPKS--IPSYPEAASTMSTK--TLELQQNSLIQSMALHDPHVSISSGPKAA--PELKRNKLIQSMAL

Query:  SPSVVED-QFRAAKTSTPTILAVHSNQPLLSSPS--SIPSLQPSLVSEVSLKFYSAGIQCSTKKKYIINTPFKETSVDNRPIVYAIDPQIKSLAIALSEV
        +P V+ED QFR  KTS+PT LAV +N+P  SS +  SI  LQPS   E  LKFYS  IQ ST +K I NTP +  SVD+ P +Y IDP I SLAI L E+
Subjt:  SPSVVED-QFRAAKTSTPTILAVHSNQPLLSSPS--SIPSLQPSLVSEVSLKFYSAGIQCSTKKKYIINTPFKETSVDNRPIVYAIDPQIKSLAIALSEV

Query:  --PTMVSNHSQYVISFVPTLRGGNERGAGSESVSMSAS-CPEKMLCWNFRGTATVNLIRALKDLIQVHEPSIVLIFGSNISGAAADQVVQELTFCGSYCR
           T  SN +++ I  VPT          SE+VSMSAS C +KMLCWNFR T    L+RALKDLIQ+H+PSIVLIFG+ ISGA AD VV+EL F GSYCR
Subjt:  --PTMVSNHSQYVISFVPTLRGGNERGAGSESVSMSAS-CPEKMLCWNFRGTATVNLIRALKDLIQVHEPSIVLIFGSNISGAAADQVVQELTFCGSYCR

Query:  KPNGYNGGVWLLLSRQDVQTEVNSYNSQQVSASVHFHSETNKPTL
        KP+GY GG WLLLS+QDVQ EV+SY+ QQVSASV  HS+ NK  +
Subjt:  KPNGYNGGVWLLLSRQDVQTEVNSYNSQQVSASVHFHSETNKPTL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGAAGGTGAAAAAACAGCTACAAATTTTTCCAAATTGGCGCAGAAGAAATATCAGTCACCTGGATTGATAGCTAAACCAGTTGAAACCCAAAAGGAGAAATGCGG
AGTCTATGTTCCAACACTGCCTAATTTGCCTAAAGTAGAAGTGCCCCAAAGCCACCCCCCCATGTTGATTTCCTCAAACCCTGTAACCCAAAAGGAGAAATGTGGATTCT
GTGTTCAAACACTGCCCAATTTGTCTAAAGTAGATGTGCCCCAAAGCCACCCCTCCATGTCGATTTCCTCAAACCCTGTAACCCAAAAGGAGAAAGGTGGATCTTCTGTT
AAAACACTGCCCAATTTGTTTCAGTCCATGGCTTTACGTGACGATCCCAAGTCGATTCCTTCATACCCTGAAGCAGCTTCAACAATGTCCACTAAAACTCTTGAGTTACA
ACAGAACAGCTTGATTCAATCCATGGCTTTACATGACCCTCATGTGTCGATTTCATCGGGTCCTAAAGCAGCTCCTGAGTTAAAACGTAACAAATTGATTCAATCCATGG
CTTTATCCCCTTCTGTTGTTGAAGATCAGTTCAGGGCAGCAAAAACCAGCACCCCCACCATTCTTGCAGTCCATAGCAACCAACCACTACTATCATCGCCTTCCAGCATT
CCATCCCTACAACCATCTCTTGTTTCAGAGGTTAGCCTCAAGTTCTATTCGGCTGGGATCCAATGCTCAACAAAAAAGAAATATATAATCAACACTCCATTTAAAGAAAC
CAGTGTCGATAATCGTCCCATTGTTTATGCAATCGACCCACAGATCAAAAGCCTTGCAATTGCTCTATCAGAAGTGCCAACCATGGTATCAAACCACAGCCAGTATGTTA
TCAGCTTCGTGCCGACTTTAAGAGGTGGTAATGAGCGTGGAGCTGGTTCGGAGTCAGTATCTATGTCAGCATCATGTCCAGAGAAAATGTTGTGCTGGAATTTTCGTGGG
ACAGCCACCGTCAATCTAATTCGAGCATTGAAAGATCTGATTCAAGTACACGAACCATCCATTGTACTGATCTTTGGCAGCAACATCAGTGGAGCTGCTGCAGATCAAGT
CGTGCAGGAGCTCACTTTCTGTGGTTCGTACTGCAGAAAGCCCAATGGCTACAATGGTGGCGTTTGGCTGTTGTTGTCCAGGCAAGATGTCCAAACTGAAGTCAACTCAT
ACAACTCACAGCAGGTTTCTGCATCAGTACATTTCCATTCTGAAACCAATAAACCAACGCTAGGTCTTTTCGATGCACTGTACAGATACCGAAACCTATACTTCGACGAA
CTCGATGGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGAAGGTGAAAAAACAGCTACAAATTTTTCCAAATTGGCGCAGAAGAAATATCAGTCACCTGGATTGATAGCTAAACCAGTTGAAACCCAAAAGGAGAAATGCGG
AGTCTATGTTCCAACACTGCCTAATTTGCCTAAAGTAGAAGTGCCCCAAAGCCACCCCCCCATGTTGATTTCCTCAAACCCTGTAACCCAAAAGGAGAAATGTGGATTCT
GTGTTCAAACACTGCCCAATTTGTCTAAAGTAGATGTGCCCCAAAGCCACCCCTCCATGTCGATTTCCTCAAACCCTGTAACCCAAAAGGAGAAAGGTGGATCTTCTGTT
AAAACACTGCCCAATTTGTTTCAGTCCATGGCTTTACGTGACGATCCCAAGTCGATTCCTTCATACCCTGAAGCAGCTTCAACAATGTCCACTAAAACTCTTGAGTTACA
ACAGAACAGCTTGATTCAATCCATGGCTTTACATGACCCTCATGTGTCGATTTCATCGGGTCCTAAAGCAGCTCCTGAGTTAAAACGTAACAAATTGATTCAATCCATGG
CTTTATCCCCTTCTGTTGTTGAAGATCAGTTCAGGGCAGCAAAAACCAGCACCCCCACCATTCTTGCAGTCCATAGCAACCAACCACTACTATCATCGCCTTCCAGCATT
CCATCCCTACAACCATCTCTTGTTTCAGAGGTTAGCCTCAAGTTCTATTCGGCTGGGATCCAATGCTCAACAAAAAAGAAATATATAATCAACACTCCATTTAAAGAAAC
CAGTGTCGATAATCGTCCCATTGTTTATGCAATCGACCCACAGATCAAAAGCCTTGCAATTGCTCTATCAGAAGTGCCAACCATGGTATCAAACCACAGCCAGTATGTTA
TCAGCTTCGTGCCGACTTTAAGAGGTGGTAATGAGCGTGGAGCTGGTTCGGAGTCAGTATCTATGTCAGCATCATGTCCAGAGAAAATGTTGTGCTGGAATTTTCGTGGG
ACAGCCACCGTCAATCTAATTCGAGCATTGAAAGATCTGATTCAAGTACACGAACCATCCATTGTACTGATCTTTGGCAGCAACATCAGTGGAGCTGCTGCAGATCAAGT
CGTGCAGGAGCTCACTTTCTGTGGTTCGTACTGCAGAAAGCCCAATGGCTACAATGGTGGCGTTTGGCTGTTGTTGTCCAGGCAAGATGTCCAAACTGAAGTCAACTCAT
ACAACTCACAGCAGGTTTCTGCATCAGTACATTTCCATTCTGAAACCAATAAACCAACGCTAGGTCTTTTCGATGCACTGTACAGATACCGAAACCTATACTTCGACGAA
CTCGATGGATAA
Protein sequenceShow/hide protein sequence
MGEGEKTATNFSKLAQKKYQSPGLIAKPVETQKEKCGVYVPTLPNLPKVEVPQSHPPMLISSNPVTQKEKCGFCVQTLPNLSKVDVPQSHPSMSISSNPVTQKEKGGSSV
KTLPNLFQSMALRDDPKSIPSYPEAASTMSTKTLELQQNSLIQSMALHDPHVSISSGPKAAPELKRNKLIQSMALSPSVVEDQFRAAKTSTPTILAVHSNQPLLSSPSSI
PSLQPSLVSEVSLKFYSAGIQCSTKKKYIINTPFKETSVDNRPIVYAIDPQIKSLAIALSEVPTMVSNHSQYVISFVPTLRGGNERGAGSESVSMSASCPEKMLCWNFRG
TATVNLIRALKDLIQVHEPSIVLIFGSNISGAAADQVVQELTFCGSYCRKPNGYNGGVWLLLSRQDVQTEVNSYNSQQVSASVHFHSETNKPTLGLFDALYRYRNLYFDE
LDG