; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC05g0457 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC05g0457
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionVegetative cell wall protein gp1-like
Genome locationMC05:3361490..3366580
RNA-Seq ExpressionMC05g0457
SyntenyMC05g0457
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR025067 - Protein of unknown function DUF4079


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004133799.1 uncharacterized protein LOC101206421 [Cucumis sativus]1.55e-16084.76Show/hide
Query:  MAITLTLPKLPHISPNSTTKLP--PIPTNLDFSQNPKKSPYSFFLRSVRDLKSCTLPLTALALPFFLDPQDALAVGGEFGILEGRSVALIHPIVMGSLFA
        MAITL+LPKLPH +P S++KLP   IPTNL  SQNPK S  + F+++V++LK  T+PLTAL LPFFL PQDALAVGGEFGILEGRS ALIHP+VMG LF 
Subjt:  MAITLTLPKLPHISPNSTTKLP--PIPTNLDFSQNPKKSPYSFFLRSVRDLKSCTLPLTALALPFFLDPQDALAVGGEFGILEGRSVALIHPIVMGSLFA

Query:  YTLWAGYLGWQWRRVRTIQNEINELKKQLAPAAVTPDGNPIDAPPSPTELKIQQLTEERKELIKGSFRDRHFNAGSILLGFGVLESIGGGVNTWFRTGKL
        YTLWAGYLGWQWRRVRT+QNEINELKKQ+APAAVTPDG P++APPSPTELKIQQLTEERKELIKGSFRDRHFNAGSILLGFGVLE+IGGGVNTWFRTGKL
Subjt:  YTLWAGYLGWQWRRVRTIQNEINELKKQLAPAAVTPDGNPIDAPPSPTELKIQQLTEERKELIKGSFRDRHFNAGSILLGFGVLESIGGGVNTWFRTGKL

Query:  FPGPHLFAGAGITVLWALAAALVPAMQKGNETARNLHIALNALNVILFIWQIPTGIEIVFKVFEFTTWP
        FPGPHLFAGAGITVLWALAAALVPAMQKGNETARNLHIALN LNV+LFIWQIPTGI+IV KVFEFT WP
Subjt:  FPGPHLFAGAGITVLWALAAALVPAMQKGNETARNLHIALNALNVILFIWQIPTGIEIVFKVFEFTTWP

XP_008437852.1 PREDICTED: uncharacterized protein LOC103483159 [Cucumis melo]2.31e-16286.25Show/hide
Query:  MAITLTLPKLPHISPNSTTKLP--PIPTNLDFSQNPKKSPYSFFLRSVRDLKSCTLPLTALALPFFLDPQDALAVGGEFGILEGRSVALIHPIVMGSLFA
        MAITL+LPKLPH +P S++KLP   IPTNL  SQNPK S  + F+++V +LK  T+PLTAL LPFFL PQDALAVGGEFGILEGRS ALIHP+VMG LF 
Subjt:  MAITLTLPKLPHISPNSTTKLP--PIPTNLDFSQNPKKSPYSFFLRSVRDLKSCTLPLTALALPFFLDPQDALAVGGEFGILEGRSVALIHPIVMGSLFA

Query:  YTLWAGYLGWQWRRVRTIQNEINELKKQLAPAAVTPDGNPIDAPPSPTELKIQQLTEERKELIKGSFRDRHFNAGSILLGFGVLESIGGGVNTWFRTGKL
        YTLWAGYLGWQWRRVRTIQNEINELKKQ+APAAVTPDG P++APPSPTELKIQQLTEERKELIKGSFRDRHFNAGSILLGFGVLE+IGGGVNTWFRTGKL
Subjt:  YTLWAGYLGWQWRRVRTIQNEINELKKQLAPAAVTPDGNPIDAPPSPTELKIQQLTEERKELIKGSFRDRHFNAGSILLGFGVLESIGGGVNTWFRTGKL

Query:  FPGPHLFAGAGITVLWALAAALVPAMQKGNETARNLHIALNALNVILFIWQIPTGIEIVFKVFEFTTWP
        FPGPHLFAGAGITVLWALAAALVPAMQKGNETARNLHIALNALNVILFIWQIPTGI+IVFKVFEFT WP
Subjt:  FPGPHLFAGAGITVLWALAAALVPAMQKGNETARNLHIALNALNVILFIWQIPTGIEIVFKVFEFTTWP

XP_022147498.1 uncharacterized protein LOC111016406 [Momordica charantia]7.28e-219100Show/hide
Query:  MLPHQSKPNLQSLFPLSFLLHPSLCFHTKNPTMAITLTLPKLPHISPNSTTKLPPIPTNLDFSQNPKKSPYSFFLRSVRDLKSCTLPLTALALPFFLDPQ
        MLPHQSKPNLQSLFPLSFLLHPSLCFHTKNPTMAITLTLPKLPHISPNSTTKLPPIPTNLDFSQNPKKSPYSFFLRSVRDLKSCTLPLTALALPFFLDPQ
Subjt:  MLPHQSKPNLQSLFPLSFLLHPSLCFHTKNPTMAITLTLPKLPHISPNSTTKLPPIPTNLDFSQNPKKSPYSFFLRSVRDLKSCTLPLTALALPFFLDPQ

Query:  DALAVGGEFGILEGRSVALIHPIVMGSLFAYTLWAGYLGWQWRRVRTIQNEINELKKQLAPAAVTPDGNPIDAPPSPTELKIQQLTEERKELIKGSFRDR
        DALAVGGEFGILEGRSVALIHPIVMGSLFAYTLWAGYLGWQWRRVRTIQNEINELKKQLAPAAVTPDGNPIDAPPSPTELKIQQLTEERKELIKGSFRDR
Subjt:  DALAVGGEFGILEGRSVALIHPIVMGSLFAYTLWAGYLGWQWRRVRTIQNEINELKKQLAPAAVTPDGNPIDAPPSPTELKIQQLTEERKELIKGSFRDR

Query:  HFNAGSILLGFGVLESIGGGVNTWFRTGKLFPGPHLFAGAGITVLWALAAALVPAMQKGNETARNLHIALNALNVILFIWQIPTGIEIVFKVFEFTTWP
        HFNAGSILLGFGVLESIGGGVNTWFRTGKLFPGPHLFAGAGITVLWALAAALVPAMQKGNETARNLHIALNALNVILFIWQIPTGIEIVFKVFEFTTWP
Subjt:  HFNAGSILLGFGVLESIGGGVNTWFRTGKLFPGPHLFAGAGITVLWALAAALVPAMQKGNETARNLHIALNALNVILFIWQIPTGIEIVFKVFEFTTWP

XP_022979305.1 uncharacterized protein LOC111479070 [Cucurbita maxima]6.19e-16279.05Show/hide
Query:  SKPNLQSLFPLSFLLHPSLCFHTKNPTMAITLTLPKLPHISPNSTTKLP--PIPTNLDFSQNPKKSPYSFFLRSVRDLKSCTLPLTALALPFFLDPQDAL
        S P+L SLFP          F   N +MAITL+LPKLP I+P  ++KLP   IPT L FSQNPK SP++FF++SVR+LK  TLPLTAL LPFFL PQDAL
Subjt:  SKPNLQSLFPLSFLLHPSLCFHTKNPTMAITLTLPKLPHISPNSTTKLP--PIPTNLDFSQNPKKSPYSFFLRSVRDLKSCTLPLTALALPFFLDPQDAL

Query:  AVGGEFGILEGRSVALIHPIVMGSLFAYTLWAGYLGWQWRRVRTIQNEINELKKQLAPAAVTPDGNPIDAPPSPTELKIQQLTEERKELIKGSFRDRHFN
        A GGEFGILEGRS+ALIHP+VMG LF +TLWAGYLGWQWRRVRTIQNEINELKKQ+ PAAV PDG P++APPSPTELKIQQLTEERKELIKGSFRDRHFN
Subjt:  AVGGEFGILEGRSVALIHPIVMGSLFAYTLWAGYLGWQWRRVRTIQNEINELKKQLAPAAVTPDGNPIDAPPSPTELKIQQLTEERKELIKGSFRDRHFN

Query:  AGSILLGFGVLESIGGGVNTWFRTGKLFPGPHLFAGAGITVLWALAAALVPAMQKGNETARNLHIALNALNVILFIWQIPTGIEIVFKVFEFTTWP
        AGSI+LGFGVLE+IGGG+NTW RTGKLFPGPHLFAGAGITVLWALAAALVPAMQKGNETARNLHIALNAL ++LF+WQIPTGI+IV KVFEFTTWP
Subjt:  AGSILLGFGVLESIGGGVNTWFRTGKLFPGPHLFAGAGITVLWALAAALVPAMQKGNETARNLHIALNALNVILFIWQIPTGIEIVFKVFEFTTWP

XP_038884717.1 uncharacterized protein LOC120075410 [Benincasa hispida]1.25e-16687.73Show/hide
Query:  MAITLTLPKLPHISPNSTTKL--PPIPTNLDFSQNPKKSPYSFFLRSVRDLKSCTLPLTALALPFFLDPQDALAVGGEFGILEGRSVALIHPIVMGSLFA
        MAITL+LPKLPHI+P S++KL  P IPTNL FSQNPK  P +FF++++R LK  TLPLTAL LPFFL PQDALAVGGEFGILEGRS ALIHP+VMG+LF 
Subjt:  MAITLTLPKLPHISPNSTTKL--PPIPTNLDFSQNPKKSPYSFFLRSVRDLKSCTLPLTALALPFFLDPQDALAVGGEFGILEGRSVALIHPIVMGSLFA

Query:  YTLWAGYLGWQWRRVRTIQNEINELKKQLAPAAVTPDGNPIDAPPSPTELKIQQLTEERKELIKGSFRDRHFNAGSILLGFGVLESIGGGVNTWFRTGKL
        YTLWAGYLGWQWRRVRTIQNEINELKKQ+APAAVTPDG P++APPSPTELKIQQLTEERKELIKGSFRDRHFNAGSILLGFGVLE+IGGGVNTWFRTGKL
Subjt:  YTLWAGYLGWQWRRVRTIQNEINELKKQLAPAAVTPDGNPIDAPPSPTELKIQQLTEERKELIKGSFRDRHFNAGSILLGFGVLESIGGGVNTWFRTGKL

Query:  FPGPHLFAGAGITVLWALAAALVPAMQKGNETARNLHIALNALNVILFIWQIPTGIEIVFKVFEFTTWP
        FPGPHLFAGAGITVLWALAAALVPAMQKGNETARNLHIALN LNVILFIWQIPTGI+IVFKVFEFTTWP
Subjt:  FPGPHLFAGAGITVLWALAAALVPAMQKGNETARNLHIALNALNVILFIWQIPTGIEIVFKVFEFTTWP

TrEMBL top hitse value%identityAlignment
A0A0A0L3S2 Uncharacterized protein7.51e-16184.76Show/hide
Query:  MAITLTLPKLPHISPNSTTKLP--PIPTNLDFSQNPKKSPYSFFLRSVRDLKSCTLPLTALALPFFLDPQDALAVGGEFGILEGRSVALIHPIVMGSLFA
        MAITL+LPKLPH +P S++KLP   IPTNL  SQNPK S  + F+++V++LK  T+PLTAL LPFFL PQDALAVGGEFGILEGRS ALIHP+VMG LF 
Subjt:  MAITLTLPKLPHISPNSTTKLP--PIPTNLDFSQNPKKSPYSFFLRSVRDLKSCTLPLTALALPFFLDPQDALAVGGEFGILEGRSVALIHPIVMGSLFA

Query:  YTLWAGYLGWQWRRVRTIQNEINELKKQLAPAAVTPDGNPIDAPPSPTELKIQQLTEERKELIKGSFRDRHFNAGSILLGFGVLESIGGGVNTWFRTGKL
        YTLWAGYLGWQWRRVRT+QNEINELKKQ+APAAVTPDG P++APPSPTELKIQQLTEERKELIKGSFRDRHFNAGSILLGFGVLE+IGGGVNTWFRTGKL
Subjt:  YTLWAGYLGWQWRRVRTIQNEINELKKQLAPAAVTPDGNPIDAPPSPTELKIQQLTEERKELIKGSFRDRHFNAGSILLGFGVLESIGGGVNTWFRTGKL

Query:  FPGPHLFAGAGITVLWALAAALVPAMQKGNETARNLHIALNALNVILFIWQIPTGIEIVFKVFEFTTWP
        FPGPHLFAGAGITVLWALAAALVPAMQKGNETARNLHIALN LNV+LFIWQIPTGI+IV KVFEFT WP
Subjt:  FPGPHLFAGAGITVLWALAAALVPAMQKGNETARNLHIALNALNVILFIWQIPTGIEIVFKVFEFTTWP

A0A1S3AUZ2 uncharacterized protein LOC1034831591.12e-16286.25Show/hide
Query:  MAITLTLPKLPHISPNSTTKLP--PIPTNLDFSQNPKKSPYSFFLRSVRDLKSCTLPLTALALPFFLDPQDALAVGGEFGILEGRSVALIHPIVMGSLFA
        MAITL+LPKLPH +P S++KLP   IPTNL  SQNPK S  + F+++V +LK  T+PLTAL LPFFL PQDALAVGGEFGILEGRS ALIHP+VMG LF 
Subjt:  MAITLTLPKLPHISPNSTTKLP--PIPTNLDFSQNPKKSPYSFFLRSVRDLKSCTLPLTALALPFFLDPQDALAVGGEFGILEGRSVALIHPIVMGSLFA

Query:  YTLWAGYLGWQWRRVRTIQNEINELKKQLAPAAVTPDGNPIDAPPSPTELKIQQLTEERKELIKGSFRDRHFNAGSILLGFGVLESIGGGVNTWFRTGKL
        YTLWAGYLGWQWRRVRTIQNEINELKKQ+APAAVTPDG P++APPSPTELKIQQLTEERKELIKGSFRDRHFNAGSILLGFGVLE+IGGGVNTWFRTGKL
Subjt:  YTLWAGYLGWQWRRVRTIQNEINELKKQLAPAAVTPDGNPIDAPPSPTELKIQQLTEERKELIKGSFRDRHFNAGSILLGFGVLESIGGGVNTWFRTGKL

Query:  FPGPHLFAGAGITVLWALAAALVPAMQKGNETARNLHIALNALNVILFIWQIPTGIEIVFKVFEFTTWP
        FPGPHLFAGAGITVLWALAAALVPAMQKGNETARNLHIALNALNVILFIWQIPTGI+IVFKVFEFT WP
Subjt:  FPGPHLFAGAGITVLWALAAALVPAMQKGNETARNLHIALNALNVILFIWQIPTGIEIVFKVFEFTTWP

A0A5D3DB32 Uncharacterized protein1.12e-16286.25Show/hide
Query:  MAITLTLPKLPHISPNSTTKLP--PIPTNLDFSQNPKKSPYSFFLRSVRDLKSCTLPLTALALPFFLDPQDALAVGGEFGILEGRSVALIHPIVMGSLFA
        MAITL+LPKLPH +P S++KLP   IPTNL  SQNPK S  + F+++V +LK  T+PLTAL LPFFL PQDALAVGGEFGILEGRS ALIHP+VMG LF 
Subjt:  MAITLTLPKLPHISPNSTTKLP--PIPTNLDFSQNPKKSPYSFFLRSVRDLKSCTLPLTALALPFFLDPQDALAVGGEFGILEGRSVALIHPIVMGSLFA

Query:  YTLWAGYLGWQWRRVRTIQNEINELKKQLAPAAVTPDGNPIDAPPSPTELKIQQLTEERKELIKGSFRDRHFNAGSILLGFGVLESIGGGVNTWFRTGKL
        YTLWAGYLGWQWRRVRTIQNEINELKKQ+APAAVTPDG P++APPSPTELKIQQLTEERKELIKGSFRDRHFNAGSILLGFGVLE+IGGGVNTWFRTGKL
Subjt:  YTLWAGYLGWQWRRVRTIQNEINELKKQLAPAAVTPDGNPIDAPPSPTELKIQQLTEERKELIKGSFRDRHFNAGSILLGFGVLESIGGGVNTWFRTGKL

Query:  FPGPHLFAGAGITVLWALAAALVPAMQKGNETARNLHIALNALNVILFIWQIPTGIEIVFKVFEFTTWP
        FPGPHLFAGAGITVLWALAAALVPAMQKGNETARNLHIALNALNVILFIWQIPTGI+IVFKVFEFT WP
Subjt:  FPGPHLFAGAGITVLWALAAALVPAMQKGNETARNLHIALNALNVILFIWQIPTGIEIVFKVFEFTTWP

A0A6J1D1H0 uncharacterized protein LOC1110164063.53e-219100Show/hide
Query:  MLPHQSKPNLQSLFPLSFLLHPSLCFHTKNPTMAITLTLPKLPHISPNSTTKLPPIPTNLDFSQNPKKSPYSFFLRSVRDLKSCTLPLTALALPFFLDPQ
        MLPHQSKPNLQSLFPLSFLLHPSLCFHTKNPTMAITLTLPKLPHISPNSTTKLPPIPTNLDFSQNPKKSPYSFFLRSVRDLKSCTLPLTALALPFFLDPQ
Subjt:  MLPHQSKPNLQSLFPLSFLLHPSLCFHTKNPTMAITLTLPKLPHISPNSTTKLPPIPTNLDFSQNPKKSPYSFFLRSVRDLKSCTLPLTALALPFFLDPQ

Query:  DALAVGGEFGILEGRSVALIHPIVMGSLFAYTLWAGYLGWQWRRVRTIQNEINELKKQLAPAAVTPDGNPIDAPPSPTELKIQQLTEERKELIKGSFRDR
        DALAVGGEFGILEGRSVALIHPIVMGSLFAYTLWAGYLGWQWRRVRTIQNEINELKKQLAPAAVTPDGNPIDAPPSPTELKIQQLTEERKELIKGSFRDR
Subjt:  DALAVGGEFGILEGRSVALIHPIVMGSLFAYTLWAGYLGWQWRRVRTIQNEINELKKQLAPAAVTPDGNPIDAPPSPTELKIQQLTEERKELIKGSFRDR

Query:  HFNAGSILLGFGVLESIGGGVNTWFRTGKLFPGPHLFAGAGITVLWALAAALVPAMQKGNETARNLHIALNALNVILFIWQIPTGIEIVFKVFEFTTWP
        HFNAGSILLGFGVLESIGGGVNTWFRTGKLFPGPHLFAGAGITVLWALAAALVPAMQKGNETARNLHIALNALNVILFIWQIPTGIEIVFKVFEFTTWP
Subjt:  HFNAGSILLGFGVLESIGGGVNTWFRTGKLFPGPHLFAGAGITVLWALAAALVPAMQKGNETARNLHIALNALNVILFIWQIPTGIEIVFKVFEFTTWP

A0A6J1ISV5 uncharacterized protein LOC1114790703.00e-16279.05Show/hide
Query:  SKPNLQSLFPLSFLLHPSLCFHTKNPTMAITLTLPKLPHISPNSTTKLP--PIPTNLDFSQNPKKSPYSFFLRSVRDLKSCTLPLTALALPFFLDPQDAL
        S P+L SLFP          F   N +MAITL+LPKLP I+P  ++KLP   IPT L FSQNPK SP++FF++SVR+LK  TLPLTAL LPFFL PQDAL
Subjt:  SKPNLQSLFPLSFLLHPSLCFHTKNPTMAITLTLPKLPHISPNSTTKLP--PIPTNLDFSQNPKKSPYSFFLRSVRDLKSCTLPLTALALPFFLDPQDAL

Query:  AVGGEFGILEGRSVALIHPIVMGSLFAYTLWAGYLGWQWRRVRTIQNEINELKKQLAPAAVTPDGNPIDAPPSPTELKIQQLTEERKELIKGSFRDRHFN
        A GGEFGILEGRS+ALIHP+VMG LF +TLWAGYLGWQWRRVRTIQNEINELKKQ+ PAAV PDG P++APPSPTELKIQQLTEERKELIKGSFRDRHFN
Subjt:  AVGGEFGILEGRSVALIHPIVMGSLFAYTLWAGYLGWQWRRVRTIQNEINELKKQLAPAAVTPDGNPIDAPPSPTELKIQQLTEERKELIKGSFRDRHFN

Query:  AGSILLGFGVLESIGGGVNTWFRTGKLFPGPHLFAGAGITVLWALAAALVPAMQKGNETARNLHIALNALNVILFIWQIPTGIEIVFKVFEFTTWP
        AGSI+LGFGVLE+IGGG+NTW RTGKLFPGPHLFAGAGITVLWALAAALVPAMQKGNETARNLHIALNAL ++LF+WQIPTGI+IV KVFEFTTWP
Subjt:  AGSILLGFGVLESIGGGVNTWFRTGKLFPGPHLFAGAGITVLWALAAALVPAMQKGNETARNLHIALNALNVILFIWQIPTGIEIVFKVFEFTTWP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G61870.1 unknown protein3.0e-10170.08Show/hide
Query:  LPKLPHISPNSTTKLPPIPTNLDFSQNPKKSPYSFFLRSVRDLKSCTLPLTALALPFFLDPQDALAVGGEFGILEGRSVALIHPIVMGSLFAYTLWAGYL
        LP+  H  P +   L PI T    SQ  K         +++ LKS +LPL  +ALPFFLDPQDA A GGEFGILEGRS ALIHPIVMG LFAYTLW GYL
Subjt:  LPKLPHISPNSTTKLPPIPTNLDFSQNPKKSPYSFFLRSVRDLKSCTLPLTALALPFFLDPQDALAVGGEFGILEGRSVALIHPIVMGSLFAYTLWAGYL

Query:  GWQWRRVRTIQNEINELKKQLAPAAVTPDGNPI---DAPPSPTELKIQQLTEERKELIKGSFRDRHFNAGSILLGFGVLESIGGGVNTWFRTGKLFPGPH
        GWQWRRVRTIQ+EI++LKKQL P  V+PDG+      +PPS TEL+IQ+LTEERKEL+KGS+RD+HF+AGS+LLGFGVLE++ GGVNT+ RTGKLFPGPH
Subjt:  GWQWRRVRTIQNEINELKKQLAPAAVTPDGNPI---DAPPSPTELKIQQLTEERKELIKGSFRDRHFNAGSILLGFGVLESIGGGVNTWFRTGKLFPGPH

Query:  LFAGAGITVLWALAAALVPAMQKGNETARNLHIALNALNVILFIWQIPTGIEIVFKVFEFTTWP
        L+AGAGITVLWA AAALVPAMQKGN+TAR+LHIALNA+NV+LFIWQIPTG++IV KVFEFT WP
Subjt:  LFAGAGITVLWALAAALVPAMQKGNETARNLHIALNALNVILFIWQIPTGIEIVFKVFEFTTWP

AT3G61870.2 unknown protein5.6e-7166.02Show/hide
Query:  LPKLPHISPNSTTKLPPIPTNLDFSQNPKKSPYSFFLRSVRDLKSCTLPLTALALPFFLDPQDALAVGGEFGILEGRSVALIHPIVMGSLFAYTLWAGYL
        LP+  H  P +   L PI T    SQ  K         +++ LKS +LPL  +ALPFFLDPQDA A GGEFGILEGRS ALIHPIVMG LFAYTLW GYL
Subjt:  LPKLPHISPNSTTKLPPIPTNLDFSQNPKKSPYSFFLRSVRDLKSCTLPLTALALPFFLDPQDALAVGGEFGILEGRSVALIHPIVMGSLFAYTLWAGYL

Query:  GWQWRRVRTIQNEINELKKQLAPAAVTPDGNPI---DAPPSPTELKIQQLTEERKELIKGSFRDRHFNAGSILLGFGVLESIGGGVNTWFRTGKLFPGPH
        GWQWRRVRTIQ+EI++LKKQL P  V+PDG+      +PPS TEL+IQ+LTEERKEL+KGS+RD+HF+AGS+LLGFGVLE++ GGVNT+ RTGKLFPGPH
Subjt:  GWQWRRVRTIQNEINELKKQLAPAAVTPDGNPI---DAPPSPTELKIQQLTEERKELIKGSFRDRHFNAGSILLGFGVLESIGGGVNTWFRTGKLFPGPH

Query:  LFAGAG
        L+AGAG
Subjt:  LFAGAG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGCCACATCAATCTAAACCCAATCTTCAATCTCTTTTTCCCTTATCCTTTCTTCTCCATCCTTCTCTCTGTTTCCACACCAAGAATCCGACCATGGCCATCACCCT
CACCCTCCCGAAACTCCCCCATATTTCCCCCAATTCCACCACCAAGCTTCCCCCCATTCCCACGAATCTCGACTTCTCCCAAAACCCTAAAAAATCCCCGTACAGTTTCT
TCCTTCGGAGCGTTCGCGATCTCAAATCATGTACTCTCCCTCTCACTGCTCTCGCATTGCCCTTCTTCCTCGATCCGCAGGATGCGCTTGCTGTTGGCGGGGAGTTCGGG
ATTTTGGAGGGGAGATCAGTTGCCCTAATTCACCCAATTGTTATGGGCTCCCTTTTCGCGTACACTCTGTGGGCAGGCTACTTGGGGTGGCAATGGCGGCGAGTGAGGAC
GATTCAGAATGAAATTAACGAGCTCAAGAAGCAACTGGCTCCTGCGGCGGTCACCCCTGATGGGAACCCAATTGACGCACCGCCATCTCCCACTGAATTGAAGATTCAGC
AGCTCACAGAGGAGAGGAAAGAGTTGATCAAAGGGTCCTTCAGAGATAGGCACTTCAATGCTGGCTCCATACTACTCGGATTCGGAGTTCTGGAATCGATCGGTGGAGGC
GTAAATACCTGGTTTAGAACAGGAAAGCTCTTTCCTGGGCCTCACTTGTTTGCTGGAGCAGGGATAACTGTTCTGTGGGCACTGGCAGCTGCTCTTGTACCTGCAATGCA
GAAGGGCAATGAGACAGCTAGAAATCTTCACATTGCGCTCAATGCTCTGAATGTTATTCTCTTTATATGGCAGATTCCCACTGGAATTGAAATTGTCTTCAAAGTCTTTG
AATTCACTACTTGGCCTTAA
mRNA sequenceShow/hide mRNA sequence
CGAAATTACTTTATGCAAAGCAATAATTAAACTTGTGAGTTCTAATGAGAGCTACTTGTATTTAAACTTGATTATACTACACACAACTCATAAGAAGTGTATGGGGCTGT
CGTGGGGGCGTAGGTCCGAACAGAGCGGCGCGCCCTGGTCCACTTTATACAGTCGCCAACGCCCAAATGTTGCCACATCAATCTAAACCCAATCTTCAATCTCTTTTTCC
CTTATCCTTTCTTCTCCATCCTTCTCTCTGTTTCCACACCAAGAATCCGACCATGGCCATCACCCTCACCCTCCCGAAACTCCCCCATATTTCCCCCAATTCCACCACCA
AGCTTCCCCCCATTCCCACGAATCTCGACTTCTCCCAAAACCCTAAAAAATCCCCGTACAGTTTCTTCCTTCGGAGCGTTCGCGATCTCAAATCATGTACTCTCCCTCTC
ACTGCTCTCGCATTGCCCTTCTTCCTCGATCCGCAGGATGCGCTTGCTGTTGGCGGGGAGTTCGGGATTTTGGAGGGGAGATCAGTTGCCCTAATTCACCCAATTGTTAT
GGGCTCCCTTTTCGCGTACACTCTGTGGGCAGGCTACTTGGGGTGGCAATGGCGGCGAGTGAGGACGATTCAGAATGAAATTAACGAGCTCAAGAAGCAACTGGCTCCTG
CGGCGGTCACCCCTGATGGGAACCCAATTGACGCACCGCCATCTCCCACTGAATTGAAGATTCAGCAGCTCACAGAGGAGAGGAAAGAGTTGATCAAAGGGTCCTTCAGA
GATAGGCACTTCAATGCTGGCTCCATACTACTCGGATTCGGAGTTCTGGAATCGATCGGTGGAGGCGTAAATACCTGGTTTAGAACAGGAAAGCTCTTTCCTGGGCCTCA
CTTGTTTGCTGGAGCAGGGATAACTGTTCTGTGGGCACTGGCAGCTGCTCTTGTACCTGCAATGCAGAAGGGCAATGAGACAGCTAGAAATCTTCACATTGCGCTCAATG
CTCTGAATGTTATTCTCTTTATATGGCAGATTCCCACTGGAATTGAAATTGTCTTCAAAGTCTTTGAATTCACTACTTGGCCTTAATTTCCTGATATTTCAGTCTAAACT
TCATTCCCCATTTCCTTATGGCCTCCCTCTTCTCAACTACATGTCTTGGACTTCCTTTTCTCTCCACTTTCCCATCTCTCACTTCTTGCCTCCGGCAAATCTTCTTCCCT
TCAGCTATGACTTATTACTCGACAACTCGAAGTCGATCAAAACTTACATTTCTATGTCGTTTTTAACGATCTCACTGTTCTCCGAAACAGATATCAGTATGCGTTTGTCT
ATAGCTCTCAACAGGTAATTTTTAGAGTCTGCATTTGGATATTGGCCTGTCTCAATGGTCCAGAGAAGTGCTGCAGATTCCAGTTCCTACCAAATATTGTACCAACTTCA
AATGGTAGACTAACTGTGCATCTTCGAGAAAGCTCAGACTAACACCGGATCGAGACGTGAGCCTGAGCGCCCCAATCCACAGAAGAAAGTGCTTTCTAGCGAGCAGAAGC
ATCTCCTTGTTCCCTCAACAATGCGGTAGTCAGATGGAAGTCTGTCCAGTAGAGTTTCAGTTGTTTCTTCCATGTTCAAAAAGTTATGATATTGGCAGTTATAGTATAAT
CTAAAGAGGATGCTTCCCCATTGTTCTTCACATTTTTCTCCATTTAATGTATCTGTTCTTTGAAGAAGCATTTCAGGTTCTTTCTTTTACTATTTTTGCCTGTGTTTCCA
TTCACTCAGAATCGGTTCATCTTCTCTTTGAATTTTCCACCATTTTCTGCCTCACACCTTCTCTAACAGATCCACTGGTTCCTACTATAAATAATTGCAGAGACACAGCT
TCCATTATCACTGTCTTTTCATTCTTAAAACTTTCCATTCTTTAGGTTTTCATTTTCTTAGCTGCCAAACAATCTCATTCGTTATGGCAAATCTTCCTCGCTTTGGTCGT
ACATGGCAGCGTCTTTCCACAATTGCACGCCCTTTTCCGGCCACCGCAAATCCTGCTTCAGAACCAGAGCCTGAGATTCTGCCACTTGCTCCGGCCACCGAAACTCTCGA
ATCTTTCGAGTCCAACCCACCGCCGGCAGCTTCTCCGGCCAAGAAACCTGCCACGCCAAGTTTCAAAGCTACTGTCACACGTGTAGCCAGCCCGCCGGTGAAGCCTGCAC
GATCCCCTCCGCCCTCTCCTGCAAAGAAATCTTCCGACCGGAGACATTCAATTAGGCCCTAATTCGTATCAGAAAACCGTCAAGCCTCACCAACCAACCACGCCACCACT
TTCCCCTCTGGCTCTGCCTAAATCTGCAAATGCGACGACGGCTCAATCCAGAATCTCGCTGGAGACGAAGCAGAAAGCCGGTCCATACAAAAAGACCGTCGAGAAGGCGG
AGAAGTCGGACTGGCCGTCGGAGTACGGATCCGGAAAAGCCGCCGCAGAAGCAGCAGGCGGAGGCTATAAACCTTACCGGACATAACGTAGGCGCGGTCATGGAAGTACG
CCAATTCTCCGAATCCGATGATCGTTCAGGCGGAGAAATCGCCAAGAAGAACGAAACAGAAGGCGGCGTCCGCCATGGAAATGATGATGAGAGGAAGAAGGGAAGGAAGG
ATAAAAGTGACAGAGTAAAGGGATTTCCAAGGACGGCATTCATGAACAGTAATTTTCAAGAAGTGAATAATTCTGTAATGTATGATTCGACGTGCAGGGGCCGTGATCCA
GGGCTGCACCTTGATTTCTCCGGGAAGTCGAAGGATGATGGAGCCATTTTCGACGGCGCCCAGAACTCTTAAGTTCTAAGGAGAAAGTGCAATGGAAGTTGTTACCCTTT
TTAAATGTGTGATTATTAAATTAAATAAATAAGCTAAAATGGAATAATATTCCACCTTTTTGTATTTAGGATCTCATATTTTTAGACCACCAAAATGTTAGACATTTAAC
TTAGAATGATATTATCCACTTTTAACATGATTCTTCGTAAGCTCTCGTGATTTGCTTTTGGTATCATTCAAAATTGTGATGAAAATGGTGTCCCAAATTTATATACGATG
AACTTTC
Protein sequenceShow/hide protein sequence
MLPHQSKPNLQSLFPLSFLLHPSLCFHTKNPTMAITLTLPKLPHISPNSTTKLPPIPTNLDFSQNPKKSPYSFFLRSVRDLKSCTLPLTALALPFFLDPQDALAVGGEFG
ILEGRSVALIHPIVMGSLFAYTLWAGYLGWQWRRVRTIQNEINELKKQLAPAAVTPDGNPIDAPPSPTELKIQQLTEERKELIKGSFRDRHFNAGSILLGFGVLESIGGG
VNTWFRTGKLFPGPHLFAGAGITVLWALAAALVPAMQKGNETARNLHIALNALNVILFIWQIPTGIEIVFKVFEFTTWP