; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10006571 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10006571
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionCalcineurin-like metallo-phosphoesterase superfamily protein
Genome locationChr07:19980800..19984031
RNA-Seq ExpressionHG10006571
SyntenyHG10006571
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsIPR029052 - Metallo-dependent phosphatase-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139260.1 uncharacterized protein LOC101208944 isoform X1 [Cucumis sativus]1.1e-11878.29Show/hide
Query:  MADPSLSRTLLTQLCLCIAFYLSLNMGRPKHYDFLKIRDENPLDFYFISVWGGLRSVKEETLLLKQITLFFNAKNVSCEHEEERGKVSHAKFILHICEPG
        MADPS+S TLL QLCLC AFYLSLNMGR K+YDFLKI D+NPLDFYFISVWGGLRSVKEETLLLKQ+              E+  KVSHAKFILHI EPG
Subjt:  MADPSLSRTLLTQLCLCIAFYLSLNMGRPKHYDFLKIRDENPLDFYFISVWGGLRSVKEETLLLKQITLFFNAKNVSCEHEEERGKVSHAKFILHICEPG

Query:  ENDRLMQNGTWYFSSLKVPWHSIQASRGNDGDYFIERIKLRYQQTLDIIAIDTGLLQESVAMGSASNMLNSHLLWLKRTLQASNSNWRIVVGFHPLVTCE
        ENDRLMQNGTWYFSSLKVPWHSI+ASRG DG +FIER KL+Y QTLDIIAIDT LLQE +AMGSAS  L SHLLWLKRTLQAS+SNWRIVVGFHPL+TCE
Subjt:  ENDRLMQNGTWYFSSLKVPWHSIQASRGNDGDYFIERIKLRYQQTLDIIAIDTGLLQESVAMGSASNMLNSHLLWLKRTLQASNSNWRIVVGFHPLVTCE

Query:  NNTQSLETKQKHFFESIHRIFIENGVNAYLSRRGCTHNVRIGSIAYIGVPG--PFQTKHFPSQRSSFREFLLQRVSSVETV
        NNT+SLET  KH FES+HRIF+ENGVNAYLSRRGCT+NVRIGSIAYIG+PG  P Q  HF S++SSFREFLLQ VSS+E V
Subjt:  NNTQSLETKQKHFFESIHRIFIENGVNAYLSRRGCTHNVRIGSIAYIGVPG--PFQTKHFPSQRSSFREFLLQRVSSVETV

XP_008456552.1 PREDICTED: uncharacterized protein LOC103496469 [Cucumis melo]1.5e-12379.93Show/hide
Query:  MADPSLSRTLLTQLCLCIAFYLSLNMGRPKHYDFLKIRDENPLDFYFISVWGGLRSVKEETLLLKQITLFFNAKNVSCEHEEERGKVSHAKFILHICEPG
        MADPS+S T+L Q+CLC AFYL LNMGR K+YD LKIRDENPLDFYFISVWGGLRSVKEETLLLKQ+              E+  KVSHAKFILHI EPG
Subjt:  MADPSLSRTLLTQLCLCIAFYLSLNMGRPKHYDFLKIRDENPLDFYFISVWGGLRSVKEETLLLKQITLFFNAKNVSCEHEEERGKVSHAKFILHICEPG

Query:  ENDRLMQNGTWYFSSLKVPWHSIQASRGNDGDYFIERIKLRYQQTLDIIAIDTGLLQESVAMGSASNMLNSHLLWLKRTLQASNSNWRIVVGFHPLVTCE
        ENDRLMQNGTWYFSSLKVPWHSI+ASRG DG++FIERIKL+Y QTLDIIAIDTGLLQE VAMGSAS  LNSHLLWLKRTLQAS+SNWRIVVGFHPLVTCE
Subjt:  ENDRLMQNGTWYFSSLKVPWHSIQASRGNDGDYFIERIKLRYQQTLDIIAIDTGLLQESVAMGSASNMLNSHLLWLKRTLQASNSNWRIVVGFHPLVTCE

Query:  NNTQSLETKQKHFFESIHRIFIENGVNAYLSRRGCTHNVRIGSIAYIGVPGPFQTKHFPSQRSSFREFLLQRVSSVETV
        NNT+SLET  KH F+S+HRIF+ENGVNAYLSRRGC++NVRIGSIAYIG+PGP Q  HF S++SSFREFLLQRVSS+ETV
Subjt:  NNTQSLETKQKHFFESIHRIFIENGVNAYLSRRGCTHNVRIGSIAYIGVPGPFQTKHFPSQRSSFREFLLQRVSSVETV

XP_022142998.1 uncharacterized protein LOC111012988 isoform X1 [Momordica charantia]2.4e-11072.4Show/hide
Query:  MADPSLSRTLLTQLCLCIAFYLSLNMGRPKHYDFLKIRDENPLDFYFISVWGGLRSVKEETLLLKQITLFFNAKNVSCEHEEERGKVSHAKFILHICEPG
        MAD S+  TL+ QLCLC  FY +LNMG PK+YDFLKI D NPLDFYFISVWGGLRS KEETLLLKQ+              E+  K SHAKF+LHI EPG
Subjt:  MADPSLSRTLLTQLCLCIAFYLSLNMGRPKHYDFLKIRDENPLDFYFISVWGGLRSVKEETLLLKQITLFFNAKNVSCEHEEERGKVSHAKFILHICEPG

Query:  ENDRLMQNGTWYFSSLKVPWHSIQASRGNDGDYFIERIKLRYQQTLDIIAIDTGLLQESVAMGSASNMLNSHLLWLKRTLQASNSNWRIVVGFHPLVTCE
        ENDRLM+NGT YFSSLKVPWH IQ SRGND  YFIER+KL+Y QTLDI+AIDTGLLQES+AMGSAS+M+N+ L WLKRTLQ SNSNWRIVVGFHPLVTCE
Subjt:  ENDRLMQNGTWYFSSLKVPWHSIQASRGNDGDYFIERIKLRYQQTLDIIAIDTGLLQESVAMGSASNMLNSHLLWLKRTLQASNSNWRIVVGFHPLVTCE

Query:  NNTQSLETKQKHFFESIHRIFIENGVNAYLSRRGCTHNVRIGSIAYIGVPGPFQTKHFPSQRSSFREFLLQRVSSVETV
        +NT+ +ET  KH FESIH+IF+E+ VNAYLSRRGC HN+RIGS AYIG PGP QT +F SQRS+ REFLL RVS +ETV
Subjt:  NNTQSLETKQKHFFESIHRIFIENGVNAYLSRRGCTHNVRIGSIAYIGVPGPFQTKHFPSQRSSFREFLLQRVSSVETV

XP_038889577.1 uncharacterized protein LOC120079458 isoform X1 [Benincasa hispida]5.0e-12481.36Show/hide
Query:  MADPSLSRTLLTQLCLCIAFYLSLNMGRPKHYDFLKIRDENPLDFYFISVWGGLRSVKEETLLLKQITLFFNAKNVSCEHEEERGKVSHAKFILHICEPG
        MADPS+ RTLL QLCLCIAFYLSLNMG PK+YDFLKI D+NPLDFYFISVWGGLRSVKEE LLLKQ+              E+  KVSHAKF+LHI EPG
Subjt:  MADPSLSRTLLTQLCLCIAFYLSLNMGRPKHYDFLKIRDENPLDFYFISVWGGLRSVKEETLLLKQITLFFNAKNVSCEHEEERGKVSHAKFILHICEPG

Query:  ENDRLMQNGTWYFSSLKVPWHSIQASRGNDGDYFIERIKLRYQQTLDIIAIDTGLLQESVAMGSASNMLNSHLLWLKRTLQASNSNWRIVVGFHPLVTCE
        ENDRLMQNG+WYFSSLKVPWHSIQ SR N GDYFIERIKL+Y+QTLD+IAIDTGLLQESVA GSAS+ +N HLLWLKRTLQASNSNWRIVVGFHPLVTCE
Subjt:  ENDRLMQNGTWYFSSLKVPWHSIQASRGNDGDYFIERIKLRYQQTLDIIAIDTGLLQESVAMGSASNMLNSHLLWLKRTLQASNSNWRIVVGFHPLVTCE

Query:  NNTQSLETKQKHFFESIHRIFIENGVNAYLSRRGCTHNVRIGSIAYIGVPGPFQTKHFPSQRSSFREFLLQRVSSVETV
        NNT+SLET   H FESIH+IF+EN VNAYLSRRGCTHNVRI SIAYIGVPGP QTKHFPSQRSSF EFLLQRVSS+ETV
Subjt:  NNTQSLETKQKHFFESIHRIFIENGVNAYLSRRGCTHNVRIGSIAYIGVPGPFQTKHFPSQRSSFREFLLQRVSSVETV

XP_038889578.1 uncharacterized protein LOC120079458 isoform X2 [Benincasa hispida]3.6e-12281Show/hide
Query:  MADPSLSRTLLTQLCLCIAFYLSLNMGRPKHYDFLKIRDENPLDFYFISVWGGLRSVKEETLLLKQITLFFNAKNVSCEHEEERGKVSHAKFILHICEPG
        MADPS+ RTLL QLCLCIAFYLSLNMG PK+YDFLKI D+NPLDFYFISVWGGLRSVKEE LLLKQ+              E+  KVSHAKF+LHI EPG
Subjt:  MADPSLSRTLLTQLCLCIAFYLSLNMGRPKHYDFLKIRDENPLDFYFISVWGGLRSVKEETLLLKQITLFFNAKNVSCEHEEERGKVSHAKFILHICEPG

Query:  ENDRLMQNGTWYFSSLKVPWHSIQASRGNDGDYFIERIKLRYQQTLDIIAIDTGLLQESVAMGSASNMLNSHLLWLKRTLQASNSNWRIVVGFHPLVTCE
        ENDRLMQNG+WYFSSLKVPWHSIQ SR N GDYFIERIKL+Y+QTLD+IAIDTGLLQESVA GSAS+ +N HLLWLKRTLQASNSNWRIVVGFHPLVTCE
Subjt:  ENDRLMQNGTWYFSSLKVPWHSIQASRGNDGDYFIERIKLRYQQTLDIIAIDTGLLQESVAMGSASNMLNSHLLWLKRTLQASNSNWRIVVGFHPLVTCE

Query:  NNTQSLETKQKHFFESIHRIFIENGVNAYLSRRGCTHNVRIGSIAYIGVPGPFQTKHFPSQRSSFREFLLQRVSSVETV
        NNT+SLET   H FESIH+IF+EN  NAYLSRRGCTHNVRI SIAYIGVPGP QTKHFPSQRSSF EFLLQRVSS+ETV
Subjt:  NNTQSLETKQKHFFESIHRIFIENGVNAYLSRRGCTHNVRIGSIAYIGVPGPFQTKHFPSQRSSFREFLLQRVSSVETV

TrEMBL top hitse value%identityAlignment
A0A0A0LG53 Uncharacterized protein5.3e-11978.29Show/hide
Query:  MADPSLSRTLLTQLCLCIAFYLSLNMGRPKHYDFLKIRDENPLDFYFISVWGGLRSVKEETLLLKQITLFFNAKNVSCEHEEERGKVSHAKFILHICEPG
        MADPS+S TLL QLCLC AFYLSLNMGR K+YDFLKI D+NPLDFYFISVWGGLRSVKEETLLLKQ+              E+  KVSHAKFILHI EPG
Subjt:  MADPSLSRTLLTQLCLCIAFYLSLNMGRPKHYDFLKIRDENPLDFYFISVWGGLRSVKEETLLLKQITLFFNAKNVSCEHEEERGKVSHAKFILHICEPG

Query:  ENDRLMQNGTWYFSSLKVPWHSIQASRGNDGDYFIERIKLRYQQTLDIIAIDTGLLQESVAMGSASNMLNSHLLWLKRTLQASNSNWRIVVGFHPLVTCE
        ENDRLMQNGTWYFSSLKVPWHSI+ASRG DG +FIER KL+Y QTLDIIAIDT LLQE +AMGSAS  L SHLLWLKRTLQAS+SNWRIVVGFHPL+TCE
Subjt:  ENDRLMQNGTWYFSSLKVPWHSIQASRGNDGDYFIERIKLRYQQTLDIIAIDTGLLQESVAMGSASNMLNSHLLWLKRTLQASNSNWRIVVGFHPLVTCE

Query:  NNTQSLETKQKHFFESIHRIFIENGVNAYLSRRGCTHNVRIGSIAYIGVPG--PFQTKHFPSQRSSFREFLLQRVSSVETV
        NNT+SLET  KH FES+HRIF+ENGVNAYLSRRGCT+NVRIGSIAYIG+PG  P Q  HF S++SSFREFLLQ VSS+E V
Subjt:  NNTQSLETKQKHFFESIHRIFIENGVNAYLSRRGCTHNVRIGSIAYIGVPG--PFQTKHFPSQRSSFREFLLQRVSSVETV

A0A1S3C339 uncharacterized protein LOC1034964697.1e-12479.93Show/hide
Query:  MADPSLSRTLLTQLCLCIAFYLSLNMGRPKHYDFLKIRDENPLDFYFISVWGGLRSVKEETLLLKQITLFFNAKNVSCEHEEERGKVSHAKFILHICEPG
        MADPS+S T+L Q+CLC AFYL LNMGR K+YD LKIRDENPLDFYFISVWGGLRSVKEETLLLKQ+              E+  KVSHAKFILHI EPG
Subjt:  MADPSLSRTLLTQLCLCIAFYLSLNMGRPKHYDFLKIRDENPLDFYFISVWGGLRSVKEETLLLKQITLFFNAKNVSCEHEEERGKVSHAKFILHICEPG

Query:  ENDRLMQNGTWYFSSLKVPWHSIQASRGNDGDYFIERIKLRYQQTLDIIAIDTGLLQESVAMGSASNMLNSHLLWLKRTLQASNSNWRIVVGFHPLVTCE
        ENDRLMQNGTWYFSSLKVPWHSI+ASRG DG++FIERIKL+Y QTLDIIAIDTGLLQE VAMGSAS  LNSHLLWLKRTLQAS+SNWRIVVGFHPLVTCE
Subjt:  ENDRLMQNGTWYFSSLKVPWHSIQASRGNDGDYFIERIKLRYQQTLDIIAIDTGLLQESVAMGSASNMLNSHLLWLKRTLQASNSNWRIVVGFHPLVTCE

Query:  NNTQSLETKQKHFFESIHRIFIENGVNAYLSRRGCTHNVRIGSIAYIGVPGPFQTKHFPSQRSSFREFLLQRVSSVETV
        NNT+SLET  KH F+S+HRIF+ENGVNAYLSRRGC++NVRIGSIAYIG+PGP Q  HF S++SSFREFLLQRVSS+ETV
Subjt:  NNTQSLETKQKHFFESIHRIFIENGVNAYLSRRGCTHNVRIGSIAYIGVPGPFQTKHFPSQRSSFREFLLQRVSSVETV

A0A5D3BFL8 Uncharacterized protein7.1e-12479.93Show/hide
Query:  MADPSLSRTLLTQLCLCIAFYLSLNMGRPKHYDFLKIRDENPLDFYFISVWGGLRSVKEETLLLKQITLFFNAKNVSCEHEEERGKVSHAKFILHICEPG
        MADPS+S T+L Q+CLC AFYL LNMGR K+YD LKIRDENPLDFYFISVWGGLRSVKEETLLLKQ+              E+  KVSHAKFILHI EPG
Subjt:  MADPSLSRTLLTQLCLCIAFYLSLNMGRPKHYDFLKIRDENPLDFYFISVWGGLRSVKEETLLLKQITLFFNAKNVSCEHEEERGKVSHAKFILHICEPG

Query:  ENDRLMQNGTWYFSSLKVPWHSIQASRGNDGDYFIERIKLRYQQTLDIIAIDTGLLQESVAMGSASNMLNSHLLWLKRTLQASNSNWRIVVGFHPLVTCE
        ENDRLMQNGTWYFSSLKVPWHSI+ASRG DG++FIERIKL+Y QTLDIIAIDTGLLQE VAMGSAS  LNSHLLWLKRTLQAS+SNWRIVVGFHPLVTCE
Subjt:  ENDRLMQNGTWYFSSLKVPWHSIQASRGNDGDYFIERIKLRYQQTLDIIAIDTGLLQESVAMGSASNMLNSHLLWLKRTLQASNSNWRIVVGFHPLVTCE

Query:  NNTQSLETKQKHFFESIHRIFIENGVNAYLSRRGCTHNVRIGSIAYIGVPGPFQTKHFPSQRSSFREFLLQRVSSVETV
        NNT+SLET  KH F+S+HRIF+ENGVNAYLSRRGC++NVRIGSIAYIG+PGP Q  HF S++SSFREFLLQRVSS+ETV
Subjt:  NNTQSLETKQKHFFESIHRIFIENGVNAYLSRRGCTHNVRIGSIAYIGVPGPFQTKHFPSQRSSFREFLLQRVSSVETV

A0A6J1CN20 uncharacterized protein LOC111012988 isoform X23.0e-8276.26Show/hide
Query:  EERGKVSHAKFILHICEPGENDRLMQNGTWYFSSLKVPWHSIQASRGNDGDYFIERIKLRYQQTLDIIAIDTGLLQESVAMGSASNMLNSHLLWLKRTLQ
        E+  K SHAKF+LHI EPGENDRLM+NGT YFSSLKVPWH IQ SRGND  YFIER+KL+Y QTLDI+AIDTGLLQES+AMGSAS+M+N+ L WLKRTLQ
Subjt:  EERGKVSHAKFILHICEPGENDRLMQNGTWYFSSLKVPWHSIQASRGNDGDYFIERIKLRYQQTLDIIAIDTGLLQESVAMGSASNMLNSHLLWLKRTLQ

Query:  ASNSNWRIVVGFHPLVTCENNTQSLETKQKHFFESIHRIFIENGVNAYLSRRGCTHNVRIGSIAYIGVPGPFQTKHFPSQRSSFREFLLQRVSSVETV
         SNSNWRIVVGFHPLVTCE+NT+ +ET  KH FESIH+IF+E+ VNAYLSRRGC HN+RIGS AYIG PGP QT +F SQRS+ REFLL RVS +ETV
Subjt:  ASNSNWRIVVGFHPLVTCENNTQSLETKQKHFFESIHRIFIENGVNAYLSRRGCTHNVRIGSIAYIGVPGPFQTKHFPSQRSSFREFLLQRVSSVETV

A0A6J1CPN0 uncharacterized protein LOC111012988 isoform X11.2e-11072.4Show/hide
Query:  MADPSLSRTLLTQLCLCIAFYLSLNMGRPKHYDFLKIRDENPLDFYFISVWGGLRSVKEETLLLKQITLFFNAKNVSCEHEEERGKVSHAKFILHICEPG
        MAD S+  TL+ QLCLC  FY +LNMG PK+YDFLKI D NPLDFYFISVWGGLRS KEETLLLKQ+              E+  K SHAKF+LHI EPG
Subjt:  MADPSLSRTLLTQLCLCIAFYLSLNMGRPKHYDFLKIRDENPLDFYFISVWGGLRSVKEETLLLKQITLFFNAKNVSCEHEEERGKVSHAKFILHICEPG

Query:  ENDRLMQNGTWYFSSLKVPWHSIQASRGNDGDYFIERIKLRYQQTLDIIAIDTGLLQESVAMGSASNMLNSHLLWLKRTLQASNSNWRIVVGFHPLVTCE
        ENDRLM+NGT YFSSLKVPWH IQ SRGND  YFIER+KL+Y QTLDI+AIDTGLLQES+AMGSAS+M+N+ L WLKRTLQ SNSNWRIVVGFHPLVTCE
Subjt:  ENDRLMQNGTWYFSSLKVPWHSIQASRGNDGDYFIERIKLRYQQTLDIIAIDTGLLQESVAMGSASNMLNSHLLWLKRTLQASNSNWRIVVGFHPLVTCE

Query:  NNTQSLETKQKHFFESIHRIFIENGVNAYLSRRGCTHNVRIGSIAYIGVPGPFQTKHFPSQRSSFREFLLQRVSSVETV
        +NT+ +ET  KH FESIH+IF+E+ VNAYLSRRGC HN+RIGS AYIG PGP QT +F SQRS+ REFLL RVS +ETV
Subjt:  NNTQSLETKQKHFFESIHRIFIENGVNAYLSRRGCTHNVRIGSIAYIGVPGPFQTKHFPSQRSSFREFLLQRVSSVETV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G30993.1 Calcineurin-like metallo-phosphoesterase superfamily protein5.5e-2833.63Show/hide
Query:  MADPSLSRTLLTQLCLCIAFYLSLNMGRPKHYDFLKIRDENPLDFYFISVWGGLRSVKEETLLLKQITLFFNAKNVSCEHEEERGKVSHAKFILHICEPG
        M   S   T+  QL LC+  Y+SL+ G P  +      +  PLD +FISV GG R +  +T LL+ +              E+  +   AKF++   E G
Subjt:  MADPSLSRTLLTQLCLCIAFYLSLNMGRPKHYDFLKIRDENPLDFYFISVWGGLRSVKEETLLLKQITLFFNAKNVSCEHEEERGKVSHAKFILHICEPG

Query:  ENDRLMQNGTWYFSSLKVPWHSIQASRGNDGDYFIERIKLRYQQTLDIIAIDTGLLQESVAMGSASNMLNSHLLWLKRTLQASNSNWRIVVGFHPLVTCE
        E D  +QN T   SSLK+PW++     G    YF E IK+ +  +LD++ +DTG L++ V  G+ +  + S L  L R L+A + +WRIVVG  PL+   
Subjt:  ENDRLMQNGTWYFSSLKVPWHSIQASRGNDGDYFIERIKLRYQQTLDIIAIDTGLLQESVAMGSASNMLNSHLLWLKRTLQASNSNWRIVVGFHPLVTCE

Query:  NNTQSLETKQKHFFESIHRIFIENGV
           +  E K+     + H+I  + GV
Subjt:  NNTQSLETKQKHFFESIHRIFIENGV

AT4G30993.2 Calcineurin-like metallo-phosphoesterase superfamily protein9.4e-3634.16Show/hide
Query:  MADPSLSRTLLTQLCLCIAFYLSLNMGRPKHYDFLKIRDENPLDFYFISVWGGLRSVKEETLLLKQITLFFNAKNVSCEHEEERGKVSHAKFILHICEPG
        M   S   T+  QL LC+  Y+SL+ G P  +      +  PLD +FISV GG R +  +T LL+ +              E+  +   AKF++   E G
Subjt:  MADPSLSRTLLTQLCLCIAFYLSLNMGRPKHYDFLKIRDENPLDFYFISVWGGLRSVKEETLLLKQITLFFNAKNVSCEHEEERGKVSHAKFILHICEPG

Query:  ENDRLMQNGTWYFSSLKVPWHSIQASRGNDGDYFIERIKLRYQQTLDIIAIDTGLLQESVAMGSASNMLNSHLLWLKRTLQASNSNWRIVVGFHPLVTCE
        E D  +QN T   SSLK+PW++     G    YF E IK+ +  +LD++ +DTG L++ V  G+ +  + S L  L R L+A + +WRIVVG  PL+   
Subjt:  ENDRLMQNGTWYFSSLKVPWHSIQASRGNDGDYFIERIKLRYQQTLDIIAIDTGLLQESVAMGSASNMLNSHLLWLKRTLQASNSNWRIVVGFHPLVTCE

Query:  NNTQSLETKQKHFFESIHRIFIENGVNAYLSRRGCTHNVRIGSIAYIGVPGPFQTKHFPSQRSSFRE--FLLQRVSSVETV
           +  E K+     + H+I  + GVN Y+S +GCT      S+  I VP   + +   +     RE  FLL RVS  E V
Subjt:  NNTQSLETKQKHFFESIHRIFIENGVNAYLSRRGCTHNVRIGSIAYIGVPGPFQTKHFPSQRSSFRE--FLLQRVSSVETV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGACCCATCACTTTCTCGGACTCTCCTTACTCAGCTCTGTCTTTGTATTGCTTTCTACTTGTCTCTCAACATGGGTCGTCCTAAACACTACGATTTCTTGAAGAT
TCGGGATGAAAATCCTCTTGATTTCTACTTTATTTCCGTCTGGGGAGGCTTACGATCTGTAAAAGAAGAGACCCTTCTTCTCAAACAGATTACACTTTTTTTCAATGCTA
AGAATGTTTCATGTGAACATGAAGAGGAAAGGGGCAAGGTTTCTCATGCAAAGTTCATCTTGCACATCTGTGAACCGGGTGAAAACGATCGCCTAATGCAGAATGGTACG
TGGTATTTCTCATCTCTGAAAGTTCCATGGCACAGCATACAGGCATCAAGAGGAAATGATGGAGATTACTTTATTGAGAGAATCAAGTTGCGATATCAGCAAACATTAGA
CATTATTGCCATAGATACAGGACTGTTACAGGAGTCCGTCGCAATGGGATCAGCAAGTAACATGCTGAACAGTCATTTACTATGGCTGAAAAGGACTCTACAAGCATCAA
ACAGTAACTGGCGTATAGTTGTTGGGTTTCACCCATTGGTTACTTGTGAAAACAATACTCAATCACTGGAGACAAAGCAAAAACATTTTTTCGAGTCAATCCATCGAATC
TTTATCGAAAATGGGGTGAATGCATACCTTAGTAGGCGTGGTTGCACCCACAATGTTCGTATTGGTAGCATAGCTTACATTGGGGTTCCCGGTCCTTTTCAGACAAAACA
CTTCCCATCTCAAAGATCCTCCTTCAGAGAATTCTTACTTCAGCGCGTCAGCTCGGTAGAGACGGTGGGTACCGAGCAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGACCCATCACTTTCTCGGACTCTCCTTACTCAGCTCTGTCTTTGTATTGCTTTCTACTTGTCTCTCAACATGGGTCGTCCTAAACACTACGATTTCTTGAAGAT
TCGGGATGAAAATCCTCTTGATTTCTACTTTATTTCCGTCTGGGGAGGCTTACGATCTGTAAAAGAAGAGACCCTTCTTCTCAAACAGATTACACTTTTTTTCAATGCTA
AGAATGTTTCATGTGAACATGAAGAGGAAAGGGGCAAGGTTTCTCATGCAAAGTTCATCTTGCACATCTGTGAACCGGGTGAAAACGATCGCCTAATGCAGAATGGTACG
TGGTATTTCTCATCTCTGAAAGTTCCATGGCACAGCATACAGGCATCAAGAGGAAATGATGGAGATTACTTTATTGAGAGAATCAAGTTGCGATATCAGCAAACATTAGA
CATTATTGCCATAGATACAGGACTGTTACAGGAGTCCGTCGCAATGGGATCAGCAAGTAACATGCTGAACAGTCATTTACTATGGCTGAAAAGGACTCTACAAGCATCAA
ACAGTAACTGGCGTATAGTTGTTGGGTTTCACCCATTGGTTACTTGTGAAAACAATACTCAATCACTGGAGACAAAGCAAAAACATTTTTTCGAGTCAATCCATCGAATC
TTTATCGAAAATGGGGTGAATGCATACCTTAGTAGGCGTGGTTGCACCCACAATGTTCGTATTGGTAGCATAGCTTACATTGGGGTTCCCGGTCCTTTTCAGACAAAACA
CTTCCCATCTCAAAGATCCTCCTTCAGAGAATTCTTACTTCAGCGCGTCAGCTCGGTAGAGACGGTGGGTACCGAGCAATAA
Protein sequenceShow/hide protein sequence
MADPSLSRTLLTQLCLCIAFYLSLNMGRPKHYDFLKIRDENPLDFYFISVWGGLRSVKEETLLLKQITLFFNAKNVSCEHEEERGKVSHAKFILHICEPGENDRLMQNGT
WYFSSLKVPWHSIQASRGNDGDYFIERIKLRYQQTLDIIAIDTGLLQESVAMGSASNMLNSHLLWLKRTLQASNSNWRIVVGFHPLVTCENNTQSLETKQKHFFESIHRI
FIENGVNAYLSRRGCTHNVRIGSIAYIGVPGPFQTKHFPSQRSSFREFLLQRVSSVETVGTEQ