; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10004465 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10004465
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionMyocardial zonula adherens protein-like
Genome locationChr08:17331049..17333283
RNA-Seq ExpressionHG10004465
SyntenyHG10004465
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585310.1 hypothetical protein SDJN03_18043, partial [Cucurbita argyrosperma subsp. sororia]1.3e-7360.3Show/hide
Query:  LPQEIDHYIKESIDHILGLPISTKTLELKLLASESTNQRLEAQCLSLQSKLNQHIVATNSAKEEAKLNARGLKKAIEKNQKLSEENSRWVEQCKKMKSII
        +PQE+D YIKESIDH LGLP+ST+TL LKL ASE+T +RLEAQCLSLQSKLNQHI +T +A+ EA ++A+GLKKAIE+N++LS+ENSR  +Q K+MK I 
Subjt:  LPQEIDHYIKESIDHILGLPISTKTLELKLLASESTNQRLEAQCLSLQSKLNQHIVATNSAKEEAKLNARGLKKAIEKNQKLSEENSRWVEQCKKMKSII

Query:  NRGKLAIDRAKETQIRVHQLEDEVEEIKLDLKFFKDQYQMGKFHLSSMEEVLVESALSQLI-GETDSSALEFLLANIENDSCRRLLHLSQSLGPPTLRAL
        +RGK+AI  AKE + R  +L DE  ++  DL FFK+QYQM KF  +SMEEVLVE+ L+ L+ G+T  +  +FL+AN+ NDSCRRLLHLSQSL P T RAL
Subjt:  NRGKLAIDRAKETQIRVHQLEDEVEEIKLDLKFFKDQYQMGKFHLSSMEEVLVESALSQLI-GETDSSALEFLLANIENDSCRRLLHLSQSLGPPTLRAL

Query:  ILAAKIKALEKDNEQLHMLLCTTQGEVKLLSDQIGYLLEDQTTSHENGGGIGKPPAGSSSAKVHEKK
         LAA+IK LE++N QL + L   QGEV LLSD+IG LLED+  S  +     KPPA SSS KV + +
Subjt:  ILAAKIKALEKDNEQLHMLLCTTQGEVKLLSDQIGYLLEDQTTSHENGGGIGKPPAGSSSAKVHEKK

KAG7020228.1 hypothetical protein SDJN02_16911, partial [Cucurbita argyrosperma subsp. argyrosperma]1.5e-7260.84Show/hide
Query:  IDHYIKESIDHILGLPISTKTLELKLLASESTNQRLEAQCLSLQSKLNQHIVATNSAKEEAKLNARGLKKAIEKNQKLSEENSRWVEQCKKMKSIINRGK
        +D YIKESIDH LGLP+ST+TL LKL ASE+T +RLEAQCLSLQSKLNQHI +T +A+ EA ++A+GLKKAIE+N++LS+ENSR  +Q K+MK I +RGK
Subjt:  IDHYIKESIDHILGLPISTKTLELKLLASESTNQRLEAQCLSLQSKLNQHIVATNSAKEEAKLNARGLKKAIEKNQKLSEENSRWVEQCKKMKSIINRGK

Query:  LAIDRAKETQIRVHQLEDEVEEIKLDLKFFKDQYQMGKFHLSSMEEVLVESALSQLI-GETDSSALEFLLANIENDSCRRLLHLSQSLGPPTLRALILAA
        +AI  AKE + R  +L DE  ++  DL FFK+QYQM KF  +SMEEVLVE+AL+ L+ G+T S+  +FL+AN+ NDSCRRLLHLSQSL P T RAL LAA
Subjt:  LAIDRAKETQIRVHQLEDEVEEIKLDLKFFKDQYQMGKFHLSSMEEVLVESALSQLI-GETDSSALEFLLANIENDSCRRLLHLSQSLGPPTLRALILAA

Query:  KIKALEKDNEQLHMLLCTTQGEVKLLSDQIGYLLEDQTTSHENGGGIGKPPAGSSSAKVHEKK
        +IK LE++N QL + L   QGEV LLSD+IG LLED+  S  +     KPPA SSS KV + +
Subjt:  KIKALEKDNEQLHMLLCTTQGEVKLLSDQIGYLLEDQTTSHENGGGIGKPPAGSSSAKVHEKK

XP_022131622.1 uncharacterized protein LOC111004753 [Momordica charantia]4.8e-7160.67Show/hide
Query:  SLPQEIDHYIKESIDHILGLPISTKTLELKLLASESTNQRLEAQCLSLQSKLNQHIVATNSAKEEAKLNARGLKKAIEKNQKLSEENSRWVEQCKKMKSI
        ++PQEID YIKESI H LGLP+S  TLELKL+ASE+  Q L  QCLSLQSKLN  I+A  S+K EAKL+A+ LKKAIEKN  LSEENS    QCKK+K+ 
Subjt:  SLPQEIDHYIKESIDHILGLPISTKTLELKLLASESTNQRLEAQCLSLQSKLNQHIVATNSAKEEAKLNARGLKKAIEKNQKLSEENSRWVEQCKKMKSI

Query:  INRGKLAIDRAKETQIRVHQLEDEVEEIKLDLKFFKDQYQMGKFHLSSMEEVLVESALSQLIGETDSSALEFLLANIENDSCRRLLHLSQSLGPPTLRAL
         N+ K+A+DRA E   RV +LE E+ E+  DL FF++QYQ  KFH SSMEE LV+S L+  IGET++ A EFLL N    SCRRLL+LS SL P T +AL
Subjt:  INRGKLAIDRAKETQIRVHQLEDEVEEIKLDLKFFKDQYQMGKFHLSSMEEVLVESALSQLIGETDSSALEFLLANIENDSCRRLLHLSQSLGPPTLRAL

Query:  ILAAKIKALEKDNEQLHMLLCTTQGEVKLLSDQIGYLLEDQT-TSHENGGGI---GKPPAGSSSAKV
         LAAKIK LEK+ E+L +LL T Q EVK L++QI YL+EDQT TSHE G      GKPPA S  A V
Subjt:  ILAAKIKALEKDNEQLHMLLCTTQGEVKLLSDQIGYLLEDQT-TSHENGGGI---GKPPAGSSSAKV

XP_022951817.1 uncharacterized protein LOC111454550 [Cucurbita moschata]9.7e-4856.1Show/hide
Query:  LNARGLKKAIEKNQKLSEENSRWVEQCKKMKSIINRGKLAIDRAKETQIRVHQLEDEVEEIKLDLKFFKDQYQMGKFHLSSMEEVLVESALSQLI-GETD
        ++A+GLKKAIE+N++LS+ENSR  +Q KKMK I +RGK+AI   KE + R  +L +E  ++  DL FFK+Q+QM KF  +SMEEVLVE+AL+ L+  +T 
Subjt:  LNARGLKKAIEKNQKLSEENSRWVEQCKKMKSIINRGKLAIDRAKETQIRVHQLEDEVEEIKLDLKFFKDQYQMGKFHLSSMEEVLVESALSQLI-GETD

Query:  SSALEFLLANIENDSCRRLLHLSQSLGPPTLRALILAAKIKALEKDNEQLHMLLCTTQGEVKLLSDQIGYLLEDQTTSHENGGGIGKPPAGSSSAKVHEK
        S+  +FL+AN+ NDSCRRLLHLSQSL P T RAL LAA+IK LE++N QL + L   QGEV LLSD+IG LLED+  S  +     KPPA SSS K+ + 
Subjt:  SSALEFLLANIENDSCRRLLHLSQSLGPPTLRALILAAKIKALEKDNEQLHMLLCTTQGEVKLLSDQIGYLLEDQTTSHENGGGIGKPPAGSSSAKVHEK

Query:  KRPLE
        KRP E
Subjt:  KRPLE

XP_038885825.1 uncharacterized protein LOC120076121 [Benincasa hispida]2.6e-8582.38Show/hide
Query:  EEAKLNARGLKKAIEKNQKLSEENSRWVEQCKKMKSIINRGKLAIDRAKETQIRVHQLEDEVEEIKLDLKFFKDQYQMGKFHLSSMEEVLVESALSQLIG
        EEAKLNA+GLKKAIEKNQ+LSEENSR VEQCKKMK+ +NRGKLA+DRAKETQ+RVH+LEDEVEE+KLDLKFFKDQ+QM KF  +SMEE+LVESALSQLIG
Subjt:  EEAKLNARGLKKAIEKNQKLSEENSRWVEQCKKMKSIINRGKLAIDRAKETQIRVHQLEDEVEEIKLDLKFFKDQYQMGKFHLSSMEEVLVESALSQLIG

Query:  ETDSSALEFLLANIENDSCRRLLHLSQSLGPPTLRALILAAKIKALEKDNEQLHMLLCTTQGEVKLLSDQIGYLLEDQTTSH-ENGGGIGKPPAGSSSAK
        E +SSA EFLLANIENDSCRRLLHLS+SLGP + R L LAAKIK+LEKDNEQL +LLCT QGE KLLSDQIGYLLE+QTTS   NGGGIGKPPAGSSS K
Subjt:  ETDSSALEFLLANIENDSCRRLLHLSQSLGPPTLRALILAAKIKALEKDNEQLHMLLCTTQGEVKLLSDQIGYLLEDQTTSH-ENGGGIGKPPAGSSSAK

Query:  VHEKKRPLED
        VHE+KR LED
Subjt:  VHEKKRPLED

TrEMBL top hitse value%identityAlignment
A0A2N9IYL3 Uncharacterized protein2.5e-4140.69Show/hide
Query:  MSLPQEIDHYIKESIDHILGLPISTKTLELKLLASESTNQRLEAQCLSLQSKLNQHIVATNSAKEEAKLNARGLKKAIEKNQKLSEENSRWVEQCKK---
        M LPQEID YIK+SIDH LGLP+ST+TLELKL  S+   +RL ++CLSLQ KL +     +  + E+ +NA+ LKK +E+NQKL+ E +  + QC K   
Subjt:  MSLPQEIDHYIKESIDHILGLPISTKTLELKLLASESTNQRLEAQCLSLQSKLNQHIVATNSAKEEAKLNARGLKKAIEKNQKLSEENSRWVEQCKK---

Query:  --------MKSIINRGKLAIDRAKETQIRVHQLEDEVEEIKLDLKFFKDQYQMGKFHLSS----MEEVLVESALSQLIGETD-SSALEFLLANIENDSCR
                 +++++ G  A +RAKE +IRVH+LE+EV  +  +L+F+K + +M     S+    +E  L+ES L+ LI + D +SA  FL AN  ++SC+
Subjt:  --------MKSIINRGKLAIDRAKETQIRVHQLEDEVEEIKLDLKFFKDQYQMGKFHLSS----MEEVLVESALSQLIGETD-SSALEFLLANIENDSCR

Query:  RLLHLSQSLGPPTLRALILAAKIKALEKDNEQLHMLLCTTQGEVKLLSDQIGYLLED------QTTSHENGGGIGKPPAGSSSAKVHEKK
        RLL +   L P T + L LAA++K LEKD + L + L   + EVKLL ++   L E+      Q     +  G G    G++SAK +++K
Subjt:  RLLHLSQSLGPPTLRALILAAKIKALEKDNEQLHMLLCTTQGEVKLLSDQIGYLLED------QTTSHENGGGIGKPPAGSSSAKVHEKK

A0A5N6RMD9 Uncharacterized protein9.5e-4141.03Show/hide
Query:  MSLPQEIDHYIKESIDHILGLPISTKTLELKLLASESTNQRLEAQCLSLQSKLNQHIVATNSAKEEAKLNARGLKKAIEKNQKLSEENSRWVEQCKK---
        M +P EID Y+KESIDH LGLP+ST TLELKL ASE   +RL  +CL LQ+KL +        + EA +NA+ LKK +E+NQKL+ E +  + QC K   
Subjt:  MSLPQEIDHYIKESIDHILGLPISTKTLELKLLASESTNQRLEAQCLSLQSKLNQHIVATNSAKEEAKLNARGLKKAIEKNQKLSEENSRWVEQCKK---

Query:  --------MKSIINRGKLAIDRAKETQIRVHQLEDEVEEIKLDLKFFKDQYQMGKFHLS----SMEEVLVESALSQLIGETD-SSALEFLLANIENDSCR
                 +++++ G  A  RAKE +IRVH+LE++V  +  +L+F+K +Y M    LS    S E  L++S L+ LI + D +S   FL AN  ++SC+
Subjt:  --------MKSIINRGKLAIDRAKETQIRVHQLEDEVEEIKLDLKFFKDQYQMGKFHLS----SMEEVLVESALSQLIGETD-SSALEFLLANIENDSCR

Query:  RLLHLSQSLGPPTLRALILAAKIKALEKDNEQLHMLLCTTQGEVKLLSDQIGYLLED------QTTSHENGGGIGKPPAGSSSAKVHEKK
        RLL +   L P T + L LAA++K+LEKD E L + L   + EVKLL  +   L E+      Q     +  G      GS+SAK +++K
Subjt:  RLLHLSQSLGPPTLRALILAAKIKALEKDNEQLHMLLCTTQGEVKLLSDQIGYLLED------QTTSHENGGGIGKPPAGSSSAKVHEKK

A0A6J1BRI9 uncharacterized protein LOC1110047532.3e-7160.67Show/hide
Query:  SLPQEIDHYIKESIDHILGLPISTKTLELKLLASESTNQRLEAQCLSLQSKLNQHIVATNSAKEEAKLNARGLKKAIEKNQKLSEENSRWVEQCKKMKSI
        ++PQEID YIKESI H LGLP+S  TLELKL+ASE+  Q L  QCLSLQSKLN  I+A  S+K EAKL+A+ LKKAIEKN  LSEENS    QCKK+K+ 
Subjt:  SLPQEIDHYIKESIDHILGLPISTKTLELKLLASESTNQRLEAQCLSLQSKLNQHIVATNSAKEEAKLNARGLKKAIEKNQKLSEENSRWVEQCKKMKSI

Query:  INRGKLAIDRAKETQIRVHQLEDEVEEIKLDLKFFKDQYQMGKFHLSSMEEVLVESALSQLIGETDSSALEFLLANIENDSCRRLLHLSQSLGPPTLRAL
         N+ K+A+DRA E   RV +LE E+ E+  DL FF++QYQ  KFH SSMEE LV+S L+  IGET++ A EFLL N    SCRRLL+LS SL P T +AL
Subjt:  INRGKLAIDRAKETQIRVHQLEDEVEEIKLDLKFFKDQYQMGKFHLSSMEEVLVESALSQLIGETDSSALEFLLANIENDSCRRLLHLSQSLGPPTLRAL

Query:  ILAAKIKALEKDNEQLHMLLCTTQGEVKLLSDQIGYLLEDQT-TSHENGGGI---GKPPAGSSSAKV
         LAAKIK LEK+ E+L +LL T Q EVK L++QI YL+EDQT TSHE G      GKPPA S  A V
Subjt:  ILAAKIKALEKDNEQLHMLLCTTQGEVKLLSDQIGYLLEDQT-TSHENGGGI---GKPPAGSSSAKV

A0A6J1GIQ2 uncharacterized protein LOC1114545504.7e-4856.1Show/hide
Query:  LNARGLKKAIEKNQKLSEENSRWVEQCKKMKSIINRGKLAIDRAKETQIRVHQLEDEVEEIKLDLKFFKDQYQMGKFHLSSMEEVLVESALSQLI-GETD
        ++A+GLKKAIE+N++LS+ENSR  +Q KKMK I +RGK+AI   KE + R  +L +E  ++  DL FFK+Q+QM KF  +SMEEVLVE+AL+ L+  +T 
Subjt:  LNARGLKKAIEKNQKLSEENSRWVEQCKKMKSIINRGKLAIDRAKETQIRVHQLEDEVEEIKLDLKFFKDQYQMGKFHLSSMEEVLVESALSQLI-GETD

Query:  SSALEFLLANIENDSCRRLLHLSQSLGPPTLRALILAAKIKALEKDNEQLHMLLCTTQGEVKLLSDQIGYLLEDQTTSHENGGGIGKPPAGSSSAKVHEK
        S+  +FL+AN+ NDSCRRLLHLSQSL P T RAL LAA+IK LE++N QL + L   QGEV LLSD+IG LLED+  S  +     KPPA SSS K+ + 
Subjt:  SSALEFLLANIENDSCRRLLHLSQSLGPPTLRALILAAKIKALEKDNEQLHMLLCTTQGEVKLLSDQIGYLLEDQTTSHENGGGIGKPPAGSSSAKVHEK

Query:  KRPLE
        KRP E
Subjt:  KRPLE

A0A6J5WBJ5 Uncharacterized protein1.2e-4042.41Show/hide
Query:  MSLPQEIDHYIKESIDHILGLPISTKTLELKLLASESTNQRLEAQCLSLQSKLNQHIVATNSAKEEAKLNARGLKKAIEKNQKLSEENSRWVEQCKK---
        M LPQ+ID YIKESIDH LGLP+ST TLELKL  SE + QRL+ Q   L +K+ +       A+ EA +NA+ L+K +E+NQ+L+ E +  V QC K   
Subjt:  MSLPQEIDHYIKESIDHILGLPISTKTLELKLLASESTNQRLEAQCLSLQSKLNQHIVATNSAKEEAKLNARGLKKAIEKNQKLSEENSRWVEQCKK---

Query:  --------MKSIINRGKLAIDRAKETQIRVHQLEDEVEEIKLDLKFFKDQYQMGKFHLSS----MEEVLVESALSQLIGETDS-SALEFLLANIENDSCR
                 +++++ G  A  RAKE +IRV +LEDEV+E++ +L F+K + +M     SS    + E L+ES L+ LI + ++ SA  FL AN  N+SC+
Subjt:  --------MKSIINRGKLAIDRAKETQIRVHQLEDEVEEIKLDLKFFKDQYQMGKFHLSS----MEEVLVESALSQLIGETDS-SALEFLLANIENDSCR

Query:  RLLHLSQSLGPPTLRALILAAKIKALEKDNEQLHMLLCTTQGEVKLLSDQIGYL------LEDQTTSHENGGGIGKPPAGSSSAKVHEKK
         LL +   L P T + L L AK+K LEKD E L M LCT + EVKLL ++   L      L  Q     N  G G     S+SAK +++K
Subjt:  RLLHLSQSLGPPTLRALILAAKIKALEKDNEQLHMLLCTTQGEVKLLSDQIGYL------LEDQTTSHENGGGIGKPPAGSSSAKVHEKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G14680.1 unknown protein2.0e-3036.69Show/hide
Query:  MSLPQEIDHYIKESIDHILGLPISTKTLELKLLASESTNQRLEAQCLSLQSKLNQHIVATNSAKEEAKLNARGLKKAIEKNQKLSEENSRWVEQCKK---
        M L +E+D +IKE+IDH LGLPIS   L+ KL  +E + +RL  Q LSL S+L +     +  + EA +NA+ LKK +E+NQKL  E    V QCKK   
Subjt:  MSLPQEIDHYIKESIDHILGLPISTKTLELKLLASESTNQRLEAQCLSLQSKLNQHIVATNSAKEEAKLNARGLKKAIEKNQKLSEENSRWVEQCKK---

Query:  --------MKSIINRGKLAIDRAKETQIRVHQLEDEVEEIKLDLKFFKDQYQMGKFHLSSMEEVLVESALSQLIGETDSSALE--FLLANIENDS-CRRL
                 +S++  G    +RA+E + RV +LE+EV ++  ++          K  + S E+ LV+S L+  + + +S +L   FL AN E D  C  L
Subjt:  --------MKSIINRGKLAIDRAKETQIRVHQLEDEVEEIKLDLKFFKDQYQMGKFHLSSMEEVLVESALSQLIGETDSSALE--FLLANIENDS-CRRL

Query:  LHLSQSLGPPTLRALILAAKIKALEKDNEQLHMLLCTTQGEVKLLSDQ
        L     L P T + + L + +K +EK+ E L M L   + EV+L+S+Q
Subjt:  LHLSQSLGPPTLRALILAAKIKALEKDNEQLHMLLCTTQGEVKLLSDQ

AT4G09060.1 unknown protein6.7e-3135.92Show/hide
Query:  SLPQEIDHYIKESIDHILGLPISTKTLELKLLASESTNQRLEAQCLSLQSKLNQHIVATNSAKEEAKLNARGLKKAIEKNQKLSEENSRWVEQCKKMK--
        +L QE++ YIK++IDH LGLPIS ++L+ KL  +E + +RL  Q   L S+L +     +  + EA +NA+ LKK +E+NQKL+ E    + QCKK++  
Subjt:  SLPQEIDHYIKESIDHILGLPISTKTLELKLLASESTNQRLEAQCLSLQSKLNQHIVATNSAKEEAKLNARGLKKAIEKNQKLSEENSRWVEQCKKMK--

Query:  ---------SIINRGKLAIDRAKETQIRVHQLEDEVEEIKLDLKFFKDQYQMGKFH--LSSMEEVLVESALSQLIGETDS-SALEFLLANIENDSCRRLL
                 +++  G  + +RA+E + RV +LEDE+  +  +++ FK Q   G+     + +EE L++S L  LI + ++     FL ANI + SC+ LL
Subjt:  ---------SIINRGKLAIDRAKETQIRVHQLEDEVEEIKLDLKFFKDQYQMGKFH--LSSMEEVLVESALSQLIGETDS-SALEFLLANIENDSCRRLL

Query:  HLSQSLGPPTLRALILAAKIKALEKDNEQLHMLLCTTQGEVKLLS
             L P T + L L +K K  EK+ E + M L   + EV+L+S
Subjt:  HLSQSLGPPTLRALILAAKIKALEKDNEQLHMLLCTTQGEVKLLS

AT4G09060.2 unknown protein5.3e-2833.21Show/hide
Query:  SLPQEIDHYIKESIDHILGLPISTKTLELKLLASESTNQRLEAQCLSLQSKLNQHIVATNSAKEEAKLNARGLKKAIEKNQKLSEENSRWVEQCKKMK--
        +L QE++ YIK++IDH LGLPIS ++L+ KL  +E + +RL  Q   L S+L +     +  + EA +NA+ LKK +E+NQKL+ E    + QCKK++  
Subjt:  SLPQEIDHYIKESIDHILGLPISTKTLELKLLASESTNQRLEAQCLSLQSKLNQHIVATNSAKEEAKLNARGLKKAIEKNQKLSEENSRWVEQCKKMK--

Query:  ---------SIINRGKLAIDRAKETQIRVHQLEDEVEEIKLDLKFFKDQYQMG-------------KFHL---------------SSMEEVLVESALSQL
                 +++  G  + +RA+E + RV +LEDE+  +  +++ FK Q   G             K +L               + +EE L++S L  L
Subjt:  ---------SIINRGKLAIDRAKETQIRVHQLEDEVEEIKLDLKFFKDQYQMG-------------KFHL---------------SSMEEVLVESALSQL

Query:  IGETDS-SALEFLLANIENDSCRRLLHLSQSLGPPTLRALILAAKIKALEKDNEQLHMLLCTTQGEVKLLS
        I + ++     FL ANI + SC+ LL     L P T + L L +K K  EK+ E + M L   + EV+L+S
Subjt:  IGETDS-SALEFLLANIENDSCRRLLHLSQSLGPPTLRALILAAKIKALEKDNEQLHMLLCTTQGEVKLLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTCTTCCCCAAGAAATCGACCATTACATCAAAGAATCAATAGACCATATTCTTGGCCTTCCGATCTCCACAAAAACTCTTGAATTGAAACTCCTTGCTTCTGAATC
CACAAACCAAAGACTTGAAGCCCAATGCCTTTCTTTACAGTCTAAATTAAACCAACACATTGTCGCTACTAACAGTGCCAAGGAGGAAGCCAAATTGAACGCACGAGGCT
TAAAGAAGGCAATTGAGAAAAACCAAAAATTGAGTGAAGAGAATTCACGTTGGGTGGAACAATGCAAGAAAATGAAATCCATTATAAACAGAGGAAAATTGGCAATTGAT
CGAGCCAAAGAAACCCAAATTCGTGTGCATCAGTTGGAAGATGAAGTTGAAGAAATAAAATTAGATTTGAAATTCTTCAAGGATCAATACCAAATGGGCAAGTTTCACTT
ATCTTCTATGGAGGAAGTTTTGGTTGAGTCAGCTTTATCCCAATTAATTGGAGAAACAGATTCCTCTGCTCTTGAGTTCTTGTTGGCAAATATTGAGAATGATTCATGTC
GGAGATTACTACATTTGTCGCAGAGTTTAGGGCCTCCAACTCTTAGAGCTTTGATTTTGGCTGCTAAAATAAAGGCCCTTGAGAAAGATAATGAGCAACTTCATATGCTA
CTCTGTACAACTCAAGGGGAGGTGAAACTGTTATCCGATCAAATCGGTTATCTTTTGGAGGACCAAACGACGTCGCATGAGAATGGAGGAGGGATTGGTAAACCACCAGC
TGGCAGTTCCTCTGCGAAGGTGCATGAAAAGAAAAGACCATTGGAAGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTCTTCCCCAAGAAATCGACCATTACATCAAAGAATCAATAGACCATATTCTTGGCCTTCCGATCTCCACAAAAACTCTTGAATTGAAACTCCTTGCTTCTGAATC
CACAAACCAAAGACTTGAAGCCCAATGCCTTTCTTTACAGTCTAAATTAAACCAACACATTGTCGCTACTAACAGTGCCAAGGAGGAAGCCAAATTGAACGCACGAGGCT
TAAAGAAGGCAATTGAGAAAAACCAAAAATTGAGTGAAGAGAATTCACGTTGGGTGGAACAATGCAAGAAAATGAAATCCATTATAAACAGAGGAAAATTGGCAATTGAT
CGAGCCAAAGAAACCCAAATTCGTGTGCATCAGTTGGAAGATGAAGTTGAAGAAATAAAATTAGATTTGAAATTCTTCAAGGATCAATACCAAATGGGCAAGTTTCACTT
ATCTTCTATGGAGGAAGTTTTGGTTGAGTCAGCTTTATCCCAATTAATTGGAGAAACAGATTCCTCTGCTCTTGAGTTCTTGTTGGCAAATATTGAGAATGATTCATGTC
GGAGATTACTACATTTGTCGCAGAGTTTAGGGCCTCCAACTCTTAGAGCTTTGATTTTGGCTGCTAAAATAAAGGCCCTTGAGAAAGATAATGAGCAACTTCATATGCTA
CTCTGTACAACTCAAGGGGAGGTGAAACTGTTATCCGATCAAATCGGTTATCTTTTGGAGGACCAAACGACGTCGCATGAGAATGGAGGAGGGATTGGTAAACCACCAGC
TGGCAGTTCCTCTGCGAAGGTGCATGAAAAGAAAAGACCATTGGAAGATTGA
Protein sequenceShow/hide protein sequence
MSLPQEIDHYIKESIDHILGLPISTKTLELKLLASESTNQRLEAQCLSLQSKLNQHIVATNSAKEEAKLNARGLKKAIEKNQKLSEENSRWVEQCKKMKSIINRGKLAID
RAKETQIRVHQLEDEVEEIKLDLKFFKDQYQMGKFHLSSMEEVLVESALSQLIGETDSSALEFLLANIENDSCRRLLHLSQSLGPPTLRALILAAKIKALEKDNEQLHML
LCTTQGEVKLLSDQIGYLLEDQTTSHENGGGIGKPPAGSSSAKVHEKKRPLED