; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg010407 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg010407
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProtein MNN4-like
Genome locationscaffold5:11790975..11801905
RNA-Seq ExpressionSpg010407
SyntenySpg010407
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB49850.1 hypothetical protein L484_000844 [Morus notabilis]4.8e-2829.1Show/hide
Query:  FAKRPRMRSMDASPVVPPTVSPAKPKAKSPKAPSPKNPFPEVVRDVNFQERMEIMRKGDFLNEKGY---SNRAGALPEFVSRIISQNKWQDFCAHPQEAV
        FAKRP   S    P +    + A   + S +  S    F +   +  ++E +      + + EKG+    +     P F+S +I    WQ FC HP + +
Subjt:  FAKRPRMRSMDASPVVPPTVSPAKPKAKSPKAPSPKNPFPEVVRDVNFQERMEIMRKGDFLNEKGY---SNRAGALPEFVSRIISQNKWQDFCAHPQEAV

Query:  VPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLHPRGND----VIRNLSAKQMKEALKLVANKGVQWKESQTKVKFMVPSDLKPKLAVWL
        VPLV+EFYA L+ +  +   V    ++F+S  IN V  I     P  +D    +I +   +Q+KE LK +A  G QW  S          +L+P   VW 
Subjt:  VPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLHPRGND----VIRNLSAKQMKEALKLVANKGVQWKESQTKVKFMVPSDLKPKLAVWL

Query:  HFIKNRLMPTTHGNTISVERVMLLYCIMKGLEINVGSIIREEILACGRKTAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIRKLQQNSIQRK
        HF+ +RL+ +THG TIS  R +LLY ++ G  INVG +I ++I AC  K  G L+F SLI++LC +  +     E R      +DL  I ++   S  R 
Subjt:  HFIKNRLMPTTHGNTISVERVMLLYCIMKGLEINVGSIIREEILACGRKTAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIRKLQQNSIQRK

Query:  DKASTSQATPQSGSNVASPSQHTPFTGSSPSSEALTIAYCRLD-----------QLRDNLRTYWAYAKERDEAIREFY
        +K+   +   +        + HT    ++ S E L       +           Q ++ L  +W Y+++RD A+++ +
Subjt:  DKASTSQATPQSGSNVASPSQHTPFTGSSPSSEALTIAYCRLD-----------QLRDNLRTYWAYAKERDEAIREFY

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]2.4e-3535.59Show/hide
Query:  KSPKAPSPKNPFPEVVRDVNFQERMEIMRKGDFLNEKGYSNRAGALPEFVSRIISQNKWQDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSS
        K+ KA   +    E   + N Q R     KG  L+    S   G LP F++++I+Q+ W+ FCAHP++ +VPLVREFYA L +   +   VRG  VS+S 
Subjt:  KSPKAPSPKNPFPEVVRDVNFQERMEIMRKGDFLNEKGYSNRAGALPEFVSRIISQNKWQDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSS

Query:  VDINRVYRIKAPLHPRGNDVIRNLSAKQMKEALKLVANKGVQWKESQTKVKFMVPSDLKPKLAVWLHFIKNRLMPTTHGNTISVERVMLLYCIMKGLEIN
          IN V+ +  P+    ++ I N++   +   L+ VA  G +W  S       + S L P   VW HF+K+ L+PTTHG T+S +R++LL+ ++ G  IN
Subjt:  VDINRVYRIKAPLHPRGNDVIRNLSAKQMKEALKLVANKGVQWKESQTKVKFMVPSDLKPKLAVWLHFIKNRLMPTTHGNTISVERVMLLYCIMKGLEIN

Query:  VGSIIREEILACGRKTAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTID-LSLIRKLQQNSIQRKDKASTSQATPQSGS
        VG +I  EI AC  +  G LFF SLIT+LC+  +     +EE+      ID +++ R  Q+   +   + S+S+    S S
Subjt:  VGSIIREEILACGRKTAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTID-LSLIRKLQQNSIQRKDKASTSQATPQSGS

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]1.1e-4033.8Show/hide
Query:  NFQERMEIMRKGDFLNEKGYSNRAGALPEFVSRIISQNKWQDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLHPRGND
        N Q R     KG  L+    S   G LP F++++I+Q+ W+ FCAHP++ +VPLVREFYA L +   +   VRG  VS+S   IN V+ +  P+    ++
Subjt:  NFQERMEIMRKGDFLNEKGYSNRAGALPEFVSRIISQNKWQDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLHPRGND

Query:  VIRNLSAKQMKEALKLVANKGVQWKESQTKVKFMVPSDLKPKLAVWLHFIKNRLMPTTHGNTISVERVMLLYCIMKGLEINVGSIIREEILACGRKTAGK
         I+N++ + +   L+ VA  G +W  S       + S L P   VW HF+K+RL+PTTHG T+S +R++LL+ ++ G  INVG +I  EI AC  +  G 
Subjt:  VIRNLSAKQMKEALKLVANKGVQWKESQTKVKFMVPSDLKPKLAVWLHFIKNRLMPTTHGNTISVERVMLLYCIMKGLEINVGSIIREEILACGRKTAGK

Query:  LFFGSLITQLCQRVKIVPGKDEERHFFKPTID-LSLIRKLQQNSIQRKDKASTSQ-ATPQSGSNVASPSQHTPFTGSSPSSEALTIAYCR--LDQLRDNL
        LFF SLIT+LC+  +     +EE+      ID +++ R  Q+   +   + S+S+ AT  S        Q         S + +   +    L       
Subjt:  LFFGSLITQLCQRVKIVPGKDEERHFFKPTID-LSLIRKLQQNSIQRKDKASTSQ-ATPQSGSNVASPSQHTPFTGSSPSSEALTIAYCR--LDQLRDNL

Query:  RTYWAYAKERDEAIREFYLSIAPSIAPVFPDFPQSLLPQEDKDSDEEDDENDEEENEE
        + +WAY+KERD A+++   +      P FP FPQ +L   D + + E D++   E  E
Subjt:  RTYWAYAKERDEAIREFYLSIAPSIAPVFPDFPQSLLPQEDKDSDEEDDENDEEENEE

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]6.4e-3338.07Show/hide
Query:  PEFVSRIISQNKWQDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLHPRGNDVIRNLSAKQMKEALKLVANKGVQWKES
        P F++ +I Q+ WQ FCAHP++ +VPLVREFY  +         +RG  V  S   IN ++ +  P+    ++ + +++  ++   L+ VA  G +W  S
Subjt:  PEFVSRIISQNKWQDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLHPRGNDVIRNLSAKQMKEALKLVANKGVQWKES

Query:  QTKVKFMVPSDLKPKLAVWLHFIKNRLMPTTHGNTISVERVMLLYCIMKGLEINVGSIIREEILACGRKTAGKLFFGSLITQLCQRVKIVPGKDEER
               + S L P   VW HF+K+RL+PTTHG T+S E V LLY ++ G  INVG +I  EI AC  + +G LFF SLIT +C+  +     +EE+
Subjt:  QTKVKFMVPSDLKPKLAVWLHFIKNRLMPTTHGNTISVERVMLLYCIMKGLEINVGSIIREEILACGRKTAGKLFFGSLITQLCQRVKIVPGKDEER

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]1.9e-2932.34Show/hide
Query:  VPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLHPRGNDVIRNLSAKQMKEALKLVANKGVQWKESQTKVKFMVPSDLKPKLAVWLHFIK
        +PLVREFYA L +   +   VRG  VS+S   IN V+ +  P+    ++ I N++  ++   L+ VA  G +W  S       + S L P   VW HF+K
Subjt:  VPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLHPRGNDVIRNLSAKQMKEALKLVANKGVQWKESQTKVKFMVPSDLKPKLAVWLHFIK

Query:  NRLMPTTHGNTISVERVMLLYCIMKGLEINVGSIIREEILACGRKTAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIRKLQQNSIQRKDKAS
        +RL+PTTHG  +S +R++LL+ ++ G  INVG +I  EI AC  +  G LFF SLIT+LC+    +   +EE+      ID   + ++ Q       +  
Subjt:  NRLMPTTHGNTISVERVMLLYCIMKGLEINVGSIIREEILACGRKTAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIRKLQQNSIQRKDKAS

Query:  TSQATPQSGSNVASPSQHTPFTGSSPSSEALTIAYCRLDQLRDNLRTYWAYAKERDEAIREFYLSIAPSIAPVFPDFPQSLLPQEDKDSDEEDDENDEEE
        T      S S  A+ S            +AL     + +      + +WAY+KERD A+++   +      P FP FPQ +L   D + + E D++   E
Subjt:  TSQATPQSGSNVASPSQHTPFTGSSPSSEALTIAYCRLDQLRDNLRTYWAYAKERDEAIREFYLSIAPSIAPVFPDFPQSLLPQEDKDSDEEDDENDEEE

Query:  NEE
          E
Subjt:  NEE

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)1.1e-3535.59Show/hide
Query:  KSPKAPSPKNPFPEVVRDVNFQERMEIMRKGDFLNEKGYSNRAGALPEFVSRIISQNKWQDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSS
        K+ KA   +    E   + N Q R     KG  L+    S   G LP F++++I+Q+ W+ FCAHP++ +VPLVREFYA L +   +   VRG  VS+S 
Subjt:  KSPKAPSPKNPFPEVVRDVNFQERMEIMRKGDFLNEKGYSNRAGALPEFVSRIISQNKWQDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSS

Query:  VDINRVYRIKAPLHPRGNDVIRNLSAKQMKEALKLVANKGVQWKESQTKVKFMVPSDLKPKLAVWLHFIKNRLMPTTHGNTISVERVMLLYCIMKGLEIN
          IN V+ +  P+    ++ I N++   +   L+ VA  G +W  S       + S L P   VW HF+K+ L+PTTHG T+S +R++LL+ ++ G  IN
Subjt:  VDINRVYRIKAPLHPRGNDVIRNLSAKQMKEALKLVANKGVQWKESQTKVKFMVPSDLKPKLAVWLHFIKNRLMPTTHGNTISVERVMLLYCIMKGLEIN

Query:  VGSIIREEILACGRKTAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTID-LSLIRKLQQNSIQRKDKASTSQATPQSGS
        VG +I  EI AC  +  G LFF SLIT+LC+  +     +EE+      ID +++ R  Q+   +   + S+S+    S S
Subjt:  VGSIIREEILACGRKTAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTID-LSLIRKLQQNSIQRKDKASTSQATPQSGS

A0A2P5BCG4 Uncharacterized protein (Fragment)5.3e-4133.8Show/hide
Query:  NFQERMEIMRKGDFLNEKGYSNRAGALPEFVSRIISQNKWQDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLHPRGND
        N Q R     KG  L+    S   G LP F++++I+Q+ W+ FCAHP++ +VPLVREFYA L +   +   VRG  VS+S   IN V+ +  P+    ++
Subjt:  NFQERMEIMRKGDFLNEKGYSNRAGALPEFVSRIISQNKWQDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLHPRGND

Query:  VIRNLSAKQMKEALKLVANKGVQWKESQTKVKFMVPSDLKPKLAVWLHFIKNRLMPTTHGNTISVERVMLLYCIMKGLEINVGSIIREEILACGRKTAGK
         I+N++ + +   L+ VA  G +W  S       + S L P   VW HF+K+RL+PTTHG T+S +R++LL+ ++ G  INVG +I  EI AC  +  G 
Subjt:  VIRNLSAKQMKEALKLVANKGVQWKESQTKVKFMVPSDLKPKLAVWLHFIKNRLMPTTHGNTISVERVMLLYCIMKGLEINVGSIIREEILACGRKTAGK

Query:  LFFGSLITQLCQRVKIVPGKDEERHFFKPTID-LSLIRKLQQNSIQRKDKASTSQ-ATPQSGSNVASPSQHTPFTGSSPSSEALTIAYCR--LDQLRDNL
        LFF SLIT+LC+  +     +EE+      ID +++ R  Q+   +   + S+S+ AT  S        Q         S + +   +    L       
Subjt:  LFFGSLITQLCQRVKIVPGKDEERHFFKPTID-LSLIRKLQQNSIQRKDKASTSQ-ATPQSGSNVASPSQHTPFTGSSPSSEALTIAYCR--LDQLRDNL

Query:  RTYWAYAKERDEAIREFYLSIAPSIAPVFPDFPQSLLPQEDKDSDEEDDENDEEENEE
        + +WAY+KERD A+++   +      P FP FPQ +L   D + + E D++   E  E
Subjt:  RTYWAYAKERDEAIREFYLSIAPSIAPVFPDFPQSLLPQEDKDSDEEDDENDEEENEE

A0A2P5DAQ2 Uncharacterized protein3.1e-3338.07Show/hide
Query:  PEFVSRIISQNKWQDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLHPRGNDVIRNLSAKQMKEALKLVANKGVQWKES
        P F++ +I Q+ WQ FCAHP++ +VPLVREFY  +         +RG  V  S   IN ++ +  P+    ++ + +++  ++   L+ VA  G +W  S
Subjt:  PEFVSRIISQNKWQDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLHPRGNDVIRNLSAKQMKEALKLVANKGVQWKES

Query:  QTKVKFMVPSDLKPKLAVWLHFIKNRLMPTTHGNTISVERVMLLYCIMKGLEINVGSIIREEILACGRKTAGKLFFGSLITQLCQRVKIVPGKDEER
               + S L P   VW HF+K+RL+PTTHG T+S E V LLY ++ G  INVG +I  EI AC  + +G LFF SLIT +C+  +     +EE+
Subjt:  QTKVKFMVPSDLKPKLAVWLHFIKNRLMPTTHGNTISVERVMLLYCIMKGLEINVGSIIREEILACGRKTAGKLFFGSLITQLCQRVKIVPGKDEER

A0A2P5DXM3 Uncharacterized protein9.4e-3032.34Show/hide
Query:  VPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLHPRGNDVIRNLSAKQMKEALKLVANKGVQWKESQTKVKFMVPSDLKPKLAVWLHFIK
        +PLVREFYA L +   +   VRG  VS+S   IN V+ +  P+    ++ I N++  ++   L+ VA  G +W  S       + S L P   VW HF+K
Subjt:  VPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLHPRGNDVIRNLSAKQMKEALKLVANKGVQWKESQTKVKFMVPSDLKPKLAVWLHFIK

Query:  NRLMPTTHGNTISVERVMLLYCIMKGLEINVGSIIREEILACGRKTAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIRKLQQNSIQRKDKAS
        +RL+PTTHG  +S +R++LL+ ++ G  INVG +I  EI AC  +  G LFF SLIT+LC+    +   +EE+      ID   + ++ Q       +  
Subjt:  NRLMPTTHGNTISVERVMLLYCIMKGLEINVGSIIREEILACGRKTAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIRKLQQNSIQRKDKAS

Query:  TSQATPQSGSNVASPSQHTPFTGSSPSSEALTIAYCRLDQLRDNLRTYWAYAKERDEAIREFYLSIAPSIAPVFPDFPQSLLPQEDKDSDEEDDENDEEE
        T      S S  A+ S            +AL     + +      + +WAY+KERD A+++   +      P FP FPQ +L   D + + E D++   E
Subjt:  TSQATPQSGSNVASPSQHTPFTGSSPSSEALTIAYCRLDQLRDNLRTYWAYAKERDEAIREFYLSIAPSIAPVFPDFPQSLLPQEDKDSDEEDDENDEEE

Query:  NEE
          E
Subjt:  NEE

W9RBS1 Uncharacterized protein2.3e-2829.1Show/hide
Query:  FAKRPRMRSMDASPVVPPTVSPAKPKAKSPKAPSPKNPFPEVVRDVNFQERMEIMRKGDFLNEKGY---SNRAGALPEFVSRIISQNKWQDFCAHPQEAV
        FAKRP   S    P +    + A   + S +  S    F +   +  ++E +      + + EKG+    +     P F+S +I    WQ FC HP + +
Subjt:  FAKRPRMRSMDASPVVPPTVSPAKPKAKSPKAPSPKNPFPEVVRDVNFQERMEIMRKGDFLNEKGY---SNRAGALPEFVSRIISQNKWQDFCAHPQEAV

Query:  VPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLHPRGND----VIRNLSAKQMKEALKLVANKGVQWKESQTKVKFMVPSDLKPKLAVWL
        VPLV+EFYA L+ +  +   V    ++F+S  IN V  I     P  +D    +I +   +Q+KE LK +A  G QW  S          +L+P   VW 
Subjt:  VPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLHPRGND----VIRNLSAKQMKEALKLVANKGVQWKESQTKVKFMVPSDLKPKLAVWL

Query:  HFIKNRLMPTTHGNTISVERVMLLYCIMKGLEINVGSIIREEILACGRKTAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIRKLQQNSIQRK
        HF+ +RL+ +THG TIS  R +LLY ++ G  INVG +I ++I AC  K  G L+F SLI++LC +  +     E R      +DL  I ++   S  R 
Subjt:  HFIKNRLMPTTHGNTISVERVMLLYCIMKGLEINVGSIIREEILACGRKTAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIRKLQQNSIQRK

Query:  DKASTSQATPQSGSNVASPSQHTPFTGSSPSSEALTIAYCRLD-----------QLRDNLRTYWAYAKERDEAIREFY
        +K+   +   +        + HT    ++ S E L       +           Q ++ L  +W Y+++RD A+++ +
Subjt:  DKASTSQATPQSGSNVASPSQHTPFTGSSPSSEALTIAYCRLD-----------QLRDNLRTYWAYAKERDEAIREFY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGCTCAGAGACGTGCTACTCTTGAAGAAGAAGCAAATAGGCAAGATGAAGAAGAAGCCGCCAAAGCAGCAGGGAGCTCTCGGCAAGAAGGTTCAACGGGTAAACC
TTCCAAACCTACAACTAACCCCTCTTCGTCTTGCAGTGACAAATCATTCGTTAACTACAGTGCAAGGAAGAGGAGTCCCAAAAAAGTGGTACCCGAGAAGCCGCTTGTTA
TTGAGCCCCTCAAAACCGCAAGAATGCCTCCAGACGTGTTCGAAGACATAATCCGTCGAGCTGTGGCAAAGGCTCTTGTGATTGCTGAAGGGTACAGGGTTGAACAAGAT
GCTTTGAAAGATATTGAAGCTGAGAGAGAGATGGAAAATCAACACATGATGGAGGAAGATGATTTTGCAAAGAGAAGAGATGAGGAAGAGTGCGAAGAGAAAAGAAGGAG
AGAAGAAGAGCAAGAGGTCGATAAGGCCTTAGAAGCTGAAGAAGAGAGAAAGTTTGAGGAAAACCTCAGGAGGGCAGCAATTGACTTGCAACTTCTTGAGGAAGAGAAAA
AGAGAAGAGAAGAATTAAGAGAATATGAGAAAAGAAAGAAGGAAGCTGAAGACTTCTTTGCAGCCTTTGAGCCACTCCACAAGGCTCAAAGTGAAGCTGAAATGTTGCAA
GGGAGGGTAGAAGAAGAGGCCCAACAGGGGCCAACTGAAGAAATTTTAGAAAAAGAAAAAGAAAGAGAAGTAGAGGATGAAGGCCAGAATGCAACCGCATCTGGGCTGCA
TTTTGAAGAAGGCCTAGCCGAGGCCACCATTGAGCAGCCTGCTGATGAGGTTCTCGAACCTCTATTCAAGGATGACCCACCAGCAGCTGATAGAACCTCTTCGGGAGAGA
AGAGGGATAGAGAAGAAAAGGAAAGCGAGGAGGCCGAGACCTCCACTGACTCTGATACAGAATCCGATTCAAAGATAAAGGAGCTGGATGATGACCAAGTTCCTATCTCT
GCAGCGTTGAGGAGAAAGAGAAAAAGAAAGATAAAGGCTGAGAGGAGGACAAAGAACAAGAATGATCCAATCTTTGCCAAGAGGCCGAGGATGAGGTCCATGGATGCCTC
TCCTGTAGTTCCTCCTACCGTCTCACCCGCCAAACCCAAGGCCAAGTCACCGAAAGCTCCATCTCCCAAAAATCCCTTTCCTGAGGTAGTCAGAGATGTCAATTTTCAGG
AAAGGATGGAGATAATGAGAAAGGGAGATTTCCTCAATGAGAAGGGATACTCTAATAGAGCAGGAGCACTGCCAGAGTTCGTGAGCAGGATCATTTCGCAGAACAAATGG
CAGGACTTTTGTGCTCACCCCCAGGAGGCTGTAGTGCCTTTGGTTCGTGAATTTTATGCTGGCCTTAGGGAAGAGAGCATAAGCATGGCGGTGGTGAGAGGGAAGATGGT
CAGTTTCTCCTCAGTCGATATTAATAGGGTGTATAGGATCAAGGCACCTCTACATCCTAGAGGGAATGATGTGATAAGGAACCTTTCGGCCAAACAAATGAAGGAGGCTC
TGAAACTTGTGGCCAATAAAGGGGTCCAATGGAAAGAATCTCAGACAAAAGTGAAATTCATGGTGCCAAGCGATCTAAAGCCAAAATTGGCAGTTTGGCTTCACTTCATC
AAGAACCGTTTGATGCCAACCACCCACGGCAATACCATCTCAGTAGAGAGAGTTATGCTCCTCTACTGCATTATGAAGGGGTTGGAAATCAACGTGGGGAGCATAATAAG
GGAGGAGATCCTAGCCTGTGGAAGAAAAACAGCAGGTAAGCTTTTCTTTGGATCACTCATCACCCAGCTTTGTCAAAGGGTGAAGATAGTTCCGGGCAAGGACGAGGAGC
GTCATTTCTTCAAGCCGACCATTGACCTGTCTTTGATCAGGAAGCTCCAACAGAACAGTATCCAAAGGAAAGACAAAGCCTCGACATCTCAGGCTACTCCTCAATCAGGG
TCGAATGTAGCTTCTCCATCCCAGCACACTCCTTTTACAGGGTCGTCACCGTCTTCAGAAGCCTTAACTATTGCCTACTGCCGGTTAGATCAACTCAGGGACAACCTGAG
AACATATTGGGCATATGCAAAGGAGCGGGATGAAGCCATTAGAGAGTTCTATCTCTCTATCGCCCCAAGCATTGCTCCGGTCTTTCCCGATTTCCCTCAGTCGCTGCTGC
CTCAAGAAGACAAGGATTCTGATGAAGAAGATGATGAGAATGATGAAGAAGAGAATGAAGAGAAAGAGAGTTCCTCAGACGAGGACTATGGGAGTTTTCTGACCCCTTTA
CTTGCTGTTTTTGAAAACAGAGGAAATGCTGGAATCTGTCCAGAAATGCGACCGCATTTCTTGGAAGGCAAAATGAAATGCGACCGCATTTCTGGAAAAACCGAGACTGT
TCCGAGTCATCCGCGGGTCGTTGTTGACGAGTCTTCTTCGCACCTAACCGGCCGTTGCATTCGTGAATTATCTATGTCGGAGTCGAGTCACAGAAATGTGCATAACGTTA
GTCCTAATGGGATCGATACCCTTGGAATACTTCTAAGGATCCCACAATCAGTTCCAAGGCCTGAGGATAGTAGAGAAGATCCAAGTGGTTGTCCAAAAGTCGTTCGTGTG
AATTATAAAGCATCGAATTATACGAATTGGGAGCTCCAAGAGTGTTTCTTGGAGATTTTTCCAACAAGATTTGGAGATCTTGGGGCTGCTGTAAATCAGAGAAGAAAACT
CAAGGAGTTGGCGTTTCGATCAAGACTACGTTCGACAAAGTTGTCGCAGCGTCGCGACGCTACGCAGACAACGTCGCGACGCTACCGCGATTTTGGAATTTTCCTCGCTG
ATCACCTAGAGTCGAGACGCTACAGTCCTAGCGTCTCGACGCTAGGCCTTCTAGAAGCCTTAAACACGGATTTTGGCCTCCTTTCTTCTTTCTTTTGGGCTTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAGCTCAGAGACGTGCTACTCTTGAAGAAGAAGCAAATAGGCAAGATGAAGAAGAAGCCGCCAAAGCAGCAGGGAGCTCTCGGCAAGAAGGTTCAACGGGTAAACC
TTCCAAACCTACAACTAACCCCTCTTCGTCTTGCAGTGACAAATCATTCGTTAACTACAGTGCAAGGAAGAGGAGTCCCAAAAAAGTGGTACCCGAGAAGCCGCTTGTTA
TTGAGCCCCTCAAAACCGCAAGAATGCCTCCAGACGTGTTCGAAGACATAATCCGTCGAGCTGTGGCAAAGGCTCTTGTGATTGCTGAAGGGTACAGGGTTGAACAAGAT
GCTTTGAAAGATATTGAAGCTGAGAGAGAGATGGAAAATCAACACATGATGGAGGAAGATGATTTTGCAAAGAGAAGAGATGAGGAAGAGTGCGAAGAGAAAAGAAGGAG
AGAAGAAGAGCAAGAGGTCGATAAGGCCTTAGAAGCTGAAGAAGAGAGAAAGTTTGAGGAAAACCTCAGGAGGGCAGCAATTGACTTGCAACTTCTTGAGGAAGAGAAAA
AGAGAAGAGAAGAATTAAGAGAATATGAGAAAAGAAAGAAGGAAGCTGAAGACTTCTTTGCAGCCTTTGAGCCACTCCACAAGGCTCAAAGTGAAGCTGAAATGTTGCAA
GGGAGGGTAGAAGAAGAGGCCCAACAGGGGCCAACTGAAGAAATTTTAGAAAAAGAAAAAGAAAGAGAAGTAGAGGATGAAGGCCAGAATGCAACCGCATCTGGGCTGCA
TTTTGAAGAAGGCCTAGCCGAGGCCACCATTGAGCAGCCTGCTGATGAGGTTCTCGAACCTCTATTCAAGGATGACCCACCAGCAGCTGATAGAACCTCTTCGGGAGAGA
AGAGGGATAGAGAAGAAAAGGAAAGCGAGGAGGCCGAGACCTCCACTGACTCTGATACAGAATCCGATTCAAAGATAAAGGAGCTGGATGATGACCAAGTTCCTATCTCT
GCAGCGTTGAGGAGAAAGAGAAAAAGAAAGATAAAGGCTGAGAGGAGGACAAAGAACAAGAATGATCCAATCTTTGCCAAGAGGCCGAGGATGAGGTCCATGGATGCCTC
TCCTGTAGTTCCTCCTACCGTCTCACCCGCCAAACCCAAGGCCAAGTCACCGAAAGCTCCATCTCCCAAAAATCCCTTTCCTGAGGTAGTCAGAGATGTCAATTTTCAGG
AAAGGATGGAGATAATGAGAAAGGGAGATTTCCTCAATGAGAAGGGATACTCTAATAGAGCAGGAGCACTGCCAGAGTTCGTGAGCAGGATCATTTCGCAGAACAAATGG
CAGGACTTTTGTGCTCACCCCCAGGAGGCTGTAGTGCCTTTGGTTCGTGAATTTTATGCTGGCCTTAGGGAAGAGAGCATAAGCATGGCGGTGGTGAGAGGGAAGATGGT
CAGTTTCTCCTCAGTCGATATTAATAGGGTGTATAGGATCAAGGCACCTCTACATCCTAGAGGGAATGATGTGATAAGGAACCTTTCGGCCAAACAAATGAAGGAGGCTC
TGAAACTTGTGGCCAATAAAGGGGTCCAATGGAAAGAATCTCAGACAAAAGTGAAATTCATGGTGCCAAGCGATCTAAAGCCAAAATTGGCAGTTTGGCTTCACTTCATC
AAGAACCGTTTGATGCCAACCACCCACGGCAATACCATCTCAGTAGAGAGAGTTATGCTCCTCTACTGCATTATGAAGGGGTTGGAAATCAACGTGGGGAGCATAATAAG
GGAGGAGATCCTAGCCTGTGGAAGAAAAACAGCAGGTAAGCTTTTCTTTGGATCACTCATCACCCAGCTTTGTCAAAGGGTGAAGATAGTTCCGGGCAAGGACGAGGAGC
GTCATTTCTTCAAGCCGACCATTGACCTGTCTTTGATCAGGAAGCTCCAACAGAACAGTATCCAAAGGAAAGACAAAGCCTCGACATCTCAGGCTACTCCTCAATCAGGG
TCGAATGTAGCTTCTCCATCCCAGCACACTCCTTTTACAGGGTCGTCACCGTCTTCAGAAGCCTTAACTATTGCCTACTGCCGGTTAGATCAACTCAGGGACAACCTGAG
AACATATTGGGCATATGCAAAGGAGCGGGATGAAGCCATTAGAGAGTTCTATCTCTCTATCGCCCCAAGCATTGCTCCGGTCTTTCCCGATTTCCCTCAGTCGCTGCTGC
CTCAAGAAGACAAGGATTCTGATGAAGAAGATGATGAGAATGATGAAGAAGAGAATGAAGAGAAAGAGAGTTCCTCAGACGAGGACTATGGGAGTTTTCTGACCCCTTTA
CTTGCTGTTTTTGAAAACAGAGGAAATGCTGGAATCTGTCCAGAAATGCGACCGCATTTCTTGGAAGGCAAAATGAAATGCGACCGCATTTCTGGAAAAACCGAGACTGT
TCCGAGTCATCCGCGGGTCGTTGTTGACGAGTCTTCTTCGCACCTAACCGGCCGTTGCATTCGTGAATTATCTATGTCGGAGTCGAGTCACAGAAATGTGCATAACGTTA
GTCCTAATGGGATCGATACCCTTGGAATACTTCTAAGGATCCCACAATCAGTTCCAAGGCCTGAGGATAGTAGAGAAGATCCAAGTGGTTGTCCAAAAGTCGTTCGTGTG
AATTATAAAGCATCGAATTATACGAATTGGGAGCTCCAAGAGTGTTTCTTGGAGATTTTTCCAACAAGATTTGGAGATCTTGGGGCTGCTGTAAATCAGAGAAGAAAACT
CAAGGAGTTGGCGTTTCGATCAAGACTACGTTCGACAAAGTTGTCGCAGCGTCGCGACGCTACGCAGACAACGTCGCGACGCTACCGCGATTTTGGAATTTTCCTCGCTG
ATCACCTAGAGTCGAGACGCTACAGTCCTAGCGTCTCGACGCTAGGCCTTCTAGAAGCCTTAAACACGGATTTTGGCCTCCTTTCTTCTTTCTTTTGGGCTTTTTAG
Protein sequenceShow/hide protein sequence
MGAQRRATLEEEANRQDEEEAAKAAGSSRQEGSTGKPSKPTTNPSSSCSDKSFVNYSARKRSPKKVVPEKPLVIEPLKTARMPPDVFEDIIRRAVAKALVIAEGYRVEQD
ALKDIEAEREMENQHMMEEDDFAKRRDEEECEEKRRREEEQEVDKALEAEEERKFEENLRRAAIDLQLLEEEKKRREELREYEKRKKEAEDFFAAFEPLHKAQSEAEMLQ
GRVEEEAQQGPTEEILEKEKEREVEDEGQNATASGLHFEEGLAEATIEQPADEVLEPLFKDDPPAADRTSSGEKRDREEKESEEAETSTDSDTESDSKIKELDDDQVPIS
AALRRKRKRKIKAERRTKNKNDPIFAKRPRMRSMDASPVVPPTVSPAKPKAKSPKAPSPKNPFPEVVRDVNFQERMEIMRKGDFLNEKGYSNRAGALPEFVSRIISQNKW
QDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLHPRGNDVIRNLSAKQMKEALKLVANKGVQWKESQTKVKFMVPSDLKPKLAVWLHFI
KNRLMPTTHGNTISVERVMLLYCIMKGLEINVGSIIREEILACGRKTAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIRKLQQNSIQRKDKASTSQATPQSG
SNVASPSQHTPFTGSSPSSEALTIAYCRLDQLRDNLRTYWAYAKERDEAIREFYLSIAPSIAPVFPDFPQSLLPQEDKDSDEEDDENDEEENEEKESSSDEDYGSFLTPL
LAVFENRGNAGICPEMRPHFLEGKMKCDRISGKTETVPSHPRVVVDESSSHLTGRCIRELSMSESSHRNVHNVSPNGIDTLGILLRIPQSVPRPEDSREDPSGCPKVVRV
NYKASNYTNWELQECFLEIFPTRFGDLGAAVNQRRKLKELAFRSRLRSTKLSQRRDATQTTSRRYRDFGIFLADHLESRRYSPSVSTLGLLEALNTDFGLLSSFFWAF