; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg031032 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg031032
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProtein MNN4-like
Genome locationscaffold10:22422915..22428981
RNA-Seq ExpressionSpg031032
SyntenySpg031032
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB49850.1 hypothetical protein L484_000844 [Morus notabilis]1.0e-2628.27Show/hide
Query:  FAKRPRTRSMDAFPTVPPTISPAKPKGKSQKAVSPKNPFPEVFKDVNFQERIEIMKKRDFLNEKGF---SNRAGALPEFITGVIFQYKRQEFCAHPQEAV
        FAKRP   S + +       + A P   S + VS    F +   +  ++E I     R+ + EKGF    +     P FI+ VI     Q FC HP + +
Subjt:  FAKRPRTRSMDAFPTVPPTISPAKPKGKSQKAVSPKNPFPEVFKDVNFQERIEIMKKRDFLNEKGF---SNRAGALPEFITGVIFQYKRQEFCAHPQEAV

Query:  VPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYKIKAPLNPRGND----VIRNPSAKQMKEALKLVANKGVQWKESQMKVKSLVPSDLKPESAIWL
        VPLV+EFYA L+ +  +   V    ++F+S  IN V  I     P  +D    +I +   +Q+KE LK +A  G QW  S     +    +L+P + +W 
Subjt:  VPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYKIKAPLNPRGND----VIRNPSAKQMKEALKLVANKGVQWKESQMKVKSLVPSDLKPESAIWL

Query:  HFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDKERHFFKPTIDLSLIGKLQQNSIQRK
        HF+ +RL+ +TH  TIS +R +LLY ++ G  INVG +I D+I AC  K  G L+F SLI++LC +  +     + R      +DL  I ++     ++ 
Subjt:  HFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDKERHFFKPTIDLSLIGKLQQNSIQRK

Query:  DKA-STSQATPPAGPNID---------------------------------RLRDDLRTYWTYAKERDEAIREFY
        +K     +   P+ P+                                   + ++ L  +W Y+++RD A+++ +
Subjt:  DKA-STSQATPPAGPNID---------------------------------RLRDDLRTYWTYAKERDEAIREFY

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]1.4e-3434.47Show/hide
Query:  KSQKAVSPKNPFPEVFKDVNFQERIEIMKKRDFLNEKGF----SNRAGALPEFITGVIFQYKRQEFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMV
        K+ KAV  +    E   + N Q        R    EKGF    S   G LP FI  VI Q+  ++FCAHP++ +VPLVREFYA L +   +   VRG  V
Subjt:  KSQKAVSPKNPFPEVFKDVNFQERIEIMKKRDFLNEKGF----SNRAGALPEFITGVIFQYKRQEFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMV

Query:  SFSSVDINRVYKIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQMKVKSLVPSDLKPESAIWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKG
        S+S   IN V+ +  P++   ++ I N +   +   L+ VA  G +W  S     + + S L P + +W HF+K+ L+PTTH  T+S DR++LL+ ++ G
Subjt:  SFSSVDINRVYKIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQMKVKSLVPSDLKPESAIWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKG

Query:  LEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDKERHFFKPTIDLSLIGKLQQNSIQRKDKASTSQATPPAGPNIDRLRDDL
          INVG +I  EI AC  ++ G LFF SLIT+LC+  +     ++E+      ID   + ++ Q       +  +S  + PA  +  R   D+
Subjt:  LEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDKERHFFKPTIDLSLIGKLQQNSIQRKDKASTSQATPPAGPNIDRLRDDL

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]3.4e-3833.14Show/hide
Query:  MKKRDFLNEKGF----SNRAGALPEFITGVIFQYKRQEFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYKIKAPLNPRGNDVIRN
        ++ R    EKGF    S   G LP FI  VI Q+  ++FCAHP++ +VPLVREFYA L +   +   VRG  VS+S   IN V+ +  P++   ++ I+N
Subjt:  MKKRDFLNEKGF----SNRAGALPEFITGVIFQYKRQEFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYKIKAPLNPRGNDVIRN

Query:  PSAKQMKEALKLVANKGVQWKESQMKVKSLVPSDLKPESAIWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFG
         + + +   L+ VA  G +W  S     + + S L P + +W HF+K+RL+PTTH  T+S DR++LL+ ++ G  INVG +I  EI AC  ++ G LFF 
Subjt:  PSAKQMKEALKLVANKGVQWKESQMKVKSLVPSDLKPESAIWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFG

Query:  SLITQLCQRVKIVPGKDKERHFFKPTIDLSLIGKL---------QQNSIQRKDKASTSQATPPAGPNIDRLRDDL---------------------RTYW
        SLIT+LC+  +     ++E+      ID   + ++         QQ S  R   AS+++        +  L   L                     + +W
Subjt:  SLITQLCQRVKIVPGKDKERHFFKPTIDLSLIGKL---------QQNSIQRKDKASTSQATPPAGPNIDRLRDDL---------------------RTYW

Query:  TYAKERDEAIREFYLSITPRIAPVFPNFPQSLLPKEEEDSEEDEENDDEDDEE
         Y+KERD A+++   +   R  P FP FPQ +L  ++ D E + E+D +   E
Subjt:  TYAKERDEAIREFYLSITPRIAPVFPNFPQSLLPKEEEDSEEDEENDDEDDEE

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]1.4e-3134.62Show/hide
Query:  SQKAVSPKNPFPEVFKDVNFQER-IEIMKKRDFLNEKGFSNRAGALPEFITGVIFQYKRQEFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSS
        + KAV  ++   E+  + N Q R + + K+  + N K         P FI  VI Q+  Q FCAHP++ +VPLVREFY  +         +RG  V  S 
Subjt:  SQKAVSPKNPFPEVFKDVNFQER-IEIMKKRDFLNEKGFSNRAGALPEFITGVIFQYKRQEFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSS

Query:  VDINRVYKIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQMKVKSLVPSDLKPESAIWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEIN
          IN ++ +  P++   ++ + + +  ++   L+ VA  G +W  S     + + S L P + +W HF+K+RL+PTTH  T+S + V LLY ++ G  IN
Subjt:  VDINRVYKIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQMKVKSLVPSDLKPESAIWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEIN

Query:  VGSIIRDEILACGRKRAGKLFFGSLITQLCQRVK
        VG +I  EI AC  +++G LFF SLIT +C+  +
Subjt:  VGSIIRDEILACGRKRAGKLFFGSLITQLCQRVK

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]7.7e-3032.88Show/hide
Query:  VPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYKIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQMKVKSLVPSDLKPESAIWLHFIK
        +PLVREFYA L +   +   VRG  VS+S   IN V+ +  P++   ++ I N +  ++   L+ VA  G +W  S     + + S L P + +W HF+K
Subjt:  VPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYKIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQMKVKSLVPSDLKPESAIWLHFIK

Query:  NRLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDKERHFFKPTIDLSLIGKL---------QQN
        +RL+PTTH   +S DR++LL+ ++ G  INVG +I  EI AC  ++ G LFF SLIT+LC+    +  ++K  +     ID   + ++         QQ 
Subjt:  NRLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDKERHFFKPTIDLSLIGKL---------QQN

Query:  SIQRKDKASTSQATPPAGPNIDRLRDDL----------RTYWTYAKERDEAIREFYLSITPRIAPVFPNFPQSLLPKEEEDSEEDEENDDEDDEE
        S  R   AS+S+        +  L   L          + +W Y+KERD A+++   +   R  P FP FPQ +L  ++ D E + E+D +   E
Subjt:  SIQRKDKASTSQATPPAGPNIDRLRDDL----------RTYWTYAKERDEAIREFYLSITPRIAPVFPNFPQSLLPKEEEDSEEDEENDDEDDEE

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)6.5e-3534.47Show/hide
Query:  KSQKAVSPKNPFPEVFKDVNFQERIEIMKKRDFLNEKGF----SNRAGALPEFITGVIFQYKRQEFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMV
        K+ KAV  +    E   + N Q        R    EKGF    S   G LP FI  VI Q+  ++FCAHP++ +VPLVREFYA L +   +   VRG  V
Subjt:  KSQKAVSPKNPFPEVFKDVNFQERIEIMKKRDFLNEKGF----SNRAGALPEFITGVIFQYKRQEFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMV

Query:  SFSSVDINRVYKIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQMKVKSLVPSDLKPESAIWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKG
        S+S   IN V+ +  P++   ++ I N +   +   L+ VA  G +W  S     + + S L P + +W HF+K+ L+PTTH  T+S DR++LL+ ++ G
Subjt:  SFSSVDINRVYKIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQMKVKSLVPSDLKPESAIWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKG

Query:  LEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDKERHFFKPTIDLSLIGKLQQNSIQRKDKASTSQATPPAGPNIDRLRDDL
          INVG +I  EI AC  ++ G LFF SLIT+LC+  +     ++E+      ID   + ++ Q       +  +S  + PA  +  R   D+
Subjt:  LEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDKERHFFKPTIDLSLIGKLQQNSIQRKDKASTSQATPPAGPNIDRLRDDL

A0A2P5BCG4 Uncharacterized protein (Fragment)1.7e-3833.14Show/hide
Query:  MKKRDFLNEKGF----SNRAGALPEFITGVIFQYKRQEFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYKIKAPLNPRGNDVIRN
        ++ R    EKGF    S   G LP FI  VI Q+  ++FCAHP++ +VPLVREFYA L +   +   VRG  VS+S   IN V+ +  P++   ++ I+N
Subjt:  MKKRDFLNEKGF----SNRAGALPEFITGVIFQYKRQEFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYKIKAPLNPRGNDVIRN

Query:  PSAKQMKEALKLVANKGVQWKESQMKVKSLVPSDLKPESAIWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFG
         + + +   L+ VA  G +W  S     + + S L P + +W HF+K+RL+PTTH  T+S DR++LL+ ++ G  INVG +I  EI AC  ++ G LFF 
Subjt:  PSAKQMKEALKLVANKGVQWKESQMKVKSLVPSDLKPESAIWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFG

Query:  SLITQLCQRVKIVPGKDKERHFFKPTIDLSLIGKL---------QQNSIQRKDKASTSQATPPAGPNIDRLRDDL---------------------RTYW
        SLIT+LC+  +     ++E+      ID   + ++         QQ S  R   AS+++        +  L   L                     + +W
Subjt:  SLITQLCQRVKIVPGKDKERHFFKPTIDLSLIGKL---------QQNSIQRKDKASTSQATPPAGPNIDRLRDDL---------------------RTYW

Query:  TYAKERDEAIREFYLSITPRIAPVFPNFPQSLLPKEEEDSEEDEENDDEDDEE
         Y+KERD A+++   +   R  P FP FPQ +L  ++ D E + E+D +   E
Subjt:  TYAKERDEAIREFYLSITPRIAPVFPNFPQSLLPKEEEDSEEDEENDDEDDEE

A0A2P5DAQ2 Uncharacterized protein6.8e-3234.62Show/hide
Query:  SQKAVSPKNPFPEVFKDVNFQER-IEIMKKRDFLNEKGFSNRAGALPEFITGVIFQYKRQEFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSS
        + KAV  ++   E+  + N Q R + + K+  + N K         P FI  VI Q+  Q FCAHP++ +VPLVREFY  +         +RG  V  S 
Subjt:  SQKAVSPKNPFPEVFKDVNFQER-IEIMKKRDFLNEKGFSNRAGALPEFITGVIFQYKRQEFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSS

Query:  VDINRVYKIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQMKVKSLVPSDLKPESAIWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEIN
          IN ++ +  P++   ++ + + +  ++   L+ VA  G +W  S     + + S L P + +W HF+K+RL+PTTH  T+S + V LLY ++ G  IN
Subjt:  VDINRVYKIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQMKVKSLVPSDLKPESAIWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEIN

Query:  VGSIIRDEILACGRKRAGKLFFGSLITQLCQRVK
        VG +I  EI AC  +++G LFF SLIT +C+  +
Subjt:  VGSIIRDEILACGRKRAGKLFFGSLITQLCQRVK

A0A2P5DXM3 Uncharacterized protein3.7e-3032.88Show/hide
Query:  VPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYKIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQMKVKSLVPSDLKPESAIWLHFIK
        +PLVREFYA L +   +   VRG  VS+S   IN V+ +  P++   ++ I N +  ++   L+ VA  G +W  S     + + S L P + +W HF+K
Subjt:  VPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYKIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQMKVKSLVPSDLKPESAIWLHFIK

Query:  NRLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDKERHFFKPTIDLSLIGKL---------QQN
        +RL+PTTH   +S DR++LL+ ++ G  INVG +I  EI AC  ++ G LFF SLIT+LC+    +  ++K  +     ID   + ++         QQ 
Subjt:  NRLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDKERHFFKPTIDLSLIGKL---------QQN

Query:  SIQRKDKASTSQATPPAGPNIDRLRDDL----------RTYWTYAKERDEAIREFYLSITPRIAPVFPNFPQSLLPKEEEDSEEDEENDDEDDEE
        S  R   AS+S+        +  L   L          + +W Y+KERD A+++   +   R  P FP FPQ +L  ++ D E + E+D +   E
Subjt:  SIQRKDKASTSQATPPAGPNIDRLRDDL----------RTYWTYAKERDEAIREFYLSITPRIAPVFPNFPQSLLPKEEEDSEEDEENDDEDDEE

W9RBS1 Uncharacterized protein5.0e-2728.27Show/hide
Query:  FAKRPRTRSMDAFPTVPPTISPAKPKGKSQKAVSPKNPFPEVFKDVNFQERIEIMKKRDFLNEKGF---SNRAGALPEFITGVIFQYKRQEFCAHPQEAV
        FAKRP   S + +       + A P   S + VS    F +   +  ++E I     R+ + EKGF    +     P FI+ VI     Q FC HP + +
Subjt:  FAKRPRTRSMDAFPTVPPTISPAKPKGKSQKAVSPKNPFPEVFKDVNFQERIEIMKKRDFLNEKGF---SNRAGALPEFITGVIFQYKRQEFCAHPQEAV

Query:  VPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYKIKAPLNPRGND----VIRNPSAKQMKEALKLVANKGVQWKESQMKVKSLVPSDLKPESAIWL
        VPLV+EFYA L+ +  +   V    ++F+S  IN V  I     P  +D    +I +   +Q+KE LK +A  G QW  S     +    +L+P + +W 
Subjt:  VPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYKIKAPLNPRGND----VIRNPSAKQMKEALKLVANKGVQWKESQMKVKSLVPSDLKPESAIWL

Query:  HFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDKERHFFKPTIDLSLIGKLQQNSIQRK
        HF+ +RL+ +TH  TIS +R +LLY ++ G  INVG +I D+I AC  K  G L+F SLI++LC +  +     + R      +DL  I ++     ++ 
Subjt:  HFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDKERHFFKPTIDLSLIGKLQQNSIQRK

Query:  DKA-STSQATPPAGPNID---------------------------------RLRDDLRTYWTYAKERDEAIREFY
        +K     +   P+ P+                                   + ++ L  +W Y+++RD A+++ +
Subjt:  DKA-STSQATPPAGPNID---------------------------------RLRDDLRTYWTYAKERDEAIREFY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAAGGGGTATCCACTCCTTGACATTGATCTAGAGATAGAGAGAACCTTTCGTCGACGAAGGAAGGAACAAAGACGAAAGAAGAAGGAGCAACAAGACTTGAGCGC
ACGGAAATTTCTAGAGGAAGCATCTTACATTCAAGTGTTTCCAATGGATCCTCCAGGAGTCGATCTTCAAGTTGATCCACAAAGTAATCCGAACACCCACAAGTCTGAAG
CAAGCAACCGACGGCAAGAAGAGAGCCCCGTTACGCCCATGCAAGGCACGCAAAGGATGAGACCCACGGGATTCTCGCCGGCGGTCGTGAACCAAGCCCCCAACGCTCAA
GCTCCATCCTCTTCGGCAGTGCCGGCCACGTCGAGGGAGATGTCGAGTTCATCTACACCGAGACGGTTCACGCGCGCCACTGCTGTTCGAGACGAGCTGCCCTTGAAGAA
GAAGGGAATCGGCAAGACGAAGAAAAAGCTGCCAAGGCAGTTGGAAGCTCTCGGCAAGGAGAAGCTTCAACGGGTAAGGTTTCCGAACCTTCAACTAACCCTTCTCTCTT
GCAGGACCAAGCCCGTTGTTACTTACAGCGCAAGAAAGAGGAGCCCAAAGAAAAATGTGTCTGAAAAGCCGCTTGAAATTCAGCACCTAGAAACCGCAAGGATGCCTCCT
TATGTATTCGAAGGAATAATCTGCCAAGCAGTGGCAAAGGCCCTTGAGGTTGCAGAGGGGTACAAGGCTAAACAAGATGCTTTGAAAGAAGTTGAAGCAGAGAGAGAGAT
GGGAAATCAGAAAATGGTTGAGGAAGACGAGCTTGCAAGGGAAAGAGATGAGAGGGAAGAGAAAAGGAAAAGAGAAGAAGAGAAAGAGGCCGAGAGGGCATTATTAGCTG
AGGAAGAAGAGGGAAGATTAGGTGAAAGCCTCAGAAGGGCAGCCATTGACTTACAACTCCTTGAGGAAGAGGAAAAGAGAAGGGAAGAAATAAAAGAAGATGAAAGGCGA
AGAAAGGAAGCCGAAGACTTCCTTGCAGCCTTTGAGCCACTCCACAAGGCTCAAAGTGAGGCTGAAGCACTGCAAGGAAGGGTAGAAGAAGAGGCCCAGCAGGGGCCAAC
AGAAGAAATTTTTGAAAAAGAAAAAGAAAGAGAAGTGGAGAATGAAGGCCAGAATGCAACTGCATCTGGGCCGCATTCTGAAGAAGGCCCAACCGAGGCCACTATCGATC
AGCCAGCTGAAGAGGTTTTTGAGCCTCTATTCACACATGACCCACCAGCTGCTGATAGCACCTCTTCGGGAGAGAAGAGGGTTGAAGAAGAAAAAGAAGACGAGGAGGCC
GAGACCTCCAGTGATTCAGATTCAGATTCAGAATCTGATTCAGAGATTAGGGAGCTAGATGGCGATCAAGTCCCTATCTCTGTAGCGCTGAGAAGAAAGATGAAGAGAGA
GATAAAAGCTGAGAGGAGGACAAAGAAAAAAAATGACCCGATATTTGCCAAGAGGCCGAGGACTAGGTCCATGGACGCCTTTCCTACAGTCCCTCCTACTATCTCACCCG
CCAAGCCTAAGGGCAAGTCACAGAAGGCCGTATCTCCTAAAAATCCATTCCCCGAGGTATTTAAAGATGTTAATTTTCAGGAACGGATAGAGATCATGAAGAAAAGAGAT
TTCCTCAATGAGAAGGGATTCTCTAACAGAGCAGGAGCACTGCCAGAGTTCATAACAGGAGTTATCTTCCAGTACAAGAGGCAGGAGTTCTGTGCTCACCCTCAGGAGGC
TGTTGTGCCTCTAGTTCGTGAATTTTACGCCGGCCTGAGGGAGGAAAGCATTAGCATGGCGGTGGTGAGGGGGAAGATGGTCAGTTTCTCCTCAGTCGACATTAATAGGG
TCTACAAGATCAAGGCACCCCTGAATCCGAGAGGGAATGATGTGATCAGGAACCCTTCGGCCAAGCAAATGAAGGAAGCTCTTAAGCTTGTGGCCAACAAGGGGGTCCAA
TGGAAAGAATCACAGATGAAAGTGAAGTCTTTAGTGCCAAGTGACCTAAAGCCAGAATCAGCAATTTGGCTTCACTTCATCAAGAACCGCTTGATGCCAACCACCCACGA
CAGCACAATTTCAGTGGATAGAGTTATGTTACTCTATTGCCTTATGAAGGGGTTGGAGATCAACGTGGGGAGCATAATCAGGGATGAGATTTTAGCCTGTGGGAGAAAGC
GAGCAGGCAAGCTTTTCTTTGGATCACTCATCACCCAGCTCTGTCAAAGGGTGAAGATCGTTCCGGGCAAGGACAAGGAGCGTCATTTCTTCAAGCCGACCATTGACCTG
TCCTTGATCGGAAAGCTCCAGCAGAACAGTATCCAGAGGAAGGACAAAGCCTCTACATCTCAGGCTACTCCACCTGCAGGGCCGAATATAGATCGACTTAGGGACGACCT
GAGGACATATTGGACATATGCAAAGGAGCGGGATGAAGCCATTAGAGAGTTCTATCTCTCTATCACCCCTAGGATTGCTCCAGTCTTTCCAAATTTCCCTCAGTCGTTGC
TGCCAAAGGAAGAAGAGGATTCTGAAGAGGATGAAGAAAATGATGATGAAGATGATGAAGAGAAAGAGAGTTCCTCGGACGAGGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCAAGGGGTATCCACTCCTTGACATTGATCTAGAGATAGAGAGAACCTTTCGTCGACGAAGGAAGGAACAAAGACGAAAGAAGAAGGAGCAACAAGACTTGAGCGC
ACGGAAATTTCTAGAGGAAGCATCTTACATTCAAGTGTTTCCAATGGATCCTCCAGGAGTCGATCTTCAAGTTGATCCACAAAGTAATCCGAACACCCACAAGTCTGAAG
CAAGCAACCGACGGCAAGAAGAGAGCCCCGTTACGCCCATGCAAGGCACGCAAAGGATGAGACCCACGGGATTCTCGCCGGCGGTCGTGAACCAAGCCCCCAACGCTCAA
GCTCCATCCTCTTCGGCAGTGCCGGCCACGTCGAGGGAGATGTCGAGTTCATCTACACCGAGACGGTTCACGCGCGCCACTGCTGTTCGAGACGAGCTGCCCTTGAAGAA
GAAGGGAATCGGCAAGACGAAGAAAAAGCTGCCAAGGCAGTTGGAAGCTCTCGGCAAGGAGAAGCTTCAACGGGTAAGGTTTCCGAACCTTCAACTAACCCTTCTCTCTT
GCAGGACCAAGCCCGTTGTTACTTACAGCGCAAGAAAGAGGAGCCCAAAGAAAAATGTGTCTGAAAAGCCGCTTGAAATTCAGCACCTAGAAACCGCAAGGATGCCTCCT
TATGTATTCGAAGGAATAATCTGCCAAGCAGTGGCAAAGGCCCTTGAGGTTGCAGAGGGGTACAAGGCTAAACAAGATGCTTTGAAAGAAGTTGAAGCAGAGAGAGAGAT
GGGAAATCAGAAAATGGTTGAGGAAGACGAGCTTGCAAGGGAAAGAGATGAGAGGGAAGAGAAAAGGAAAAGAGAAGAAGAGAAAGAGGCCGAGAGGGCATTATTAGCTG
AGGAAGAAGAGGGAAGATTAGGTGAAAGCCTCAGAAGGGCAGCCATTGACTTACAACTCCTTGAGGAAGAGGAAAAGAGAAGGGAAGAAATAAAAGAAGATGAAAGGCGA
AGAAAGGAAGCCGAAGACTTCCTTGCAGCCTTTGAGCCACTCCACAAGGCTCAAAGTGAGGCTGAAGCACTGCAAGGAAGGGTAGAAGAAGAGGCCCAGCAGGGGCCAAC
AGAAGAAATTTTTGAAAAAGAAAAAGAAAGAGAAGTGGAGAATGAAGGCCAGAATGCAACTGCATCTGGGCCGCATTCTGAAGAAGGCCCAACCGAGGCCACTATCGATC
AGCCAGCTGAAGAGGTTTTTGAGCCTCTATTCACACATGACCCACCAGCTGCTGATAGCACCTCTTCGGGAGAGAAGAGGGTTGAAGAAGAAAAAGAAGACGAGGAGGCC
GAGACCTCCAGTGATTCAGATTCAGATTCAGAATCTGATTCAGAGATTAGGGAGCTAGATGGCGATCAAGTCCCTATCTCTGTAGCGCTGAGAAGAAAGATGAAGAGAGA
GATAAAAGCTGAGAGGAGGACAAAGAAAAAAAATGACCCGATATTTGCCAAGAGGCCGAGGACTAGGTCCATGGACGCCTTTCCTACAGTCCCTCCTACTATCTCACCCG
CCAAGCCTAAGGGCAAGTCACAGAAGGCCGTATCTCCTAAAAATCCATTCCCCGAGGTATTTAAAGATGTTAATTTTCAGGAACGGATAGAGATCATGAAGAAAAGAGAT
TTCCTCAATGAGAAGGGATTCTCTAACAGAGCAGGAGCACTGCCAGAGTTCATAACAGGAGTTATCTTCCAGTACAAGAGGCAGGAGTTCTGTGCTCACCCTCAGGAGGC
TGTTGTGCCTCTAGTTCGTGAATTTTACGCCGGCCTGAGGGAGGAAAGCATTAGCATGGCGGTGGTGAGGGGGAAGATGGTCAGTTTCTCCTCAGTCGACATTAATAGGG
TCTACAAGATCAAGGCACCCCTGAATCCGAGAGGGAATGATGTGATCAGGAACCCTTCGGCCAAGCAAATGAAGGAAGCTCTTAAGCTTGTGGCCAACAAGGGGGTCCAA
TGGAAAGAATCACAGATGAAAGTGAAGTCTTTAGTGCCAAGTGACCTAAAGCCAGAATCAGCAATTTGGCTTCACTTCATCAAGAACCGCTTGATGCCAACCACCCACGA
CAGCACAATTTCAGTGGATAGAGTTATGTTACTCTATTGCCTTATGAAGGGGTTGGAGATCAACGTGGGGAGCATAATCAGGGATGAGATTTTAGCCTGTGGGAGAAAGC
GAGCAGGCAAGCTTTTCTTTGGATCACTCATCACCCAGCTCTGTCAAAGGGTGAAGATCGTTCCGGGCAAGGACAAGGAGCGTCATTTCTTCAAGCCGACCATTGACCTG
TCCTTGATCGGAAAGCTCCAGCAGAACAGTATCCAGAGGAAGGACAAAGCCTCTACATCTCAGGCTACTCCACCTGCAGGGCCGAATATAGATCGACTTAGGGACGACCT
GAGGACATATTGGACATATGCAAAGGAGCGGGATGAAGCCATTAGAGAGTTCTATCTCTCTATCACCCCTAGGATTGCTCCAGTCTTTCCAAATTTCCCTCAGTCGTTGC
TGCCAAAGGAAGAAGAGGATTCTGAAGAGGATGAAGAAAATGATGATGAAGATGATGAAGAGAAAGAGAGTTCCTCGGACGAGGAATAG
Protein sequenceShow/hide protein sequence
MSKGYPLLDIDLEIERTFRRRRKEQRRKKKEQQDLSARKFLEEASYIQVFPMDPPGVDLQVDPQSNPNTHKSEASNRRQEESPVTPMQGTQRMRPTGFSPAVVNQAPNAQ
APSSSAVPATSREMSSSSTPRRFTRATAVRDELPLKKKGIGKTKKKLPRQLEALGKEKLQRVRFPNLQLTLLSCRTKPVVTYSARKRSPKKNVSEKPLEIQHLETARMPP
YVFEGIICQAVAKALEVAEGYKAKQDALKEVEAEREMGNQKMVEEDELARERDEREEKRKREEEKEAERALLAEEEEGRLGESLRRAAIDLQLLEEEEKRREEIKEDERR
RKEAEDFLAAFEPLHKAQSEAEALQGRVEEEAQQGPTEEIFEKEKEREVENEGQNATASGPHSEEGPTEATIDQPAEEVFEPLFTHDPPAADSTSSGEKRVEEEKEDEEA
ETSSDSDSDSESDSEIRELDGDQVPISVALRRKMKREIKAERRTKKKNDPIFAKRPRTRSMDAFPTVPPTISPAKPKGKSQKAVSPKNPFPEVFKDVNFQERIEIMKKRD
FLNEKGFSNRAGALPEFITGVIFQYKRQEFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYKIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQ
WKESQMKVKSLVPSDLKPESAIWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDKERHFFKPTIDL
SLIGKLQQNSIQRKDKASTSQATPPAGPNIDRLRDDLRTYWTYAKERDEAIREFYLSITPRIAPVFPNFPQSLLPKEEEDSEEDEENDDEDDEEKESSSDEE