; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg004660 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg004660
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProtein MNN4-like
Genome locationscaffold5:20460763..20466852
RNA-Seq ExpressionSpg004660
SyntenySpg004660
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB49850.1 hypothetical protein L484_000844 [Morus notabilis]8.4e-3030.16Show/hide
Query:  FAKRPRTRSMDASPAVPPTTSPAKPKGKSLKAASPKNPFPEVFKDVNFQERMDIMKKRDFLNEKGF---SNRAGALPEFVSRIISQYKWQDFCAHPQEAV
        FAKRP + S    PA+    + A     S +  S    F +   +  ++E    +  R+ + EKGF    +     P F+S +I    WQ FC HP + +
Subjt:  FAKRPRTRSMDASPAVPPTTSPAKPKGKSLKAASPKNPFPEVFKDVNFQERMDIMKKRDFLNEKGF---SNRAGALPEFVSRIISQYKWQDFCAHPQEAV

Query:  VPLLREFYPGLRKESINMAVVRGKIVSFSSVDINRVYRIKAPLNPRGND----VIRNPSARQMKDALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWL
        VPL++EFY  L+ +  N   V    ++F+S  IN V  I     P  +D    +I +    Q+K+ LK +A  G QW  S     +    +L+P + VW 
Subjt:  VPLLREFYPGLRKESINMAVVRGKIVSFSSVDINRVYRIKAPLNPRGND----VIRNPSARQMKDALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWL

Query:  HFIKNRLMPTTHDSTISVDRVRLLYCLMKGLEINVGSIIRDEILACGWKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIGKLQQNSIQRK
        HF+ +RL+ +TH  TIS +R  LLY ++ G  INVG +I D+I AC  K  G L+F SLI++LC +  +     E R      +DL  I ++     ++ 
Subjt:  HFIKNRLMPTTHDSTISVDRVRLLYCLMKGLEINVGSIIRDEILACGWKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIGKLQQNSIQRK

Query:  DKASTSQATPQSGPNVASPSQHTSFTGPSPASEALGM-----------VHRQLDQIRENLKTYWTYAKERDEAIREFY
        +K    +   Q  P+  S S HT     + + E L             +   L Q +E L  +W Y+++RD A+++ +
Subjt:  DKASTSQATPQSGPNVASPSQHTSFTGPSPASEALGM-----------VHRQLDQIRENLKTYWTYAKERDEAIREFY

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]1.0e-3535.98Show/hide
Query:  MKKRDFLNEKGF----SNRAGALPEFVSRIISQYKWQDFCAHPQEAVVPLLREFYPGLRKESINMAVVRGKIVSFSSVDINRVYRIKAPLNPRGNDVIRN
        ++ R    EKGF    S   G LP F++++I+Q+ W+ FCAHP++ +VPL+REFY  L     N   VRG  VS+S   IN V+ +  P++   ++ I N
Subjt:  MKKRDFLNEKGF----SNRAGALPEFVSRIISQYKWQDFCAHPQEAVVPLLREFYPGLRKESINMAVVRGKIVSFSSVDINRVYRIKAPLNPRGNDVIRN

Query:  PSARQMKDALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDSTISVDRVRLLYCLMKGLEINVGSIIRDEILACGWKRAGKLFFG
         +   +   L+ VA  G +W  S     + + S L P + VW HF+K+ L+PTTH  T+S DR+ LL+ ++ G  INVG +I  EI AC  ++ G LFF 
Subjt:  PSARQMKDALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDSTISVDRVRLLYCLMKGLEINVGSIIRDEILACGWKRAGKLFFG

Query:  SLITQLCQRVKIVPGKDEERHFFKPTIDLSLIGKLQQNSIQRKDKASTSQATPQSGPNVASPSQ
        SLIT+LC+  +     +EE+      ID   + ++ Q     +    ++Q    S P  AS S+
Subjt:  SLITQLCQRVKIVPGKDEERHFFKPTIDLSLIGKLQQNSIQRKDKASTSQATPQSGPNVASPSQ

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]1.1e-4032.49Show/hide
Query:  MKKRDFLNEKGF----SNRAGALPEFVSRIISQYKWQDFCAHPQEAVVPLLREFYPGLRKESINMAVVRGKIVSFSSVDINRVYRIKAPLNPRGNDVIRN
        ++ R    EKGF    S   G LP F++++I+Q+ W+ FCAHP++ +VPL+REFY  L     N   VRG  VS+S   IN V+ +  P++   ++ I+N
Subjt:  MKKRDFLNEKGF----SNRAGALPEFVSRIISQYKWQDFCAHPQEAVVPLLREFYPGLRKESINMAVVRGKIVSFSSVDINRVYRIKAPLNPRGNDVIRN

Query:  PSARQMKDALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDSTISVDRVRLLYCLMKGLEINVGSIIRDEILACGWKRAGKLFFG
         + + +   L+ VA  G +W  S     + + S L P + VW HF+K+RL+PTTH  T+S DR+ LL+ ++ G  INVG +I  EI AC  ++ G LFF 
Subjt:  PSARQMKDALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDSTISVDRVRLLYCLMKGLEINVGSIIRDEILACGWKRAGKLFFG

Query:  SLITQLCQRVKIVPGKDEERHFFKPTIDLSLIGKLQQN--SIQRKDKASTSQATPQSGPNVASPSQHTSFTGPSPASEALGMVHRQ--LDQIRENLKTYW
        SLIT+LC+  +     +EE+      ID   + ++ Q   +   +  +S+  AT  S        Q         + + +   H    L    +  + +W
Subjt:  SLITQLCQRVKIVPGKDEERHFFKPTIDLSLIGKLQQN--SIQRKDKASTSQATPQSGPNVASPSQHTSFTGPSPASEALGMVHRQ--LDQIRENLKTYW

Query:  TYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPQEEKDSDEEEDEENDDEEKE
         Y+KERD A+++   +      P FP FPQ +L   + + + E D++  +E  E
Subjt:  TYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPQEEKDSDEEEDEENDDEEKE

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]6.8e-3233.61Show/hide
Query:  KAASPKNPFPEVFKDVNFQER-MDIMKKRDFLNEKGFSNRAGALPEFVSRIISQYKWQDFCAHPQEAVVPLLREFYPGLRKESINMAVVRGKIVSFSSVD
        KA   ++   E+  + N Q R + + K+  + N K         P F++ +I Q+ WQ FCAHP++ +VPL+REFY  +     +   +RG  V  S   
Subjt:  KAASPKNPFPEVFKDVNFQER-MDIMKKRDFLNEKGFSNRAGALPEFVSRIISQYKWQDFCAHPQEAVVPLLREFYPGLRKESINMAVVRGKIVSFSSVD

Query:  INRVYRIKAPLNPRGNDVIRNPSARQMKDALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDSTISVDRVRLLYCLMKGLEINVG
        IN ++ +  P++   ++ + + +  ++   L+ VA  G +W  S     + + S L P + VW HF+K+RL+PTTH  T+S + V LLY ++ G  INVG
Subjt:  INRVYRIKAPLNPRGNDVIRNPSARQMKDALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDSTISVDRVRLLYCLMKGLEINVG

Query:  SIIRDEILACGWKRAGKLFFGSLITQLCQRVKIVPGKDEER
         +I  EI AC  +++G LFF SLIT +C+  +     +EE+
Subjt:  SIIRDEILACGWKRAGKLFFGSLITQLCQRVKIVPGKDEER

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]7.1e-2930.72Show/hide
Query:  VPLLREFYPGLRKESINMAVVRGKIVSFSSVDINRVYRIKAPLNPRGNDVIRNPSARQMKDALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIK
        +PL+REFY  L     N   VRG  VS+S   IN V+ +  P++   ++ I N +  ++   L+ VA  G +W  S     + + S L P + VW HF+K
Subjt:  VPLLREFYPGLRKESINMAVVRGKIVSFSSVDINRVYRIKAPLNPRGNDVIRNPSARQMKDALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIK

Query:  NRLMPTTHDSTISVDRVRLLYCLMKGLEINVGSIIRDEILACGWKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIGKLQQNSIQRKDKAS
        +RL+PTTH   +S DR+ LL+ ++ G  INVG +I  EI AC  ++ G LFF SLIT+LC+    +  +++          L   G++   ++ R  +  
Subjt:  NRLMPTTHDSTISVDRVRLLYCLMKGLEINVGSIIRDEILACGWKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIGKLQQNSIQRKDKAS

Query:  TSQATPQSGPNVASPSQHTSFTGPSPASEALGMVHRQLDQ---IRENLKTYWTYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPQEEKDSDEEEDEEN
         +++T Q  P+ + P+  +S        + L  + ++L Q     +  + +W Y+KERD A+++   +      P FP FPQ +L   + + + E D++ 
Subjt:  TSQATPQSGPNVASPSQHTSFTGPSPASEALGMVHRQLDQ---IRENLKTYWTYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPQEEKDSDEEEDEEN

Query:  DDEEKE
         +E  E
Subjt:  DDEEKE

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)4.9e-3635.98Show/hide
Query:  MKKRDFLNEKGF----SNRAGALPEFVSRIISQYKWQDFCAHPQEAVVPLLREFYPGLRKESINMAVVRGKIVSFSSVDINRVYRIKAPLNPRGNDVIRN
        ++ R    EKGF    S   G LP F++++I+Q+ W+ FCAHP++ +VPL+REFY  L     N   VRG  VS+S   IN V+ +  P++   ++ I N
Subjt:  MKKRDFLNEKGF----SNRAGALPEFVSRIISQYKWQDFCAHPQEAVVPLLREFYPGLRKESINMAVVRGKIVSFSSVDINRVYRIKAPLNPRGNDVIRN

Query:  PSARQMKDALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDSTISVDRVRLLYCLMKGLEINVGSIIRDEILACGWKRAGKLFFG
         +   +   L+ VA  G +W  S     + + S L P + VW HF+K+ L+PTTH  T+S DR+ LL+ ++ G  INVG +I  EI AC  ++ G LFF 
Subjt:  PSARQMKDALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDSTISVDRVRLLYCLMKGLEINVGSIIRDEILACGWKRAGKLFFG

Query:  SLITQLCQRVKIVPGKDEERHFFKPTIDLSLIGKLQQNSIQRKDKASTSQATPQSGPNVASPSQ
        SLIT+LC+  +     +EE+      ID   + ++ Q     +    ++Q    S P  AS S+
Subjt:  SLITQLCQRVKIVPGKDEERHFFKPTIDLSLIGKLQQNSIQRKDKASTSQATPQSGPNVASPSQ

A0A2P5BCG4 Uncharacterized protein (Fragment)5.1e-4132.49Show/hide
Query:  MKKRDFLNEKGF----SNRAGALPEFVSRIISQYKWQDFCAHPQEAVVPLLREFYPGLRKESINMAVVRGKIVSFSSVDINRVYRIKAPLNPRGNDVIRN
        ++ R    EKGF    S   G LP F++++I+Q+ W+ FCAHP++ +VPL+REFY  L     N   VRG  VS+S   IN V+ +  P++   ++ I+N
Subjt:  MKKRDFLNEKGF----SNRAGALPEFVSRIISQYKWQDFCAHPQEAVVPLLREFYPGLRKESINMAVVRGKIVSFSSVDINRVYRIKAPLNPRGNDVIRN

Query:  PSARQMKDALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDSTISVDRVRLLYCLMKGLEINVGSIIRDEILACGWKRAGKLFFG
         + + +   L+ VA  G +W  S     + + S L P + VW HF+K+RL+PTTH  T+S DR+ LL+ ++ G  INVG +I  EI AC  ++ G LFF 
Subjt:  PSARQMKDALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDSTISVDRVRLLYCLMKGLEINVGSIIRDEILACGWKRAGKLFFG

Query:  SLITQLCQRVKIVPGKDEERHFFKPTIDLSLIGKLQQN--SIQRKDKASTSQATPQSGPNVASPSQHTSFTGPSPASEALGMVHRQ--LDQIRENLKTYW
        SLIT+LC+  +     +EE+      ID   + ++ Q   +   +  +S+  AT  S        Q         + + +   H    L    +  + +W
Subjt:  SLITQLCQRVKIVPGKDEERHFFKPTIDLSLIGKLQQN--SIQRKDKASTSQATPQSGPNVASPSQHTSFTGPSPASEALGMVHRQ--LDQIRENLKTYW

Query:  TYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPQEEKDSDEEEDEENDDEEKE
         Y+KERD A+++   +      P FP FPQ +L   + + + E D++  +E  E
Subjt:  TYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPQEEKDSDEEEDEENDDEEKE

A0A2P5DAQ2 Uncharacterized protein3.3e-3233.61Show/hide
Query:  KAASPKNPFPEVFKDVNFQER-MDIMKKRDFLNEKGFSNRAGALPEFVSRIISQYKWQDFCAHPQEAVVPLLREFYPGLRKESINMAVVRGKIVSFSSVD
        KA   ++   E+  + N Q R + + K+  + N K         P F++ +I Q+ WQ FCAHP++ +VPL+REFY  +     +   +RG  V  S   
Subjt:  KAASPKNPFPEVFKDVNFQER-MDIMKKRDFLNEKGFSNRAGALPEFVSRIISQYKWQDFCAHPQEAVVPLLREFYPGLRKESINMAVVRGKIVSFSSVD

Query:  INRVYRIKAPLNPRGNDVIRNPSARQMKDALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDSTISVDRVRLLYCLMKGLEINVG
        IN ++ +  P++   ++ + + +  ++   L+ VA  G +W  S     + + S L P + VW HF+K+RL+PTTH  T+S + V LLY ++ G  INVG
Subjt:  INRVYRIKAPLNPRGNDVIRNPSARQMKDALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDSTISVDRVRLLYCLMKGLEINVG

Query:  SIIRDEILACGWKRAGKLFFGSLITQLCQRVKIVPGKDEER
         +I  EI AC  +++G LFF SLIT +C+  +     +EE+
Subjt:  SIIRDEILACGWKRAGKLFFGSLITQLCQRVKIVPGKDEER

A0A2P5DXM3 Uncharacterized protein3.4e-2930.72Show/hide
Query:  VPLLREFYPGLRKESINMAVVRGKIVSFSSVDINRVYRIKAPLNPRGNDVIRNPSARQMKDALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIK
        +PL+REFY  L     N   VRG  VS+S   IN V+ +  P++   ++ I N +  ++   L+ VA  G +W  S     + + S L P + VW HF+K
Subjt:  VPLLREFYPGLRKESINMAVVRGKIVSFSSVDINRVYRIKAPLNPRGNDVIRNPSARQMKDALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIK

Query:  NRLMPTTHDSTISVDRVRLLYCLMKGLEINVGSIIRDEILACGWKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIGKLQQNSIQRKDKAS
        +RL+PTTH   +S DR+ LL+ ++ G  INVG +I  EI AC  ++ G LFF SLIT+LC+    +  +++          L   G++   ++ R  +  
Subjt:  NRLMPTTHDSTISVDRVRLLYCLMKGLEINVGSIIRDEILACGWKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIGKLQQNSIQRKDKAS

Query:  TSQATPQSGPNVASPSQHTSFTGPSPASEALGMVHRQLDQ---IRENLKTYWTYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPQEEKDSDEEEDEEN
         +++T Q  P+ + P+  +S        + L  + ++L Q     +  + +W Y+KERD A+++   +      P FP FPQ +L   + + + E D++ 
Subjt:  TSQATPQSGPNVASPSQHTSFTGPSPASEALGMVHRQLDQ---IRENLKTYWTYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPQEEKDSDEEEDEEN

Query:  DDEEKE
         +E  E
Subjt:  DDEEKE

W9RBS1 Uncharacterized protein4.0e-3030.16Show/hide
Query:  FAKRPRTRSMDASPAVPPTTSPAKPKGKSLKAASPKNPFPEVFKDVNFQERMDIMKKRDFLNEKGF---SNRAGALPEFVSRIISQYKWQDFCAHPQEAV
        FAKRP + S    PA+    + A     S +  S    F +   +  ++E    +  R+ + EKGF    +     P F+S +I    WQ FC HP + +
Subjt:  FAKRPRTRSMDASPAVPPTTSPAKPKGKSLKAASPKNPFPEVFKDVNFQERMDIMKKRDFLNEKGF---SNRAGALPEFVSRIISQYKWQDFCAHPQEAV

Query:  VPLLREFYPGLRKESINMAVVRGKIVSFSSVDINRVYRIKAPLNPRGND----VIRNPSARQMKDALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWL
        VPL++EFY  L+ +  N   V    ++F+S  IN V  I     P  +D    +I +    Q+K+ LK +A  G QW  S     +    +L+P + VW 
Subjt:  VPLLREFYPGLRKESINMAVVRGKIVSFSSVDINRVYRIKAPLNPRGND----VIRNPSARQMKDALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWL

Query:  HFIKNRLMPTTHDSTISVDRVRLLYCLMKGLEINVGSIIRDEILACGWKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIGKLQQNSIQRK
        HF+ +RL+ +TH  TIS +R  LLY ++ G  INVG +I D+I AC  K  G L+F SLI++LC +  +     E R      +DL  I ++     ++ 
Subjt:  HFIKNRLMPTTHDSTISVDRVRLLYCLMKGLEINVGSIIRDEILACGWKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIGKLQQNSIQRK

Query:  DKASTSQATPQSGPNVASPSQHTSFTGPSPASEALGM-----------VHRQLDQIRENLKTYWTYAKERDEAIREFY
        +K    +   Q  P+  S S HT     + + E L             +   L Q +E L  +W Y+++RD A+++ +
Subjt:  DKASTSQATPQSGPNVASPSQHTSFTGPSPASEALGM-----------VHRQLDQIRENLKTYWTYAKERDEAIREFY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAAGGGGTATCCACTCCTTGACATTGATCCCGAGATAGAAAGAACCTTTCGTCGTCGAAGGAAGGAACAGAGACGAAAGAAGAAGGAGCAACAAGAGTTGAGCGC
ACGGGAACTTCTAGAGGAAGCATCTTACATTCAAGAGTTTCCAATGGATCCTCCTGGAGTCGAGCCTCAAGTATATCCACAAGCTCATGAAGAATTGGACGAAAAAGATG
AAGAAGAGATTGAAGTGGAAGACATCAGTCCAACCATGAAGTGGCAAAGGATAAAACCATATTGGGGAAGAGGCTTCGAGGATGAGAAAGCCCATGTCTCCATGATTGAT
TTGTCAAACCCTCAAGCTCTCCATATTCTCCATCGTGAAACCGAAAGACCCACCAGCAAGAAGATGAAGAACACTCAAGGATCGTCATCCTCACGCAAGAACACTCGATC
TCAGAGTGTCCGAGCGACCCACGAAGCTGAAGCAAGCAACCGACGGCAAGAAGAGACCCCCGTTACGCCCATGCACGGCACGCGAAGGACAAGACCCATGGGATTCTCGC
CGGCGGTCGTGAACCAAGCGCCCGATGCTCCTACTCCATCTTCTTCGGCAATGTCGGCCACGTCGAGGGAGATGCCGAGTTCATCTACGCCAAGGAGGTTCACGCGCGCC
ACTGCCGTCCGCCAAACCCAAAAACCCGCTACTCAACAATACAAAAAATGTTCGCGGGAGTGGTTTGAAATGATCTGTGAGATGGGTGCCAAGAGACGAGCTGCCCTTGA
AGAAGAAGGGAATCAGAAAGACGAAGAAAAAGCCGCCAAGGCAGCTGAAAGCTCTCGGCAAGGAGAAGCTTCAATGAGTAAGGTTTCTGAACCTTCATCTAACCCTTCTT
TTTCTTGCAGGTCAAAACCGTTTGTAACTTACTGTGCAAGAAAGAAGAGCCCGAAGAAGGTTGTGTATGAAAGGCCATTAAAAATTGAGCCCCTAAAAACCGCAAGGATG
CCTCCCAATGTATTCGAAGGAATAATCCGTCAAGTTGTGGCAAAGGCCCTTGAGATTGCTGAGGGGTACAAGGCTGAACAGGATGCATTAAAAGAGGTTGAAGCTGAGAG
GGAGATGGAAAATCAGAAAATGACTGAGGAGGATGAGTTTGCAAAGAAAAGAGATGAGGAAGAAGAGAAAAGAAAGAGAGAAGAAGAGCAAGAGGCCGAGAGAGCCTTAG
AGGCTGAAGAAGAAAGAAAGAAGGAAGCTGAAGACTTCCTTGCAGCCTTTGAGCCACTCCATAAGGCTCAAAGTGAAGTTGAAGCACTGCAAGGAAGGGTAGAAGAAAAG
GCCCAACAGGGGCCAACAGAAGAAAATTTTGAAAAAGAAAAAGAAAGAGAAGTGGAGAATGAAGGCCAGAATGCAACCGCATCTGGGCCACATTCTGAGGAAGGCCTAAC
CGAGGCCACCGTTCATCAGCCAGCTGAAGAGGTTTTTGAGCCTCTATTCACACATGACCCACCAGCTGCTAATAGCACCTCTTCGGGAGAGAGGAAGGTAAGTGAGCTAG
ATGATGACCAAGTCCCTATCTCTGCAGCATTGAGAAGAAAGAGGAAGAGAGAGATTAAGGCTGAGAGGAGGACAAAGAACAAGAATGACCCGATATTTGCCAAGAGGCCG
AGGACAAGGTCCATGGACGCCTCTCCTGCAGTTCCTCCGACTACCTCACCCGCCAAGCCTAAGGGCAAGTCACTGAAGGCCGCATCTCCTAAAAATCCATTCCCTGAGGT
ATTTAAAGATGTTAATTTTCAGGAAAGGATGGATATCATGAAGAAAAGAGACTTCCTCAACGAGAAAGGATTCTCTAACAGAGCTGGAGCACTGCCAGAGTTCGTGAGCA
GGATCATATCTCAATACAAATGGCAGGACTTCTGTGCTCACCCTCAGGAGGCTGTTGTGCCTCTACTTCGAGAGTTTTACCCTGGCCTGAGGAAGGAGAGTATTAACATG
GCGGTGGTGAGGGGGAAGATAGTCAGTTTCTCCTCAGTCGACATTAACAGGGTGTACAGGATCAAAGCACCCCTGAATCCAAGAGGGAATGATGTTATCAGGAACCCTTC
GGCCAGACAGATGAAAGACGCTTTGAAACTTGTGGCCAACAAGGGGGTCCAATGGAAAGAATCGCAGACGAAAGTGAAGTCTTTAGTGCCAAGCGACCTAAAGCCAGAAT
CGGCAGTTTGGCTTCACTTCATCAAAAACCGCTTGATGCCAACCACCCATGACAGCACGATCTCAGTAGATAGAGTGAGGCTACTCTATTGTCTTATGAAGGGGTTGGAG
ATCAACGTGGGGAGCATAATTAGGGACGAGATCTTAGCCTGTGGATGGAAAAGGGCAGGCAAGCTTTTCTTCGGCTCACTCATCACCCAACTCTGTCAGAGGGTGAAGAT
TGTGCCAGGCAAGGACGAGGAGCGCCATTTCTTTAAACCAACCATCGACTTGTCCTTGATAGGGAAGCTCCAGCAGAACAGTATCCAGAGGAAAGACAAAGCCTCTACAT
CTCAGGCCACTCCTCAATCAGGGCCGAATGTAGCCTCTCCATCCCAACACACTTCTTTTACAGGGCCTTCACCAGCATCGGAAGCCCTAGGTATGGTCCACCGCCAGCTT
GATCAAATCAGGGAGAACCTGAAGACATATTGGACATATGCCAAGGAGAGGGATGAAGCAATTAGAGAGTTCTATCTCTCGATTGCCCCTAGTATCGCTCCGGTCTTTCC
AAATTTCCCTCAGTCGCTGCTGCCCCAGGAAGAGAAAGATTCTGATGAAGAGGAAGATGAAGAGAATGATGATGAAGAGAAAGAGAGTTCCTCGGACGAGGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCAAGGGGTATCCACTCCTTGACATTGATCCCGAGATAGAAAGAACCTTTCGTCGTCGAAGGAAGGAACAGAGACGAAAGAAGAAGGAGCAACAAGAGTTGAGCGC
ACGGGAACTTCTAGAGGAAGCATCTTACATTCAAGAGTTTCCAATGGATCCTCCTGGAGTCGAGCCTCAAGTATATCCACAAGCTCATGAAGAATTGGACGAAAAAGATG
AAGAAGAGATTGAAGTGGAAGACATCAGTCCAACCATGAAGTGGCAAAGGATAAAACCATATTGGGGAAGAGGCTTCGAGGATGAGAAAGCCCATGTCTCCATGATTGAT
TTGTCAAACCCTCAAGCTCTCCATATTCTCCATCGTGAAACCGAAAGACCCACCAGCAAGAAGATGAAGAACACTCAAGGATCGTCATCCTCACGCAAGAACACTCGATC
TCAGAGTGTCCGAGCGACCCACGAAGCTGAAGCAAGCAACCGACGGCAAGAAGAGACCCCCGTTACGCCCATGCACGGCACGCGAAGGACAAGACCCATGGGATTCTCGC
CGGCGGTCGTGAACCAAGCGCCCGATGCTCCTACTCCATCTTCTTCGGCAATGTCGGCCACGTCGAGGGAGATGCCGAGTTCATCTACGCCAAGGAGGTTCACGCGCGCC
ACTGCCGTCCGCCAAACCCAAAAACCCGCTACTCAACAATACAAAAAATGTTCGCGGGAGTGGTTTGAAATGATCTGTGAGATGGGTGCCAAGAGACGAGCTGCCCTTGA
AGAAGAAGGGAATCAGAAAGACGAAGAAAAAGCCGCCAAGGCAGCTGAAAGCTCTCGGCAAGGAGAAGCTTCAATGAGTAAGGTTTCTGAACCTTCATCTAACCCTTCTT
TTTCTTGCAGGTCAAAACCGTTTGTAACTTACTGTGCAAGAAAGAAGAGCCCGAAGAAGGTTGTGTATGAAAGGCCATTAAAAATTGAGCCCCTAAAAACCGCAAGGATG
CCTCCCAATGTATTCGAAGGAATAATCCGTCAAGTTGTGGCAAAGGCCCTTGAGATTGCTGAGGGGTACAAGGCTGAACAGGATGCATTAAAAGAGGTTGAAGCTGAGAG
GGAGATGGAAAATCAGAAAATGACTGAGGAGGATGAGTTTGCAAAGAAAAGAGATGAGGAAGAAGAGAAAAGAAAGAGAGAAGAAGAGCAAGAGGCCGAGAGAGCCTTAG
AGGCTGAAGAAGAAAGAAAGAAGGAAGCTGAAGACTTCCTTGCAGCCTTTGAGCCACTCCATAAGGCTCAAAGTGAAGTTGAAGCACTGCAAGGAAGGGTAGAAGAAAAG
GCCCAACAGGGGCCAACAGAAGAAAATTTTGAAAAAGAAAAAGAAAGAGAAGTGGAGAATGAAGGCCAGAATGCAACCGCATCTGGGCCACATTCTGAGGAAGGCCTAAC
CGAGGCCACCGTTCATCAGCCAGCTGAAGAGGTTTTTGAGCCTCTATTCACACATGACCCACCAGCTGCTAATAGCACCTCTTCGGGAGAGAGGAAGGTAAGTGAGCTAG
ATGATGACCAAGTCCCTATCTCTGCAGCATTGAGAAGAAAGAGGAAGAGAGAGATTAAGGCTGAGAGGAGGACAAAGAACAAGAATGACCCGATATTTGCCAAGAGGCCG
AGGACAAGGTCCATGGACGCCTCTCCTGCAGTTCCTCCGACTACCTCACCCGCCAAGCCTAAGGGCAAGTCACTGAAGGCCGCATCTCCTAAAAATCCATTCCCTGAGGT
ATTTAAAGATGTTAATTTTCAGGAAAGGATGGATATCATGAAGAAAAGAGACTTCCTCAACGAGAAAGGATTCTCTAACAGAGCTGGAGCACTGCCAGAGTTCGTGAGCA
GGATCATATCTCAATACAAATGGCAGGACTTCTGTGCTCACCCTCAGGAGGCTGTTGTGCCTCTACTTCGAGAGTTTTACCCTGGCCTGAGGAAGGAGAGTATTAACATG
GCGGTGGTGAGGGGGAAGATAGTCAGTTTCTCCTCAGTCGACATTAACAGGGTGTACAGGATCAAAGCACCCCTGAATCCAAGAGGGAATGATGTTATCAGGAACCCTTC
GGCCAGACAGATGAAAGACGCTTTGAAACTTGTGGCCAACAAGGGGGTCCAATGGAAAGAATCGCAGACGAAAGTGAAGTCTTTAGTGCCAAGCGACCTAAAGCCAGAAT
CGGCAGTTTGGCTTCACTTCATCAAAAACCGCTTGATGCCAACCACCCATGACAGCACGATCTCAGTAGATAGAGTGAGGCTACTCTATTGTCTTATGAAGGGGTTGGAG
ATCAACGTGGGGAGCATAATTAGGGACGAGATCTTAGCCTGTGGATGGAAAAGGGCAGGCAAGCTTTTCTTCGGCTCACTCATCACCCAACTCTGTCAGAGGGTGAAGAT
TGTGCCAGGCAAGGACGAGGAGCGCCATTTCTTTAAACCAACCATCGACTTGTCCTTGATAGGGAAGCTCCAGCAGAACAGTATCCAGAGGAAAGACAAAGCCTCTACAT
CTCAGGCCACTCCTCAATCAGGGCCGAATGTAGCCTCTCCATCCCAACACACTTCTTTTACAGGGCCTTCACCAGCATCGGAAGCCCTAGGTATGGTCCACCGCCAGCTT
GATCAAATCAGGGAGAACCTGAAGACATATTGGACATATGCCAAGGAGAGGGATGAAGCAATTAGAGAGTTCTATCTCTCGATTGCCCCTAGTATCGCTCCGGTCTTTCC
AAATTTCCCTCAGTCGCTGCTGCCCCAGGAAGAGAAAGATTCTGATGAAGAGGAAGATGAAGAGAATGATGATGAAGAGAAAGAGAGTTCCTCGGACGAGGAATAG
Protein sequenceShow/hide protein sequence
MSKGYPLLDIDPEIERTFRRRRKEQRRKKKEQQELSARELLEEASYIQEFPMDPPGVEPQVYPQAHEELDEKDEEEIEVEDISPTMKWQRIKPYWGRGFEDEKAHVSMID
LSNPQALHILHRETERPTSKKMKNTQGSSSSRKNTRSQSVRATHEAEASNRRQEETPVTPMHGTRRTRPMGFSPAVVNQAPDAPTPSSSAMSATSREMPSSSTPRRFTRA
TAVRQTQKPATQQYKKCSREWFEMICEMGAKRRAALEEEGNQKDEEKAAKAAESSRQGEASMSKVSEPSSNPSFSCRSKPFVTYCARKKSPKKVVYERPLKIEPLKTARM
PPNVFEGIIRQVVAKALEIAEGYKAEQDALKEVEAEREMENQKMTEEDEFAKKRDEEEEKRKREEEQEAERALEAEEERKKEAEDFLAAFEPLHKAQSEVEALQGRVEEK
AQQGPTEENFEKEKEREVENEGQNATASGPHSEEGLTEATVHQPAEEVFEPLFTHDPPAANSTSSGERKVSELDDDQVPISAALRRKRKREIKAERRTKNKNDPIFAKRP
RTRSMDASPAVPPTTSPAKPKGKSLKAASPKNPFPEVFKDVNFQERMDIMKKRDFLNEKGFSNRAGALPEFVSRIISQYKWQDFCAHPQEAVVPLLREFYPGLRKESINM
AVVRGKIVSFSSVDINRVYRIKAPLNPRGNDVIRNPSARQMKDALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDSTISVDRVRLLYCLMKGLE
INVGSIIRDEILACGWKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIGKLQQNSIQRKDKASTSQATPQSGPNVASPSQHTSFTGPSPASEALGMVHRQL
DQIRENLKTYWTYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPQEEKDSDEEEDEENDDEEKESSSDEE