; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg039311 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg039311
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionWIYLD domain-containing protein
Genome locationscaffold10:45942829..45945188
RNA-Seq ExpressionSpg039311
SyntenySpg039311
Gene Ontology termsGO:0018024 - histone-lysine N-methyltransferase activity (molecular function)
InterPro domainsIPR018848 - WIYLD domain
IPR043017 - WIYLD domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602621.1 putative zinc metalloprotease EGY2, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.2e-5551.69Show/hide
Query:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRHTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVSFSTIFVEFGLYMHAFFNYALWFM
        MAPR R KKRGNLRIDAALDAMNPFGF PKLVR TVKELLSVYGGD+GWVFIEEGSYTLLIDTLLEK+KDGAIEKV                        
Subjt:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRHTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVSFSTIFVEFGLYMHAFFNYALWFM

Query:  DRSRYQQRTIVVCCVQETTCSVDLRLLNAHFLKVPEENVGADCQATLVAGCSSSVIDEASSSNPGAEITV----------MDNGALRITSALPANDSEER
                                           E+   AD Q T  AGCSS+V++EAS+SNPGAEITV          +DN   RIT+ +PANDS+ER
Subjt:  DRSRYQQRTIVVCCVQETTCSVDLRLLNAHFLKVPEENVGADCQATLVAGCSSSVIDEASSSNPGAEITV----------MDNGALRITSALPANDSEER

Query:  YGKDDD--------DHVGSTLNQS-SPARTPKPSRRRPYHGWISNG--DMGDLVRLAPAPLPEELAK
        Y K++D          + S++NQS   A TPK  RR+PYHGWIS+G  D  DLV L PA LPEE A+
Subjt:  YGKDDD--------DHVGSTLNQS-SPARTPKPSRRRPYHGWISNG--DMGDLVRLAPAPLPEELAK

KAG7033305.1 hypothetical protein SDJN02_07360 [Cucurbita argyrosperma subsp. argyrosperma]1.1e-5650.72Show/hide
Query:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRHTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVSFSTIFVEFGLYMHAFFNYALWFM
        MAPR R KKRGNLRIDAALDAMNPFGF PKLVR TVKELLSVYGGD+GWVFIEEGSYTLLIDTLLEK+KDGAIEKV                        
Subjt:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRHTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVSFSTIFVEFGLYMHAFFNYALWFM

Query:  DRSRYQQRTIVVCCVQETTCSVDLRLLNAHFLKVPEENVGADCQATLVAGCSSSVIDEASSSNPGAEITVMDNGALRITSALPANDSEERYGKDDD----
                                           E+   AD Q T  AGCSS+V         G   + +DN   RIT+ +PANDS+ERY K++D    
Subjt:  DRSRYQQRTIVVCCVQETTCSVDLRLLNAHFLKVPEENVGADCQATLVAGCSSSVIDEASSSNPGAEITVMDNGALRITSALPANDSEERYGKDDD----

Query:  ----DHVGSTLNQS-SPARTPKPSRRRPYHGWISNG--DMGDLVRLAPAPLPEELAKLFILPAQRKRKQRWDVKSADS
              + S++NQS   A TPK  RR+PYHGWIS+G  D  DLV L PA LPEE A+L I  AQRKRK RWDVK ++S
Subjt:  ----DHVGSTLNQS-SPARTPKPSRRRPYHGWISNG--DMGDLVRLAPAPLPEELAKLFILPAQRKRKQRWDVKSADS

XP_022953876.1 uncharacterized protein LOC111456280 isoform X1 [Cucurbita moschata]2.1e-6352.78Show/hide
Query:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRHTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVSFSTIFVEFGLYMHAFFNYALWFM
        MAPR R KKRGNLRIDAALDAMNPFGF PKLVR TVKELLSVYGGD+GWVFIEEGSYTLLIDTLLEK+KDGAIEKV                        
Subjt:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRHTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVSFSTIFVEFGLYMHAFFNYALWFM

Query:  DRSRYQQRTIVVCCVQETTCSVDLRLLNAHFLKVPEENVGADCQATLVAGCSSSVIDEASSSNPGAEITV----------MDNGALRITSALPANDSEER
                                           E+   AD Q T  AGCSS+V++EAS+SNPGAEITV          +DN   RIT+ +PANDS+ER
Subjt:  DRSRYQQRTIVVCCVQETTCSVDLRLLNAHFLKVPEENVGADCQATLVAGCSSSVIDEASSSNPGAEITV----------MDNGALRITSALPANDSEER

Query:  YGKDDD--------DHVGSTLNQS-SPARTPKPSRRRPYHGWISNG--DMGDLVRLAPAPLPEELAKLFILPAQRKRKQRWDVKSADS
        Y K++D          + S++NQS   A TPK  RR+PYHGWIS+G  D  DLV L PA LPEE A+L I  AQRKRK RWDVK ++S
Subjt:  YGKDDD--------DHVGSTLNQS-SPARTPKPSRRRPYHGWISNG--DMGDLVRLAPAPLPEELAKLFILPAQRKRKQRWDVKSADS

XP_022990775.1 uncharacterized protein LOC111487557 [Cucurbita maxima]1.2e-5551.69Show/hide
Query:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRHTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVSFSTIFVEFGLYMHAFFNYALWFM
        MAPR RSKKRGNLRIDAALDAMNPFGF PKLVR TVKELLSVYGGD+GWVFIEEGSYTLLIDTLLEK+KDGAIEKV                        
Subjt:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRHTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVSFSTIFVEFGLYMHAFFNYALWFM

Query:  DRSRYQQRTIVVCCVQETTCSVDLRLLNAHFLKVPEENVGADCQATLVAGCSSSVIDEASSSNPGAEITV----------MDNGALRITSALPANDSEER
                                           E+   AD Q T  AGCSS+V++EAS+SNPGAEITV          +DN   RIT+ +PANDS+ER
Subjt:  DRSRYQQRTIVVCCVQETTCSVDLRLLNAHFLKVPEENVGADCQATLVAGCSSSVIDEASSSNPGAEITV----------MDNGALRITSALPANDSEER

Query:  YGKDDD--------DHVGSTLNQS-SPARTPKPSRRRPYHGWISNG--DMGDLVRLAPAPLPEELAK
        Y K++D          + S++NQS   A  PK  RR+PYHGWIS+G  D  DLV L PA LPEE A+
Subjt:  YGKDDD--------DHVGSTLNQS-SPARTPKPSRRRPYHGWISNG--DMGDLVRLAPAPLPEELAK

XP_023531273.1 uncharacterized protein LOC111793562 isoform X1 [Cucurbita pepo subsp. pepo]1.0e-6252.43Show/hide
Query:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRHTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVSFSTIFVEFGLYMHAFFNYALWFM
        MAPR R KKRGNLRIDAALDAMNPFGF PKLVR TVKELLSVYGGD+GWVFIEEGSYTLLIDTLLEK+KDGAIEKV                        
Subjt:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRHTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVSFSTIFVEFGLYMHAFFNYALWFM

Query:  DRSRYQQRTIVVCCVQETTCSVDLRLLNAHFLKVPEENVGADCQATLVAGCSSSVIDEASSSNPGAEITV----------MDNGALRITSALPANDSEER
                                           E+   AD Q T  AGCSS+V++EAS+SNPGAEITV          +DN   RIT+ +PANDS+ER
Subjt:  DRSRYQQRTIVVCCVQETTCSVDLRLLNAHFLKVPEENVGADCQATLVAGCSSSVIDEASSSNPGAEITV----------MDNGALRITSALPANDSEER

Query:  YGKDDD--------DHVGSTLNQS-SPARTPKPSRRRPYHGWISNG--DMGDLVRLAPAPLPEELAKLFILPAQRKRKQRWDVKSADS
        Y K++D          + S++NQS   A TPK  RR+PYHGWIS+   D  DLV L PA LPEE A+L I  AQRKRK RWDVK ++S
Subjt:  YGKDDD--------DHVGSTLNQS-SPARTPKPSRRRPYHGWISNG--DMGDLVRLAPAPLPEELAKLFILPAQRKRKQRWDVKSADS

TrEMBL top hitse value%identityAlignment
A0A1S3B3F8 uncharacterized protein LOC103485563 isoform X11.2e-5148.56Show/hide
Query:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRHTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVSFSTIFVEFGLYMHAFFNYALWFM
        MAPRVRSKKR NLRIDAALDAM  FGFPPKLVR TVKELL VYGGDDGWVFIEEGSYTLLIDTLLEKQ +GAIE                          
Subjt:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRHTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVSFSTIFVEFGLYMHAFFNYALWFM

Query:  DRSRYQQRTIVVCCVQETTCSVDLRLLNAHFLKVPEENVGADCQATLVAGCSSSVIDEASSSNPGAEITVMDNGALRITSALPANDSEERYGKD------
                                         V  +N   D Q T VAGCS S ID   S           N A+  T+ LPAN+ +  +  D      
Subjt:  DRSRYQQRTIVVCCVQETTCSVDLRLLNAHFLKVPEENVGADCQATLVAGCSSSVIDEASSSNPGAEITVMDNGALRITSALPANDSEERYGKD------

Query:  ----DDDHVGSTLNQSSPARTPKPSRRRPYHGWISNGDM-GDLVRLAPAPLPEELAKLFILPAQRKRKQRWDVKSADS
            DDDH  ST NQS PA TPK  RR+ YHGWI   +   DLV L P  LPEELAKL I  A +KRK+RWDV+SA+S
Subjt:  ----DDDHVGSTLNQSSPARTPKPSRRRPYHGWISNGDM-GDLVRLAPAPLPEELAKLFILPAQRKRKQRWDVKSADS

A0A6J1BY42 uncharacterized protein LOC111006514 isoform X15.8e-5149.82Show/hide
Query:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRHTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVSFSTIFVEFGLYMHAFFNYALWFM
        MAPRVRS+KRGNLRIDAALDAM PFGF PKLVR TVKELLSVYGGDDGWVFIEEGSYTLLIDT+L+K KDG I                           
Subjt:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRHTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVSFSTIFVEFGLYMHAFFNYALWFM

Query:  DRSRYQQRTIVVCCVQETTCSVDLRLLNAHFLKVPEENVGADC-QATLVAGCSSSVIDEASSSNPGAEI--TVMDNGALRITSALPANDSEERY-----G
                                       ++V EEN  A+  + T +AGCSS+  DE +       +     DN A RIT+ L   DSE RY     G
Subjt:  DRSRYQQRTIVVCCVQETTCSVDLRLLNAHFLKVPEENVGADC-QATLVAGCSSSVIDEASSSNPGAEI--TVMDNGALRITSALPANDSEERY-----G

Query:  KDDDDHVGSTLNQSSP-ARTPKPSRRRPYHGWISNGD-MGDLVRLAPAPLPEELAKLFILPAQRKRKQRWDVKSADS
          DDDH  S  NQS+P A TPK S RRPYHGWIS+ D   DLV L P P   E A+L + P QRKRKQRWDVK A+S
Subjt:  KDDDDHVGSTLNQSSP-ARTPKPSRRRPYHGWISNGD-MGDLVRLAPAPLPEELAKLFILPAQRKRKQRWDVKSADS

A0A6J1GPA9 uncharacterized protein LOC111456280 isoform X11.0e-6352.78Show/hide
Query:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRHTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVSFSTIFVEFGLYMHAFFNYALWFM
        MAPR R KKRGNLRIDAALDAMNPFGF PKLVR TVKELLSVYGGD+GWVFIEEGSYTLLIDTLLEK+KDGAIEKV                        
Subjt:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRHTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVSFSTIFVEFGLYMHAFFNYALWFM

Query:  DRSRYQQRTIVVCCVQETTCSVDLRLLNAHFLKVPEENVGADCQATLVAGCSSSVIDEASSSNPGAEITV----------MDNGALRITSALPANDSEER
                                           E+   AD Q T  AGCSS+V++EAS+SNPGAEITV          +DN   RIT+ +PANDS+ER
Subjt:  DRSRYQQRTIVVCCVQETTCSVDLRLLNAHFLKVPEENVGADCQATLVAGCSSSVIDEASSSNPGAEITV----------MDNGALRITSALPANDSEER

Query:  YGKDDD--------DHVGSTLNQS-SPARTPKPSRRRPYHGWISNG--DMGDLVRLAPAPLPEELAKLFILPAQRKRKQRWDVKSADS
        Y K++D          + S++NQS   A TPK  RR+PYHGWIS+G  D  DLV L PA LPEE A+L I  AQRKRK RWDVK ++S
Subjt:  YGKDDD--------DHVGSTLNQS-SPARTPKPSRRRPYHGWISNG--DMGDLVRLAPAPLPEELAKLFILPAQRKRKQRWDVKSADS

A0A6J1GQX1 uncharacterized protein LOC111456280 isoform X27.8e-5651.69Show/hide
Query:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRHTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVSFSTIFVEFGLYMHAFFNYALWFM
        MAPR R KKRGNLRIDAALDAMNPFGF PKLVR TVKELLSVYGGD+GWVFIEEGSYTLLIDTLLEK+KDGAIEKV                        
Subjt:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRHTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVSFSTIFVEFGLYMHAFFNYALWFM

Query:  DRSRYQQRTIVVCCVQETTCSVDLRLLNAHFLKVPEENVGADCQATLVAGCSSSVIDEASSSNPGAEITV----------MDNGALRITSALPANDSEER
                                           E+   AD Q T  AGCSS+V++EAS+SNPGAEITV          +DN   RIT+ +PANDS+ER
Subjt:  DRSRYQQRTIVVCCVQETTCSVDLRLLNAHFLKVPEENVGADCQATLVAGCSSSVIDEASSSNPGAEITV----------MDNGALRITSALPANDSEER

Query:  YGKDDD--------DHVGSTLNQS-SPARTPKPSRRRPYHGWISNG--DMGDLVRLAPAPLPEELAK
        Y K++D          + S++NQS   A TPK  RR+PYHGWIS+G  D  DLV L PA LPEE A+
Subjt:  YGKDDD--------DHVGSTLNQS-SPARTPKPSRRRPYHGWISNG--DMGDLVRLAPAPLPEELAK

A0A6J1JR10 uncharacterized protein LOC1114875576.0e-5651.69Show/hide
Query:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRHTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVSFSTIFVEFGLYMHAFFNYALWFM
        MAPR RSKKRGNLRIDAALDAMNPFGF PKLVR TVKELLSVYGGD+GWVFIEEGSYTLLIDTLLEK+KDGAIEKV                        
Subjt:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRHTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVSFSTIFVEFGLYMHAFFNYALWFM

Query:  DRSRYQQRTIVVCCVQETTCSVDLRLLNAHFLKVPEENVGADCQATLVAGCSSSVIDEASSSNPGAEITV----------MDNGALRITSALPANDSEER
                                           E+   AD Q T  AGCSS+V++EAS+SNPGAEITV          +DN   RIT+ +PANDS+ER
Subjt:  DRSRYQQRTIVVCCVQETTCSVDLRLLNAHFLKVPEENVGADCQATLVAGCSSSVIDEASSSNPGAEITV----------MDNGALRITSALPANDSEER

Query:  YGKDDD--------DHVGSTLNQS-SPARTPKPSRRRPYHGWISNG--DMGDLVRLAPAPLPEELAK
        Y K++D          + S++NQS   A  PK  RR+PYHGWIS+G  D  DLV L PA LPEE A+
Subjt:  YGKDDD--------DHVGSTLNQS-SPARTPKPSRRRPYHGWISNG--DMGDLVRLAPAPLPEELAK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G45248.2 Nucleolar histone methyltransferase-related protein2.8e-0534.92Show/hide
Query:  KKRGNLRIDAALDAMNPFGFPPKLVRHTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKD
        K  G  R DAA D M  FGF   ++  ++K++L VY G+D W  IE+ +Y + +   L+ +++
Subjt:  KKRGNLRIDAALDAMNPFGFPPKLVRHTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKD

AT1G45248.3 Nucleolar histone methyltransferase-related protein2.8e-0534.92Show/hide
Query:  KKRGNLRIDAALDAMNPFGFPPKLVRHTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKD
        K  G  R DAA D M  FGF   ++  ++K++L VY G+D W  IE+ +Y + +   L+ +++
Subjt:  KKRGNLRIDAALDAMNPFGFPPKLVRHTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKD

AT2G40020.1 Nucleolar histone methyltransferase-related protein1.2e-0828.02Show/hide
Query:  LRIDAALDAMNPFGFPPKLVRHTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQ--KDGAIEKVSFSTIFVEFGLYMHAFFNYALWFMDRSRYQQRTI
        +R DAA D M  FGF   ++  ++KELL VY  +D W  IE+ SY  L+   LEKQ  K+  + +V  + +            N+     +  +  Q  +
Subjt:  LRIDAALDAMNPFGFPPKLVRHTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQ--KDGAIEKVSFSTIFVEFGLYMHAFFNYALWFMDRSRYQQRTI

Query:  VVC---CVQETTCSVDLRLLNAHFLKVPEENVGADCQATLVAGCSSSVIDEASSSNPGAEITVMDNGALRITSALPANDSEERYGKDDDDHVGSTLNQSS
         +      QE    ++     A  L +  E V     A +  G +SS  D A SS  G E   +  G L         D EE    D ++       ++ 
Subjt:  VVC---CVQETTCSVDLRLLNAHFLKVPEENVGADCQATLVAGCSSSVIDEASSSNPGAEITVMDNGALRITSALPANDSEERYGKDDDDHVGSTLNQSS

Query:  PARTPKPSRRRPYHGWISNGDMGDLVRLAPAPLPEELAKLFIL---PAQRKRKQRWD
        P   PK     P     S+GD  ++++L P PL EEL +L        +RK++ RWD
Subjt:  PARTPKPSRRRPYHGWISNGDMGDLVRLAPAPLPEELAKLFIL---PAQRKRKQRWD

AT2G40020.2 Nucleolar histone methyltransferase-related protein3.4e-1148.57Show/hide
Query:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRHTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKD
        MAPR R KK G +R DAA D M  FGF   ++  ++KELL VY  +D W  IE+ SY  L+   LEKQ++
Subjt:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRHTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKD

AT2G40020.3 Nucleolar histone methyltransferase-related protein1.4e-1229.74Show/hide
Query:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRHTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQ--KDGAIEKVSFSTIFVEFGLYMHAFFNYALW
        MAPR R KK G +R DAA D M  FGF   ++  ++KELL VY  +D W  IE+ SY  L+   LEKQ  K+  + +V  + +            N+   
Subjt:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRHTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQ--KDGAIEKVSFSTIFVEFGLYMHAFFNYALW

Query:  FMDRSRYQQRTIVVC---CVQETTCSVDLRLLNAHFLKVPEENVGADCQATLVAGCSSSVIDEASSSNPGAEITVMDNGALRITSALPANDSEERYGKDD
          +  +  Q  + +      QE    ++     A  L +  E V     A +  G +SS  D A SS  G E   +  G L         D EE    D 
Subjt:  FMDRSRYQQRTIVVC---CVQETTCSVDLRLLNAHFLKVPEENVGADCQATLVAGCSSSVIDEASSSNPGAEITVMDNGALRITSALPANDSEERYGKDD

Query:  DDHVGSTLNQSSPARTPKPSRRRPYHGWISNGDMGDLVRLAPAPLPEELAKLFIL---PAQRKRKQRWD
        ++       ++ P   PK     P     S+GD  ++++L P PL EEL +L        +RK++ RWD
Subjt:  DDHVGSTLNQSSPARTPKPSRRRPYHGWISNGDMGDLVRLAPAPLPEELAKLFIL---PAQRKRKQRWD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCCCAGAGTGCGAAGCAAAAAGAGGGGTAACTTACGAATTGATGCTGCGCTCGATGCTATGAACCCTTTTGGATTTCCCCCAAAGTTGGTTCGTCATACGGTGAA
GGAGCTCCTTAGTGTCTATGGAGGAGACGATGGATGGGTATTCATCGAGGAAGGCTCTTATACTCTTTTGATCGATACCCTTCTCGAAAAACAGAAAGATGGTGCAATTG
AGAAGGTTAGTTTTTCAACTATTTTTGTTGAATTTGGCTTGTACATGCACGCTTTCTTTAACTATGCATTATGGTTTATGGATCGTTCACGATACCAACAGAGAACCATT
GTTGTTTGTTGTGTACAAGAAACTACTTGTTCTGTAGATTTACGACTATTGAATGCCCATTTTCTGAAGGTTCCTGAAGAGAATGTAGGAGCAGATTGTCAGGCGACCCT
TGTAGCTGGCTGTTCATCGAGTGTCATTGATGAAGCTTCCTCATCCAATCCAGGGGCTGAGATTACTGTGATGGACAATGGAGCTTTAAGGATCACATCCGCATTGCCTG
CAAATGATTCAGAAGAAAGATACGGGAAGGATGATGATGACCATGTTGGGAGTACTCTTAACCAGTCTTCACCGGCACGTACCCCAAAACCGAGTAGGCGAAGACCTTAT
CATGGCTGGATCTCTAACGGCGACATGGGAGATCTCGTGCGCTTAGCACCAGCTCCATTGCCTGAAGAGTTGGCCAAGTTATTCATTCTTCCTGCACAGAGAAAAAGAAA
GCAGCGTTGGGATGTTAAGTCTGCAGACTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCCCAGAGTGCGAAGCAAAAAGAGGGGTAACTTACGAATTGATGCTGCGCTCGATGCTATGAACCCTTTTGGATTTCCCCCAAAGTTGGTTCGTCATACGGTGAA
GGAGCTCCTTAGTGTCTATGGAGGAGACGATGGATGGGTATTCATCGAGGAAGGCTCTTATACTCTTTTGATCGATACCCTTCTCGAAAAACAGAAAGATGGTGCAATTG
AGAAGGTTAGTTTTTCAACTATTTTTGTTGAATTTGGCTTGTACATGCACGCTTTCTTTAACTATGCATTATGGTTTATGGATCGTTCACGATACCAACAGAGAACCATT
GTTGTTTGTTGTGTACAAGAAACTACTTGTTCTGTAGATTTACGACTATTGAATGCCCATTTTCTGAAGGTTCCTGAAGAGAATGTAGGAGCAGATTGTCAGGCGACCCT
TGTAGCTGGCTGTTCATCGAGTGTCATTGATGAAGCTTCCTCATCCAATCCAGGGGCTGAGATTACTGTGATGGACAATGGAGCTTTAAGGATCACATCCGCATTGCCTG
CAAATGATTCAGAAGAAAGATACGGGAAGGATGATGATGACCATGTTGGGAGTACTCTTAACCAGTCTTCACCGGCACGTACCCCAAAACCGAGTAGGCGAAGACCTTAT
CATGGCTGGATCTCTAACGGCGACATGGGAGATCTCGTGCGCTTAGCACCAGCTCCATTGCCTGAAGAGTTGGCCAAGTTATTCATTCTTCCTGCACAGAGAAAAAGAAA
GCAGCGTTGGGATGTTAAGTCTGCAGACTCATGA
Protein sequenceShow/hide protein sequence
MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRHTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVSFSTIFVEFGLYMHAFFNYALWFMDRSRYQQRTI
VVCCVQETTCSVDLRLLNAHFLKVPEENVGADCQATLVAGCSSSVIDEASSSNPGAEITVMDNGALRITSALPANDSEERYGKDDDDHVGSTLNQSSPARTPKPSRRRPY
HGWISNGDMGDLVRLAPAPLPEELAKLFILPAQRKRKQRWDVKSADS