; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023588 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023588
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionprotein LIGHT-DEPENDENT SHORT HYPOCOTYLS 10-like
Genome locationtig00000892:4779089..4779627
RNA-Seq ExpressionSgr023588
SyntenySgr023588
Gene Ontology termsGO:0009299 - mRNA transcription (biological process)
GO:0009416 - response to light stimulus (biological process)
GO:0005634 - nucleus (cellular component)
InterPro domainsIPR006936 - ALOG domain
IPR040222 - ALOG family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EOX92850.1 Pleckstrin domain-containing family N member 1 [Theobroma cacao]3.3e-5873.03Show/hide
Query:  MSGERGKDYAEGSPSSP-PSSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ--------------------
        MS ERGKD AEGS   P    QQP TPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCN NHVLDFLRYLDQFGKTKVH+Q                    
Subjt:  MSGERGKDYAEGSPSSP-PSSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ--------------------

Query:  ---------GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKKPNQNKGSNEESSSSSVMQFS
                 GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKKPNQ KG++E   SSS MQFS
Subjt:  ---------GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKKPNQNKGSNEESSSSSVMQFS

XP_007048693.2 PREDICTED: protein LIGHT-DEPENDENT SHORT HYPOCOTYLS 10 [Theobroma cacao]3.3e-5873.03Show/hide
Query:  MSGERGKDYAEGSPSSP-PSSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ--------------------
        MS ERGKD AEGS   P    QQP TPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCN NHVLDFLRYLDQFGKTKVH+Q                    
Subjt:  MSGERGKDYAEGSPSSP-PSSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ--------------------

Query:  ---------GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKKPNQNKGSNEESSSSSVMQFS
                 GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKKPNQ KG++E   SSS MQFS
Subjt:  ---------GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKKPNQNKGSNEESSSSSVMQFS

XP_021273752.1 protein LIGHT-DEPENDENT SHORT HYPOCOTYLS 10 [Herrania umbratica]4.3e-5873.03Show/hide
Query:  MSGERGKDYAEGSPSSP-PSSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ--------------------
        MS ERGKD AEGS   P    QQP TPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCN NHVLDFLRYLDQFGKTKVH+Q                    
Subjt:  MSGERGKDYAEGSPSSP-PSSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ--------------------

Query:  ---------GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKKPNQNKGSNEESSSSSVMQFS
                 GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIP+KKKKKKPNQ KGS+E   SSS MQFS
Subjt:  ---------GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKKPNQNKGSNEESSSSSVMQFS

XP_022149714.1 protein LIGHT-DEPENDENT SHORT HYPOCOTYLS 10-like [Momordica charantia]7.4e-6679.21Show/hide
Query:  SGERGKDYAEGSPSSPPSSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ----------------------
        SGERGKDYAEGS S   SSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ                      
Subjt:  SGERGKDYAEGSPSSPPSSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ----------------------

Query:  -------GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKKPNQNKGSNEESSSSSVMQFSTP
               GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKKP+QNKG+ EESSSSS MQFSTP
Subjt:  -------GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKKPNQNKGSNEESSSSSVMQFSTP

XP_022773780.1 protein LIGHT-DEPENDENT SHORT HYPOCOTYLS 10-like [Durio zibethinus]9.7e-5872.47Show/hide
Query:  MSGERGKDYAEGSP-SSPPSSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ--------------------
        MS ERGKD AEGS  ++P   QQP TPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCN NHVLDFLRYLDQFGKTKVH+Q                    
Subjt:  MSGERGKDYAEGSP-SSPPSSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ--------------------

Query:  ---------GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKKPNQNKGSNEESSSSSVMQFS
                 GRLRAAYEENGGSPETNPFAS AIRVYLREVRECQAKARGIPYKKKKKKPNQ KGS+E   SSS M FS
Subjt:  ---------GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKKPNQNKGSNEESSSSSVMQFS

TrEMBL top hitse value%identityAlignment
A0A061DJQ9 Pleckstrin domain-containing family N member 11.6e-5873.03Show/hide
Query:  MSGERGKDYAEGSPSSP-PSSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ--------------------
        MS ERGKD AEGS   P    QQP TPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCN NHVLDFLRYLDQFGKTKVH+Q                    
Subjt:  MSGERGKDYAEGSPSSP-PSSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ--------------------

Query:  ---------GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKKPNQNKGSNEESSSSSVMQFS
                 GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKKPNQ KG++E   SSS MQFS
Subjt:  ---------GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKKPNQNKGSNEESSSSSVMQFS

A0A6J0ZGE6 protein LIGHT-DEPENDENT SHORT HYPOCOTYLS 102.1e-5873.03Show/hide
Query:  MSGERGKDYAEGSPSSP-PSSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ--------------------
        MS ERGKD AEGS   P    QQP TPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCN NHVLDFLRYLDQFGKTKVH+Q                    
Subjt:  MSGERGKDYAEGSPSSP-PSSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ--------------------

Query:  ---------GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKKPNQNKGSNEESSSSSVMQFS
                 GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIP+KKKKKKPNQ KGS+E   SSS MQFS
Subjt:  ---------GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKKPNQNKGSNEESSSSSVMQFS

A0A6J1D7V0 protein LIGHT-DEPENDENT SHORT HYPOCOTYLS 10-like3.6e-6679.21Show/hide
Query:  SGERGKDYAEGSPSSPPSSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ----------------------
        SGERGKDYAEGS S   SSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ                      
Subjt:  SGERGKDYAEGSPSSPPSSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ----------------------

Query:  -------GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKKPNQNKGSNEESSSSSVMQFSTP
               GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKKP+QNKG+ EESSSSS MQFSTP
Subjt:  -------GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKKPNQNKGSNEESSSSSVMQFSTP

A0A6P6B9S8 protein LIGHT-DEPENDENT SHORT HYPOCOTYLS 10-like4.7e-5872.47Show/hide
Query:  MSGERGKDYAEGSP-SSPPSSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ--------------------
        MS ERGKD AEGS  ++P   QQP TPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCN NHVLDFLRYLDQFGKTKVH+Q                    
Subjt:  MSGERGKDYAEGSP-SSPPSSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ--------------------

Query:  ---------GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKKPNQNKGSNEESSSSSVMQFS
                 GRLRAAYEENGGSPETNPFAS AIRVYLREVRECQAKARGIPYKKKKKKPNQ KGS+E   SSS M FS
Subjt:  ---------GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKKPNQNKGSNEESSSSSVMQFS

M5WSR1 ALOG domain-containing protein4.0e-5770.56Show/hide
Query:  MSGERGKDYAEGSPS--SP-PSSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ------------------
        MS ERGKD+A+GS S  SP    QQP TPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCN+NHVL+FLRYLDQFGKTKVH+Q                  
Subjt:  MSGERGKDYAEGSPS--SP-PSSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ------------------

Query:  -----------GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKKPNQNKGSNEESSSSSVMQFS
                   GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKKP+ +KG N++ SSS+ M FS
Subjt:  -----------GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKKPNQNKGSNEESSSSSVMQFS

SwissProt top hitse value%identityAlignment
B8AH02 Protein G1-like38.9e-3854.43Show/hide
Query:  GERGKDYAEGSPSSPPSSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVH-------------------------
        G  G   A  S S+   +  P TPSRYE+QKRRDWNTFGQYL+N RPP+ L+QC+  HVL+FLRYLDQFGKTKVH                         
Subjt:  GERGKDYAEGSPSSPPSSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVH-------------------------

Query:  ----VQGRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKKPNQ
            + GRLRAA+EENGG PE+NPFA  A+R+YLREVRE QA+ARG+ Y+KKK+K  Q
Subjt:  ----VQGRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKKPNQ

Q0DZF3 Protein G1-like38.9e-3854.43Show/hide
Query:  GERGKDYAEGSPSSPPSSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVH-------------------------
        G  G   A  S S+   +  P TPSRYE+QKRRDWNTFGQYL+N RPP+ L+QC+  HVL+FLRYLDQFGKTKVH                         
Subjt:  GERGKDYAEGSPSSPPSSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVH-------------------------

Query:  ----VQGRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKKPNQ
            + GRLRAA+EENGG PE+NPFA  A+R+YLREVRE QA+ARG+ Y+KKK+K  Q
Subjt:  ----VQGRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKKPNQ

Q9LMK2 Protein LIGHT-DEPENDENT SHORT HYPOCOTYLS 66.1e-3956.86Show/hide
Query:  RGKDYAEGSPSSPPSSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ-------------------------
        +G D      SSPP+     TPSRYESQKRRDWNTF QYLKN +PP+ LS+C+  HV++FL+YLDQFGKTKVHV                          
Subjt:  RGKDYAEGSPSSPPSSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ-------------------------

Query:  ----GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKK
            GRLRAAYEENGG P++NPFA+ A+R+YLREVRE QAKARGIPY+KKK+K
Subjt:  ----GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKK

Q9S7R3 Protein LIGHT-DEPENDENT SHORT HYPOCOTYLS 103.7e-5266.29Show/hide
Query:  ERGKDYAEGSPSSPPSSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ------------------------
        ERGK   E S S P     P TPSRYESQKRRDWNTFGQYLKNQRPPVP+S C+ NHVLDFLRYLDQFGKTKVHV                         
Subjt:  ERGKDYAEGSPSSPPSSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ------------------------

Query:  -----GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPY-KKKKKKPNQNKGSNEESSSSSVMQFS
             GRLRAAYEENGG PETNPFASGAIRVYLREVRECQAKARGIPY KKKKKKP    G   E SSSS   FS
Subjt:  -----GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPY-KKKKKKPNQNKGSNEESSSSSVMQFS

Q9ZVA0 Protein LIGHT-DEPENDENT SHORT HYPOCOTYLS 73.3e-4052.94Show/hide
Query:  RGKDYAEGS--PSSPPSSQ--QPTTP------SRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ---------------
        +GK  AEGS  P S P  Q  QP +P      SRYESQKRRDWNTF QYL+NQ+PPV +SQC  NH+LDFL+YLDQFGKTKVH+                
Subjt:  RGKDYAEGS--PSSPPSSQ--QPTTP------SRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ---------------

Query:  --------------GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKKPNQN--------KGSNEESSSSSV
                      GRLRAA+EENGG PE NPFA G IRV+LREVR+ QAKARG+PYKK+KK+  +N         G+   SSSS++
Subjt:  --------------GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKKPNQN--------KGSNEESSSSSV

Arabidopsis top hitse value%identityAlignment
AT1G07090.1 Protein of unknown function (DUF640)4.4e-4056.86Show/hide
Query:  RGKDYAEGSPSSPPSSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ-------------------------
        +G D      SSPP+     TPSRYESQKRRDWNTF QYLKN +PP+ LS+C+  HV++FL+YLDQFGKTKVHV                          
Subjt:  RGKDYAEGSPSSPPSSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ-------------------------

Query:  ----GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKK
            GRLRAAYEENGG P++NPFA+ A+R+YLREVRE QAKARGIPY+KKK+K
Subjt:  ----GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKK

AT1G78815.1 Protein of unknown function (DUF640)2.3e-4152.94Show/hide
Query:  RGKDYAEGS--PSSPPSSQ--QPTTP------SRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ---------------
        +GK  AEGS  P S P  Q  QP +P      SRYESQKRRDWNTF QYL+NQ+PPV +SQC  NH+LDFL+YLDQFGKTKVH+                
Subjt:  RGKDYAEGS--PSSPPSSQ--QPTTP------SRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ---------------

Query:  --------------GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKKPNQN--------KGSNEESSSSSV
                      GRLRAA+EENGG PE NPFA G IRV+LREVR+ QAKARG+PYKK+KK+  +N         G+   SSSS++
Subjt:  --------------GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKKPNQN--------KGSNEESSSSSV

AT2G42610.1 Protein of unknown function (DUF640)2.6e-5366.29Show/hide
Query:  ERGKDYAEGSPSSPPSSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ------------------------
        ERGK   E S S P     P TPSRYESQKRRDWNTFGQYLKNQRPPVP+S C+ NHVLDFLRYLDQFGKTKVHV                         
Subjt:  ERGKDYAEGSPSSPPSSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ------------------------

Query:  -----GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPY-KKKKKKPNQNKGSNEESSSSSVMQFS
             GRLRAAYEENGG PETNPFASGAIRVYLREVRECQAKARGIPY KKKKKKP    G   E SSSS   FS
Subjt:  -----GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPY-KKKKKKPNQNKGSNEESSSSSVMQFS

AT2G42610.2 Protein of unknown function (DUF640)2.6e-5366.29Show/hide
Query:  ERGKDYAEGSPSSPPSSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ------------------------
        ERGK   E S S P     P TPSRYESQKRRDWNTFGQYLKNQRPPVP+S C+ NHVLDFLRYLDQFGKTKVHV                         
Subjt:  ERGKDYAEGSPSSPPSSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ------------------------

Query:  -----GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPY-KKKKKKPNQNKGSNEESSSSSVMQFS
             GRLRAAYEENGG PETNPFASGAIRVYLREVRECQAKARGIPY KKKKKKP    G   E SSSS   FS
Subjt:  -----GRLRAAYEENGGSPETNPFASGAIRVYLREVRECQAKARGIPY-KKKKKKPNQNKGSNEESSSSSVMQFS

AT4G18610.1 Protein of unknown function (DUF640)6.3e-3959.18Show/hide
Query:  PPSSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ-----------------------------GRLRAAYE
        PP  QQP   SRYESQKRRDWNTF QYLK+Q PP+ +SQ ++ HVL FLRYLDQFGKTKVH Q                             GRLRAAYE
Subjt:  PPSSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQ-----------------------------GRLRAAYE

Query:  EN-GGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKKPNQNK
        E+ GGSP+TNPFA+G+IRV+LREVRE QAKARGIPY+KKK++  +N+
Subjt:  EN-GGSPETNPFASGAIRVYLREVRECQAKARGIPYKKKKKKPNQNK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGGTGAAAGAGGCAAAGACTATGCAGAAGGATCGCCGAGCTCGCCTCCGTCGTCGCAGCAACCGACGACTCCAAGTCGGTACGAGTCGCAGAAACGGAGGGATTG
GAACACTTTTGGGCAATACTTGAAGAATCAAAGACCTCCTGTTCCGCTCTCTCAGTGCAACTTCAACCATGTGTTGGACTTTCTTAGATATCTAGATCAGTTTGGGAAGA
CAAAAGTTCATGTCCAAGGCAGGTTGAGAGCTGCCTACGAAGAAAACGGAGGATCGCCCGAGACAAACCCTTTTGCAAGTGGTGCAATCAGGGTTTATCTGAGAGAGGTG
AGGGAGTGTCAAGCGAAAGCGAGAGGAATTCCTTATAAAAAGAAGAAGAAGAAGCCGAACCAAAACAAGGGAAGCAATGAAGAATCAAGTAGTAGTTCAGTGATGCAGTT
TTCAACTCCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTGGTGAAAGAGGCAAAGACTATGCAGAAGGATCGCCGAGCTCGCCTCCGTCGTCGCAGCAACCGACGACTCCAAGTCGGTACGAGTCGCAGAAACGGAGGGATTG
GAACACTTTTGGGCAATACTTGAAGAATCAAAGACCTCCTGTTCCGCTCTCTCAGTGCAACTTCAACCATGTGTTGGACTTTCTTAGATATCTAGATCAGTTTGGGAAGA
CAAAAGTTCATGTCCAAGGCAGGTTGAGAGCTGCCTACGAAGAAAACGGAGGATCGCCCGAGACAAACCCTTTTGCAAGTGGTGCAATCAGGGTTTATCTGAGAGAGGTG
AGGGAGTGTCAAGCGAAAGCGAGAGGAATTCCTTATAAAAAGAAGAAGAAGAAGCCGAACCAAAACAAGGGAAGCAATGAAGAATCAAGTAGTAGTTCAGTGATGCAGTT
TTCAACTCCTTAG
Protein sequenceShow/hide protein sequence
MSGERGKDYAEGSPSSPPSSQQPTTPSRYESQKRRDWNTFGQYLKNQRPPVPLSQCNFNHVLDFLRYLDQFGKTKVHVQGRLRAAYEENGGSPETNPFASGAIRVYLREV
RECQAKARGIPYKKKKKKPNQNKGSNEESSSSSVMQFSTP