; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0034383 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0034383
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr3:6849652..6850612
RNA-Seq ExpressionLag0034383
SyntenyLag0034383
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592918.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]4.7e-4760.69Show/hide
Query:  MAFCADPSLGNWRCAERMFDYVQDQSLFVYNVMVKAYNKR----------------------------------------GEKIHVFVVKAGMDYDDYVC
        MAFCADPS+GN R  ERMFDY+Q  SLFVYNVMVKAY KR                                        GEK+H FVVK GM  D YVC
Subjt:  MAFCADPSLGNWRCAERMFDYVQDQSLFVYNVMVKAYNKR----------------------------------------GEKIHVFVVKAGMDYDDYVC

Query:  NSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDALNTFKEMQQESNEKPSEAAVVSTHSA
        NSLMDMYAEL ++G+AKKLFDEMP  D VSWNV+I+GYVRCRRFEDA+N FKEMQQESNEKP EA VVST SA
Subjt:  NSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDALNTFKEMQQESNEKPSEAAVVSTHSA

KAG7025324.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]4.7e-4760.69Show/hide
Query:  MAFCADPSLGNWRCAERMFDYVQDQSLFVYNVMVKAYNKR----------------------------------------GEKIHVFVVKAGMDYDDYVC
        MAFCADPS+GN R  ERMFDY+Q  SLFVYNVMVKAY KR                                        GEK+H FVVK GM  D YVC
Subjt:  MAFCADPSLGNWRCAERMFDYVQDQSLFVYNVMVKAYNKR----------------------------------------GEKIHVFVVKAGMDYDDYVC

Query:  NSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDALNTFKEMQQESNEKPSEAAVVSTHSA
        NSLMDMYAEL ++G+AKKLFDEMP  D VSWNV+I+GYVRCRRFEDA+N FKEMQQESNEKP EA VVST SA
Subjt:  NSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDALNTFKEMQQESNEKPSEAAVVSTHSA

XP_022960435.1 pentatricopeptide repeat-containing protein At1g31430 [Cucurbita moschata]4.7e-4760.69Show/hide
Query:  MAFCADPSLGNWRCAERMFDYVQDQSLFVYNVMVKAYNKR----------------------------------------GEKIHVFVVKAGMDYDDYVC
        MAFCADPS+GN R  ERMFDY+Q  SLFVYNVMVKAY KR                                        GEK+H FVVK GM  D YVC
Subjt:  MAFCADPSLGNWRCAERMFDYVQDQSLFVYNVMVKAYNKR----------------------------------------GEKIHVFVVKAGMDYDDYVC

Query:  NSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDALNTFKEMQQESNEKPSEAAVVSTHSA
        NSLMDMYAEL ++G+AKKLFDEMP  D VSWNV+I+GYVRCRRFEDA+N FKEMQQESNEKP EA VVST SA
Subjt:  NSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDALNTFKEMQQESNEKPSEAAVVSTHSA

XP_023513900.1 pentatricopeptide repeat-containing protein At1g31430 isoform X1 [Cucurbita pepo subsp. pepo]1.1e-4660.12Show/hide
Query:  MAFCADPSLGNWRCAERMFDYVQDQSLFVYNVMVKAYNKR----------------------------------------GEKIHVFVVKAGMDYDDYVC
        MAFCADPS+GN R  E+MFDY+Q  SLFVYNVMVKAY KR                                        GEK+H FVVK GM  D YVC
Subjt:  MAFCADPSLGNWRCAERMFDYVQDQSLFVYNVMVKAYNKR----------------------------------------GEKIHVFVVKAGMDYDDYVC

Query:  NSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDALNTFKEMQQESNEKPSEAAVVSTHSA
        NSLMDMYAEL ++G+AKKLFDEMP  D VSWNV+I+GYVRCRRFEDA+N FKEMQQESNEKP EA VVST SA
Subjt:  NSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDALNTFKEMQQESNEKPSEAAVVSTHSA

XP_023513901.1 pentatricopeptide repeat-containing protein At1g31430 isoform X2 [Cucurbita pepo subsp. pepo]1.1e-4660.12Show/hide
Query:  MAFCADPSLGNWRCAERMFDYVQDQSLFVYNVMVKAYNKR----------------------------------------GEKIHVFVVKAGMDYDDYVC
        MAFCADPS+GN R  E+MFDY+Q  SLFVYNVMVKAY KR                                        GEK+H FVVK GM  D YVC
Subjt:  MAFCADPSLGNWRCAERMFDYVQDQSLFVYNVMVKAYNKR----------------------------------------GEKIHVFVVKAGMDYDDYVC

Query:  NSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDALNTFKEMQQESNEKPSEAAVVSTHSA
        NSLMDMYAEL ++G+AKKLFDEMP  D VSWNV+I+GYVRCRRFEDA+N FKEMQQESNEKP EA VVST SA
Subjt:  NSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDALNTFKEMQQESNEKPSEAAVVSTHSA

TrEMBL top hitse value%identityAlignment
A0A1S3CPD2 pentatricopeptide repeat-containing protein At1g314304.8e-4558.96Show/hide
Query:  MAFCADPSLGNWRCAERMFDYVQDQSLFVYNVMVKAYNKR----------------------------------------GEKIHVFVVKAGMDYDDYVC
        MAFCAD SLGN R AE++FDYVQD SLFVYNVMVK Y KR                                        GEK+H FVVK GM+ D+YVC
Subjt:  MAFCADPSLGNWRCAERMFDYVQDQSLFVYNVMVKAYNKR----------------------------------------GEKIHVFVVKAGMDYDDYVC

Query:  NSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDALNTFKEMQQESNEKPSEAAVVSTHSA
        NSLMDMY+EL  + +AKKLFDEM T D VSWNVMI+GYV CRRFEDA+NTF+EMQQE NEKP EA VVST SA
Subjt:  NSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDALNTFKEMQQESNEKPSEAAVVSTHSA

A0A5A7TXP7 Pentatricopeptide repeat-containing protein4.8e-4558.96Show/hide
Query:  MAFCADPSLGNWRCAERMFDYVQDQSLFVYNVMVKAYNKR----------------------------------------GEKIHVFVVKAGMDYDDYVC
        MAFCAD SLGN R AE++FDYVQD SLFVYNVMVK Y KR                                        GEK+H FVVK GM+ D+YVC
Subjt:  MAFCADPSLGNWRCAERMFDYVQDQSLFVYNVMVKAYNKR----------------------------------------GEKIHVFVVKAGMDYDDYVC

Query:  NSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDALNTFKEMQQESNEKPSEAAVVSTHSA
        NSLMDMY+EL  + +AKKLFDEM T D VSWNVMI+GYV CRRFEDA+NTF+EMQQE NEKP EA VVST SA
Subjt:  NSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDALNTFKEMQQESNEKPSEAAVVSTHSA

A0A6J1CTE5 pentatricopeptide repeat-containing protein At1g314308.7e-4760.12Show/hide
Query:  MAFCADPSLGNWRCAERMFDYVQDQSLFVYNVMVKAYNKR----------------------------------------GEKIHVFVVKAGMDYDDYVC
        MAFCADPSLGN R AER+FDY+Q+  LFVYNVMVKAY KR                                        GEKIH FVVK GMD+D+YVC
Subjt:  MAFCADPSLGNWRCAERMFDYVQDQSLFVYNVMVKAYNKR----------------------------------------GEKIHVFVVKAGMDYDDYVC

Query:  NSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDALNTFKEMQQESNEKPSEAAVVSTHSA
        NSLMDMYAEL +I  A+KLFDEM T D VSWNV+I+GYVRCRRFEDA+NTFKEMQ+ESN KP EA +VST SA
Subjt:  NSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDALNTFKEMQQESNEKPSEAAVVSTHSA

A0A6J1H7E9 pentatricopeptide repeat-containing protein At1g314302.3e-4760.69Show/hide
Query:  MAFCADPSLGNWRCAERMFDYVQDQSLFVYNVMVKAYNKR----------------------------------------GEKIHVFVVKAGMDYDDYVC
        MAFCADPS+GN R  ERMFDY+Q  SLFVYNVMVKAY KR                                        GEK+H FVVK GM  D YVC
Subjt:  MAFCADPSLGNWRCAERMFDYVQDQSLFVYNVMVKAYNKR----------------------------------------GEKIHVFVVKAGMDYDDYVC

Query:  NSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDALNTFKEMQQESNEKPSEAAVVSTHSA
        NSLMDMYAEL ++G+AKKLFDEMP  D VSWNV+I+GYVRCRRFEDA+N FKEMQQESNEKP EA VVST SA
Subjt:  NSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDALNTFKEMQQESNEKPSEAAVVSTHSA

A0A6J1KW17 pentatricopeptide repeat-containing protein At1g314308.7e-4760.12Show/hide
Query:  MAFCADPSLGNWRCAERMFDYVQDQSLFVYNVMVKAYNKR----------------------------------------GEKIHVFVVKAGMDYDDYVC
        MAFCADPS+GN    ERMFDY+Q  SLFVYNVMVKAY KR                                        GEK+H FVVK GM  D YVC
Subjt:  MAFCADPSLGNWRCAERMFDYVQDQSLFVYNVMVKAYNKR----------------------------------------GEKIHVFVVKAGMDYDDYVC

Query:  NSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDALNTFKEMQQESNEKPSEAAVVSTHSA
        NSLMDMYAEL ++G+AKKLFDEMP  D VSWNV+I+GYVRCRRFEDA+N FKEMQQESNEKP EA VVST SA
Subjt:  NSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDALNTFKEMQQESNEKPSEAAVVSTHSA

SwissProt top hitse value%identityAlignment
O64705 Pentatricopeptide repeat-containing protein At2g344008.5e-1533.33Show/hide
Query:  FVYNVMVKAYNKR-----GEKIHVFVVKAGMDYDDYVCNSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDALNTFKEMQQESNEKP
        F YN +  A  K      G  +H  + K G++ D ++ +SL+ MYA+  Q+G A+KLFDE+   D VSWN MI+GY      +DA++ F++M++E  E P
Subjt:  FVYNVMVKAYNKR-----GEKIHVFVVKAGMDYDDYVCNSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDALNTFKEMQQESNEKP

Query:  SEAAVVS-----THSAFLHDHHDFLLVLWTTVINGYVHGG---CSPYRLCSVGSSRAREMDSWMTMDAV-------VVSGNGKTGETPRLFTEME
         E  +VS     +H   L        +  T  I      G    S Y  C    S  R  +  +  D V       V S NGK+ E  +LF EME
Subjt:  SEAAVVS-----THSAFLHDHHDFLLVLWTTVINGYVHGG---CSPYRLCSVGSSRAREMDSWMTMDAV-------VVSGNGKTGETPRLFTEME

Q7Y211 Pentatricopeptide repeat-containing protein At3g57430, chloroplastic2.1e-1345.56Show/hide
Query:  RGEKIHVFVVKAGMDYDDYVCNSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDALNTFKEMQQESNEKPSEAAVVS
        R E IH FVVK G+D D +V N+LMDMY+ L +I  A ++F +M   D V+WN MI GYV     EDAL    +MQ    +    A+ VS
Subjt:  RGEKIHVFVVKAGMDYDDYVCNSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDALNTFKEMQQESNEKPSEAAVVS

Q9C866 Pentatricopeptide repeat-containing protein At1g314305.0e-2335.12Show/hide
Query:  MFDYVQDQSLFVYN----VMVKAYNK-----RGEKIHVFVVKAGMDYDDYVCNSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDAL
        +F  ++ Q L+  N    V++K+  +      GEK+H + VKAG+++D YV NSLM MYA L +I    K+FDEMP  D VSWN +I+ YV   RFEDA+
Subjt:  MFDYVQDQSLFVYN----VMVKAYNK-----RGEKIHVFVVKAGMDYDDYVCNSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDAL

Query:  NTFKEMQQESNEKPSEAAVVSTHSA---------------FLHDHHDFLLVLWTTVINGYVHGGCSPYRLCSVGSSRAREMDSWMTMDAVVVSGNGKTGE
          FK M QESN K  E  +VST SA               F+    +  + +   +++ +   GC         S R + +  W +M    VS  G+  E
Subjt:  NTFKEMQQESNEKPSEAAVVSTHSA---------------FLHDHHDFLLVLWTTVINGYVHGGCSPYRLCSVGSSRAREMDSWMTMDAVVVSGNGKTGE

Query:  TPRLF
           LF
Subjt:  TPRLF

Q9SJG6 Pentatricopeptide repeat-containing protein At2g42920, chloroplastic3.2e-1430.3Show/hide
Query:  VQDQSLFVYNVMVKAYNKRGEKIHVFVVKAGM-DYDDYVCNSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDALNTFKEMQQESNE
        ++D S F+ N M+  Y   G  I  + +  GM  +D    NS++  +A+   I  A+ LFDEMP  + VSWN MI+G+VR  RF+DAL+ F+EM QE + 
Subjt:  VQDQSLFVYNVMVKAYNKRGEKIHVFVVKAGM-DYDDYVCNSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDALNTFKEMQQESNE

Query:  KPSEAAVVS----------------THSAFLHDHHDFLLVLWTTVINGYVHGGCSPYRLCSVGSSRAREMDSWMTMDAVVVSGNGKTGETPRLFTEME
        KP    +VS                 H   + +  +   ++ T +I+ Y   GC    L     +  +++  W +M  + ++ NG       LF+E+E
Subjt:  KPSEAAVVS----------------THSAFLHDHHDFLLVLWTTVINGYVHGGCSPYRLCSVGSSRAREMDSWMTMDAVVVSGNGKTGETPRLFTEME

Q9ZUT5 Pentatricopeptide repeat-containing protein At2g373101.1e-1431.67Show/hide
Query:  KIHVFVVKAGMDYDDYVCNSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDALNTFKEMQQESNEKPSEAAVVS-------------
        ++H FV++ G D D +V N ++  Y +   I SA+K+FDEM   D VSWN MI+GY +   FED    +K M   S+ KP+   V+S             
Subjt:  KIHVFVVKAGMDYDDYVCNSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDALNTFKEMQQESNEKPSEAAVVS-------------

Query:  ---THSAFLHDHHDFLLVLWTTVINGYVHGGCSPYRLCSVGSSRAREMDSWMTMDAVV--VSGNGKTGETPRLFTEMELV
            H   + +H    L L   VI  Y   G   Y           E DS +T  A++     +G   E   LF+EME +
Subjt:  ---THSAFLHDHHDFLLVLWTTVINGYVHGGCSPYRLCSVGSSRAREMDSWMTMDAVV--VSGNGKTGETPRLFTEMELV

Arabidopsis top hitse value%identityAlignment
AT1G31430.1 Pentatricopeptide repeat (PPR-like) superfamily protein3.5e-2435.12Show/hide
Query:  MFDYVQDQSLFVYN----VMVKAYNK-----RGEKIHVFVVKAGMDYDDYVCNSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDAL
        +F  ++ Q L+  N    V++K+  +      GEK+H + VKAG+++D YV NSLM MYA L +I    K+FDEMP  D VSWN +I+ YV   RFEDA+
Subjt:  MFDYVQDQSLFVYN----VMVKAYNK-----RGEKIHVFVVKAGMDYDDYVCNSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDAL

Query:  NTFKEMQQESNEKPSEAAVVSTHSA---------------FLHDHHDFLLVLWTTVINGYVHGGCSPYRLCSVGSSRAREMDSWMTMDAVVVSGNGKTGE
          FK M QESN K  E  +VST SA               F+    +  + +   +++ +   GC         S R + +  W +M    VS  G+  E
Subjt:  NTFKEMQQESNEKPSEAAVVSTHSA---------------FLHDHHDFLLVLWTTVINGYVHGGCSPYRLCSVGSSRAREMDSWMTMDAVVVSGNGKTGE

Query:  TPRLF
           LF
Subjt:  TPRLF

AT2G34400.1 Pentatricopeptide repeat (PPR-like) superfamily protein6.0e-1633.33Show/hide
Query:  FVYNVMVKAYNKR-----GEKIHVFVVKAGMDYDDYVCNSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDALNTFKEMQQESNEKP
        F YN +  A  K      G  +H  + K G++ D ++ +SL+ MYA+  Q+G A+KLFDE+   D VSWN MI+GY      +DA++ F++M++E  E P
Subjt:  FVYNVMVKAYNKR-----GEKIHVFVVKAGMDYDDYVCNSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDALNTFKEMQQESNEKP

Query:  SEAAVVS-----THSAFLHDHHDFLLVLWTTVINGYVHGG---CSPYRLCSVGSSRAREMDSWMTMDAV-------VVSGNGKTGETPRLFTEME
         E  +VS     +H   L        +  T  I      G    S Y  C    S  R  +  +  D V       V S NGK+ E  +LF EME
Subjt:  SEAAVVS-----THSAFLHDHHDFLLVLWTTVINGYVHGG---CSPYRLCSVGSSRAREMDSWMTMDAV-------VVSGNGKTGETPRLFTEME

AT2G37310.1 Pentatricopeptide repeat (PPR) superfamily protein7.9e-1631.67Show/hide
Query:  KIHVFVVKAGMDYDDYVCNSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDALNTFKEMQQESNEKPSEAAVVS-------------
        ++H FV++ G D D +V N ++  Y +   I SA+K+FDEM   D VSWN MI+GY +   FED    +K M   S+ KP+   V+S             
Subjt:  KIHVFVVKAGMDYDDYVCNSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDALNTFKEMQQESNEKPSEAAVVS-------------

Query:  ---THSAFLHDHHDFLLVLWTTVINGYVHGGCSPYRLCSVGSSRAREMDSWMTMDAVV--VSGNGKTGETPRLFTEMELV
            H   + +H    L L   VI  Y   G   Y           E DS +T  A++     +G   E   LF+EME +
Subjt:  ---THSAFLHDHHDFLLVLWTTVINGYVHGGCSPYRLCSVGSSRAREMDSWMTMDAVV--VSGNGKTGETPRLFTEMELV

AT2G42920.1 Pentatricopeptide repeat (PPR-like) superfamily protein2.3e-1530.3Show/hide
Query:  VQDQSLFVYNVMVKAYNKRGEKIHVFVVKAGM-DYDDYVCNSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDALNTFKEMQQESNE
        ++D S F+ N M+  Y   G  I  + +  GM  +D    NS++  +A+   I  A+ LFDEMP  + VSWN MI+G+VR  RF+DAL+ F+EM QE + 
Subjt:  VQDQSLFVYNVMVKAYNKRGEKIHVFVVKAGM-DYDDYVCNSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDALNTFKEMQQESNE

Query:  KPSEAAVVS----------------THSAFLHDHHDFLLVLWTTVINGYVHGGCSPYRLCSVGSSRAREMDSWMTMDAVVVSGNGKTGETPRLFTEME
        KP    +VS                 H   + +  +   ++ T +I+ Y   GC    L     +  +++  W +M  + ++ NG       LF+E+E
Subjt:  KPSEAAVVS----------------THSAFLHDHHDFLLVLWTTVINGYVHGGCSPYRLCSVGSSRAREMDSWMTMDAVVVSGNGKTGETPRLFTEME

AT3G57430.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.5e-1445.56Show/hide
Query:  RGEKIHVFVVKAGMDYDDYVCNSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDALNTFKEMQQESNEKPSEAAVVS
        R E IH FVVK G+D D +V N+LMDMY+ L +I  A ++F +M   D V+WN MI GYV     EDAL    +MQ    +    A+ VS
Subjt:  RGEKIHVFVVKAGMDYDDYVCNSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDALNTFKEMQQESNEKPSEAAVVS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATTTTGTGCAGACCCGTCTCTTGGAAACTGGCGCTGTGCGGAGAGGATGTTCGATTACGTACAAGACCAATCTCTGTTTGTTTATAATGTGATGGTTAAAGCGTA
TAACAAAAGGGGTGAAAAGATTCATGTGTTTGTGGTGAAGGCGGGGATGGATTATGATGATTATGTTTGTAATTCACTTATGGATATGTATGCTGAATTGAGCCAGATTG
GGAGTGCTAAGAAGTTGTTCGACGAAATGCCGACTACAGATTATGTTTCTTGGAATGTTATGATTGCTGGGTATGTTAGGTGTCGGAGATTTGAGGATGCTTTGAATACA
TTTAAGGAAATGCAGCAAGAGAGCAATGAGAAACCCAGTGAAGCTGCTGTAGTTAGCACTCATTCTGCCTTTCTTCACGACCATCATGATTTCTTGCTTGTTCTGTGGAC
AACTGTTATTAATGGGTATGTTCATGGTGGTTGCTCTCCTTACAGGTTGTGCTCAGTTGGTAGCTCTAGAGCAAGGGAAATGGACTCATGGATGACAATGGATGCTGTGG
TTGTCTCGGGGAATGGGAAGACTGGTGAAACACCAAGGCTGTTCACAGAAATGGAACTTGTACGAGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCATTTTGTGCAGACCCGTCTCTTGGAAACTGGCGCTGTGCGGAGAGGATGTTCGATTACGTACAAGACCAATCTCTGTTTGTTTATAATGTGATGGTTAAAGCGTA
TAACAAAAGGGGTGAAAAGATTCATGTGTTTGTGGTGAAGGCGGGGATGGATTATGATGATTATGTTTGTAATTCACTTATGGATATGTATGCTGAATTGAGCCAGATTG
GGAGTGCTAAGAAGTTGTTCGACGAAATGCCGACTACAGATTATGTTTCTTGGAATGTTATGATTGCTGGGTATGTTAGGTGTCGGAGATTTGAGGATGCTTTGAATACA
TTTAAGGAAATGCAGCAAGAGAGCAATGAGAAACCCAGTGAAGCTGCTGTAGTTAGCACTCATTCTGCCTTTCTTCACGACCATCATGATTTCTTGCTTGTTCTGTGGAC
AACTGTTATTAATGGGTATGTTCATGGTGGTTGCTCTCCTTACAGGTTGTGCTCAGTTGGTAGCTCTAGAGCAAGGGAAATGGACTCATGGATGACAATGGATGCTGTGG
TTGTCTCGGGGAATGGGAAGACTGGTGAAACACCAAGGCTGTTCACAGAAATGGAACTTGTACGAGCTTGA
Protein sequenceShow/hide protein sequence
MAFCADPSLGNWRCAERMFDYVQDQSLFVYNVMVKAYNKRGEKIHVFVVKAGMDYDDYVCNSLMDMYAELSQIGSAKKLFDEMPTTDYVSWNVMIAGYVRCRRFEDALNT
FKEMQQESNEKPSEAAVVSTHSAFLHDHHDFLLVLWTTVINGYVHGGCSPYRLCSVGSSRAREMDSWMTMDAVVVSGNGKTGETPRLFTEMELVRA