; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10005161 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10005161
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr07:137622..138110
RNA-Seq ExpressionHG10005161
SyntenyHG10005161
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7011986.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]1.5e-5982.67Show/hide
Query:  MPIRDVVAWNTVIAGKAQHGCSEEVLNLYNMMKTAGFRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAFL
        MPIR+VVAWNT+IAGKAQ+GCSEEVLN YNMMK AGFRPD+ITFVSVIS   ELATL QGQQIHAEVIKAGAGSVVAVVSSLISMYSR GC EDSVK F+
Subjt:  MPIRDVVAWNTVIAGKAQHGCSEEVLNLYNMMKTAGFRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAFL

Query:  DREDSDGVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLKMEVNQVTFLSL
        DRED+D VLWSAM A +GFHGRGEEAI L  FH ME+LKME N+VTFLSL
Subjt:  DREDSDGVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLKMEVNQVTFLSL

XP_022147886.1 pentatricopeptide repeat-containing protein At2g41080 isoform X1 [Momordica charantia]2.1e-5880.67Show/hide
Query:  MPIRDVVAWNTVIAGKAQHGCSEEVLNLYNMMKTAGFRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAFL
        MP R+VVAWNT+IAGKAQ+GCSEEVLN YNMMK AGFRPD+ITFVSVIS   ELATL QGQQIHAE IKAGAGSVVAV+SSLISMYSR GC +DSVKAFL
Subjt:  MPIRDVVAWNTVIAGKAQHGCSEEVLNLYNMMKTAGFRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAFL

Query:  DREDSDGVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLKMEVNQVTFLSL
        DRED+D VLWSAM A +GFHGRGEE I L  FH ME+LKME N+VTFLSL
Subjt:  DREDSDGVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLKMEVNQVTFLSL

XP_022953052.1 pentatricopeptide repeat-containing protein At2g41080 [Cucurbita moschata]1.5e-5982.67Show/hide
Query:  MPIRDVVAWNTVIAGKAQHGCSEEVLNLYNMMKTAGFRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAFL
        MPIR+VVAWNT+IAGKAQ+GCSEEVLN YNMMK AGFRPD+ITFVSVIS   ELATL QGQQIHAEVIKAGAGSVVAVVSSLISMYSR GC EDSVK F+
Subjt:  MPIRDVVAWNTVIAGKAQHGCSEEVLNLYNMMKTAGFRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAFL

Query:  DREDSDGVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLKMEVNQVTFLSL
        DRED+D VLWSAM A +GFHGRGEEAI L  FH ME+LKME N+VTFLSL
Subjt:  DREDSDGVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLKMEVNQVTFLSL

XP_022969026.1 pentatricopeptide repeat-containing protein At2g41080 [Cucurbita maxima]6.6e-6083.33Show/hide
Query:  MPIRDVVAWNTVIAGKAQHGCSEEVLNLYNMMKTAGFRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAFL
        MPIR+VVAWNT+IAGKAQ+GCSEEVLN YNMMK AGFRPD+ITFVSVIS   ELATL QGQQIHAEVIKAGAGSVVAVVSSLISMYSR GC EDSVK FL
Subjt:  MPIRDVVAWNTVIAGKAQHGCSEEVLNLYNMMKTAGFRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAFL

Query:  DREDSDGVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLKMEVNQVTFLSL
        DRED+D VLWSAM A +GFHGRGEEAI L  FH ME+LKME N+VTFLSL
Subjt:  DREDSDGVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLKMEVNQVTFLSL

XP_023553764.1 pentatricopeptide repeat-containing protein At2g41080 [Cucurbita pepo subsp. pepo]1.5e-5982.67Show/hide
Query:  MPIRDVVAWNTVIAGKAQHGCSEEVLNLYNMMKTAGFRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAFL
        MPIR+VVAWNT+IAGKAQ+GCSEEVLN YNMMK AGFRPD+ITFVSVIS   ELATL QGQQIHAEVIKAGAGSVVAVVSSLISMYSR GC EDSVK F+
Subjt:  MPIRDVVAWNTVIAGKAQHGCSEEVLNLYNMMKTAGFRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAFL

Query:  DREDSDGVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLKMEVNQVTFLSL
        DRED+D VLWSAM A +GFHGRGEEAI L  FH ME+LKME N+VTFLSL
Subjt:  DREDSDGVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLKMEVNQVTFLSL

TrEMBL top hitse value%identityAlignment
A0A5A7SLF2 Pentatricopeptide repeat-containing protein1.1e-5778.67Show/hide
Query:  MPIRDVVAWNTVIAGKAQHGCSEEVLNLYNMMKTAGFRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAFL
        MPIR VVAWNT+IAGKAQ+GC EEVLN YNMMK AGFRPD+ITFVSV+S   ELATL QGQQIHAEVIKAGA SV+AV+SSLISMYSR GC EDS+KAF+
Subjt:  MPIRDVVAWNTVIAGKAQHGCSEEVLNLYNMMKTAGFRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAFL

Query:  DREDSDGVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLKMEVNQVTFLSL
        DRED D VLWS+M A +GFHGRGEEA+ L  FH MEDLKME N+VTFLSL
Subjt:  DREDSDGVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLKMEVNQVTFLSL

A0A6J1D3K7 pentatricopeptide repeat-containing protein At2g41080 isoform X21.0e-5880.67Show/hide
Query:  MPIRDVVAWNTVIAGKAQHGCSEEVLNLYNMMKTAGFRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAFL
        MP R+VVAWNT+IAGKAQ+GCSEEVLN YNMMK AGFRPD+ITFVSVIS   ELATL QGQQIHAE IKAGAGSVVAV+SSLISMYSR GC +DSVKAFL
Subjt:  MPIRDVVAWNTVIAGKAQHGCSEEVLNLYNMMKTAGFRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAFL

Query:  DREDSDGVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLKMEVNQVTFLSL
        DRED+D VLWSAM A +GFHGRGEE I L  FH ME+LKME N+VTFLSL
Subjt:  DREDSDGVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLKMEVNQVTFLSL

A0A6J1D3P8 pentatricopeptide repeat-containing protein At2g41080 isoform X11.0e-5880.67Show/hide
Query:  MPIRDVVAWNTVIAGKAQHGCSEEVLNLYNMMKTAGFRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAFL
        MP R+VVAWNT+IAGKAQ+GCSEEVLN YNMMK AGFRPD+ITFVSVIS   ELATL QGQQIHAE IKAGAGSVVAV+SSLISMYSR GC +DSVKAFL
Subjt:  MPIRDVVAWNTVIAGKAQHGCSEEVLNLYNMMKTAGFRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAFL

Query:  DREDSDGVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLKMEVNQVTFLSL
        DRED+D VLWSAM A +GFHGRGEE I L  FH ME+LKME N+VTFLSL
Subjt:  DREDSDGVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLKMEVNQVTFLSL

A0A6J1GM53 pentatricopeptide repeat-containing protein At2g410807.1e-6082.67Show/hide
Query:  MPIRDVVAWNTVIAGKAQHGCSEEVLNLYNMMKTAGFRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAFL
        MPIR+VVAWNT+IAGKAQ+GCSEEVLN YNMMK AGFRPD+ITFVSVIS   ELATL QGQQIHAEVIKAGAGSVVAVVSSLISMYSR GC EDSVK F+
Subjt:  MPIRDVVAWNTVIAGKAQHGCSEEVLNLYNMMKTAGFRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAFL

Query:  DREDSDGVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLKMEVNQVTFLSL
        DRED+D VLWSAM A +GFHGRGEEAI L  FH ME+LKME N+VTFLSL
Subjt:  DREDSDGVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLKMEVNQVTFLSL

A0A6J1HWJ8 pentatricopeptide repeat-containing protein At2g410803.2e-6083.33Show/hide
Query:  MPIRDVVAWNTVIAGKAQHGCSEEVLNLYNMMKTAGFRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAFL
        MPIR+VVAWNT+IAGKAQ+GCSEEVLN YNMMK AGFRPD+ITFVSVIS   ELATL QGQQIHAEVIKAGAGSVVAVVSSLISMYSR GC EDSVK FL
Subjt:  MPIRDVVAWNTVIAGKAQHGCSEEVLNLYNMMKTAGFRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAFL

Query:  DREDSDGVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLKMEVNQVTFLSL
        DRED+D VLWSAM A +GFHGRGEEAI L  FH ME+LKME N+VTFLSL
Subjt:  DREDSDGVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLKMEVNQVTFLSL

SwissProt top hitse value%identityAlignment
O81767 Pentatricopeptide repeat-containing protein At4g339901.3e-2137.09Show/hide
Query:  MPIRDVVAWNTVIAGKAQHGCSEEVLNLYNMMKTAG-FRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAF
        +P  DV++WNT+I+G AQ+G + E + +YN+M+  G    ++ T+VSV+    +   L QG ++H  ++K G    V VV+SL  MY + G  ED++  F
Subjt:  MPIRDVVAWNTVIAGKAQHGCSEEVLNLYNMMKTAG-FRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAF

Query:  LDREDSDGVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLKMEVNQVTFLSL
              + V W+ + A HGFHG GE+A+ LF   L E +K +   +TF++L
Subjt:  LDREDSDGVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLKMEVNQVTFLSL

Q8S9M4 Pentatricopeptide repeat-containing protein At2g410801.6e-4560.67Show/hide
Query:  MPIRDVVAWNTVIAGKAQHGCSEEVLNLYNMMKTAGFRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAFL
        MP+R++VAWNT+I G AQ+GC E VL LY MMK +G RP++ITFV+V+S+  +LA   QGQQIHAE IK GA SVVAVVSSLISMYS+ GC  D+ KAF 
Subjt:  MPIRDVVAWNTVIAGKAQHGCSEEVLNLYNMMKTAGFRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAFL

Query:  DREDSDGVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLKMEVNQVTFLSL
        +RED D V+WS+M + +GFHG+G+EAI LF+  + E   ME+N+V FL+L
Subjt:  DREDSDGVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLKMEVNQVTFLSL

Q9FWA6 Pentatricopeptide repeat-containing protein At3g02330, mitochondrial3.7e-2139.58Show/hide
Query:  VAWNTVIAGKAQHGCSEEVLNLYNMMKTAGFRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAFLDREDSD
        V+WN++I+G      SE+   L+  M   G  PD+ T+ +V+ T   LA+   G+QIHA+VIK    S V + S+L+ MYS+ G   DS   F      D
Subjt:  VAWNTVIAGKAQHGCSEEVLNLYNMMKTAGFRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAFLDREDSD

Query:  GVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLKMEVNQVTFLSL
         V W+AM   +  HG+GEEAI LF   ++E++K   N VTF+S+
Subjt:  GVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLKMEVNQVTFLSL

Q9LW63 Putative pentatricopeptide repeat-containing protein At3g233306.2e-2137.41Show/hide
Query:  RDVVAWNTVIAGKAQHGCSEEVLNLYNMMKTAGFRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAFLDRE
        RD ++WN+++AG  Q+G   E L L+  M TA  +P  + F SVI     LATL  G+Q+H  V++ G GS + + S+L+ MYS+ G  + + K F    
Subjt:  RDVVAWNTVIAGKAQHGCSEEVLNLYNMMKTAGFRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAFLDRE

Query:  DSDGVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLKMEVNQVTFLSL
          D V W+A+   H  HG G EA+ L  F  M+   ++ NQV F+++
Subjt:  DSDGVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLKMEVNQVTFLSL

Q9SHZ8 Pentatricopeptide repeat-containing protein At2g220705.3e-2040.15Show/hide
Query:  RDVVAWNTVIAGKAQHGCSEEVLNLYNMMKTAGFRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAF-LDR
        RDVVAW  +I G  QHG   E +NL+  M   G RP+  T  +++S    LA+L+ G+QIH   +K+G    V+V ++LI+MY++ G    + +AF L R
Subjt:  RDVVAWNTVIAGKAQHGCSEEVLNLYNMMKTAGFRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAF-LDR

Query:  EDSDGVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLK
         + D V W++M      HG  EEA+ LF   LME L+
Subjt:  EDSDGVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLK

Arabidopsis top hitse value%identityAlignment
AT2G22070.1 pentatricopeptide (PPR) repeat-containing protein3.7e-2140.15Show/hide
Query:  RDVVAWNTVIAGKAQHGCSEEVLNLYNMMKTAGFRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAF-LDR
        RDVVAW  +I G  QHG   E +NL+  M   G RP+  T  +++S    LA+L+ G+QIH   +K+G    V+V ++LI+MY++ G    + +AF L R
Subjt:  RDVVAWNTVIAGKAQHGCSEEVLNLYNMMKTAGFRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAF-LDR

Query:  EDSDGVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLK
         + D V W++M      HG  EEA+ LF   LME L+
Subjt:  EDSDGVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLK

AT2G41080.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.2e-4660.67Show/hide
Query:  MPIRDVVAWNTVIAGKAQHGCSEEVLNLYNMMKTAGFRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAFL
        MP+R++VAWNT+I G AQ+GC E VL LY MMK +G RP++ITFV+V+S+  +LA   QGQQIHAE IK GA SVVAVVSSLISMYS+ GC  D+ KAF 
Subjt:  MPIRDVVAWNTVIAGKAQHGCSEEVLNLYNMMKTAGFRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAFL

Query:  DREDSDGVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLKMEVNQVTFLSL
        +RED D V+WS+M + +GFHG+G+EAI LF+  + E   ME+N+V FL+L
Subjt:  DREDSDGVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLKMEVNQVTFLSL

AT3G02330.1 Pentatricopeptide repeat (PPR) superfamily protein2.6e-2239.58Show/hide
Query:  VAWNTVIAGKAQHGCSEEVLNLYNMMKTAGFRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAFLDREDSD
        V+WN++I+G      SE+   L+  M   G  PD+ T+ +V+ T   LA+   G+QIHA+VIK    S V + S+L+ MYS+ G   DS   F      D
Subjt:  VAWNTVIAGKAQHGCSEEVLNLYNMMKTAGFRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAFLDREDSD

Query:  GVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLKMEVNQVTFLSL
         V W+AM   +  HG+GEEAI LF   ++E++K   N VTF+S+
Subjt:  GVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLKMEVNQVTFLSL

AT3G23330.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.4e-2237.41Show/hide
Query:  RDVVAWNTVIAGKAQHGCSEEVLNLYNMMKTAGFRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAFLDRE
        RD ++WN+++AG  Q+G   E L L+  M TA  +P  + F SVI     LATL  G+Q+H  V++ G GS + + S+L+ MYS+ G  + + K F    
Subjt:  RDVVAWNTVIAGKAQHGCSEEVLNLYNMMKTAGFRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAFLDRE

Query:  DSDGVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLKMEVNQVTFLSL
          D V W+A+   H  HG G EA+ L  F  M+   ++ NQV F+++
Subjt:  DSDGVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLKMEVNQVTFLSL

AT4G33990.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.9e-2337.09Show/hide
Query:  MPIRDVVAWNTVIAGKAQHGCSEEVLNLYNMMKTAG-FRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAF
        +P  DV++WNT+I+G AQ+G + E + +YN+M+  G    ++ T+VSV+    +   L QG ++H  ++K G    V VV+SL  MY + G  ED++  F
Subjt:  MPIRDVVAWNTVIAGKAQHGCSEEVLNLYNMMKTAG-FRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAF

Query:  LDREDSDGVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLKMEVNQVTFLSL
              + V W+ + A HGFHG GE+A+ LF   L E +K +   +TF++L
Subjt:  LDREDSDGVLWSAMTATHGFHGRGEEAIGLFHFHLMEDLKMEVNQVTFLSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAATTCGTGATGTGGTTGCTTGGAATACTGTCATTGCTGGAAAAGCTCAACATGGGTGTTCAGAAGAAGTGTTGAACCTGTATAATATGATGAAAACGGCAGGCTT
TCGACCGGATAGAATAACATTTGTGAGTGTAATAAGTACGTATTTGGAACTGGCGACGTTAGCACAAGGCCAGCAGATCCATGCTGAAGTGATCAAAGCTGGAGCTGGTT
CAGTTGTAGCAGTTGTTAGTTCATTGATTAGCATGTATTCACGGTTTGGGTGTCCAGAGGACTCTGTGAAAGCCTTTTTGGATCGTGAAGATTCTGATGGTGTGTTATGG
AGTGCTATGACTGCTACTCATGGATTCCATGGGAGAGGAGAGGAAGCTATTGGGTTGTTTCACTTTCACCTTATGGAAGATTTGAAAATGGAGGTAAATCAAGTGACCTT
CTTGAGTCTGCAGCATGAATATAAGTTGAGAGATGCAATATCTACATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCAATTCGTGATGTGGTTGCTTGGAATACTGTCATTGCTGGAAAAGCTCAACATGGGTGTTCAGAAGAAGTGTTGAACCTGTATAATATGATGAAAACGGCAGGCTT
TCGACCGGATAGAATAACATTTGTGAGTGTAATAAGTACGTATTTGGAACTGGCGACGTTAGCACAAGGCCAGCAGATCCATGCTGAAGTGATCAAAGCTGGAGCTGGTT
CAGTTGTAGCAGTTGTTAGTTCATTGATTAGCATGTATTCACGGTTTGGGTGTCCAGAGGACTCTGTGAAAGCCTTTTTGGATCGTGAAGATTCTGATGGTGTGTTATGG
AGTGCTATGACTGCTACTCATGGATTCCATGGGAGAGGAGAGGAAGCTATTGGGTTGTTTCACTTTCACCTTATGGAAGATTTGAAAATGGAGGTAAATCAAGTGACCTT
CTTGAGTCTGCAGCATGAATATAAGTTGAGAGATGCAATATCTACATGA
Protein sequenceShow/hide protein sequence
MPIRDVVAWNTVIAGKAQHGCSEEVLNLYNMMKTAGFRPDRITFVSVISTYLELATLAQGQQIHAEVIKAGAGSVVAVVSSLISMYSRFGCPEDSVKAFLDREDSDGVLW
SAMTATHGFHGRGEEAIGLFHFHLMEDLKMEVNQVTFLSLQHEYKLRDAIST