; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0013483 (gene) of Chayote v1 genome

Gene IDSed0013483
OrganismSechium edule (Chayote v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationLG13:25937473..25938513
RNA-Seq ExpressionSed0013483
SyntenySed0013483
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4274760.1 unnamed protein product [Prunus armeniaca]7.4e-2327.27Show/hide
Query:  SNNSMMAKIWNSFWKVEIAPMIKMGVWNVLKNFIPSKANIQAKEVDINSSCILCGRANEANLHLVKECKLAIHAWKSIFPFFTDFASFFRSADSGESF--
        S++S  + +W S W V+  P +K  +W    N +  + N+  + V     C  CG  +E   H+  EC  A   W      F        S  +GE F  
Subjt:  SNNSMMAKIWNSFWKVEIAPMIKMGVWNVLKNFIPSKANIQAKEVDINSSCILCGRANEANLHLVKECKLAIHAWKSIFPFFTDFASFFRSADSGESF--

Query:  -WHGICTKIPRDEYFKAIII-----VWNIWKCRNSVLHNKIKANKETLENLISNSINLCSVSSVSGSSVD----MPTSLPRSCDRWTPPWIETWKLNVDG
         W  + T +   E+    I      +W IWK RN  + + ++A+   +  L+   ++    +      +      P S P     W  P +   K+N D 
Subjt:  -WHGICTKIPRDEYFKAIII-----VWNIWKCRNSVLHNKIKANKETLENLISNSINLCSVSSVSGSSVD----MPTSLPRSCDRWTPPWIETWKLNVDG

Query:  SWKEDSGNGGVGWL-----GMQM-----GELRRKCANVMECLVIRDGLRHILDCGVDLPNEILVESDSSAIICLLNGVSQDVTEISNIVADIREMIPRFR
        +W      GGVGW+     G+ +     G+L    A +ME L IR  L    +C      +I+VESDS   I +LNG +   +++  IV DIR+++    
Subjt:  SWKEDSGNGGVGWL-----GMQM-----GELRRKCANVMECLVIRDGLRHILDCGVDLPNEILVESDSSAIICLLNGVSQDVTEISNIVADIREMIPRFR

Query:  SISFSFCRRECNRAADLLA
         +SF F  R CN+AA  +A
Subjt:  SISFSFCRRECNRAADLLA

CAB4274761.1 unnamed protein product [Prunus armeniaca]7.4e-2327.27Show/hide
Query:  SNNSMMAKIWNSFWKVEIAPMIKMGVWNVLKNFIPSKANIQAKEVDINSSCILCGRANEANLHLVKECKLAIHAWKSIFPFFTDFASFFRSADSGESF--
        S++S  + +W S W V+  P +K  +W    N +  + N+  + V     C  CG  +E   H+  EC  A   W      F        S  +GE F  
Subjt:  SNNSMMAKIWNSFWKVEIAPMIKMGVWNVLKNFIPSKANIQAKEVDINSSCILCGRANEANLHLVKECKLAIHAWKSIFPFFTDFASFFRSADSGESF--

Query:  -WHGICTKIPRDEYFKAIII-----VWNIWKCRNSVLHNKIKANKETLENLISNSINLCSVSSVSGSSVD----MPTSLPRSCDRWTPPWIETWKLNVDG
         W  + T +   E+    I      +W IWK RN  + + ++A+   +  L+   ++    +      +      P S P     W  P +   K+N D 
Subjt:  -WHGICTKIPRDEYFKAIII-----VWNIWKCRNSVLHNKIKANKETLENLISNSINLCSVSSVSGSSVD----MPTSLPRSCDRWTPPWIETWKLNVDG

Query:  SWKEDSGNGGVGWL-----GMQM-----GELRRKCANVMECLVIRDGLRHILDCGVDLPNEILVESDSSAIICLLNGVSQDVTEISNIVADIREMIPRFR
        +W      GGVGW+     G+ +     G+L    A +ME L IR  L    +C      +I+VESDS   I +LNG +   +++  IV DIR+++    
Subjt:  SWKEDSGNGGVGWL-----GMQM-----GELRRKCANVMECLVIRDGLRHILDCGVDLPNEILVESDSSAIICLLNGVSQDVTEISNIVADIREMIPRFR

Query:  SISFSFCRRECNRAADLLA
         +SF F  R CN+AA  +A
Subjt:  SISFSFCRRECNRAADLLA

CAB4303756.1 unnamed protein product [Prunus armeniaca]5.7e-2328.35Show/hide
Query:  SNNSMMAKIWNSFWKVEIAPMIKMGVWNVLKNFIPSKANIQAKEVDINSSCILCGRANEANLHLVKECKLAIHAWKSIFPFFTDFASFFRSADSGESF--
        S++S  + +W S W V+  P +K  +W    N +  + N++ + V     C  CG  +E   H+  EC  A   W      F        S  +GE F  
Subjt:  SNNSMMAKIWNSFWKVEIAPMIKMGVWNVLKNFIPSKANIQAKEVDINSSCILCGRANEANLHLVKECKLAIHAWKSIFPFFTDFASFFRSADSGESF--

Query:  -WHGICTKIPRDEYFKAIII-----VWNIWKCRNSVLHNKIKANKETLENLISNSINLCSVSSVSGSSVDM------PTSLPRSCDRWTPPWIETWKLNV
         W  + T +   E+    I      +W IWK RN  + + ++A+      L+     L  VS    +  D+      P S P     W  P     K+N 
Subjt:  -WHGICTKIPRDEYFKAIII-----VWNIWKCRNSVLHNKIKANKETLENLISNSINLCSVSSVSGSSVDM------PTSLPRSCDRWTPPWIETWKLNV

Query:  DGSWKEDSGNGGVGWL-----GMQM-----GELRRKCANVMECLVIRDGLRHILDCGVDLPNEILVESDSSAIICLLNGVSQDVTEISNIVADIREMIPR
        D +W      GGVGW+     G+ +     G+L    A +ME L IR  L    +C      +I+VESDS   I +LNG +   +++  IV DIR+++  
Subjt:  DGSWKEDSGNGGVGWL-----GMQM-----GELRRKCANVMECLVIRDGLRHILDCGVDLPNEILVESDSSAIICLLNGVSQDVTEISNIVADIREMIPR

Query:  FRSISFSFCRRECNRAADLLA
           +SF F  R CN+AA  +A
Subjt:  FRSISFSFCRRECNRAADLLA

CAB4321714.1 unnamed protein product [Prunus armeniaca]1.7e-2228.08Show/hide
Query:  SNNSMMAKIWNSFWKVEIAPMIKMGVWNVLKNFIPSKANIQAKEVDINSSCILCGRANEANLHLVKECKLAIHAWKSIFPFFTDFASFFRSADSGESF--
        S++S  + +W S W V+  P +K  +W    N +  + N+  + V     C  CG  +E   H+  EC  A   W      F        S  +GE F  
Subjt:  SNNSMMAKIWNSFWKVEIAPMIKMGVWNVLKNFIPSKANIQAKEVDINSSCILCGRANEANLHLVKECKLAIHAWKSIFPFFTDFASFFRSADSGESF--

Query:  -WHGICTKIPRDEYFKAIII-----VWNIWKCRNSVLHNKIKANKETLENLISNSINLCSVSSVSGSSVDM------PTSLPRSCDRWTPPWIETWKLNV
         W  + T +   E+    I      +W IWK RN  + + ++A+      L+     L  VS    +  D+      P S P     W  P     K+N 
Subjt:  -WHGICTKIPRDEYFKAIII-----VWNIWKCRNSVLHNKIKANKETLENLISNSINLCSVSSVSGSSVDM------PTSLPRSCDRWTPPWIETWKLNV

Query:  DGSWKEDSGNGGVGWL----------GMQMGELRRKCANVMECLVIRDGLRHILDCGVDLPNEILVESDSSAIICLLNGVSQDVTEISNIVADIREMIPR
        D +W      GGVGW+              G+L    A +ME L IR  L    +C      +I+VESDS   I +LNG +   +++  IV DIR+++  
Subjt:  DGSWKEDSGNGGVGWL----------GMQMGELRRKCANVMECLVIRDGLRHILDCGVDLPNEILVESDSSAIICLLNGVSQDVTEISNIVADIREMIPR

Query:  FRSISFSFCRRECNRAA
           +SF F  R CN+AA
Subjt:  FRSISFSFCRRECNRAA

XP_030479022.1 uncharacterized protein LOC115696252 [Cannabis sativa]1.7e-2225.73Show/hide
Query:  MAKIWNSFWKVEIAPMIKMGVWNVLKNFIPSKANIQAKEVDINSSCILCGRANEANLHLVKECKLAIHAWKSIFPFFTDFASFFRSADSGESFW-HGICT
        ++  WNS W++ + P +K  +W    N +P+K  +Q + V I+S+C LC  + E +LHL   C  A +  +++     D      S +S  + W H + +
Subjt:  MAKIWNSFWKVEIAPMIKMGVWNVLKNFIPSKANIQAKEVDINSSCILCGRANEANLHLVKECKLAIHAWKSIFPFFTDFASFFRSADSGESFW-HGICT

Query:  KIPRDEYFKAIIIVWNIWKCRNSVLHNKIKANKETLENLISNSINL---CSVSSVSGSSVDMPTSLPRSCDRWTPPWIETWKLNVDGSWKEDSGNGGVGW
          P  E     ++ W IW+CRN ++ NK + + + +  + S ++ L    S  S+S +  +  TS   S +RWT P     K+NVD +   +  + G G+
Subjt:  KIPRDEYFKAIIIVWNIWKCRNSVLHNKIKANKETLENLISNSINL---CSVSSVSGSSVDMPTSLPRSCDRWTPPWIETWKLNVDGSWKEDSGNGGVGW

Query:  ----------LGMQMGELRRKCANVMECLVIRDGLRHILDCGVDLPNEILVESDSSAIICLLNGVSQDVTEISNIVADIREMIPRFRSISFSFCRRECNR
                  L  Q  +L    A + E + I++ L  I     +    + VESD   ++  +N      +   +I+ D ++++    ++S SF +R  N 
Subjt:  ----------LGMQMGELRRKCANVMECLVIRDGLRHILDCGVDLPNEILVESDSSAIICLLNGVSQDVTEISNIVADIREMIPRFRSISFSFCRRECNR

Query:  AADLLAK
         A  LA+
Subjt:  AADLLAK

TrEMBL top hitse value%identityAlignment
A0A6J5UAY2 Reverse transcriptase domain-containing protein4.4e-2125.16Show/hide
Query:  SNNSMMAKIWNSFWKVEIAPMIKMGVWNVLKNFIPSKANIQAKEVDINSSCILCGRANEANLHLVKECKLAIHAWKSIFPFFTDFASFFRSADSGESF--
        S      +IW S W +++ P I+  +W   +N +  K N+  + +D+ +SC LCG+  E+ +H+  +C+ A   W +  P   D          GE F  
Subjt:  SNNSMMAKIWNSFWKVEIAPMIKMGVWNVLKNFIPSKANIQAKEVDINSSCILCGRANEANLHLVKECKLAIHAWKSIFPFFTDFASFFRSADSGESF--

Query:  -WHGICTKIPRDEYFKAIIIV-----WNIWKCRNSVLHNKIKANKETLENLISNSINLCSVSSVSGSSVDMPTSLPR-------SCDRWTPPWIETWKLN
         W  +  K  + E+   I+ V     W IWK RN  +                + ++    + + G    +   LPR       +   W  P     K+N
Subjt:  -WHGICTKIPRDEYFKAIIIV-----WNIWKCRNSVLHNKIKANKETLENLISNSINLCSVSSVSGSSVDMPTSLPR-------SCDRWTPPWIETWKLN

Query:  VDGSWKEDSGNGGVGWLGMQ----------MGELRRKCANVMECLVIRDGLRHILDCGVDLPNEILVESDSSAIICLLNGVSQDVTEISNIVADIREMIP
         D +W   +  GGVGW+              G++R   A  ME   IR+ L   +  G+   +++ VESDS  +I ++ G       +  IV DI++++ 
Subjt:  VDGSWKEDSGNGGVGWLGMQ----------MGELRRKCANVMECLVIRDGLRHILDCGVDLPNEILVESDSSAIICLLNGVSQDVTEISNIVADIREMIP

Query:  RFRSISFSFCRRECNRAADLLA
        +F+   F +  R CN+AA  +A
Subjt:  RFRSISFSFCRRECNRAADLLA

A0A6J5UE59 Reverse transcriptase domain-containing protein3.6e-2327.27Show/hide
Query:  SNNSMMAKIWNSFWKVEIAPMIKMGVWNVLKNFIPSKANIQAKEVDINSSCILCGRANEANLHLVKECKLAIHAWKSIFPFFTDFASFFRSADSGESF--
        S++S  + +W S W V+  P +K  +W    N +  + N+  + V     C  CG  +E   H+  EC  A   W      F        S  +GE F  
Subjt:  SNNSMMAKIWNSFWKVEIAPMIKMGVWNVLKNFIPSKANIQAKEVDINSSCILCGRANEANLHLVKECKLAIHAWKSIFPFFTDFASFFRSADSGESF--

Query:  -WHGICTKIPRDEYFKAIII-----VWNIWKCRNSVLHNKIKANKETLENLISNSINLCSVSSVSGSSVD----MPTSLPRSCDRWTPPWIETWKLNVDG
         W  + T +   E+    I      +W IWK RN  + + ++A+   +  L+   ++    +      +      P S P     W  P +   K+N D 
Subjt:  -WHGICTKIPRDEYFKAIII-----VWNIWKCRNSVLHNKIKANKETLENLISNSINLCSVSSVSGSSVD----MPTSLPRSCDRWTPPWIETWKLNVDG

Query:  SWKEDSGNGGVGWL-----GMQM-----GELRRKCANVMECLVIRDGLRHILDCGVDLPNEILVESDSSAIICLLNGVSQDVTEISNIVADIREMIPRFR
        +W      GGVGW+     G+ +     G+L    A +ME L IR  L    +C      +I+VESDS   I +LNG +   +++  IV DIR+++    
Subjt:  SWKEDSGNGGVGWL-----GMQM-----GELRRKCANVMECLVIRDGLRHILDCGVDLPNEILVESDSSAIICLLNGVSQDVTEISNIVADIREMIPRFR

Query:  SISFSFCRRECNRAADLLA
         +SF F  R CN+AA  +A
Subjt:  SISFSFCRRECNRAADLLA

A0A6J5UF50 Uncharacterized protein3.6e-2327.27Show/hide
Query:  SNNSMMAKIWNSFWKVEIAPMIKMGVWNVLKNFIPSKANIQAKEVDINSSCILCGRANEANLHLVKECKLAIHAWKSIFPFFTDFASFFRSADSGESF--
        S++S  + +W S W V+  P +K  +W    N +  + N+  + V     C  CG  +E   H+  EC  A   W      F        S  +GE F  
Subjt:  SNNSMMAKIWNSFWKVEIAPMIKMGVWNVLKNFIPSKANIQAKEVDINSSCILCGRANEANLHLVKECKLAIHAWKSIFPFFTDFASFFRSADSGESF--

Query:  -WHGICTKIPRDEYFKAIII-----VWNIWKCRNSVLHNKIKANKETLENLISNSINLCSVSSVSGSSVD----MPTSLPRSCDRWTPPWIETWKLNVDG
         W  + T +   E+    I      +W IWK RN  + + ++A+   +  L+   ++    +      +      P S P     W  P +   K+N D 
Subjt:  -WHGICTKIPRDEYFKAIII-----VWNIWKCRNSVLHNKIKANKETLENLISNSINLCSVSSVSGSSVD----MPTSLPRSCDRWTPPWIETWKLNVDG

Query:  SWKEDSGNGGVGWL-----GMQM-----GELRRKCANVMECLVIRDGLRHILDCGVDLPNEILVESDSSAIICLLNGVSQDVTEISNIVADIREMIPRFR
        +W      GGVGW+     G+ +     G+L    A +ME L IR  L    +C      +I+VESDS   I +LNG +   +++  IV DIR+++    
Subjt:  SWKEDSGNGGVGWL-----GMQM-----GELRRKCANVMECLVIRDGLRHILDCGVDLPNEILVESDSSAIICLLNGVSQDVTEISNIVADIREMIPRFR

Query:  SISFSFCRRECNRAADLLA
         +SF F  R CN+AA  +A
Subjt:  SISFSFCRRECNRAADLLA

A0A6J5WPU6 Reverse transcriptase domain-containing protein2.8e-2328.35Show/hide
Query:  SNNSMMAKIWNSFWKVEIAPMIKMGVWNVLKNFIPSKANIQAKEVDINSSCILCGRANEANLHLVKECKLAIHAWKSIFPFFTDFASFFRSADSGESF--
        S++S  + +W S W V+  P +K  +W    N +  + N++ + V     C  CG  +E   H+  EC  A   W      F        S  +GE F  
Subjt:  SNNSMMAKIWNSFWKVEIAPMIKMGVWNVLKNFIPSKANIQAKEVDINSSCILCGRANEANLHLVKECKLAIHAWKSIFPFFTDFASFFRSADSGESF--

Query:  -WHGICTKIPRDEYFKAIII-----VWNIWKCRNSVLHNKIKANKETLENLISNSINLCSVSSVSGSSVDM------PTSLPRSCDRWTPPWIETWKLNV
         W  + T +   E+    I      +W IWK RN  + + ++A+      L+     L  VS    +  D+      P S P     W  P     K+N 
Subjt:  -WHGICTKIPRDEYFKAIII-----VWNIWKCRNSVLHNKIKANKETLENLISNSINLCSVSSVSGSSVDM------PTSLPRSCDRWTPPWIETWKLNV

Query:  DGSWKEDSGNGGVGWL-----GMQM-----GELRRKCANVMECLVIRDGLRHILDCGVDLPNEILVESDSSAIICLLNGVSQDVTEISNIVADIREMIPR
        D +W      GGVGW+     G+ +     G+L    A +ME L IR  L    +C      +I+VESDS   I +LNG +   +++  IV DIR+++  
Subjt:  DGSWKEDSGNGGVGWL-----GMQM-----GELRRKCANVMECLVIRDGLRHILDCGVDLPNEILVESDSSAIICLLNGVSQDVTEISNIVADIREMIPR

Query:  FRSISFSFCRRECNRAADLLA
           +SF F  R CN+AA  +A
Subjt:  FRSISFSFCRRECNRAADLLA

A0A6J5YDN0 Uncharacterized protein8.0e-2328.08Show/hide
Query:  SNNSMMAKIWNSFWKVEIAPMIKMGVWNVLKNFIPSKANIQAKEVDINSSCILCGRANEANLHLVKECKLAIHAWKSIFPFFTDFASFFRSADSGESF--
        S++S  + +W S W V+  P +K  +W    N +  + N+  + V     C  CG  +E   H+  EC  A   W      F        S  +GE F  
Subjt:  SNNSMMAKIWNSFWKVEIAPMIKMGVWNVLKNFIPSKANIQAKEVDINSSCILCGRANEANLHLVKECKLAIHAWKSIFPFFTDFASFFRSADSGESF--

Query:  -WHGICTKIPRDEYFKAIII-----VWNIWKCRNSVLHNKIKANKETLENLISNSINLCSVSSVSGSSVDM------PTSLPRSCDRWTPPWIETWKLNV
         W  + T +   E+    I      +W IWK RN  + + ++A+      L+     L  VS    +  D+      P S P     W  P     K+N 
Subjt:  -WHGICTKIPRDEYFKAIII-----VWNIWKCRNSVLHNKIKANKETLENLISNSINLCSVSSVSGSSVDM------PTSLPRSCDRWTPPWIETWKLNV

Query:  DGSWKEDSGNGGVGWL----------GMQMGELRRKCANVMECLVIRDGLRHILDCGVDLPNEILVESDSSAIICLLNGVSQDVTEISNIVADIREMIPR
        D +W      GGVGW+              G+L    A +ME L IR  L    +C      +I+VESDS   I +LNG +   +++  IV DIR+++  
Subjt:  DGSWKEDSGNGGVGWL----------GMQMGELRRKCANVMECLVIRDGLRHILDCGVDLPNEILVESDSSAIICLLNGVSQDVTEISNIVADIREMIPR

Query:  FRSISFSFCRRECNRAA
           +SF F  R CN+AA
Subjt:  FRSISFSFCRRECNRAA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10000.1 Ribonuclease H-like superfamily protein4.4e-1324.07Show/hide
Query:  PMIKMGVWNVLKNFIPSKANIQAKEVDINSSCILCGRANEANLHLVKECKLAIHAWKSIFPFFTDFASFFRSADSGESFWHGICTKIPRDEYFKAII--I
        P IK+ +W      +P  A +  + +     C  CG A E + H++  C  A   W ++ P                +         P   +   +   I
Subjt:  PMIKMGVWNVLKNFIPSKANIQAKEVDINSSCILCGRANEANLHLVKECKLAIHAWKSIFPFFTDFASFFRSADSGESFWHGICTKIPRDEYFKAII--I

Query:  VWNIWKCRNS-VLHNKIKANKETLENLISNSINLCSVSSVSGSSVDMPTSLPRSCDRWTPPWIETWKLNVDGSWKEDSGNGGVGWLGMQMGELRRKCAN-
         W+IWK RN  +  N   +  ET+   + +++   S + ++   V + T  P S    T  ++      VD +W++ S   G GW+        ++    
Subjt:  VWNIWKCRNS-VLHNKIKANKETLENLISNSINLCSVSSVSGSSVDMPTSLPRSCDRWTPPWIETWKLNVDGSWKEDSGNGGVGWLGMQMGELRRKCAN-

Query:  -----------VMECLVIRDGLRHILDCGVDLPNEILVESDSSAIICLLNGVSQDVTEISNIVADIREMIPRFRSISFSFCRRECNRAADLLAKL
                     E   I+  + H L       +++LV SDS +I+  LN  +  + EI  ++ +IR +  RFRSISF F  R  N  AD  AKL
Subjt:  -----------VMECLVIRDGLRHILDCGVDLPNEILVESDSSAIICLLNGVSQDVTEISNIVADIREMIPRFRSISFSFCRRECNRAADLLAKL

AT1G60720.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.2e-0423.44Show/hide
Query:  WKVEIAPMIKMGVWNVLKNFIPSKANIQAKEVDINSSCILCGRANEANLHLVKECKLAIHAWKSIFP-------FFTDFASFFRSADSGESFWHGICTKI
        W     P     +W    + +P++  + +     +  C LC    E+  HL+  C+ A   W+  F         F  +A       S  S    +  K+
Subjt:  WKVEIAPMIKMGVWNVLKNFIPSKANIQAKEVDINSSCILCGRANEANLHLVKECKLAIHAWKSIFP-------FFTDFASFFRSADSGESFWHGICTKI

Query:  PRDEYFKAIIIVWNIWKCRNSVLHNKIK
               A  I++NIW+ RN+VLHN ++
Subjt:  PRDEYFKAIIIVWNIWKCRNSVLHNKIK

AT4G09490.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.7e-0428.03Show/hide
Query:  PWIETWKLNVDGSWKEDSGNGGVGWLGMQMGEL-----RRKCANV-----MECLVIRDGLRHILDCGVDLPNEILVESDSSAIICLLNGVSQDVTEISNI
        P ++   +  D +WK ++ + G GW+     EL     +    NV      E + +   L++    G+    ++ + SDS  +I  +   S   TE   I
Subjt:  PWIETWKLNVDGSWKEDSGNGGVGWLGMQMGEL-----RRKCANV-----MECLVIRDGLRHILDCGVDLPNEILVESDSSAIICLLNGVSQDVTEISNI

Query:  VADIREMIPRFRSISFSFCRRECNRAADLLAK
        + DI  +   F  +SFSF  R  NR AD LAK
Subjt:  VADIREMIPRFRSISFSFCRRECNRAADLLAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACTTATTAAATTTTGATAAGGCTTCTCCCTCTAACAATTCCATGATGGCCAAGATATGGAATTCCTTTTGGAAGGTTGAGATTGCTCCTATGATAAAAATGGGGGT
CTGGAATGTGTTAAAGAATTTTATTCCATCCAAAGCGAACATTCAAGCGAAGGAAGTGGATATCAACAGTAGCTGTATATTGTGTGGGAGGGCAAATGAAGCAAATTTGC
ACTTGGTCAAAGAGTGCAAGCTCGCAATTCATGCTTGGAAATCTATTTTTCCTTTCTTTACTGATTTTGCTTCTTTTTTCAGGTCAGCTGATTCAGGGGAAAGCTTTTGG
CATGGGATCTGTACAAAGATCCCTAGGGATGAATATTTTAAAGCAATTATTATTGTGTGGAATATTTGGAAATGTCGTAATTCGGTGTTGCATAACAAAATCAAGGCTAA
CAAAGAGACTTTGGAAAATTTAATTTCAAATTCTATAAATTTATGTAGTGTTTCTTCAGTCTCAGGTTCTAGTGTGGATATGCCTACCTCTTTGCCACGCTCGTGTGATA
GATGGACGCCGCCGTGGATCGAAACTTGGAAATTGAATGTTGATGGTTCGTGGAAGGAGGACAGTGGTAATGGCGGAGTGGGGTGGTTGGGGATGCAAATGGGGGAATTA
AGGCGCAAATGCGCAAATGTGATGGAGTGTTTGGTCATTCGAGATGGGCTTAGACACATTCTGGATTGTGGCGTTGATCTTCCCAACGAGATCTTGGTGGAATCAGATTC
AAGCGCGATCATTTGTTTATTAAATGGGGTTTCTCAAGATGTTACTGAAATCAGCAATATCGTAGCTGATATTAGGGAGATGATCCCTCGGTTTAGGAGTATATCCTTTT
CTTTTTGTCGTAGGGAGTGCAATAGAGCTGCTGATTTGTTAGCAAAGCTCGGTCCTCCTCTGGATTTGTTGTTTTGTTTTTCTTGA
mRNA sequenceShow/hide mRNA sequence
GTTCAAATGGGGTCTTTAGTGTTAAAAGTGCTTATCACCTTGCTATGAACTTATTAAATTTTGATAAGGCTTCTCCCTCTAACAATTCCATGATGGCCAAGATATGGAAT
TCCTTTTGGAAGGTTGAGATTGCTCCTATGATAAAAATGGGGGTCTGGAATGTGTTAAAGAATTTTATTCCATCCAAAGCGAACATTCAAGCGAAGGAAGTGGATATCAA
CAGTAGCTGTATATTGTGTGGGAGGGCAAATGAAGCAAATTTGCACTTGGTCAAAGAGTGCAAGCTCGCAATTCATGCTTGGAAATCTATTTTTCCTTTCTTTACTGATT
TTGCTTCTTTTTTCAGGTCAGCTGATTCAGGGGAAAGCTTTTGGCATGGGATCTGTACAAAGATCCCTAGGGATGAATATTTTAAAGCAATTATTATTGTGTGGAATATT
TGGAAATGTCGTAATTCGGTGTTGCATAACAAAATCAAGGCTAACAAAGAGACTTTGGAAAATTTAATTTCAAATTCTATAAATTTATGTAGTGTTTCTTCAGTCTCAGG
TTCTAGTGTGGATATGCCTACCTCTTTGCCACGCTCGTGTGATAGATGGACGCCGCCGTGGATCGAAACTTGGAAATTGAATGTTGATGGTTCGTGGAAGGAGGACAGTG
GTAATGGCGGAGTGGGGTGGTTGGGGATGCAAATGGGGGAATTAAGGCGCAAATGCGCAAATGTGATGGAGTGTTTGGTCATTCGAGATGGGCTTAGACACATTCTGGAT
TGTGGCGTTGATCTTCCCAACGAGATCTTGGTGGAATCAGATTCAAGCGCGATCATTTGTTTATTAAATGGGGTTTCTCAAGATGTTACTGAAATCAGCAATATCGTAGC
TGATATTAGGGAGATGATCCCTCGGTTTAGGAGTATATCCTTTTCTTTTTGTCGTAGGGAGTGCAATAGAGCTGCTGATTTGTTAGCAAAGCTCGGTCCTCCTCTGGATT
TGTTGTTTTGTTTTTCTTGA
Protein sequenceShow/hide protein sequence
MNLLNFDKASPSNNSMMAKIWNSFWKVEIAPMIKMGVWNVLKNFIPSKANIQAKEVDINSSCILCGRANEANLHLVKECKLAIHAWKSIFPFFTDFASFFRSADSGESFW
HGICTKIPRDEYFKAIIIVWNIWKCRNSVLHNKIKANKETLENLISNSINLCSVSSVSGSSVDMPTSLPRSCDRWTPPWIETWKLNVDGSWKEDSGNGGVGWLGMQMGEL
RRKCANVMECLVIRDGLRHILDCGVDLPNEILVESDSSAIICLLNGVSQDVTEISNIVADIREMIPRFRSISFSFCRRECNRAADLLAKLGPPLDLLFCFS