; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG08G004545 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG08G004545
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionReverse transcriptase
Genome locationCG_Chr08:14329106..14330314
RNA-Seq ExpressionClCG08G004545
SyntenyClCG08G004545
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041588 - Integrase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIN21773.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]5.3e-7151.88Show/hide
Query:  MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRTAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILE
        + D + Y+WD+P L+K+GPD+I R C+PE     IL QCH SPY GHF G RTAAKILQSG+FWP LF+DA  +   CDRCQR GNIS R+EMPL +ILE
Subjt:  MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRTAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILE

Query:  VELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDAVTVS----NFLQKNIFTRFRT---PRALISDE-----------------------
        VELFDVWGIDFMGPF+PS G+ YILVAVDYVSKWVEA +   ND+  V+      LQ N    FR      A I  E                       
Subjt:  VELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDAVTVS----NFLQKNIFTRFRT---PRALISDE-----------------------

Query:  ---------GKLKTRWSGPFVIKEIFPHGAVEWMNEDDTNAFKVNGQRVKPYFGECIEHDKVAVDL
                 GKLK+RWSGPF I E+FPHGAVE  NE+  N FKVN QR+K Y+G  ++    ++ L
Subjt:  ---------GKLKTRWSGPFVIKEIFPHGAVEWMNEDDTNAFKVNGQRVKPYFGECIEHDKVAVDL

PIN21854.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]1.0e-6640.35Show/hide
Query:  MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRTAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILE
        + D + Y+WD+P L+K+GPD+I R C+PE     I  QCH SPY GHF   RTAAKILQSG+FWP LF+D   +   CDRCQR GNIS R+EMPL +ILE
Subjt:  MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRTAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILE

Query:  VELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEG-----------------------------
        VELFDVWGIDFMGPF+PS G+ YILVAVDY+SKWVEA++   ND+  V NF++KNIFTRF TPRA+ISD G                             
Subjt:  VELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEG-----------------------------

Query:  --------------------------------------------------------------------------------------KLKTRWSGPFVIKE
                                                                                              KLK+RWS PF I E
Subjt:  --------------------------------------------------------------------------------------KLKTRWSGPFVIKE

Query:  IFPHGAVEWMNEDDTNAFKVNGQRVKPYFGECIEHDKVAVDL
        + PHGAVE  N++  N FKVN QR+K Y+   ++    ++ L
Subjt:  IFPHGAVEWMNEDDTNAFKVNGQRVKPYFGECIEHDKVAVDL

WP_217833161.1 DDE-type integrase/transposase/recombinase, partial [Synechococcus sp. PCC 7002]4.3e-7374.71Show/hide
Query:  HDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRTAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILEV
        H++KFY WDEP LY+ G DHI R C+PE     IL  CH++PY GHFGGQRTAAK+LQSGYFWPTLF+DAR YA+ CDRCQR GNIS+RNEMPL S+LEV
Subjt:  HDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRTAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILEV

Query:  ELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEG
        ELFDVWGIDFMGPF PS G+ YILVAVDYVSKWVEA +CA+NDA TVS FL+K IF+RF TPRA+ISDEG
Subjt:  ELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEG

XP_042757945.1 uncharacterized protein LOC111885853 [Lactuca sativa]2.3e-6655.66Show/hide
Query:  MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRTAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILE
        M DAKFY+WDEP L++   D ++R CIPE   ++IL +CH+S Y GHF G++TA ++L SG++WP+LF+DA ++  RCDRCQR GNI  R EMPL++I+E
Subjt:  MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRTAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILE

Query:  VELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEG-KLKTRWSGPFVIKEIFPHGAVEWMNEDD
        VELFDVWGIDFMGPF+PS+G  YILVAVDYVSKWVEA++C RNDA TV NFL+K IF+RF TPRA+ISDEG     R  G  + K    H      +   
Subjt:  VELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEG-KLKTRWSGPFVIKEIFPHGAVEWMNEDD

Query:  TNAFKVNGQRVK
            + N +++K
Subjt:  TNAFKVNGQRVK

XP_042757945.1 uncharacterized protein LOC111885853 [Lactuca sativa]8.3e-0861.82Show/hide
Query:  GKLKTRWSGPFVIKEIFPHGAVEWMNEDDTNAFKVNGQRVKPYFGECIEHDKVAV
        GKLK+RWSGPF I  + P GA+E +NE D   F VNGQRVK YFGE  E + VAV
Subjt:  GKLKTRWSGPFVIKEIFPHGAVEWMNEDDTNAFKVNGQRVKPYFGECIEHDKVAV

XP_042757945.1 uncharacterized protein LOC111885853 [Lactuca sativa]1.5e-6545.87Show/hide
Query:  MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRTAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILE
        + DA+ Y+WD+P L+K+GPD+I R C+PE     IL QCH SPY GHF G RTAAKILQSG+FWP LF+DA  +   CDRCQR GNIS R+EMPL +ILE
Subjt:  MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRTAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILE

Query:  VELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDA----------------------------------------VTVSNFLQKNIF--T
        VELFDVW IDFMGPF+PS G+ YILVAVDY+SKWVEA++   ND+                                        V VSN   K I   T
Subjt:  VELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDA----------------------------------------VTVSNFLQKNIF--T

Query:  RFRTP--RALISDE--------------------------------GKLKTRWSGPFVIKEIFPHGAVEWMNEDDTNAFKVNGQRVKPYFGECIEHDKVA
           TP   A I  E                                GKLK+RWSGPF I E+FPHGAVE  NE+  N FKVN QR+K Y+   +     +
Subjt:  RFRTP--RALISDE--------------------------------GKLKTRWSGPFVIKEIFPHGAVEWMNEDDTNAFKVNGQRVKPYFGECIEHDKVA

Query:  VDL
        + L
Subjt:  VDL

TrEMBL top hitse value%identityAlignment
A0A2G9FWY3 Reverse transcriptase1.6e-6567.25Show/hide
Query:  MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRTAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILE
        + D + Y+WD+P L+K+GPD+I R C+PE     IL QCH SPY GHF G RTAAKILQSG+FWP LF+DA  +   CDRCQR GNIS R+EMPL +ILE
Subjt:  MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRTAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILE

Query:  VELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEG
        VELFDVWGIDFMGPF+PS G+ YILVAVDYVSKWVEA +   ND+  V NF++KNIFTRF TPRA+ISD G
Subjt:  VELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEG

A0A2G9FWY3 Reverse transcriptase6.6e-1156.14Show/hide
Query:  GKLKTRWSGPFVIKEIFPHGAVEWMNEDDTNAFKVNGQRVKPYFGECIEHDKVAVDL
        GKLK+RWSGPF I E+FPHGAVE  N++  N FKVN QR+K Y+GE ++    ++ L
Subjt:  GKLKTRWSGPFVIKEIFPHGAVEWMNEDDTNAFKVNGQRVKPYFGECIEHDKVAVDL

A0A2G9FWY3 Reverse transcriptase3.6e-6568.42Show/hide
Query:  MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRTAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILE
        +HD KFY WDEP LYKRG D + R C+PE   +++L  CHDS Y GHF G RTAAK+LQSG FWPTLF+DA  Y  RCDRCQR GNIS RNEMP   +LE
Subjt:  MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRTAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILE

Query:  VELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEG
        VE+FDVWGIDFMGPF  S    YILVAVDYVSKWVEAI+   NDA  V +FL+KNIF+RF  PRALISDEG
Subjt:  VELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEG

A0A2G9HK33 Reverse transcriptase7.2e-6645.87Show/hide
Query:  MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRTAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILE
        + DA+ Y+WD+P L+K+GPD+I R C+PE     IL QCH SPY GHF G RTAAKILQSG+FWP LF+DA  +   CDRCQR GNIS R+EMPL +ILE
Subjt:  MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRTAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILE

Query:  VELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDA----------------------------------------VTVSNFLQKNIF--T
        VELFDVW IDFMGPF+PS G+ YILVAVDY+SKWVEA++   ND+                                        V VSN   K I   T
Subjt:  VELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDA----------------------------------------VTVSNFLQKNIF--T

Query:  RFRTP--RALISDE--------------------------------GKLKTRWSGPFVIKEIFPHGAVEWMNEDDTNAFKVNGQRVKPYFGECIEHDKVA
           TP   A I  E                                GKLK+RWSGPF I E+FPHGAVE  NE+  N FKVN QR+K Y+   +     +
Subjt:  RFRTP--RALISDE--------------------------------GKLKTRWSGPFVIKEIFPHGAVEWMNEDDTNAFKVNGQRVKPYFGECIEHDKVA

Query:  VDL
        + L
Subjt:  VDL

A0A2G9HWC5 DNA-directed DNA polymerase2.6e-7151.88Show/hide
Query:  MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRTAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILE
        + D + Y+WD+P L+K+GPD+I R C+PE     IL QCH SPY GHF G RTAAKILQSG+FWP LF+DA  +   CDRCQR GNIS R+EMPL +ILE
Subjt:  MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRTAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILE

Query:  VELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDAVTVS----NFLQKNIFTRFRT---PRALISDE-----------------------
        VELFDVWGIDFMGPF+PS G+ YILVAVDYVSKWVEA +   ND+  V+      LQ N    FR      A I  E                       
Subjt:  VELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDAVTVS----NFLQKNIFTRFRT---PRALISDE-----------------------

Query:  ---------GKLKTRWSGPFVIKEIFPHGAVEWMNEDDTNAFKVNGQRVKPYFGECIEHDKVAVDL
                 GKLK+RWSGPF I E+FPHGAVE  NE+  N FKVN QR+K Y+G  ++    ++ L
Subjt:  ---------GKLKTRWSGPFVIKEIFPHGAVEWMNEDDTNAFKVNGQRVKPYFGECIEHDKVAVDL

A0A2G9HWF8 Reverse transcriptase5.0e-6740.35Show/hide
Query:  MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRTAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILE
        + D + Y+WD+P L+K+GPD+I R C+PE     I  QCH SPY GHF   RTAAKILQSG+FWP LF+D   +   CDRCQR GNIS R+EMPL +ILE
Subjt:  MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRTAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILE

Query:  VELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEG-----------------------------
        VELFDVWGIDFMGPF+PS G+ YILVAVDY+SKWVEA++   ND+  V NF++KNIFTRF TPRA+ISD G                             
Subjt:  VELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEG-----------------------------

Query:  --------------------------------------------------------------------------------------KLKTRWSGPFVIKE
                                                                                              KLK+RWS PF I E
Subjt:  --------------------------------------------------------------------------------------KLKTRWSGPFVIKE

Query:  IFPHGAVEWMNEDDTNAFKVNGQRVKPYFGECIEHDKVAVDL
        + PHGAVE  N++  N FKVN QR+K Y+   ++    ++ L
Subjt:  IFPHGAVEWMNEDDTNAFKVNGQRVKPYFGECIEHDKVAVDL

A0A2K3NJZ5 Integrase catalytic domain-containing protein (Fragment)1.1e-1069.57Show/hide
Query:  GKLKTRWSGPFVIKEIFPHGAVEWMNEDDTNAFKVNGQRVKPYFGE
        GKLK+RWSGPF IK++FPHGAVE  + D    FKVNGQR+KPYFG+
Subjt:  GKLKTRWSGPFVIKEIFPHGAVEWMNEDDTNAFKVNGQRVKPYFGE

SwissProt top hitse value%identityAlignment
P10394 Retrovirus-related Pol polyprotein from transposon 4124.9e-1128.08Show/hide
Query:  ETSYQRILSQCHDSPYE-GHFGGQRTAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNIS-SRNEMPLTSILEVELFDVWGIDFMGPFLPS-NGHNYIL
        E   + ILS  HD P + GH G  +T AK+ +  Y+W  + +  ++Y  +C +CQ+      ++  M +T   E   FD   +D +GP   S NG+ Y +
Subjt:  ETSYQRILSQCHDSPYE-GHFGGQRTAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNIS-SRNEMPLTSILEVELFDVWGIDFMGPFLPS-NGHNYIL

Query:  VAVDYVSKWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEG
          +  ++K++ AI  A   A TV+  + ++   ++   +  I+D G
Subjt:  VAVDYVSKWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEG

P14350 Pro-Pol polyprotein1.1e-1027.98Show/hide
Query:  YYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRTAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMP-LTSILEVELFD
        Y+ ++ ++    P+ + ++  P++  Q+I+ Q H+     H G + T  KI    Y+WP + +D      RC +C  I N S++   P L      + FD
Subjt:  YYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRTAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMP-LTSILEVELFD

Query:  VWGIDFMGPFLPSNGHNYILVAVDYVS--KWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEG
         + ID++GP  PS G+ Y+LV VD ++   W+         A   S     N+ T    P+ + SD+G
Subjt:  VWGIDFMGPFLPSNGHNYILVAVDYVS--KWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEG

P23074 Pro-Pol polyprotein2.9e-1129.94Show/hide
Query:  YYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRTAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMP-LTSILEVELFD
        Y  +E +L    P+ I R+  P+   ++I+S  H+     H G   T  K+  S Y+WP L +D      +C +C  + N ++    P L  +  ++ FD
Subjt:  YYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRTAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMP-LTSILEVELFD

Query:  VWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAI-SCARNDAVTVSNFLQKNIFTRFRTPRALISDEG
         + ID++GP  PSNG+ ++LV VD ++ +V    + A + + TV      N+ T    P+ L SD+G
Subjt:  VWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAI-SCARNDAVTVSNFLQKNIFTRFRTPRALISDEG

P92516 Uncharacterized mitochondrial protein AtMg007504.3e-1560.71Show/hide
Query:  ILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILEVELFDVWGIDFM
        +LQ+G++WPT F+DA  +   CD CQR GN + RNEMP   ILEVE+FDVWGI FM
Subjt:  ILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILEVELFDVWGIDFM

Q87040 Pro-Pol polyprotein3.7e-1128.57Show/hide
Query:  YYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRTAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMP-LTSILEVELFD
        YY ++ ++    P+ + ++  P++  Q+I+ Q H+     H G + T  KI    Y+WP + +D      RC +C  I N S++   P L      + FD
Subjt:  YYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRTAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMP-LTSILEVELFD

Query:  VWGIDFMGPFLPSNGHNYILVAVDYVS--KWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEG
         + ID++GP  PS G+ Y+LV VD ++   W+         A   S     N+ T    P+ + SD+G
Subjt:  VWGIDFMGPFLPSNGHNYILVAVDYVS--KWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEG

Arabidopsis top hitse value%identityAlignment
ATMG00750.1 GAG/POL/ENV polyprotein3.0e-1660.71Show/hide
Query:  ILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILEVELFDVWGIDFM
        +LQ+G++WPT F+DA  +   CD CQR GN + RNEMP   ILEVE+FDVWGI FM
Subjt:  ILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILEVELFDVWGIDFM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATGACGCGAAGTTTTATTATTGGGACGAGCCCCAGCTCTACAAAAGGGGGCCGGATCACATCTTCAGACTCTGCATCCCAGAGACTTCGTATCAACGCATCCTATC
TCAATGTCATGACTCCCCCTATGAAGGGCATTTTGGAGGACAACGAACTGCAGCCAAGATATTGCAAAGCGGATACTTCTGGCCAACTCTTTTTAGAGATGCCAGGGACT
ATGCGATCAGATGCGACAGATGTCAACGCATCGGAAATATCTCAAGTCGGAACGAAATGCCACTTACCTCTATTTTAGAAGTCGAGCTCTTTGACGTTTGGGGTATTGAT
TTCATGGGCCCATTCCTGCCATCAAACGGCCACAACTACATACTGGTAGCGGTGGACTATGTATCAAAATGGGTTGAAGCAATCTCATGCGCCAGGAATGACGCGGTAAC
AGTCTCAAACTTTCTTCAGAAGAATATATTTACGAGATTCAGAACGCCGAGAGCCCTTATTAGCGATGAAGGGAAATTAAAAACAAGATGGTCTGGTCCTTTTGTGATCA
AGGAAATCTTTCCTCATGGTGCCGTAGAATGGATGAATGAAGATGACACCAACGCATTCAAAGTTAATGGTCAGCGTGTGAAACCATATTTTGGAGAATGCATTGAACAC
GACAAAGTAGCGGTTGACCTAGCAAAAATCGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCATGACGCGAAGTTTTATTATTGGGACGAGCCCCAGCTCTACAAAAGGGGGCCGGATCACATCTTCAGACTCTGCATCCCAGAGACTTCGTATCAACGCATCCTATC
TCAATGTCATGACTCCCCCTATGAAGGGCATTTTGGAGGACAACGAACTGCAGCCAAGATATTGCAAAGCGGATACTTCTGGCCAACTCTTTTTAGAGATGCCAGGGACT
ATGCGATCAGATGCGACAGATGTCAACGCATCGGAAATATCTCAAGTCGGAACGAAATGCCACTTACCTCTATTTTAGAAGTCGAGCTCTTTGACGTTTGGGGTATTGAT
TTCATGGGCCCATTCCTGCCATCAAACGGCCACAACTACATACTGGTAGCGGTGGACTATGTATCAAAATGGGTTGAAGCAATCTCATGCGCCAGGAATGACGCGGTAAC
AGTCTCAAACTTTCTTCAGAAGAATATATTTACGAGATTCAGAACGCCGAGAGCCCTTATTAGCGATGAAGGGAAATTAAAAACAAGATGGTCTGGTCCTTTTGTGATCA
AGGAAATCTTTCCTCATGGTGCCGTAGAATGGATGAATGAAGATGACACCAACGCATTCAAAGTTAATGGTCAGCGTGTGAAACCATATTTTGGAGAATGCATTGAACAC
GACAAAGTAGCGGTTGACCTAGCAAAAATCGAATGA
Protein sequenceShow/hide protein sequence
MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRTAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILEVELFDVWGID
FMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEGKLKTRWSGPFVIKEIFPHGAVEWMNEDDTNAFKVNGQRVKPYFGECIEH
DKVAVDLAKIE