; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G04450 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G04450
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionIntegrase catalytic domain-containing protein
Genome locationClcChr08:13437977..13439185
RNA-Seq ExpressionClc08G04450
SyntenyClc08G04450
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041588 - Integrase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIN21773.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]2.0e-7051.5Show/hide
Query:  MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRIAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILE
        + D + Y+WD+P L+K+GPD+I R C+PE     IL QCH SPY GHF G R AAKILQSG+FWP LF+DA  +   CDRCQR GNIS R+EMPL +ILE
Subjt:  MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRIAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILE

Query:  VELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDAVTVS----NFLQKNIFTRFRT---PRALISDE-----------------------
        VELFDVWGIDFMGPF+PS G+ YILVAVDYVSKWVEA +   ND+  V+      LQ N    FR      A I  E                       
Subjt:  VELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDAVTVS----NFLQKNIFTRFRT---PRALISDE-----------------------

Query:  ---------GKLKTRWSGPFVIKEIFPHGAVEWMNEDDTNAFKVNGQRVKPYFGECIEHDKVAVDL
                 GKLK+RWSGPF I E+FPHGAVE  NE+  N FKVN QR+K Y+G  ++    ++ L
Subjt:  ---------GKLKTRWSGPFVIKEIFPHGAVEWMNEDDTNAFKVNGQRVKPYFGECIEHDKVAVDL

PIN21854.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]3.9e-6640.06Show/hide
Query:  MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRIAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILE
        + D + Y+WD+P L+K+GPD+I R C+PE     I  QCH SPY GHF   R AAKILQSG+FWP LF+D   +   CDRCQR GNIS R+EMPL +ILE
Subjt:  MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRIAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILE

Query:  VELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEG-----------------------------
        VELFDVWGIDFMGPF+PS G+ YILVAVDY+SKWVEA++   ND+  V NF++KNIFTRF TPRA+ISD G                             
Subjt:  VELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEG-----------------------------

Query:  --------------------------------------------------------------------------------------KLKTRWSGPFVIKE
                                                                                              KLK+RWS PF I E
Subjt:  --------------------------------------------------------------------------------------KLKTRWSGPFVIKE

Query:  IFPHGAVEWMNEDDTNAFKVNGQRVKPYFGECIEHDKVAVDL
        + PHGAVE  N++  N FKVN QR+K Y+   ++    ++ L
Subjt:  IFPHGAVEWMNEDDTNAFKVNGQRVKPYFGECIEHDKVAVDL

WP_217833161.1 DDE-type integrase/transposase/recombinase, partial [Synechococcus sp. PCC 7002]2.1e-7274.12Show/hide
Query:  HDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRIAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILEV
        H++KFY WDEP LY+ G DHI R C+PE     IL  CH++PY GHFGGQR AAK+LQSGYFWPTLF+DAR YA+ CDRCQR GNIS+RNEMPL S+LEV
Subjt:  HDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRIAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILEV

Query:  ELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEG
        ELFDVWGIDFMGPF PS G+ YILVAVDYVSKWVEA +CA+NDA TVS FL+K IF+RF TPRA+ISDEG
Subjt:  ELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEG

XP_042757945.1 uncharacterized protein LOC111885853 [Lactuca sativa]8.7e-6655.19Show/hide
Query:  MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRIAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILE
        M DAKFY+WDEP L++   D ++R CIPE   ++IL +CH+S Y GHF G++ A ++L SG++WP+LF+DA ++  RCDRCQR GNI  R EMPL++I+E
Subjt:  MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRIAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILE

Query:  VELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEG-KLKTRWSGPFVIKEIFPHGAVEWMNEDD
        VELFDVWGIDFMGPF+PS+G  YILVAVDYVSKWVEA++C RNDA TV NFL+K IF+RF TPRA+ISDEG     R  G  + K    H      +   
Subjt:  VELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEG-KLKTRWSGPFVIKEIFPHGAVEWMNEDD

Query:  TNAFKVNGQRVK
            + N +++K
Subjt:  TNAFKVNGQRVK

XP_042757945.1 uncharacterized protein LOC111885853 [Lactuca sativa]8.3e-0861.82Show/hide
Query:  GKLKTRWSGPFVIKEIFPHGAVEWMNEDDTNAFKVNGQRVKPYFGECIEHDKVAV
        GKLK+RWSGPF I  + P GA+E +NE D   F VNGQRVK YFGE  E + VAV
Subjt:  GKLKTRWSGPFVIKEIFPHGAVEWMNEDDTNAFKVNGQRVKPYFGECIEHDKVAV

XP_042757945.1 uncharacterized protein LOC111885853 [Lactuca sativa]5.7e-6545.54Show/hide
Query:  MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRIAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILE
        + DA+ Y+WD+P L+K+GPD+I R C+PE     IL QCH SPY GHF G R AAKILQSG+FWP LF+DA  +   CDRCQR GNIS R+EMPL +ILE
Subjt:  MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRIAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILE

Query:  VELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDA----------------------------------------VTVSNFLQKNIF--T
        VELFDVW IDFMGPF+PS G+ YILVAVDY+SKWVEA++   ND+                                        V VSN   K I   T
Subjt:  VELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDA----------------------------------------VTVSNFLQKNIF--T

Query:  RFRTP--RALISDE--------------------------------GKLKTRWSGPFVIKEIFPHGAVEWMNEDDTNAFKVNGQRVKPYFGECIEHDKVA
           TP   A I  E                                GKLK+RWSGPF I E+FPHGAVE  NE+  N FKVN QR+K Y+   +     +
Subjt:  RFRTP--RALISDE--------------------------------GKLKTRWSGPFVIKEIFPHGAVEWMNEDDTNAFKVNGQRVKPYFGECIEHDKVA

Query:  VDL
        + L
Subjt:  VDL

TrEMBL top hitse value%identityAlignment
A0A2G9FWY3 Reverse transcriptase6.1e-6566.67Show/hide
Query:  MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRIAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILE
        + D + Y+WD+P L+K+GPD+I R C+PE     IL QCH SPY GHF G R AAKILQSG+FWP LF+DA  +   CDRCQR GNIS R+EMPL +ILE
Subjt:  MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRIAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILE

Query:  VELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEG
        VELFDVWGIDFMGPF+PS G+ YILVAVDYVSKWVEA +   ND+  V NF++KNIFTRF TPRA+ISD G
Subjt:  VELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEG

A0A2G9FWY3 Reverse transcriptase6.6e-1156.14Show/hide
Query:  GKLKTRWSGPFVIKEIFPHGAVEWMNEDDTNAFKVNGQRVKPYFGECIEHDKVAVDL
        GKLK+RWSGPF I E+FPHGAVE  N++  N FKVN QR+K Y+GE ++    ++ L
Subjt:  GKLKTRWSGPFVIKEIFPHGAVEWMNEDDTNAFKVNGQRVKPYFGECIEHDKVAVDL

A0A2G9FWY3 Reverse transcriptase1.4e-6467.84Show/hide
Query:  MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRIAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILE
        +HD KFY WDEP LYKRG D + R C+PE   +++L  CHDS Y GHF G R AAK+LQSG FWPTLF+DA  Y  RCDRCQR GNIS RNEMP   +LE
Subjt:  MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRIAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILE

Query:  VELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEG
        VE+FDVWGIDFMGPF  S    YILVAVDYVSKWVEAI+   NDA  V +FL+KNIF+RF  PRALISDEG
Subjt:  VELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEG

A0A2G9HK33 Reverse transcriptase2.7e-6545.54Show/hide
Query:  MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRIAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILE
        + DA+ Y+WD+P L+K+GPD+I R C+PE     IL QCH SPY GHF G R AAKILQSG+FWP LF+DA  +   CDRCQR GNIS R+EMPL +ILE
Subjt:  MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRIAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILE

Query:  VELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDA----------------------------------------VTVSNFLQKNIF--T
        VELFDVW IDFMGPF+PS G+ YILVAVDY+SKWVEA++   ND+                                        V VSN   K I   T
Subjt:  VELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDA----------------------------------------VTVSNFLQKNIF--T

Query:  RFRTP--RALISDE--------------------------------GKLKTRWSGPFVIKEIFPHGAVEWMNEDDTNAFKVNGQRVKPYFGECIEHDKVA
           TP   A I  E                                GKLK+RWSGPF I E+FPHGAVE  NE+  N FKVN QR+K Y+   +     +
Subjt:  RFRTP--RALISDE--------------------------------GKLKTRWSGPFVIKEIFPHGAVEWMNEDDTNAFKVNGQRVKPYFGECIEHDKVA

Query:  VDL
        + L
Subjt:  VDL

A0A2G9HWC5 DNA-directed DNA polymerase9.7e-7151.5Show/hide
Query:  MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRIAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILE
        + D + Y+WD+P L+K+GPD+I R C+PE     IL QCH SPY GHF G R AAKILQSG+FWP LF+DA  +   CDRCQR GNIS R+EMPL +ILE
Subjt:  MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRIAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILE

Query:  VELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDAVTVS----NFLQKNIFTRFRT---PRALISDE-----------------------
        VELFDVWGIDFMGPF+PS G+ YILVAVDYVSKWVEA +   ND+  V+      LQ N    FR      A I  E                       
Subjt:  VELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDAVTVS----NFLQKNIFTRFRT---PRALISDE-----------------------

Query:  ---------GKLKTRWSGPFVIKEIFPHGAVEWMNEDDTNAFKVNGQRVKPYFGECIEHDKVAVDL
                 GKLK+RWSGPF I E+FPHGAVE  NE+  N FKVN QR+K Y+G  ++    ++ L
Subjt:  ---------GKLKTRWSGPFVIKEIFPHGAVEWMNEDDTNAFKVNGQRVKPYFGECIEHDKVAVDL

A0A2G9HWF8 Reverse transcriptase1.9e-6640.06Show/hide
Query:  MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRIAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILE
        + D + Y+WD+P L+K+GPD+I R C+PE     I  QCH SPY GHF   R AAKILQSG+FWP LF+D   +   CDRCQR GNIS R+EMPL +ILE
Subjt:  MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRIAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILE

Query:  VELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEG-----------------------------
        VELFDVWGIDFMGPF+PS G+ YILVAVDY+SKWVEA++   ND+  V NF++KNIFTRF TPRA+ISD G                             
Subjt:  VELFDVWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEG-----------------------------

Query:  --------------------------------------------------------------------------------------KLKTRWSGPFVIKE
                                                                                              KLK+RWS PF I E
Subjt:  --------------------------------------------------------------------------------------KLKTRWSGPFVIKE

Query:  IFPHGAVEWMNEDDTNAFKVNGQRVKPYFGECIEHDKVAVDL
        + PHGAVE  N++  N FKVN QR+K Y+   ++    ++ L
Subjt:  IFPHGAVEWMNEDDTNAFKVNGQRVKPYFGECIEHDKVAVDL

A0A2K3NJZ5 Integrase catalytic domain-containing protein (Fragment)1.1e-1069.57Show/hide
Query:  GKLKTRWSGPFVIKEIFPHGAVEWMNEDDTNAFKVNGQRVKPYFGE
        GKLK+RWSGPF IK++FPHGAVE  + D    FKVNGQR+KPYFG+
Subjt:  GKLKTRWSGPFVIKEIFPHGAVEWMNEDDTNAFKVNGQRVKPYFGE

SwissProt top hitse value%identityAlignment
P10394 Retrovirus-related Pol polyprotein from transposon 4121.9e-1027.4Show/hide
Query:  ETSYQRILSQCHDSPYE-GHFGGQRIAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNIS-SRNEMPLTSILEVELFDVWGIDFMGPFLPS-NGHNYIL
        E   + ILS  HD P + GH G  +  AK+ +  Y+W  + +  ++Y  +C +CQ+      ++  M +T   E   FD   +D +GP   S NG+ Y +
Subjt:  ETSYQRILSQCHDSPYE-GHFGGQRIAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNIS-SRNEMPLTSILEVELFDVWGIDFMGPFLPS-NGHNYIL

Query:  VAVDYVSKWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEG
          +  ++K++ AI  A   A TV+  + ++   ++   +  I+D G
Subjt:  VAVDYVSKWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEG

P14350 Pro-Pol polyprotein4.1e-1027.38Show/hide
Query:  YYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRIAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMP-LTSILEVELFD
        Y+ ++ ++    P+ + ++  P++  Q+I+ Q H+     H G +    KI    Y+WP + +D      RC +C  I N S++   P L      + FD
Subjt:  YYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRIAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMP-LTSILEVELFD

Query:  VWGIDFMGPFLPSNGHNYILVAVDYVS--KWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEG
         + ID++GP  PS G+ Y+LV VD ++   W+         A   S     N+ T    P+ + SD+G
Subjt:  VWGIDFMGPFLPSNGHNYILVAVDYVS--KWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEG

P23074 Pro-Pol polyprotein1.1e-1029.34Show/hide
Query:  YYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRIAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMP-LTSILEVELFD
        Y  +E +L    P+ I R+  P+   ++I+S  H+     H G      K+  S Y+WP L +D      +C +C  + N ++    P L  +  ++ FD
Subjt:  YYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRIAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMP-LTSILEVELFD

Query:  VWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAI-SCARNDAVTVSNFLQKNIFTRFRTPRALISDEG
         + ID++GP  PSNG+ ++LV VD ++ +V    + A + + TV      N+ T    P+ L SD+G
Subjt:  VWGIDFMGPFLPSNGHNYILVAVDYVSKWVEAI-SCARNDAVTVSNFLQKNIFTRFRTPRALISDEG

P92516 Uncharacterized mitochondrial protein AtMg007504.3e-1560.71Show/hide
Query:  ILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILEVELFDVWGIDFM
        +LQ+G++WPT F+DA  +   CD CQR GN + RNEMP   ILEVE+FDVWGI FM
Subjt:  ILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILEVELFDVWGIDFM

Q87040 Pro-Pol polyprotein1.4e-1027.98Show/hide
Query:  YYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRIAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMP-LTSILEVELFD
        YY ++ ++    P+ + ++  P++  Q+I+ Q H+     H G +    KI    Y+WP + +D      RC +C  I N S++   P L      + FD
Subjt:  YYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRIAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMP-LTSILEVELFD

Query:  VWGIDFMGPFLPSNGHNYILVAVDYVS--KWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEG
         + ID++GP  PS G+ Y+LV VD ++   W+         A   S     N+ T    P+ + SD+G
Subjt:  VWGIDFMGPFLPSNGHNYILVAVDYVS--KWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEG

Arabidopsis top hitse value%identityAlignment
ATMG00750.1 GAG/POL/ENV polyprotein3.0e-1660.71Show/hide
Query:  ILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILEVELFDVWGIDFM
        +LQ+G++WPT F+DA  +   CD CQR GN + RNEMP   ILEVE+FDVWGI FM
Subjt:  ILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILEVELFDVWGIDFM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATGACGCGAAGTTTTATTATTGGGACGAGCCCCAGCTCTACAAAAGGGGGCCGGATCACATCTTCAGACTCTGCATCCCAGAGACTTCGTATCAACGCATCCTATC
TCAATGTCATGACTCCCCCTATGAAGGGCATTTTGGAGGACAACGAATTGCAGCCAAGATATTGCAAAGCGGATACTTCTGGCCAACTCTTTTTAGAGATGCCAGGGACT
ATGCGATCAGATGCGACAGATGTCAACGCATCGGAAATATCTCAAGTCGGAACGAAATGCCACTTACCTCTATTTTAGAAGTCGAGCTCTTTGACGTTTGGGGTATTGAT
TTCATGGGCCCATTCCTGCCATCAAACGGCCACAACTACATACTGGTAGCGGTGGACTATGTATCAAAATGGGTTGAAGCAATCTCATGCGCCAGGAATGACGCGGTGAC
AGTCTCAAACTTTCTTCAGAAGAATATATTTACGAGATTCAGAACGCCGAGAGCCCTTATTAGCGATGAAGGGAAATTAAAAACAAGATGGTCTGGTCCTTTTGTGATCA
AGGAAATCTTTCCTCATGGTGCCGTAGAATGGATGAATGAAGATGACACCAACGCATTCAAAGTTAATGGTCAGCGTGTGAAACCATATTTTGGAGAATGCATTGAACAC
GACAAAGTAGCGGTTGACCTAGCAAAAATCGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCATGACGCGAAGTTTTATTATTGGGACGAGCCCCAGCTCTACAAAAGGGGGCCGGATCACATCTTCAGACTCTGCATCCCAGAGACTTCGTATCAACGCATCCTATC
TCAATGTCATGACTCCCCCTATGAAGGGCATTTTGGAGGACAACGAATTGCAGCCAAGATATTGCAAAGCGGATACTTCTGGCCAACTCTTTTTAGAGATGCCAGGGACT
ATGCGATCAGATGCGACAGATGTCAACGCATCGGAAATATCTCAAGTCGGAACGAAATGCCACTTACCTCTATTTTAGAAGTCGAGCTCTTTGACGTTTGGGGTATTGAT
TTCATGGGCCCATTCCTGCCATCAAACGGCCACAACTACATACTGGTAGCGGTGGACTATGTATCAAAATGGGTTGAAGCAATCTCATGCGCCAGGAATGACGCGGTGAC
AGTCTCAAACTTTCTTCAGAAGAATATATTTACGAGATTCAGAACGCCGAGAGCCCTTATTAGCGATGAAGGGAAATTAAAAACAAGATGGTCTGGTCCTTTTGTGATCA
AGGAAATCTTTCCTCATGGTGCCGTAGAATGGATGAATGAAGATGACACCAACGCATTCAAAGTTAATGGTCAGCGTGTGAAACCATATTTTGGAGAATGCATTGAACAC
GACAAAGTAGCGGTTGACCTAGCAAAAATCGAATGA
Protein sequenceShow/hide protein sequence
MHDAKFYYWDEPQLYKRGPDHIFRLCIPETSYQRILSQCHDSPYEGHFGGQRIAAKILQSGYFWPTLFRDARDYAIRCDRCQRIGNISSRNEMPLTSILEVELFDVWGID
FMGPFLPSNGHNYILVAVDYVSKWVEAISCARNDAVTVSNFLQKNIFTRFRTPRALISDEGKLKTRWSGPFVIKEIFPHGAVEWMNEDDTNAFKVNGQRVKPYFGECIEH
DKVAVDLAKIE