; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg020890 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg020890
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationscaffold9:2990621..2996546
RNA-Seq ExpressionSpg020890
SyntenySpg020890
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057507.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.4e-2426.87Show/hide
Query:  EKSALWRKIIESIYGTSHCGWKANLLKGKKGNRLWVDIALNYPKFEQFTRFIVNNGKSIKFWEDRWCEDQPLKSIFLDLFAISGKKDASIAECWCHDSQT
        E ++LW+K I++ Y  +H G    + +    N  W  I      +E    ++ N+G S+ FW  +W  + PL      L+A+S  + A++ E W   S  
Subjt:  EKSALWRKIIESIYGTSHCGWKANLLKGKKGNRLWVDIALNYPKFEQFTRFIVNNGKSIKFWEDRWCEDQPLKSIFLDLFAISGKKDASIAECWCHDSQT

Query:  WNLGFKRGLFEREISNWLALVEKIKNVELINGQDTISWKLEGSGVFSTKSC--FLSMTSASPKLNQPSSSL--IWKHKSPKKVKMLLWSLVYRSLNTDEM
        WN+  +R L ERE   W ++   +  +    G    +W    S  ++  S        S+ PK       L  +W+   P+K K  +W++V++ LNT + 
Subjt:  WNLGFKRGLFEREISNWLALVEKIKNVELINGQDTISWKLEGSGVFSTKSC--FLSMTSASPKLNQPSSSL--IWKHKSPKKVKMLLWSLVYRSLNTDEM

Query:  LQRKFGGWMLSPSACRLCIKAAKTLDHLFLHCEFAGAAWNFVANLLGISFCKPKKIEDWLLEGAHLCR
        +Q++     L+PS C  C  + + ++HLF+ C FA + WN  ++  G        ++D  L+   LCR
Subjt:  LQRKFGGWMLSPSACRLCIKAAKTLDHLFLHCEFAGAAWNFVANLLGISFCKPKKIEDWLLEGAHLCR

TYJ99315.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.3e-5126.07Show/hide
Query:  LVVTKRDLHDDWGRILDIMQQQLDAPLVINPFHPDKALLKCPSEELAELLSKNRGWVSFGPFILKIERWNTEKHSRMSCVPCDGGWIQIRNLLLHLWRLN
        +V+ +R  HDDW +IL  +++Q +     N FH +KAL+   S   A LL +N+GW + G + ++ E+W+   H+    +P  GGW   R + LHLW + 
Subjt:  LVVTKRDLHDDWGRILDIMQQQLDAPLVINPFHPDKALLKCPSEELAELLSKNRGWVSFGPFILKIERWNTEKHSRMSCVPCDGGWIQIRNLLLHLWRLN

Query:  VFKAIGDCLGGFIEYEESNSLLIECVDLKLKIRDNYCGFIPAEVRIVDG-NNHYIIQIVTFQDGNMLIVRVAEIHGSFSSAAAHAF--HRGPNDPFF---
         F+ IG    G I+  E        ++ ++K+R NY GF+PA VRI D   N + +Q+VT  +G  LI R   +HG+F   AA +F      ++ FF   
Subjt:  VFKAIGDCLGGFIEYEESNSLLIECVDLKLKIRDNYCGFIPAEVRIVDG-NNHYIIQIVTFQDGNMLIVRVAEIHGSFSSAAAHAF--HRGPNDPFF---

Query:  ----CPVDIWRIEDG---------------IVYPMVNIQERS----ESLNKENRQLDGGTELFEISRQQSFLGFSEKAGPSKVH-----KSQAQVEESQQ
             P  +    DG               I+ P  N    S    E +N  N          EI    S  G  +K G  KV       S   +++S++
Subjt:  ----CPVDIWRIEDG---------------IVYPMVNIQERS----ESLNKENRQLDGGTELFEISRQQSFLGFSEKAGPSKVH-----KSQAQVEESQQ

Query:  AQS-------------DKNDSGESP----PPSNNKKGKESRLRKRPS--RPTLKGEPSSVRLAKPPISGKRKDSSPQLQEDEIGINLEDGMVNQQIDSPR
          S             D   +  SP    P    K  +E  ++K+ S  +P  K   +       PI     D     +   + ++L D        S  
Subjt:  AQS-------------DKNDSGESP----PPSNNKKGKESRLRKRPS--RPTLKGEPSSVRLAKPPISGKRKDSSPQLQEDEIGINLEDGMVNQQIDSPR

Query:  KEILEKPCMEYDIENS---PKALACIQPIEEASNSNSQC------------------------IEGFAISKEVVMTLSKN-----------------NLC
                   DI N+   P+      P+ E SNS+S+                          +  A  K++V  L KN                 N+ 
Subjt:  KEILEKPCMEYDIENS---PKALACIQPIEEASNSNSQC------------------------IEGFAISKEVVMTLSKN-----------------NLC

Query:  IRPIVGANNKKDRRLVKSIWSSRYIAWIALDAINSAGGILFMWKDSIVEVLDSVLGVFSVTVQCSFQGQNVGWISGVYGPCNYKDRRQFMQELFDLNGLC
        +  +       ++R++KS+W S  I WIA +A  S+GGIL +W      +L    G+FS++        +  W++G+YGP   ++R  F  EL +L  L 
Subjt:  IRPIVGANNKKDRRLVKSIWSSRYIAWIALDAINSAGGILFMWKDSIVEVLDSVLGVFSVTVQCSFQGQNVGWISGVYGPCNYKDRRQFMQELFDLNGLC

Query:  QGIWCIVGDFNMVRWSKERLNASRSTKSMANFNRFIDSSELLEPPMMNGAFSWSRM
           W + GD N++R  +E  +   S+ +    N FI ++ L++PP+ N  F+WS +
Subjt:  QGIWCIVGDFNMVRWSKERLNASRSTKSMANFNRFIDSSELLEPPMMNGAFSWSRM

TYJ99315.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]8.7e-1927.27Show/hide
Query:  LAKWKEILLLVMEGS-GL-RLNLLKYAL--------IGEKSALWRKIIESIYGTSHCGWKANLLKGKKGNRLWVDIALNYPKFEQFTRFIVNNGKSIKFW
        L  W ++     EG  G+ RLN+   AL        + E +ALWR++I+  Y     G   + +        W  I  +   F+    + +NNG  I FW
Subjt:  LAKWKEILLLVMEGS-GL-RLNLLKYAL--------IGEKSALWRKIIESIYGTSHCGWKANLLKGKKGNRLWVDIALNYPKFEQFTRFIVNNGKSIKFW

Query:  EDRWCEDQPLKSIFLDLFAISGKKDASIAECWCHDSQTWNLGFKRGLFEREISNWLALVEKIKNVELINGQDTISWKLEGSGVFSTKSC------FLSMT
           W ++  L + +  LFA++  K+ S+ + W      WN+ F+R L +RE  NW  ++E +       G    +W  + +  FS  S        L  T
Subjt:  EDRWCEDQPLKSIFLDLFAISGKKDASIAECWCHDSQTWNLGFKRGLFEREISNWLALVEKIKNVELINGQDTISWKLEGSGVFSTKSC------FLSMT

Query:  SASPKLNQPSSSLIWKHKSPKKVKMLLWSLVYRSLNTDEMLQRKFGGWMLSPS
           P+       +IWK   P K+K  +W L+ R +NT E++Q+K    +L P+
Subjt:  SASPKLNQPSSSLIWKHKSPKKVKMLLWSLVYRSLNTDEMLQRKFGGWMLSPS

TYJ99315.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]8.6e-5125.71Show/hide
Query:  WSEVLVVTKRDLHDDWGRILDIMQQQLDAPLVINPFHPDKALLKCPSEELAELLSKNR---GWVSFGPFILKIERWNTEKHSRMSCVPCDGGWIQIRNLL
        + + +++T+R  HDDW RI+  +++Q +      PF  DKA+L   S+  A+LL  N+   GW + G + +K E W++  HS  S +P  GGW++ R + 
Subjt:  WSEVLVVTKRDLHDDWGRILDIMQQQLDAPLVINPFHPDKALLKCPSEELAELLSKNR---GWVSFGPFILKIERWNTEKHSRMSCVPCDGGWIQIRNLL

Query:  LHLWRLNVFKAIGDCLGGFIEYEESNSLLIECVDLKLKIRDNYCGFIPAEVRIVDG-NNHYIIQIVTFQDGNMLIVRVAEIHGSFSSAAAHAFHR-----
        LHLW  N F+ IG   GGF++  +    + + +D K+K+R NY GF+PA + I D    ++I+  V   +   L+ R   +HGSF + AA  F +     
Subjt:  LHLWRLNVFKAIGDCLGGFIEYEESNSLLIECVDLKLKIRDNYCGFIPAEVRIVDG-NNHYIIQIVTFQDGNMLIVRVAEIHGSFSSAAAHAFHR-----

Query:  ---------------------------------------------GPNDPFFCPVDIWRIEDGIVYPMVNIQERSESLNKENRQLDGGTELFEISRQQSF
                                                        DPF   +   R E G    ++N Q      +K ++++         +R+ SF
Subjt:  ---------------------------------------------GPNDPFFCPVDIWRIEDGIVYPMVNIQERSESLNKENRQLDGGTELFEISRQQSF

Query:  LGFSEKAGPSKVHKSQAQVEESQQAQSDK----NDSGES--PPPSNNKKGKESRLRKRPSRPTLKGEPSSVRLAKPPISGKRKDSSPQL-----------
        L       P  +  + +  E + + +S +    ND  E    P    K     R++K P   T   E  ++ L +     K+ + S  +           
Subjt:  LGFSEKAGPSKVHKSQAQVEESQQAQSDK----NDSGES--PPPSNNKKGKESRLRKRPSRPTLKGEPSSVRLAKPPISGKRKDSSPQL-----------

Query:  QEDEIGINLEDGMV----NQQIDSPRKEILEKPCMEYDIENSPKALACIQPIEEASNSNSQCIEGFAISKEVVMTLSKNNLCIRPIVGANNKKDRRLVKS
         E+  G++  +       ++  DS   + L     E   +N   + +  +   + + + S+     A  +++V+ L +N L + P    N+         
Subjt:  QEDEIGINLEDGMV----NQQIDSPRKEILEKPCMEYDIENSPKALACIQPIEEASNSNSQCIEGFAISKEVVMTLSKNNLCIRPIVGANNKKDRRLVKS

Query:  IWSSRYIAWIALDAINSAGGILFMWKDSIVEVLDSVLGVFSVTVQCSFQGQNVGWISGVYGPCNYKDRRQFMQELFDLNGLCQGIWCIVGDFNMVRWSKE
        I S + +       +   GGIL +W D+  +V D  +G +S+++       N  W++ VYGP  Y DR +   EL  L  LC   W I GDFN+VRW +E
Subjt:  IWSSRYIAWIALDAINSAGGILFMWKDSIVEVLDSVLGVFSVTVQCSFQGQNVGWISGVYGPCNYKDRRQFMQELFDLNGLCQGIWCIVGDFNMVRWSKE

Query:  RLNASRSTKSMANFNRFIDSSELLEPPMMNGAFSWSRM
            S   ++MANFN FI  +EL++PP++N  F+WS +
Subjt:  RLNASRSTKSMANFNRFIDSSELLEPPMMNGAFSWSRM

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]5.6e-5025.5Show/hide
Query:  WSEVLVVTKRDLHDDWGRILDIMQQQLDAPLVINPFHPDKALLKCPSEELAELLSKNR---GWVSFGPFILKIERWNTEKHSRMSCVPCDGGWIQIRNLL
        + + +++T+R  HDDW RI+  +++Q +      PF  DKA+L   + + A+LL  N+   GW + G + +K E W++  HS  S +P  GGW++ R + 
Subjt:  WSEVLVVTKRDLHDDWGRILDIMQQQLDAPLVINPFHPDKALLKCPSEELAELLSKNR---GWVSFGPFILKIERWNTEKHSRMSCVPCDGGWIQIRNLL

Query:  LHLWRLNVFKAIGDCLGGFIEYEESNSLLIECVDLKLKIRDNYCGFIPAEVRIVDG-NNHYIIQIVTFQDGNMLIVRVAEIHGSFSSAAAHAFHR-----
        LHLW  N F+ IG   GGF++  +    + + +D K+K+R NY GF+PA + I D    ++I+  V   +   L+ R   +HGSF + AA  F +     
Subjt:  LHLWRLNVFKAIGDCLGGFIEYEESNSLLIECVDLKLKIRDNYCGFIPAEVRIVDG-NNHYIIQIVTFQDGNMLIVRVAEIHGSFSSAAAHAFHR-----

Query:  ---------------------------------------------GPNDPFFCPVDIWRIEDGIVYPMVNIQERSESLNKENRQLDGGTELFEISRQQSF
                                                        DPF   +   R E G    ++N Q+     +K ++++         +R+ SF
Subjt:  ---------------------------------------------GPNDPFFCPVDIWRIEDGIVYPMVNIQERSESLNKENRQLDGGTELFEISRQQSF

Query:  LGFSEKAGPSKVHKSQAQVEESQQAQSDK----NDSGES--PPPSNNKKGKESRLRKRPSRPTLKGEPSSVRLAKPPISGKRKDSSPQL-----------
        L       P  +  + +  E + + +S +    ND  E    P    K     R++K P   T   E   + L +     K+ + S  +           
Subjt:  LGFSEKAGPSKVHKSQAQVEESQQAQSDK----NDSGES--PPPSNNKKGKESRLRKRPSRPTLKGEPSSVRLAKPPISGKRKDSSPQL-----------

Query:  QEDEIGINLEDGMV----NQQIDSPRKEILEKPCMEYDIENSPKALACIQPIEEASNSNSQCIEGFAISKEVVMTLSKNNLCIRPIVGANNKKDRRLVKS
         E+  G++  +       ++  DS   + L     E   +N   + +  +   + + + S+     A  +++V+ L +N L + P          +    
Subjt:  QEDEIGINLEDGMV----NQQIDSPRKEILEKPCMEYDIENSPKALACIQPIEEASNSNSQCIEGFAISKEVVMTLSKNNLCIRPIVGANNKKDRRLVKS

Query:  IWSSRYIAWIALD---------AINSAGGILFMWKDSIVEVLDSVLGVFSVTVQCSFQGQNVGWISGVYGPCNYKDRRQFMQELFDLNGLCQGIWCIVGD
        + SS Y   I  D          +   GGIL +W D+  +V D  +G +S+++       N  W++ VYGP  Y DR +   EL  L  LC   W I GD
Subjt:  IWSSRYIAWIALD---------AINSAGGILFMWKDSIVEVLDSVLGVFSVTVQCSFQGQNVGWISGVYGPCNYKDRRQFMQELFDLNGLCQGIWCIVGD

Query:  FNMVRWSKERLNASRSTKSMANFNRFIDSSELLEPPMMNGAFSWSRM
        FN+VRW +E    S   ++MANFN FI  +EL++PP +N  F+WS +
Subjt:  FNMVRWSKERLNASRSTKSMANFNRFIDSSELLEPPMMNGAFSWSRM

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.4e-2426.87Show/hide
Query:  EKSALWRKIIESIYGTSHCGWKANLLKGKKGNRLWVDIALNYPKFEQFTRFIVNNGKSIKFWEDRWCEDQPLKSIFLDLFAISGKKDASIAECWCHDSQT
        E ++LW+K I++ Y  +H G    + +    N  W  I      +E    ++ N+G S+ FW  +W  + PL      L+A+S  + A++ E W   S  
Subjt:  EKSALWRKIIESIYGTSHCGWKANLLKGKKGNRLWVDIALNYPKFEQFTRFIVNNGKSIKFWEDRWCEDQPLKSIFLDLFAISGKKDASIAECWCHDSQT

Query:  WNLGFKRGLFEREISNWLALVEKIKNVELINGQDTISWKLEGSGVFSTKSC--FLSMTSASPKLNQPSSSL--IWKHKSPKKVKMLLWSLVYRSLNTDEM
        WN+  +R L ERE   W ++   +  +    G    +W    S  ++  S        S+ PK       L  +W+   P+K K  +W++V++ LNT + 
Subjt:  WNLGFKRGLFEREISNWLALVEKIKNVELINGQDTISWKLEGSGVFSTKSC--FLSMTSASPKLNQPSSSL--IWKHKSPKKVKMLLWSLVYRSLNTDEM

Query:  LQRKFGGWMLSPSACRLCIKAAKTLDHLFLHCEFAGAAWNFVANLLGISFCKPKKIEDWLLEGAHLCR
        +Q++     L+PS C  C  + + ++HLF+ C FA + WN  ++  G        ++D  L+   LCR
Subjt:  LQRKFGGWMLSPSACRLCIKAAKTLDHLFLHCEFAGAAWNFVANLLGISFCKPKKIEDWLLEGAHLCR

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]5.6e-5025.5Show/hide
Query:  WSEVLVVTKRDLHDDWGRILDIMQQQLDAPLVINPFHPDKALLKCPSEELAELLSKNR---GWVSFGPFILKIERWNTEKHSRMSCVPCDGGWIQIRNLL
        + + +++T+R  HDDW RI+  +++Q +      PF  DKA+L   + + A+LL  N+   GW + G + +K E W++  HS  S +P  GGW++ R + 
Subjt:  WSEVLVVTKRDLHDDWGRILDIMQQQLDAPLVINPFHPDKALLKCPSEELAELLSKNR---GWVSFGPFILKIERWNTEKHSRMSCVPCDGGWIQIRNLL

Query:  LHLWRLNVFKAIGDCLGGFIEYEESNSLLIECVDLKLKIRDNYCGFIPAEVRIVDG-NNHYIIQIVTFQDGNMLIVRVAEIHGSFSSAAAHAFHR-----
        LHLW  N F+ IG   GGF++  +    + + +D K+K+R NY GF+PA + I D    ++I+  V   +   L+ R   +HGSF + AA  F +     
Subjt:  LHLWRLNVFKAIGDCLGGFIEYEESNSLLIECVDLKLKIRDNYCGFIPAEVRIVDG-NNHYIIQIVTFQDGNMLIVRVAEIHGSFSSAAAHAFHR-----

Query:  ---------------------------------------------GPNDPFFCPVDIWRIEDGIVYPMVNIQERSESLNKENRQLDGGTELFEISRQQSF
                                                        DPF   +   R E G    ++N Q+     +K ++++         +R+ SF
Subjt:  ---------------------------------------------GPNDPFFCPVDIWRIEDGIVYPMVNIQERSESLNKENRQLDGGTELFEISRQQSF

Query:  LGFSEKAGPSKVHKSQAQVEESQQAQSDK----NDSGES--PPPSNNKKGKESRLRKRPSRPTLKGEPSSVRLAKPPISGKRKDSSPQL-----------
        L       P  +  + +  E + + +S +    ND  E    P    K     R++K P   T   E   + L +     K+ + S  +           
Subjt:  LGFSEKAGPSKVHKSQAQVEESQQAQSDK----NDSGES--PPPSNNKKGKESRLRKRPSRPTLKGEPSSVRLAKPPISGKRKDSSPQL-----------

Query:  QEDEIGINLEDGMV----NQQIDSPRKEILEKPCMEYDIENSPKALACIQPIEEASNSNSQCIEGFAISKEVVMTLSKNNLCIRPIVGANNKKDRRLVKS
         E+  G++  +       ++  DS   + L     E   +N   + +  +   + + + S+     A  +++V+ L +N L + P          +    
Subjt:  QEDEIGINLEDGMV----NQQIDSPRKEILEKPCMEYDIENSPKALACIQPIEEASNSNSQCIEGFAISKEVVMTLSKNNLCIRPIVGANNKKDRRLVKS

Query:  IWSSRYIAWIALD---------AINSAGGILFMWKDSIVEVLDSVLGVFSVTVQCSFQGQNVGWISGVYGPCNYKDRRQFMQELFDLNGLCQGIWCIVGD
        + SS Y   I  D          +   GGIL +W D+  +V D  +G +S+++       N  W++ VYGP  Y DR +   EL  L  LC   W I GD
Subjt:  IWSSRYIAWIALD---------AINSAGGILFMWKDSIVEVLDSVLGVFSVTVQCSFQGQNVGWISGVYGPCNYKDRRQFMQELFDLNGLCQGIWCIVGD

Query:  FNMVRWSKERLNASRSTKSMANFNRFIDSSELLEPPMMNGAFSWSRM
        FN+VRW +E    S   ++MANFN FI  +EL++PP +N  F+WS +
Subjt:  FNMVRWSKERLNASRSTKSMANFNRFIDSSELLEPPMMNGAFSWSRM

XP_022149859.1 uncharacterized protein LOC111018186 [Momordica charantia]9.5e-6647.76Show/hide
Query:  DEVRKIKWSEVLVVTKRDLHDDWGRILDIMQQQLDAPLVINPFHPDKALLKCPSEELAELLSKNRGWVSFGPFILKIERWNTEKHSRMSCVPCDGGWIQI
        +EVR++ W E +V+T+RD HDDW RIL  +++Q ++  +INPF  DKAL+KCPS++LA LL  N+GWV+FGP  +K+E WN   H R    P  G W++I
Subjt:  DEVRKIKWSEVLVVTKRDLHDDWGRILDIMQQQLDAPLVINPFHPDKALLKCPSEELAELLSKNRGWVSFGPFILKIERWNTEKHSRMSCVPCDGGWIQI

Query:  RNLLLHLWRLNVFKAIGDCLGGFIEYEESNSLLIECVDLKLKIRDNYCGFIPAEVRIVDGNNHYIIQIVTFQDGNMLIVRVAEIHGSFSSAAAHAFHRGP
        RN+ LHLW L  FKAIG+ LGGFI+Y+++NS  IEC D+ +K++ NYCGFIPAE+  +DG   +  ++V+F+D   L  +   IHG FSS AA +FH+G 
Subjt:  RNLLLHLWRLNVFKAIGDCLGGFIEYEESNSLLIECVDLKLKIRDNYCGFIPAEVRIVDGNNHYIIQIVTFQDGNMLIVRVAEIHGSFSSAAAHAFHRGP

Query:  NDPFFCPVDIWRIEDGIVYPMVNIQERSESLNKENRQLDGGTELF
         +     +D WR+E+G  YP VNIQ  +    K  R  +GG++L+
Subjt:  NDPFFCPVDIWRIEDGIVYPMVNIQERSESLNKENRQLDGGTELF

TrEMBL top hitse value%identityAlignment
A0A5A7US62 LINE-1 retrotransposable element ORF2 protein2.7e-5025.5Show/hide
Query:  WSEVLVVTKRDLHDDWGRILDIMQQQLDAPLVINPFHPDKALLKCPSEELAELLSKNR---GWVSFGPFILKIERWNTEKHSRMSCVPCDGGWIQIRNLL
        + + +++T+R  HDDW RI+  +++Q +      PF  DKA+L   + + A+LL  N+   GW + G + +K E W++  HS  S +P  GGW++ R + 
Subjt:  WSEVLVVTKRDLHDDWGRILDIMQQQLDAPLVINPFHPDKALLKCPSEELAELLSKNR---GWVSFGPFILKIERWNTEKHSRMSCVPCDGGWIQIRNLL

Query:  LHLWRLNVFKAIGDCLGGFIEYEESNSLLIECVDLKLKIRDNYCGFIPAEVRIVDG-NNHYIIQIVTFQDGNMLIVRVAEIHGSFSSAAAHAFHR-----
        LHLW  N F+ IG   GGF++  +    + + +D K+K+R NY GF+PA + I D    ++I+  V   +   L+ R   +HGSF + AA  F +     
Subjt:  LHLWRLNVFKAIGDCLGGFIEYEESNSLLIECVDLKLKIRDNYCGFIPAEVRIVDG-NNHYIIQIVTFQDGNMLIVRVAEIHGSFSSAAAHAFHR-----

Query:  ---------------------------------------------GPNDPFFCPVDIWRIEDGIVYPMVNIQERSESLNKENRQLDGGTELFEISRQQSF
                                                        DPF   +   R E G    ++N Q+     +K ++++         +R+ SF
Subjt:  ---------------------------------------------GPNDPFFCPVDIWRIEDGIVYPMVNIQERSESLNKENRQLDGGTELFEISRQQSF

Query:  LGFSEKAGPSKVHKSQAQVEESQQAQSDK----NDSGES--PPPSNNKKGKESRLRKRPSRPTLKGEPSSVRLAKPPISGKRKDSSPQL-----------
        L       P  +  + +  E + + +S +    ND  E    P    K     R++K P   T   E   + L +     K+ + S  +           
Subjt:  LGFSEKAGPSKVHKSQAQVEESQQAQSDK----NDSGES--PPPSNNKKGKESRLRKRPSRPTLKGEPSSVRLAKPPISGKRKDSSPQL-----------

Query:  QEDEIGINLEDGMV----NQQIDSPRKEILEKPCMEYDIENSPKALACIQPIEEASNSNSQCIEGFAISKEVVMTLSKNNLCIRPIVGANNKKDRRLVKS
         E+  G++  +       ++  DS   + L     E   +N   + +  +   + + + S+     A  +++V+ L +N L + P          +    
Subjt:  QEDEIGINLEDGMV----NQQIDSPRKEILEKPCMEYDIENSPKALACIQPIEEASNSNSQCIEGFAISKEVVMTLSKNNLCIRPIVGANNKKDRRLVKS

Query:  IWSSRYIAWIALD---------AINSAGGILFMWKDSIVEVLDSVLGVFSVTVQCSFQGQNVGWISGVYGPCNYKDRRQFMQELFDLNGLCQGIWCIVGD
        + SS Y   I  D          +   GGIL +W D+  +V D  +G +S+++       N  W++ VYGP  Y DR +   EL  L  LC   W I GD
Subjt:  IWSSRYIAWIALD---------AINSAGGILFMWKDSIVEVLDSVLGVFSVTVQCSFQGQNVGWISGVYGPCNYKDRRQFMQELFDLNGLCQGIWCIVGD

Query:  FNMVRWSKERLNASRSTKSMANFNRFIDSSELLEPPMMNGAFSWSRM
        FN+VRW +E    S   ++MANFN FI  +EL++PP +N  F+WS +
Subjt:  FNMVRWSKERLNASRSTKSMANFNRFIDSSELLEPPMMNGAFSWSRM

A0A5A7US62 LINE-1 retrotransposable element ORF2 protein1.1e-2426.87Show/hide
Query:  EKSALWRKIIESIYGTSHCGWKANLLKGKKGNRLWVDIALNYPKFEQFTRFIVNNGKSIKFWEDRWCEDQPLKSIFLDLFAISGKKDASIAECWCHDSQT
        E ++LW+K I++ Y  +H G    + +    N  W  I      +E    ++ N+G S+ FW  +W  + PL      L+A+S  + A++ E W   S  
Subjt:  EKSALWRKIIESIYGTSHCGWKANLLKGKKGNRLWVDIALNYPKFEQFTRFIVNNGKSIKFWEDRWCEDQPLKSIFLDLFAISGKKDASIAECWCHDSQT

Query:  WNLGFKRGLFEREISNWLALVEKIKNVELINGQDTISWKLEGSGVFSTKSC--FLSMTSASPKLNQPSSSL--IWKHKSPKKVKMLLWSLVYRSLNTDEM
        WN+  +R L ERE   W ++   +  +    G    +W    S  ++  S        S+ PK       L  +W+   P+K K  +W++V++ LNT + 
Subjt:  WNLGFKRGLFEREISNWLALVEKIKNVELINGQDTISWKLEGSGVFSTKSC--FLSMTSASPKLNQPSSSL--IWKHKSPKKVKMLLWSLVYRSLNTDEM

Query:  LQRKFGGWMLSPSACRLCIKAAKTLDHLFLHCEFAGAAWNFVANLLGISFCKPKKIEDWLLEGAHLCR
        +Q++     L+PS C  C  + + ++HLF+ C FA + WN  ++  G        ++D  L+   LCR
Subjt:  LQRKFGGWMLSPSACRLCIKAAKTLDHLFLHCEFAGAAWNFVANLLGISFCKPKKIEDWLLEGAHLCR

A0A5A7US62 LINE-1 retrotransposable element ORF2 protein2.7e-5025.5Show/hide
Query:  WSEVLVVTKRDLHDDWGRILDIMQQQLDAPLVINPFHPDKALLKCPSEELAELLSKNR---GWVSFGPFILKIERWNTEKHSRMSCVPCDGGWIQIRNLL
        + + +++T+R  HDDW RI+  +++Q +      PF  DKA+L   + + A+LL  N+   GW + G + +K E W++  HS  S +P  GGW++ R + 
Subjt:  WSEVLVVTKRDLHDDWGRILDIMQQQLDAPLVINPFHPDKALLKCPSEELAELLSKNR---GWVSFGPFILKIERWNTEKHSRMSCVPCDGGWIQIRNLL

Query:  LHLWRLNVFKAIGDCLGGFIEYEESNSLLIECVDLKLKIRDNYCGFIPAEVRIVDG-NNHYIIQIVTFQDGNMLIVRVAEIHGSFSSAAAHAFHR-----
        LHLW  N F+ IG   GGF++  +    + + +D K+K+R NY GF+PA + I D    ++I+  V   +   L+ R   +HGSF + AA  F +     
Subjt:  LHLWRLNVFKAIGDCLGGFIEYEESNSLLIECVDLKLKIRDNYCGFIPAEVRIVDG-NNHYIIQIVTFQDGNMLIVRVAEIHGSFSSAAAHAFHR-----

Query:  ---------------------------------------------GPNDPFFCPVDIWRIEDGIVYPMVNIQERSESLNKENRQLDGGTELFEISRQQSF
                                                        DPF   +   R E G    ++N Q+     +K ++++         +R+ SF
Subjt:  ---------------------------------------------GPNDPFFCPVDIWRIEDGIVYPMVNIQERSESLNKENRQLDGGTELFEISRQQSF

Query:  LGFSEKAGPSKVHKSQAQVEESQQAQSDK----NDSGES--PPPSNNKKGKESRLRKRPSRPTLKGEPSSVRLAKPPISGKRKDSSPQL-----------
        L       P  +  + +  E + + +S +    ND  E    P    K     R++K P   T   E   + L +     K+ + S  +           
Subjt:  LGFSEKAGPSKVHKSQAQVEESQQAQSDK----NDSGES--PPPSNNKKGKESRLRKRPSRPTLKGEPSSVRLAKPPISGKRKDSSPQL-----------

Query:  QEDEIGINLEDGMV----NQQIDSPRKEILEKPCMEYDIENSPKALACIQPIEEASNSNSQCIEGFAISKEVVMTLSKNNLCIRPIVGANNKKDRRLVKS
         E+  G++  +       ++  DS   + L     E   +N   + +  +   + + + S+     A  +++V+ L +N L + P          +    
Subjt:  QEDEIGINLEDGMV----NQQIDSPRKEILEKPCMEYDIENSPKALACIQPIEEASNSNSQCIEGFAISKEVVMTLSKNNLCIRPIVGANNKKDRRLVKS

Query:  IWSSRYIAWIALD---------AINSAGGILFMWKDSIVEVLDSVLGVFSVTVQCSFQGQNVGWISGVYGPCNYKDRRQFMQELFDLNGLCQGIWCIVGD
        + SS Y   I  D          +   GGIL +W D+  +V D  +G +S+++       N  W++ VYGP  Y DR +   EL  L  LC   W I GD
Subjt:  IWSSRYIAWIALD---------AINSAGGILFMWKDSIVEVLDSVLGVFSVTVQCSFQGQNVGWISGVYGPCNYKDRRQFMQELFDLNGLCQGIWCIVGD

Query:  FNMVRWSKERLNASRSTKSMANFNRFIDSSELLEPPMMNGAFSWSRM
        FN+VRW +E    S   ++MANFN FI  +EL++PP +N  F+WS +
Subjt:  FNMVRWSKERLNASRSTKSMANFNRFIDSSELLEPPMMNGAFSWSRM

A0A5D3BLV7 LINE-1 retrotransposable element ORF2 protein1.1e-5126.07Show/hide
Query:  LVVTKRDLHDDWGRILDIMQQQLDAPLVINPFHPDKALLKCPSEELAELLSKNRGWVSFGPFILKIERWNTEKHSRMSCVPCDGGWIQIRNLLLHLWRLN
        +V+ +R  HDDW +IL  +++Q +     N FH +KAL+   S   A LL +N+GW + G + ++ E+W+   H+    +P  GGW   R + LHLW + 
Subjt:  LVVTKRDLHDDWGRILDIMQQQLDAPLVINPFHPDKALLKCPSEELAELLSKNRGWVSFGPFILKIERWNTEKHSRMSCVPCDGGWIQIRNLLLHLWRLN

Query:  VFKAIGDCLGGFIEYEESNSLLIECVDLKLKIRDNYCGFIPAEVRIVDG-NNHYIIQIVTFQDGNMLIVRVAEIHGSFSSAAAHAF--HRGPNDPFF---
         F+ IG    G I+  E        ++ ++K+R NY GF+PA VRI D   N + +Q+VT  +G  LI R   +HG+F   AA +F      ++ FF   
Subjt:  VFKAIGDCLGGFIEYEESNSLLIECVDLKLKIRDNYCGFIPAEVRIVDG-NNHYIIQIVTFQDGNMLIVRVAEIHGSFSSAAAHAF--HRGPNDPFF---

Query:  ----CPVDIWRIEDG---------------IVYPMVNIQERS----ESLNKENRQLDGGTELFEISRQQSFLGFSEKAGPSKVH-----KSQAQVEESQQ
             P  +    DG               I+ P  N    S    E +N  N          EI    S  G  +K G  KV       S   +++S++
Subjt:  ----CPVDIWRIEDG---------------IVYPMVNIQERS----ESLNKENRQLDGGTELFEISRQQSFLGFSEKAGPSKVH-----KSQAQVEESQQ

Query:  AQS-------------DKNDSGESP----PPSNNKKGKESRLRKRPS--RPTLKGEPSSVRLAKPPISGKRKDSSPQLQEDEIGINLEDGMVNQQIDSPR
          S             D   +  SP    P    K  +E  ++K+ S  +P  K   +       PI     D     +   + ++L D        S  
Subjt:  AQS-------------DKNDSGESP----PPSNNKKGKESRLRKRPS--RPTLKGEPSSVRLAKPPISGKRKDSSPQLQEDEIGINLEDGMVNQQIDSPR

Query:  KEILEKPCMEYDIENS---PKALACIQPIEEASNSNSQC------------------------IEGFAISKEVVMTLSKN-----------------NLC
                   DI N+   P+      P+ E SNS+S+                          +  A  K++V  L KN                 N+ 
Subjt:  KEILEKPCMEYDIENS---PKALACIQPIEEASNSNSQC------------------------IEGFAISKEVVMTLSKN-----------------NLC

Query:  IRPIVGANNKKDRRLVKSIWSSRYIAWIALDAINSAGGILFMWKDSIVEVLDSVLGVFSVTVQCSFQGQNVGWISGVYGPCNYKDRRQFMQELFDLNGLC
        +  +       ++R++KS+W S  I WIA +A  S+GGIL +W      +L    G+FS++        +  W++G+YGP   ++R  F  EL +L  L 
Subjt:  IRPIVGANNKKDRRLVKSIWSSRYIAWIALDAINSAGGILFMWKDSIVEVLDSVLGVFSVTVQCSFQGQNVGWISGVYGPCNYKDRRQFMQELFDLNGLC

Query:  QGIWCIVGDFNMVRWSKERLNASRSTKSMANFNRFIDSSELLEPPMMNGAFSWSRM
           W + GD N++R  +E  +   S+ +    N FI ++ L++PP+ N  F+WS +
Subjt:  QGIWCIVGDFNMVRWSKERLNASRSTKSMANFNRFIDSSELLEPPMMNGAFSWSRM

A0A5D3BLV7 LINE-1 retrotransposable element ORF2 protein4.2e-1927.27Show/hide
Query:  LAKWKEILLLVMEGS-GL-RLNLLKYAL--------IGEKSALWRKIIESIYGTSHCGWKANLLKGKKGNRLWVDIALNYPKFEQFTRFIVNNGKSIKFW
        L  W ++     EG  G+ RLN+   AL        + E +ALWR++I+  Y     G   + +        W  I  +   F+    + +NNG  I FW
Subjt:  LAKWKEILLLVMEGS-GL-RLNLLKYAL--------IGEKSALWRKIIESIYGTSHCGWKANLLKGKKGNRLWVDIALNYPKFEQFTRFIVNNGKSIKFW

Query:  EDRWCEDQPLKSIFLDLFAISGKKDASIAECWCHDSQTWNLGFKRGLFEREISNWLALVEKIKNVELINGQDTISWKLEGSGVFSTKSC------FLSMT
           W ++  L + +  LFA++  K+ S+ + W      WN+ F+R L +RE  NW  ++E +       G    +W  + +  FS  S        L  T
Subjt:  EDRWCEDQPLKSIFLDLFAISGKKDASIAECWCHDSQTWNLGFKRGLFEREISNWLALVEKIKNVELINGQDTISWKLEGSGVFSTKSC------FLSMT

Query:  SASPKLNQPSSSLIWKHKSPKKVKMLLWSLVYRSLNTDEMLQRKFGGWMLSPS
           P+       +IWK   P K+K  +W L+ R +NT E++Q+K    +L P+
Subjt:  SASPKLNQPSSSLIWKHKSPKKVKMLLWSLVYRSLNTDEMLQRKFGGWMLSPS

A0A5D3BLV7 LINE-1 retrotransposable element ORF2 protein4.2e-5125.71Show/hide
Query:  WSEVLVVTKRDLHDDWGRILDIMQQQLDAPLVINPFHPDKALLKCPSEELAELLSKNR---GWVSFGPFILKIERWNTEKHSRMSCVPCDGGWIQIRNLL
        + + +++T+R  HDDW RI+  +++Q +      PF  DKA+L   S+  A+LL  N+   GW + G + +K E W++  HS  S +P  GGW++ R + 
Subjt:  WSEVLVVTKRDLHDDWGRILDIMQQQLDAPLVINPFHPDKALLKCPSEELAELLSKNR---GWVSFGPFILKIERWNTEKHSRMSCVPCDGGWIQIRNLL

Query:  LHLWRLNVFKAIGDCLGGFIEYEESNSLLIECVDLKLKIRDNYCGFIPAEVRIVDG-NNHYIIQIVTFQDGNMLIVRVAEIHGSFSSAAAHAFHR-----
        LHLW  N F+ IG   GGF++  +    + + +D K+K+R NY GF+PA + I D    ++I+  V   +   L+ R   +HGSF + AA  F +     
Subjt:  LHLWRLNVFKAIGDCLGGFIEYEESNSLLIECVDLKLKIRDNYCGFIPAEVRIVDG-NNHYIIQIVTFQDGNMLIVRVAEIHGSFSSAAAHAFHR-----

Query:  ---------------------------------------------GPNDPFFCPVDIWRIEDGIVYPMVNIQERSESLNKENRQLDGGTELFEISRQQSF
                                                        DPF   +   R E G    ++N Q      +K ++++         +R+ SF
Subjt:  ---------------------------------------------GPNDPFFCPVDIWRIEDGIVYPMVNIQERSESLNKENRQLDGGTELFEISRQQSF

Query:  LGFSEKAGPSKVHKSQAQVEESQQAQSDK----NDSGES--PPPSNNKKGKESRLRKRPSRPTLKGEPSSVRLAKPPISGKRKDSSPQL-----------
        L       P  +  + +  E + + +S +    ND  E    P    K     R++K P   T   E  ++ L +     K+ + S  +           
Subjt:  LGFSEKAGPSKVHKSQAQVEESQQAQSDK----NDSGES--PPPSNNKKGKESRLRKRPSRPTLKGEPSSVRLAKPPISGKRKDSSPQL-----------

Query:  QEDEIGINLEDGMV----NQQIDSPRKEILEKPCMEYDIENSPKALACIQPIEEASNSNSQCIEGFAISKEVVMTLSKNNLCIRPIVGANNKKDRRLVKS
         E+  G++  +       ++  DS   + L     E   +N   + +  +   + + + S+     A  +++V+ L +N L + P    N+         
Subjt:  QEDEIGINLEDGMV----NQQIDSPRKEILEKPCMEYDIENSPKALACIQPIEEASNSNSQCIEGFAISKEVVMTLSKNNLCIRPIVGANNKKDRRLVKS

Query:  IWSSRYIAWIALDAINSAGGILFMWKDSIVEVLDSVLGVFSVTVQCSFQGQNVGWISGVYGPCNYKDRRQFMQELFDLNGLCQGIWCIVGDFNMVRWSKE
        I S + +       +   GGIL +W D+  +V D  +G +S+++       N  W++ VYGP  Y DR +   EL  L  LC   W I GDFN+VRW +E
Subjt:  IWSSRYIAWIALDAINSAGGILFMWKDSIVEVLDSVLGVFSVTVQCSFQGQNVGWISGVYGPCNYKDRRQFMQELFDLNGLCQGIWCIVGDFNMVRWSKE

Query:  RLNASRSTKSMANFNRFIDSSELLEPPMMNGAFSWSRM
            S   ++MANFN FI  +EL++PP++N  F+WS +
Subjt:  RLNASRSTKSMANFNRFIDSSELLEPPMMNGAFSWSRM

A0A5D3CA17 LINE-1 retrotransposable element ORF2 protein1.1e-2426.87Show/hide
Query:  EKSALWRKIIESIYGTSHCGWKANLLKGKKGNRLWVDIALNYPKFEQFTRFIVNNGKSIKFWEDRWCEDQPLKSIFLDLFAISGKKDASIAECWCHDSQT
        E ++LW+K I++ Y  +H G    + +    N  W  I      +E    ++ N+G S+ FW  +W  + PL      L+A+S  + A++ E W   S  
Subjt:  EKSALWRKIIESIYGTSHCGWKANLLKGKKGNRLWVDIALNYPKFEQFTRFIVNNGKSIKFWEDRWCEDQPLKSIFLDLFAISGKKDASIAECWCHDSQT

Query:  WNLGFKRGLFEREISNWLALVEKIKNVELINGQDTISWKLEGSGVFSTKSC--FLSMTSASPKLNQPSSSL--IWKHKSPKKVKMLLWSLVYRSLNTDEM
        WN+  +R L ERE   W ++   +  +    G    +W    S  ++  S        S+ PK       L  +W+   P+K K  +W++V++ LNT + 
Subjt:  WNLGFKRGLFEREISNWLALVEKIKNVELINGQDTISWKLEGSGVFSTKSC--FLSMTSASPKLNQPSSSL--IWKHKSPKKVKMLLWSLVYRSLNTDEM

Query:  LQRKFGGWMLSPSACRLCIKAAKTLDHLFLHCEFAGAAWNFVANLLGISFCKPKKIEDWLLEGAHLCR
        +Q++     L+PS C  C  + + ++HLF+ C FA + WN  ++  G        ++D  L+   LCR
Subjt:  LQRKFGGWMLSPSACRLCIKAAKTLDHLFLHCEFAGAAWNFVANLLGISFCKPKKIEDWLLEGAHLCR

A0A6J1D6X4 uncharacterized protein LOC1110181864.6e-6647.76Show/hide
Query:  DEVRKIKWSEVLVVTKRDLHDDWGRILDIMQQQLDAPLVINPFHPDKALLKCPSEELAELLSKNRGWVSFGPFILKIERWNTEKHSRMSCVPCDGGWIQI
        +EVR++ W E +V+T+RD HDDW RIL  +++Q ++  +INPF  DKAL+KCPS++LA LL  N+GWV+FGP  +K+E WN   H R    P  G W++I
Subjt:  DEVRKIKWSEVLVVTKRDLHDDWGRILDIMQQQLDAPLVINPFHPDKALLKCPSEELAELLSKNRGWVSFGPFILKIERWNTEKHSRMSCVPCDGGWIQI

Query:  RNLLLHLWRLNVFKAIGDCLGGFIEYEESNSLLIECVDLKLKIRDNYCGFIPAEVRIVDGNNHYIIQIVTFQDGNMLIVRVAEIHGSFSSAAAHAFHRGP
        RN+ LHLW L  FKAIG+ LGGFI+Y+++NS  IEC D+ +K++ NYCGFIPAE+  +DG   +  ++V+F+D   L  +   IHG FSS AA +FH+G 
Subjt:  RNLLLHLWRLNVFKAIGDCLGGFIEYEESNSLLIECVDLKLKIRDNYCGFIPAEVRIVDGNNHYIIQIVTFQDGNMLIVRVAEIHGSFSSAAAHAFHRGP

Query:  NDPFFCPVDIWRIEDGIVYPMVNIQERSESLNKENRQLDGGTELF
         +     +D WR+E+G  YP VNIQ  +    K  R  +GG++L+
Subjt:  NDPFFCPVDIWRIEDGIVYPMVNIQERSESLNKENRQLDGGTELF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G60720.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.6e-0721.79Show/hide
Query:  PKFEQFTRFIVNNGKSIKFWEDRWCEDQPLKSIFLDLFAISGKKDASIAECWCHDSQTWNLGFKRGLFEREISNWLALVEKIKNVELINGQDTISWKLEG
        P  EQF +  + NG+   FW D W    PL  +  D  + S +   +           W L   R    + I + ++ +       +   +D+  W + G
Subjt:  PKFEQFTRFIVNNGKSIKFWEDRWCEDQPLKSIFLDLFAISGKKDASIAECWCHDSQTWNLGFKRGLFEREISNWLALVEKIKNVELINGQDTISWKLEG

Query:  --SGVFSTKSCFLSMTSASPKLNQPSSSLIWKHKSPKKVKMLLWSLVYRSLNTDEMLQRKFGGW-MLSPSACRLCIKAAKTLDHLFLHCEFAGAAWNFVA
             FS+   + ++   +P+L+   +  +W   +  K    +W      L T    +++   W  +    C LC    ++ DHL   CEFA   W    
Subjt:  --SGVFSTKSCFLSMTSASPKLNQPSSSLIWKHKSPKKVKMLLWSLVYRSLNTDEMLQRKFGGW-MLSPSACRLCIKAAKTLDHLFLHCEFAGAAWNFVA

Query:  NLLGISFCKPKKIEDWLLEGAHLCRWQRHSEASA
        + L    C  +++       A L  W R S +SA
Subjt:  NLLGISFCKPKKIEDWLLEGAHLCRWQRHSEASA

AT3G25270.1 Ribonuclease H-like superfamily protein8.4e-0433.33Show/hide
Query:  IWKHKSPKKVKMLLWSLVYRSLNTDEMLQRKFGGWMLSPSACRLCIKAAKTLDHLFLHCEFAGAAW
        IWK K+  K+K  LW L+  +L T + L+R+    + +   C  C +  +T  HLF  C +A   W
Subjt:  IWKHKSPKKVKMLLWSLVYRSLNTDEMLQRKFGGWMLSPSACRLCIKAAKTLDHLFLHCEFAGAAW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACAGTCTGCTTTCAAAAGCTGCAAAGACGAAGTCAGGAAGATCAAATGGAGTGAAGTCCTGGTGGTAACAAAACGAGATCTTCATGACGATTGGGGCCGCATTCT
TGATATTATGCAGCAACAATTGGATGCCCCTTTAGTCATCAACCCATTCCACCCTGACAAAGCTTTGCTGAAATGTCCCTCAGAAGAGTTGGCGGAGCTTTTATCAAAGA
ATAGAGGCTGGGTTAGTTTCGGGCCGTTTATATTGAAGATAGAGAGATGGAATACGGAAAAGCACAGCCGGATGTCCTGTGTTCCGTGCGATGGGGGGTGGATTCAAATT
AGAAACCTTCTTTTACATTTATGGCGCCTCAACGTTTTTAAAGCCATAGGGGATTGCCTTGGTGGCTTCATTGAATACGAAGAGTCTAACTCTCTCCTCATTGAATGCGT
GGATTTAAAGTTGAAGATTAGAGACAATTATTGCGGTTTTATCCCTGCGGAAGTTCGAATTGTCGATGGAAATAACCACTATATTATTCAGATTGTCACGTTTCAGGATG
GCAATATGCTTATTGTCAGGGTCGCCGAAATTCATGGCAGCTTCTCGTCGGCGGCGGCCCATGCCTTTCATCGAGGTCCAAACGATCCTTTCTTTTGCCCTGTGGACATA
TGGAGGATTGAAGACGGTATAGTTTATCCGATGGTTAATATCCAAGAGAGGTCAGAAAGTCTGAATAAGGAGAATAGACAGCTGGACGGCGGTACAGAGCTTTTTGAAAT
ATCCCGCCAACAATCCTTTCTAGGGTTCTCTGAGAAAGCTGGGCCGTCCAAAGTGCATAAAAGTCAGGCCCAAGTCGAAGAATCACAACAAGCCCAATCGGATAAAAATG
ATTCTGGAGAAAGCCCACCGCCCAGCAACAATAAAAAAGGAAAGGAGTCACGTTTGCGAAAAAGACCCAGTCGACCAACTTTAAAAGGGGAACCATCCAGTGTACGACTA
GCAAAACCTCCGATATCTGGGAAGAGGAAAGATAGTAGTCCGCAGTTGCAAGAGGATGAGATAGGGATTAACCTTGAGGATGGCATGGTCAACCAACAAATAGATTCTCC
TCGAAAGGAAATCCTTGAGAAACCTTGTATGGAGTATGATATTGAGAATTCCCCGAAGGCCTTGGCCTGCATACAGCCGATAGAGGAAGCCTCAAATAGTAATTCTCAGT
GCATTGAAGGATTTGCTATCAGCAAAGAGGTGGTGATGACTCTTAGTAAAAATAATTTGTGTATTAGACCTATAGTGGGGGCAAACAACAAGAAAGATAGGAGGCTTGTG
AAATCCATTTGGAGCTCTAGGTATATAGCTTGGATTGCCCTGGATGCTATTAACTCGGCTGGAGGTATCCTTTTTATGTGGAAAGATTCAATTGTTGAGGTTTTGGACTC
TGTGTTGGGGGTCTTTTCTGTCACTGTGCAATGTTCTTTTCAGGGTCAGAATGTAGGGTGGATTTCTGGGGTATATGGGCCATGTAATTACAAGGATAGAAGGCAGTTTA
TGCAAGAGCTGTTTGATTTGAATGGCTTATGCCAGGGGATCTGGTGCATCGTGGGTGATTTTAATATGGTTAGATGGTCCAAGGAGAGACTTAATGCTAGCAGATCGACG
AAAAGTATGGCTAACTTTAATCGGTTCATAGATTCCTCTGAGCTTCTAGAGCCTCCCATGATGAACGGGGCGTTTTCCTGGTCTAGAATGGGTGAACATATTGGTGCCAA
TGATGTGGAAATCTCCATGTTGAAGTATGCTGATGATACCCTTGTTTTTTGTCCGAATAGTGAAGAGGAATTGGCTAAATGGAAGGAGATCCTCTTGTTGGTTATGGAAG
GATCGGGCCTTAGACTAAACTTACTTAAATATGCTCTTATAGGTGAGAAGTCAGCTCTTTGGAGGAAGATTATTGAGAGTATTTATGGGACCTCTCATTGCGGTTGGAAA
GCCAATTTGTTGAAAGGAAAAAAAGGTAATAGGCTCTGGGTTGACATAGCCTTGAACTATCCCAAATTTGAGCAGTTTACGAGGTTCATTGTTAATAATGGTAAAAGCAT
TAAGTTTTGGGAGGATAGATGGTGTGAAGATCAGCCCCTTAAATCCATCTTTTTAGATTTATTTGCCATTTCTGGCAAAAAAGATGCTTCTATAGCGGAGTGTTGGTGCC
ATGATTCTCAAACTTGGAATCTGGGATTCAAAAGAGGCCTCTTTGAAAGGGAAATTAGTAATTGGTTGGCCTTGGTGGAGAAAATAAAAAATGTTGAATTGATAAATGGT
CAAGACACCATCAGCTGGAAATTAGAAGGGTCTGGAGTTTTTTCGACTAAATCTTGTTTCCTTTCAATGACTTCTGCTTCTCCAAAGTTAAACCAGCCTAGTAGCAGTCT
TATCTGGAAACATAAAAGCCCAAAGAAGGTGAAAATGCTTCTTTGGTCCCTTGTATACAGAAGCTTGAATACGGATGAGATGCTGCAAAGAAAGTTTGGTGGCTGGATGC
TCTCTCCTTCGGCCTGCAGGTTGTGCATTAAGGCAGCTAAAACCTTAGATCATTTATTCCTACATTGTGAGTTTGCAGGGGCTGCTTGGAATTTTGTGGCCAATTTGCTG
GGCATTTCGTTTTGCAAGCCAAAGAAGATTGAGGATTGGCTGTTAGAAGGCGCTCACCTTTGTAGATGGCAAAGGCATAGTGAGGCGAGCGCCTGGTTGATGGCACTCGC
CTTAGCTGCCTTCAAGGCGCAACAACCCAAGGTCGCCTTGCCTCTTGCTTTAGGCGACGCCTTGGGCTGCCACGAAAATGATTCGAAATCAAACAAGTGTTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAACAGTCTGCTTTCAAAAGCTGCAAAGACGAAGTCAGGAAGATCAAATGGAGTGAAGTCCTGGTGGTAACAAAACGAGATCTTCATGACGATTGGGGCCGCATTCT
TGATATTATGCAGCAACAATTGGATGCCCCTTTAGTCATCAACCCATTCCACCCTGACAAAGCTTTGCTGAAATGTCCCTCAGAAGAGTTGGCGGAGCTTTTATCAAAGA
ATAGAGGCTGGGTTAGTTTCGGGCCGTTTATATTGAAGATAGAGAGATGGAATACGGAAAAGCACAGCCGGATGTCCTGTGTTCCGTGCGATGGGGGGTGGATTCAAATT
AGAAACCTTCTTTTACATTTATGGCGCCTCAACGTTTTTAAAGCCATAGGGGATTGCCTTGGTGGCTTCATTGAATACGAAGAGTCTAACTCTCTCCTCATTGAATGCGT
GGATTTAAAGTTGAAGATTAGAGACAATTATTGCGGTTTTATCCCTGCGGAAGTTCGAATTGTCGATGGAAATAACCACTATATTATTCAGATTGTCACGTTTCAGGATG
GCAATATGCTTATTGTCAGGGTCGCCGAAATTCATGGCAGCTTCTCGTCGGCGGCGGCCCATGCCTTTCATCGAGGTCCAAACGATCCTTTCTTTTGCCCTGTGGACATA
TGGAGGATTGAAGACGGTATAGTTTATCCGATGGTTAATATCCAAGAGAGGTCAGAAAGTCTGAATAAGGAGAATAGACAGCTGGACGGCGGTACAGAGCTTTTTGAAAT
ATCCCGCCAACAATCCTTTCTAGGGTTCTCTGAGAAAGCTGGGCCGTCCAAAGTGCATAAAAGTCAGGCCCAAGTCGAAGAATCACAACAAGCCCAATCGGATAAAAATG
ATTCTGGAGAAAGCCCACCGCCCAGCAACAATAAAAAAGGAAAGGAGTCACGTTTGCGAAAAAGACCCAGTCGACCAACTTTAAAAGGGGAACCATCCAGTGTACGACTA
GCAAAACCTCCGATATCTGGGAAGAGGAAAGATAGTAGTCCGCAGTTGCAAGAGGATGAGATAGGGATTAACCTTGAGGATGGCATGGTCAACCAACAAATAGATTCTCC
TCGAAAGGAAATCCTTGAGAAACCTTGTATGGAGTATGATATTGAGAATTCCCCGAAGGCCTTGGCCTGCATACAGCCGATAGAGGAAGCCTCAAATAGTAATTCTCAGT
GCATTGAAGGATTTGCTATCAGCAAAGAGGTGGTGATGACTCTTAGTAAAAATAATTTGTGTATTAGACCTATAGTGGGGGCAAACAACAAGAAAGATAGGAGGCTTGTG
AAATCCATTTGGAGCTCTAGGTATATAGCTTGGATTGCCCTGGATGCTATTAACTCGGCTGGAGGTATCCTTTTTATGTGGAAAGATTCAATTGTTGAGGTTTTGGACTC
TGTGTTGGGGGTCTTTTCTGTCACTGTGCAATGTTCTTTTCAGGGTCAGAATGTAGGGTGGATTTCTGGGGTATATGGGCCATGTAATTACAAGGATAGAAGGCAGTTTA
TGCAAGAGCTGTTTGATTTGAATGGCTTATGCCAGGGGATCTGGTGCATCGTGGGTGATTTTAATATGGTTAGATGGTCCAAGGAGAGACTTAATGCTAGCAGATCGACG
AAAAGTATGGCTAACTTTAATCGGTTCATAGATTCCTCTGAGCTTCTAGAGCCTCCCATGATGAACGGGGCGTTTTCCTGGTCTAGAATGGGTGAACATATTGGTGCCAA
TGATGTGGAAATCTCCATGTTGAAGTATGCTGATGATACCCTTGTTTTTTGTCCGAATAGTGAAGAGGAATTGGCTAAATGGAAGGAGATCCTCTTGTTGGTTATGGAAG
GATCGGGCCTTAGACTAAACTTACTTAAATATGCTCTTATAGGTGAGAAGTCAGCTCTTTGGAGGAAGATTATTGAGAGTATTTATGGGACCTCTCATTGCGGTTGGAAA
GCCAATTTGTTGAAAGGAAAAAAAGGTAATAGGCTCTGGGTTGACATAGCCTTGAACTATCCCAAATTTGAGCAGTTTACGAGGTTCATTGTTAATAATGGTAAAAGCAT
TAAGTTTTGGGAGGATAGATGGTGTGAAGATCAGCCCCTTAAATCCATCTTTTTAGATTTATTTGCCATTTCTGGCAAAAAAGATGCTTCTATAGCGGAGTGTTGGTGCC
ATGATTCTCAAACTTGGAATCTGGGATTCAAAAGAGGCCTCTTTGAAAGGGAAATTAGTAATTGGTTGGCCTTGGTGGAGAAAATAAAAAATGTTGAATTGATAAATGGT
CAAGACACCATCAGCTGGAAATTAGAAGGGTCTGGAGTTTTTTCGACTAAATCTTGTTTCCTTTCAATGACTTCTGCTTCTCCAAAGTTAAACCAGCCTAGTAGCAGTCT
TATCTGGAAACATAAAAGCCCAAAGAAGGTGAAAATGCTTCTTTGGTCCCTTGTATACAGAAGCTTGAATACGGATGAGATGCTGCAAAGAAAGTTTGGTGGCTGGATGC
TCTCTCCTTCGGCCTGCAGGTTGTGCATTAAGGCAGCTAAAACCTTAGATCATTTATTCCTACATTGTGAGTTTGCAGGGGCTGCTTGGAATTTTGTGGCCAATTTGCTG
GGCATTTCGTTTTGCAAGCCAAAGAAGATTGAGGATTGGCTGTTAGAAGGCGCTCACCTTTGTAGATGGCAAAGGCATAGTGAGGCGAGCGCCTGGTTGATGGCACTCGC
CTTAGCTGCCTTCAAGGCGCAACAACCCAAGGTCGCCTTGCCTCTTGCTTTAGGCGACGCCTTGGGCTGCCACGAAAATGATTCGAAATCAAACAAGTGTTTTTGA
Protein sequenceShow/hide protein sequence
MEQSAFKSCKDEVRKIKWSEVLVVTKRDLHDDWGRILDIMQQQLDAPLVINPFHPDKALLKCPSEELAELLSKNRGWVSFGPFILKIERWNTEKHSRMSCVPCDGGWIQI
RNLLLHLWRLNVFKAIGDCLGGFIEYEESNSLLIECVDLKLKIRDNYCGFIPAEVRIVDGNNHYIIQIVTFQDGNMLIVRVAEIHGSFSSAAAHAFHRGPNDPFFCPVDI
WRIEDGIVYPMVNIQERSESLNKENRQLDGGTELFEISRQQSFLGFSEKAGPSKVHKSQAQVEESQQAQSDKNDSGESPPPSNNKKGKESRLRKRPSRPTLKGEPSSVRL
AKPPISGKRKDSSPQLQEDEIGINLEDGMVNQQIDSPRKEILEKPCMEYDIENSPKALACIQPIEEASNSNSQCIEGFAISKEVVMTLSKNNLCIRPIVGANNKKDRRLV
KSIWSSRYIAWIALDAINSAGGILFMWKDSIVEVLDSVLGVFSVTVQCSFQGQNVGWISGVYGPCNYKDRRQFMQELFDLNGLCQGIWCIVGDFNMVRWSKERLNASRST
KSMANFNRFIDSSELLEPPMMNGAFSWSRMGEHIGANDVEISMLKYADDTLVFCPNSEEELAKWKEILLLVMEGSGLRLNLLKYALIGEKSALWRKIIESIYGTSHCGWK
ANLLKGKKGNRLWVDIALNYPKFEQFTRFIVNNGKSIKFWEDRWCEDQPLKSIFLDLFAISGKKDASIAECWCHDSQTWNLGFKRGLFEREISNWLALVEKIKNVELING
QDTISWKLEGSGVFSTKSCFLSMTSASPKLNQPSSSLIWKHKSPKKVKMLLWSLVYRSLNTDEMLQRKFGGWMLSPSACRLCIKAAKTLDHLFLHCEFAGAAWNFVANLL
GISFCKPKKIEDWLLEGAHLCRWQRHSEASAWLMALALAAFKAQQPKVALPLALGDALGCHENDSKSNKCF