; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000393 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000393
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr4:5795957..5806338
RNA-Seq ExpressionLag0000393
SyntenyLag0000393
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]3.8e-22362.14Show/hide
Query:  CPQTNCVAKKNMIHDSKCDLLVLETCLVENDDSAWILDS------------------------------GEVISAIAVGDIKLFFTKERYMILDNVYIVP
        CP+     K       K DLLV+ETCLVE D S WILDS                              GEV+SA AVGD+ LFF ++RY+IL +V  VP
Subjt:  CPQTNCVAKKNMIHDSKCDLLVLETCLVENDDSAWILDS------------------------------GEVISAIAVGDIKLFFTKERYMILDNVYIVP

Query:  KIKRNLISLSCLLEQGYSISFSVNEAFI-----------------------------------------------------TKRGHINLNRIGRLVKSGL
         +KRNLIS++C+LE  Y+ISF VNE FI                                                      + GHINLNRI RLVKSG+
Subjt:  KIKRNLISLSCLLEQGYSISFSVNEAFI-----------------------------------------------------TKRGHINLNRIGRLVKSGL

Query:  LNKLEDDSLPPCESCLEGKMTKRPFTGKGYCAKEPLELIHSDLCGPMNIRARGGYEYFISFIDDYSRYGYLYLMHHKSEAFEKFKEYKAEVENLLGKTIK
        LN+LED+SLPPCESCLEGKMTKR FTGKG  AK PLEL+HSDLCGPMN++ARGGYEYFISFIDD+SRYG++YL+HHKSE+FEKFKEYKAEVEN +GKTIK
Subjt:  LNKLEDDSLPPCESCLEGKMTKRPFTGKGYCAKEPLELIHSDLCGPMNIRARGGYEYFISFIDDYSRYGYLYLMHHKSEAFEKFKEYKAEVENLLGKTIK

Query:  TLRSDRGGEYMDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFWGYAVETAVYILNVVPSKSVSETPFELWKGR-----
        TLRSDRGGEYMD  FQDYLIE GI SQLSAP TPQQNGVSERRNRTLLDMVRSMMSYAQLP+SFWGYA+ETA++ILN VPSKSV ETP+ELWKGR     
Subjt:  TLRSDRGGEYMDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFWGYAVETAVYILNVVPSKSVSETPFELWKGR-----

Query:  ------------------------------YPKETRGGYFFDPEENKVLVSTNAPFLEEDHLRDHKPRSKVVLSELSNEATE---TSTRVVDDTGTSSQS
                                      YPKE+RGG F+ P+ENKV VSTNA FLEEDH R+H+PRSK+VL E+   AT+   +ST+VVD    S QS
Subjt:  ------------------------------YPKETRGGYFFDPEENKVLVSTNAPFLEEDHLRDHKPRSKVVLSELSNEATE---TSTRVVDDTGTSSQS

Query:  RPSQEIREPRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVDKNKWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKV
          SQE+R PRRSGRVV QP+RYLGL ET ++IPDDG+EDPLT+KQAM+DVD+++WIKAM+LE+ESM+FNSVW LVD P  V+PIGCKWIYKRKRD AGKV
Subjt:  RPSQEIREPRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVDKNKWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKV

Query:  QTFKARLVAKGFTQREGIDYEETFSPIAMLKSIRILLSIATFYDYEIWQMDVKTVFLNGNLKK
        QTFKARLVAKG+TQ+EG+DYEETFSP+AMLKSIRILLSIATFY+YEIWQMDVKT FLNGNL++
Subjt:  QTFKARLVAKGFTQREGIDYEETFSPIAMLKSIRILLSIATFYDYEIWQMDVKTVFLNGNLKK

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]5.4e-23865.63Show/hide
Query:  STEECPQTNC----VAKKNMIHDSKCDLLVLETCLVENDDSAWILDS------------------------------GEVISAIAVGDIKLFFTKERYMI
        + +E  +TNC    V KK    + K DLLVLETCLVEND +AWILDS                              G+VISA AVGD KLFF   ++M 
Subjt:  STEECPQTNC----VAKKNMIHDSKCDLLVLETCLVENDDSAWILDS------------------------------GEVISAIAVGDIKLFFTKERYMI

Query:  LDNVYIVPKIKRNLISLSCLLEQGYSISFSVNEAFITKR-------------------------------------------------------GHINLN
        L+N+YIVPKIKRNL+S+SCL+E  YSI+FS+NEAFI K                                                        GHINL+
Subjt:  LDNVYIVPKIKRNLISLSCLLEQGYSISFSVNEAFITKR-------------------------------------------------------GHINLN

Query:  RIGRLVKSGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYCAKEPLELIHSDLCGPMNIRARGGYEYFISFIDDYSRYGYLYLMHHKSEAFEKFKEYKAE
        RIGRLVK+GLLNKL+D SLPPCESCLEGKMTKRPFTGKGY AKEPLELIHSDLCGPMN++ARGG+EYFISFIDDYSRYGYLYLM HKSEA EKFKEYK E
Subjt:  RIGRLVKSGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYCAKEPLELIHSDLCGPMNIRARGGYEYFISFIDDYSRYGYLYLMHHKSEAFEKFKEYKAE

Query:  VENLLGKTIKTLRSDRGGEYMDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFWGYAVETAVYILNVVPSKSVSETPFE
        VENLL K IK LRSDRGGEYMDL FQDY+IEHGI SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP+SFWGYAVETAV+ILN VPSKSVSETPFE
Subjt:  VENLLGKTIKTLRSDRGGEYMDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFWGYAVETAVYILNVVPSKSVSETPFE

Query:  LWKGR-----------------------------------YPKETRGGYFFDPEENKVLVSTNAPFLEEDHLRDHKPRSKVVLSELSNEATETSTRV---
        LW+GR                                   YPKETRGG FFDP+EN+V VSTNA FLEEDH+R+HKPRSK+VLSE ++E+T     V   
Subjt:  LWKGR-----------------------------------YPKETRGGYFFDPEENKVLVSTNAPFLEEDHLRDHKPRSKVVLSELSNEATETSTRV---

Query:  --VDDTGTSSQSRPSQEIREPRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVDKNKWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKW
          VD+T TS QS PSQ +R PRRSGRVV QP+RYLGLTET VVIPDDG+EDPL++KQAM+DVDK++W+KAMDLE+ESM+FNSVW+LVD PEGV+PIGCKW
Subjt:  --VDDTGTSSQSRPSQEIREPRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVDKNKWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKW

Query:  IYKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPIAMLKSIRILLSIATFYDYEIWQMDVKTVFLNGNLKK
        IYKRKRD+AGKVQTFKARLVAKG+TQREG+DYEETFSP+AMLKSIRILLSIATFYDYEIWQMDVKT FLNGNL++
Subjt:  IYKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPIAMLKSIRILLSIATFYDYEIWQMDVKTVFLNGNLKK

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]5.6e-23564.99Show/hide
Query:  STEECPQTNC---VAKKNMIHDSKCDLLVLETCLVENDDSAWILDS------------------------------GEVISAIAVGDIKLFFTKERYMIL
        + +E  +TNC   + KKN   +SK DLLVLETCLVEND +AWILDS                              G+VISA AVGD KLFF   ++M L
Subjt:  STEECPQTNC---VAKKNMIHDSKCDLLVLETCLVENDDSAWILDS------------------------------GEVISAIAVGDIKLFFTKERYMIL

Query:  DNVYIVPKIKRNLISLSCLLEQGYSISFSVNEAFITKR-------------------------------------------------------GHINLNR
        +N+YIVPKIKRNL+S+SCL+E  YSI+FS+NEAFI K                                                        GHINL+R
Subjt:  DNVYIVPKIKRNLISLSCLLEQGYSISFSVNEAFITKR-------------------------------------------------------GHINLNR

Query:  IGRLVKSGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYCAKEPLELIHSDLCGPMNIRARGGYEYFISFIDDYSRYGYLYLMHHKSEAFEKFKEYKAEV
        IGRLVK GLLNKL+D SLPPCESCLEGKMTKRPFTGKGY AKEPLELIHSDLCGPMN++ARG +EYFISFIDDYSRYGYLYLM HKSEA EKFKEYK EV
Subjt:  IGRLVKSGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYCAKEPLELIHSDLCGPMNIRARGGYEYFISFIDDYSRYGYLYLMHHKSEAFEKFKEYKAEV

Query:  ENLLGKTIKTLRSDRGGEYMDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFWGYAVETAVYILNVVPSKSVSETPFEL
        ENLL K IK  RSDRGGEYMDL FQDY+IEHGI SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP+SFWGYAVETAV+ILN VPSKSVSETPFEL
Subjt:  ENLLGKTIKTLRSDRGGEYMDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFWGYAVETAVYILNVVPSKSVSETPFEL

Query:  WKGR-----------------------------------YPKETRGGYFFDPEENKVLVSTNAPFLEEDHLRDHKPRSKVVLSELSNEATETSTRV----
        W+GR                                   YPKETRGG FFDP+EN+V VSTNA FLEEDH+R+HKPRSK+VLSE ++E+T     V    
Subjt:  WKGR-----------------------------------YPKETRGGYFFDPEENKVLVSTNAPFLEEDHLRDHKPRSKVVLSELSNEATETSTRV----

Query:  -VDDTGTSSQSRPSQEIREPRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVDKNKWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWI
         VD+T TS QS PSQ +R PRRSGRVV QP+RYLGLTET VVIPDDG+EDPL++KQAM+DVDK++W+KAMDLE+ESM+FNSVW+LVD PEGV+PIGCKWI
Subjt:  -VDDTGTSSQSRPSQEIREPRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVDKNKWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWI

Query:  YKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPIAMLKSIRILLSIATFYDYEIWQMDVKTVFLNGNLKK
        YKRKRD+AGKVQTFKARLVAKG+T++EG+DYEETFS +AMLKSIRILLSIA FYDYEIWQMDVKT FLNGNL++
Subjt:  YKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPIAMLKSIRILLSIATFYDYEIWQMDVKTVFLNGNLKK

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]1.9e-22258.99Show/hide
Query:  NVCALSNAKLNGLVHSTKNTCLNAK---HVAKNLGLVSTEECPQTNCVAKKNMIHDSKCDLLVLETCLVENDDSAWILDS--------------------
        N   L+ AK      + K  C +     H  +N        CP+   +A+K      K DLLVLETCLVENDDSAWI+DS                    
Subjt:  NVCALSNAKLNGLVHSTKNTCLNAK---HVAKNLGLVSTEECPQTNCVAKKNMIHDSKCDLLVLETCLVENDDSAWILDS--------------------

Query:  ----------GEVISAIAVGDIKLFFTKERYMILDNVYIVPKIKRNLISLSCLLEQGYSISFSVNEAFITKR----------------------------
                  G V+SAIAVG ++L   K  +++L+NVY+VP +KRNLIS+ CLLEQ YS++F+VN+ FI K                             
Subjt:  ----------GEVISAIAVGDIKLFFTKERYMILDNVYIVPKIKRNLISLSCLLEQGYSISFSVNEAFITKR----------------------------

Query:  ---------------------------GHINLNRIGRLVKSGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYCAKEPLELIHSDLCGPMNIRARGGYEY
                                   GHINLNRI RLVK+GLL++LE++SLP CESCLEGKMTKRPFTGKG+ AKEPLEL+HSDLCGPMN++ARGG+EY
Subjt:  ---------------------------GHINLNRIGRLVKSGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYCAKEPLELIHSDLCGPMNIRARGGYEY

Query:  FISFIDDYSRYGYLYLMHHKSEAFEKFKEYKAEVENLLGKTIKTLRSDRGGEYMDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSY
        FI+F DDYSRYGY+YLM HKSEA EKFKEYKAEVEN L KTIKT RSDRGGEYMDL FQ+YL+E GI SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSY
Subjt:  FISFIDDYSRYGYLYLMHHKSEAFEKFKEYKAEVENLLGKTIKTLRSDRGGEYMDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSY

Query:  AQLPNSFWGYAVETAVYILNVVPSKSVSETPFELWKGR-----------------------------------YPKETRGGYFFDPEENKVLVSTNAPFL
        A LPNSFWGYAV+TAVYILN VPSKSVSETP +LW GR                                   YPK TRGGYF+DP++NKV VSTNA FL
Subjt:  AQLPNSFWGYAVETAVYILNVVPSKSVSETPFELWKGR-----------------------------------YPKETRGGYFFDPEENKVLVSTNAPFL

Query:  EEDHLRDHKPRSKVVLSELSNEATETSTRVVDD---------TGTSSQSRPSQEIREPRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVD
        EEDH+R+HKPRSK+VL+ELS E TE STRVV++          G+S+++   Q +REPRRSGRV   P RY+ LTET  VI D  IEDPLTFK+AM+DVD
Subjt:  EEDHLRDHKPRSKVVLSELSNEATETSTRVVDD---------TGTSSQSRPSQEIREPRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVD

Query:  KNKWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPIAMLKSIRILLSIATFYDYEIWQMD
        K++WIKAM+LE+ESM+FNSVWDLVDQP+GV+PIGCKWIYKRKR A GKVQTFKARLVAKG+TQ EG+DYEETFSP+AMLKSIRILLSIA ++DYEIWQMD
Subjt:  KNKWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPIAMLKSIRILLSIATFYDYEIWQMD

Query:  VKTVFLNGNLKK
        VKT FLNGNL++
Subjt:  VKTVFLNGNLKK

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]1.9e-22258.99Show/hide
Query:  NVCALSNAKLNGLVHSTKNTCLNAK---HVAKNLGLVSTEECPQTNCVAKKNMIHDSKCDLLVLETCLVENDDSAWILDS--------------------
        N   L+ AK      + K  C +     H  +N        CP+   +A+K      K DLLVLETCLVENDDSAWI+DS                    
Subjt:  NVCALSNAKLNGLVHSTKNTCLNAK---HVAKNLGLVSTEECPQTNCVAKKNMIHDSKCDLLVLETCLVENDDSAWILDS--------------------

Query:  ----------GEVISAIAVGDIKLFFTKERYMILDNVYIVPKIKRNLISLSCLLEQGYSISFSVNEAFITKR----------------------------
                  G V+SAIAVG ++L   K  +++L+NVY+VP +KRNLIS+ CLLEQ YS++F+VN+ FI K                             
Subjt:  ----------GEVISAIAVGDIKLFFTKERYMILDNVYIVPKIKRNLISLSCLLEQGYSISFSVNEAFITKR----------------------------

Query:  ---------------------------GHINLNRIGRLVKSGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYCAKEPLELIHSDLCGPMNIRARGGYEY
                                   GHINLNRI RLVK+GLL++LE++SLP CESCLEGKMTKRPFTGKG+ AKEPLEL+HSDLCGPMN++ARGG+EY
Subjt:  ---------------------------GHINLNRIGRLVKSGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYCAKEPLELIHSDLCGPMNIRARGGYEY

Query:  FISFIDDYSRYGYLYLMHHKSEAFEKFKEYKAEVENLLGKTIKTLRSDRGGEYMDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSY
        FI+F DDYSRYGY+YLM HKSEA EKFKEYKAEVEN L KTIKT RSDRGGEYMDL FQ+YL+E GI SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSY
Subjt:  FISFIDDYSRYGYLYLMHHKSEAFEKFKEYKAEVENLLGKTIKTLRSDRGGEYMDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSY

Query:  AQLPNSFWGYAVETAVYILNVVPSKSVSETPFELWKGR-----------------------------------YPKETRGGYFFDPEENKVLVSTNAPFL
        A LPNSFWGYAV+TAVYILN VPSKSVSETP +LW GR                                   YPK TRGGYF+DP++NKV VSTNA FL
Subjt:  AQLPNSFWGYAVETAVYILNVVPSKSVSETPFELWKGR-----------------------------------YPKETRGGYFFDPEENKVLVSTNAPFL

Query:  EEDHLRDHKPRSKVVLSELSNEATETSTRVVDD---------TGTSSQSRPSQEIREPRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVD
        EEDH+R+HKPRSK+VL+ELS E TE STRVV++          G+S+++   Q +REPRRSGRV   P RY+ LTET  VI D  IEDPLTFK+AM+DVD
Subjt:  EEDHLRDHKPRSKVVLSELSNEATETSTRVVDD---------TGTSSQSRPSQEIREPRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVD

Query:  KNKWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPIAMLKSIRILLSIATFYDYEIWQMD
        K++WIKAM+LE+ESM+FNSVWDLVDQP+GV+PIGCKWIYKRKR A GKVQTFKARLVAKG+TQ EG+DYEETFSP+AMLKSIRILLSIA ++DYEIWQMD
Subjt:  KNKWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPIAMLKSIRILLSIATFYDYEIWQMD

Query:  VKTVFLNGNLKK
        VKT FLNGNL++
Subjt:  VKTVFLNGNLKK

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein9.0e-22358.99Show/hide
Query:  NVCALSNAKLNGLVHSTKNTCLNAK---HVAKNLGLVSTEECPQTNCVAKKNMIHDSKCDLLVLETCLVENDDSAWILDS--------------------
        N   L+ AK      + K  C +     H  +N        CP+   +A+K      K DLLVLETCLVENDDSAWI+DS                    
Subjt:  NVCALSNAKLNGLVHSTKNTCLNAK---HVAKNLGLVSTEECPQTNCVAKKNMIHDSKCDLLVLETCLVENDDSAWILDS--------------------

Query:  ----------GEVISAIAVGDIKLFFTKERYMILDNVYIVPKIKRNLISLSCLLEQGYSISFSVNEAFITKR----------------------------
                  G V+SAIAVG ++L   K  +++L+NVY+VP +KRNLIS+ CLLEQ YS++F+VN+ FI K                             
Subjt:  ----------GEVISAIAVGDIKLFFTKERYMILDNVYIVPKIKRNLISLSCLLEQGYSISFSVNEAFITKR----------------------------

Query:  ---------------------------GHINLNRIGRLVKSGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYCAKEPLELIHSDLCGPMNIRARGGYEY
                                   GHINLNRI RLVK+GLL++LE++SLP CESCLEGKMTKRPFTGKG+ AKEPLEL+HSDLCGPMN++ARGG+EY
Subjt:  ---------------------------GHINLNRIGRLVKSGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYCAKEPLELIHSDLCGPMNIRARGGYEY

Query:  FISFIDDYSRYGYLYLMHHKSEAFEKFKEYKAEVENLLGKTIKTLRSDRGGEYMDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSY
        FI+F DDYSRYGY+YLM HKSEA EKFKEYKAEVEN L KTIKT RSDRGGEYMDL FQ+YL+E GI SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSY
Subjt:  FISFIDDYSRYGYLYLMHHKSEAFEKFKEYKAEVENLLGKTIKTLRSDRGGEYMDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSY

Query:  AQLPNSFWGYAVETAVYILNVVPSKSVSETPFELWKGR-----------------------------------YPKETRGGYFFDPEENKVLVSTNAPFL
        A LPNSFWGYAV+TAVYILN VPSKSVSETP +LW GR                                   YPK TRGGYF+DP++NKV VSTNA FL
Subjt:  AQLPNSFWGYAVETAVYILNVVPSKSVSETPFELWKGR-----------------------------------YPKETRGGYFFDPEENKVLVSTNAPFL

Query:  EEDHLRDHKPRSKVVLSELSNEATETSTRVVDD---------TGTSSQSRPSQEIREPRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVD
        EEDH+R+HKPRSK+VL+ELS E TE STRVV++          G+S+++   Q +REPRRSGRV   P RY+ LTET  VI D  IEDPLTFK+AM+DVD
Subjt:  EEDHLRDHKPRSKVVLSELSNEATETSTRVVDD---------TGTSSQSRPSQEIREPRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVD

Query:  KNKWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPIAMLKSIRILLSIATFYDYEIWQMD
        K++WIKAM+LE+ESM+FNSVWDLVDQP+GV+PIGCKWIYKRKR A GKVQTFKARLVAKG+TQ EG+DYEETFSP+AMLKSIRILLSIA ++DYEIWQMD
Subjt:  KNKWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPIAMLKSIRILLSIATFYDYEIWQMD

Query:  VKTVFLNGNLKK
        VKT FLNGNL++
Subjt:  VKTVFLNGNLKK

A0A5A7T2V9 Gag/pol protein2.7e-23564.99Show/hide
Query:  STEECPQTNC---VAKKNMIHDSKCDLLVLETCLVENDDSAWILDS------------------------------GEVISAIAVGDIKLFFTKERYMIL
        + +E  +TNC   + KKN   +SK DLLVLETCLVEND +AWILDS                              G+VISA AVGD KLFF   ++M L
Subjt:  STEECPQTNC---VAKKNMIHDSKCDLLVLETCLVENDDSAWILDS------------------------------GEVISAIAVGDIKLFFTKERYMIL

Query:  DNVYIVPKIKRNLISLSCLLEQGYSISFSVNEAFITKR-------------------------------------------------------GHINLNR
        +N+YIVPKIKRNL+S+SCL+E  YSI+FS+NEAFI K                                                        GHINL+R
Subjt:  DNVYIVPKIKRNLISLSCLLEQGYSISFSVNEAFITKR-------------------------------------------------------GHINLNR

Query:  IGRLVKSGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYCAKEPLELIHSDLCGPMNIRARGGYEYFISFIDDYSRYGYLYLMHHKSEAFEKFKEYKAEV
        IGRLVK GLLNKL+D SLPPCESCLEGKMTKRPFTGKGY AKEPLELIHSDLCGPMN++ARG +EYFISFIDDYSRYGYLYLM HKSEA EKFKEYK EV
Subjt:  IGRLVKSGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYCAKEPLELIHSDLCGPMNIRARGGYEYFISFIDDYSRYGYLYLMHHKSEAFEKFKEYKAEV

Query:  ENLLGKTIKTLRSDRGGEYMDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFWGYAVETAVYILNVVPSKSVSETPFEL
        ENLL K IK  RSDRGGEYMDL FQDY+IEHGI SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP+SFWGYAVETAV+ILN VPSKSVSETPFEL
Subjt:  ENLLGKTIKTLRSDRGGEYMDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFWGYAVETAVYILNVVPSKSVSETPFEL

Query:  WKGR-----------------------------------YPKETRGGYFFDPEENKVLVSTNAPFLEEDHLRDHKPRSKVVLSELSNEATETSTRV----
        W+GR                                   YPKETRGG FFDP+EN+V VSTNA FLEEDH+R+HKPRSK+VLSE ++E+T     V    
Subjt:  WKGR-----------------------------------YPKETRGGYFFDPEENKVLVSTNAPFLEEDHLRDHKPRSKVVLSELSNEATETSTRV----

Query:  -VDDTGTSSQSRPSQEIREPRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVDKNKWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWI
         VD+T TS QS PSQ +R PRRSGRVV QP+RYLGLTET VVIPDDG+EDPL++KQAM+DVDK++W+KAMDLE+ESM+FNSVW+LVD PEGV+PIGCKWI
Subjt:  -VDDTGTSSQSRPSQEIREPRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVDKNKWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWI

Query:  YKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPIAMLKSIRILLSIATFYDYEIWQMDVKTVFLNGNLKK
        YKRKRD+AGKVQTFKARLVAKG+T++EG+DYEETFS +AMLKSIRILLSIA FYDYEIWQMDVKT FLNGNL++
Subjt:  YKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPIAMLKSIRILLSIATFYDYEIWQMDVKTVFLNGNLKK

A0A5A7TZD0 Gag/pol protein2.6e-23865.63Show/hide
Query:  STEECPQTNC----VAKKNMIHDSKCDLLVLETCLVENDDSAWILDS------------------------------GEVISAIAVGDIKLFFTKERYMI
        + +E  +TNC    V KK    + K DLLVLETCLVEND +AWILDS                              G+VISA AVGD KLFF   ++M 
Subjt:  STEECPQTNC----VAKKNMIHDSKCDLLVLETCLVENDDSAWILDS------------------------------GEVISAIAVGDIKLFFTKERYMI

Query:  LDNVYIVPKIKRNLISLSCLLEQGYSISFSVNEAFITKR-------------------------------------------------------GHINLN
        L+N+YIVPKIKRNL+S+SCL+E  YSI+FS+NEAFI K                                                        GHINL+
Subjt:  LDNVYIVPKIKRNLISLSCLLEQGYSISFSVNEAFITKR-------------------------------------------------------GHINLN

Query:  RIGRLVKSGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYCAKEPLELIHSDLCGPMNIRARGGYEYFISFIDDYSRYGYLYLMHHKSEAFEKFKEYKAE
        RIGRLVK+GLLNKL+D SLPPCESCLEGKMTKRPFTGKGY AKEPLELIHSDLCGPMN++ARGG+EYFISFIDDYSRYGYLYLM HKSEA EKFKEYK E
Subjt:  RIGRLVKSGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYCAKEPLELIHSDLCGPMNIRARGGYEYFISFIDDYSRYGYLYLMHHKSEAFEKFKEYKAE

Query:  VENLLGKTIKTLRSDRGGEYMDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFWGYAVETAVYILNVVPSKSVSETPFE
        VENLL K IK LRSDRGGEYMDL FQDY+IEHGI SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP+SFWGYAVETAV+ILN VPSKSVSETPFE
Subjt:  VENLLGKTIKTLRSDRGGEYMDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFWGYAVETAVYILNVVPSKSVSETPFE

Query:  LWKGR-----------------------------------YPKETRGGYFFDPEENKVLVSTNAPFLEEDHLRDHKPRSKVVLSELSNEATETSTRV---
        LW+GR                                   YPKETRGG FFDP+EN+V VSTNA FLEEDH+R+HKPRSK+VLSE ++E+T     V   
Subjt:  LWKGR-----------------------------------YPKETRGGYFFDPEENKVLVSTNAPFLEEDHLRDHKPRSKVVLSELSNEATETSTRV---

Query:  --VDDTGTSSQSRPSQEIREPRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVDKNKWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKW
          VD+T TS QS PSQ +R PRRSGRVV QP+RYLGLTET VVIPDDG+EDPL++KQAM+DVDK++W+KAMDLE+ESM+FNSVW+LVD PEGV+PIGCKW
Subjt:  --VDDTGTSSQSRPSQEIREPRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVDKNKWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKW

Query:  IYKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPIAMLKSIRILLSIATFYDYEIWQMDVKTVFLNGNLKK
        IYKRKRD+AGKVQTFKARLVAKG+TQREG+DYEETFSP+AMLKSIRILLSIATFYDYEIWQMDVKT FLNGNL++
Subjt:  IYKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPIAMLKSIRILLSIATFYDYEIWQMDVKTVFLNGNLKK

A0A5D3CPJ6 Gag/pol protein9.0e-22358.99Show/hide
Query:  NVCALSNAKLNGLVHSTKNTCLNAK---HVAKNLGLVSTEECPQTNCVAKKNMIHDSKCDLLVLETCLVENDDSAWILDS--------------------
        N   L+ AK      + K  C +     H  +N        CP+   +A+K      K DLLVLETCLVENDDSAWI+DS                    
Subjt:  NVCALSNAKLNGLVHSTKNTCLNAK---HVAKNLGLVSTEECPQTNCVAKKNMIHDSKCDLLVLETCLVENDDSAWILDS--------------------

Query:  ----------GEVISAIAVGDIKLFFTKERYMILDNVYIVPKIKRNLISLSCLLEQGYSISFSVNEAFITKR----------------------------
                  G V+SAIAVG ++L   K  +++L+NVY+VP +KRNLIS+ CLLEQ YS++F+VN+ FI K                             
Subjt:  ----------GEVISAIAVGDIKLFFTKERYMILDNVYIVPKIKRNLISLSCLLEQGYSISFSVNEAFITKR----------------------------

Query:  ---------------------------GHINLNRIGRLVKSGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYCAKEPLELIHSDLCGPMNIRARGGYEY
                                   GHINLNRI RLVK+GLL++LE++SLP CESCLEGKMTKRPFTGKG+ AKEPLEL+HSDLCGPMN++ARGG+EY
Subjt:  ---------------------------GHINLNRIGRLVKSGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYCAKEPLELIHSDLCGPMNIRARGGYEY

Query:  FISFIDDYSRYGYLYLMHHKSEAFEKFKEYKAEVENLLGKTIKTLRSDRGGEYMDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSY
        FI+F DDYSRYGY+YLM HKSEA EKFKEYKAEVEN L KTIKT RSDRGGEYMDL FQ+YL+E GI SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSY
Subjt:  FISFIDDYSRYGYLYLMHHKSEAFEKFKEYKAEVENLLGKTIKTLRSDRGGEYMDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSY

Query:  AQLPNSFWGYAVETAVYILNVVPSKSVSETPFELWKGR-----------------------------------YPKETRGGYFFDPEENKVLVSTNAPFL
        A LPNSFWGYAV+TAVYILN VPSKSVSETP +LW GR                                   YPK TRGGYF+DP++NKV VSTNA FL
Subjt:  AQLPNSFWGYAVETAVYILNVVPSKSVSETPFELWKGR-----------------------------------YPKETRGGYFFDPEENKVLVSTNAPFL

Query:  EEDHLRDHKPRSKVVLSELSNEATETSTRVVDD---------TGTSSQSRPSQEIREPRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVD
        EEDH+R+HKPRSK+VL+ELS E TE STRVV++          G+S+++   Q +REPRRSGRV   P RY+ LTET  VI D  IEDPLTFK+AM+DVD
Subjt:  EEDHLRDHKPRSKVVLSELSNEATETSTRVVDD---------TGTSSQSRPSQEIREPRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVD

Query:  KNKWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPIAMLKSIRILLSIATFYDYEIWQMD
        K++WIKAM+LE+ESM+FNSVWDLVDQP+GV+PIGCKWIYKRKR A GKVQTFKARLVAKG+TQ EG+DYEETFSP+AMLKSIRILLSIA ++DYEIWQMD
Subjt:  KNKWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPIAMLKSIRILLSIATFYDYEIWQMD

Query:  VKTVFLNGNLKK
        VKT FLNGNL++
Subjt:  VKTVFLNGNLKK

E2GK51 Gag/pol protein (Fragment)1.8e-22362.14Show/hide
Query:  CPQTNCVAKKNMIHDSKCDLLVLETCLVENDDSAWILDS------------------------------GEVISAIAVGDIKLFFTKERYMILDNVYIVP
        CP+     K       K DLLV+ETCLVE D S WILDS                              GEV+SA AVGD+ LFF ++RY+IL +V  VP
Subjt:  CPQTNCVAKKNMIHDSKCDLLVLETCLVENDDSAWILDS------------------------------GEVISAIAVGDIKLFFTKERYMILDNVYIVP

Query:  KIKRNLISLSCLLEQGYSISFSVNEAFI-----------------------------------------------------TKRGHINLNRIGRLVKSGL
         +KRNLIS++C+LE  Y+ISF VNE FI                                                      + GHINLNRI RLVKSG+
Subjt:  KIKRNLISLSCLLEQGYSISFSVNEAFI-----------------------------------------------------TKRGHINLNRIGRLVKSGL

Query:  LNKLEDDSLPPCESCLEGKMTKRPFTGKGYCAKEPLELIHSDLCGPMNIRARGGYEYFISFIDDYSRYGYLYLMHHKSEAFEKFKEYKAEVENLLGKTIK
        LN+LED+SLPPCESCLEGKMTKR FTGKG  AK PLEL+HSDLCGPMN++ARGGYEYFISFIDD+SRYG++YL+HHKSE+FEKFKEYKAEVEN +GKTIK
Subjt:  LNKLEDDSLPPCESCLEGKMTKRPFTGKGYCAKEPLELIHSDLCGPMNIRARGGYEYFISFIDDYSRYGYLYLMHHKSEAFEKFKEYKAEVENLLGKTIK

Query:  TLRSDRGGEYMDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFWGYAVETAVYILNVVPSKSVSETPFELWKGR-----
        TLRSDRGGEYMD  FQDYLIE GI SQLSAP TPQQNGVSERRNRTLLDMVRSMMSYAQLP+SFWGYA+ETA++ILN VPSKSV ETP+ELWKGR     
Subjt:  TLRSDRGGEYMDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFWGYAVETAVYILNVVPSKSVSETPFELWKGR-----

Query:  ------------------------------YPKETRGGYFFDPEENKVLVSTNAPFLEEDHLRDHKPRSKVVLSELSNEATE---TSTRVVDDTGTSSQS
                                      YPKE+RGG F+ P+ENKV VSTNA FLEEDH R+H+PRSK+VL E+   AT+   +ST+VVD    S QS
Subjt:  ------------------------------YPKETRGGYFFDPEENKVLVSTNAPFLEEDHLRDHKPRSKVVLSELSNEATE---TSTRVVDDTGTSSQS

Query:  RPSQEIREPRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVDKNKWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKV
          SQE+R PRRSGRVV QP+RYLGL ET ++IPDDG+EDPLT+KQAM+DVD+++WIKAM+LE+ESM+FNSVW LVD P  V+PIGCKWIYKRKRD AGKV
Subjt:  RPSQEIREPRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVDKNKWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKV

Query:  QTFKARLVAKGFTQREGIDYEETFSPIAMLKSIRILLSIATFYDYEIWQMDVKTVFLNGNLKK
        QTFKARLVAKG+TQ+EG+DYEETFSP+AMLKSIRILLSIATFY+YEIWQMDVKT FLNGNL++
Subjt:  QTFKARLVAKGFTQREGIDYEETFSPIAMLKSIRILLSIATFYDYEIWQMDVKTVFLNGNLKK

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.7e-5427.48Show/hide
Query:  FTKERYMILDNVYIVPKIKRNLISLSCLLEQGYSISFSVNEAF---ITKRGHIN------LNRIGRLVKSGLLNKLEDDSLPPCESCLEGKMTKRPF---
        F K    I  N  +V K    L ++  +  Q YSI+      F     + GHI+      + R        LLN LE  S   CE CL GK  + PF   
Subjt:  FTKERYMILDNVYIVPKIKRNLISLSCLLEQGYSISFSVNEAF---ITKRGHIN------LNRIGRLVKSGLLNKLEDDSLPPCESCLEGKMTKRPF---

Query:  TGKGYCAKEPLELIHSDLCGPMNIRARGGYEYFISFIDDYSRYGYLYLMHHKSEAFEKFKEYKAEVENLLGKTIKTLRSDRGGEYMDLSFQDYLIEHGIT
          K +  K PL ++HSD+CGP+         YF+ F+D ++ Y   YL+ +KS+ F  F+++ A+ E      +  L  D G EY+    + + ++ GI+
Subjt:  TGKGYCAKEPLELIHSDLCGPMNIRARGGYEYFISFIDDYSRYGYLYLMHHKSEAFEKFKEYKAEVENLLGKTIKTLRSDRGGEYMDLSFQDYLIEHGIT

Query:  SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFWGYAVETAVYILNVVPSKSV---SETPFELWKGRYPK---------------ETRGGYFFD
          L+ P TPQ NGVSER  RT+ +  R+M+S A+L  SFWG AV TA Y++N +PS+++   S+TP+E+W  + P                + + G F D
Subjt:  SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFWGYAVETAVYILNVVPSKSV---SETPFELWKGRYPK---------------ETRGGYFFD

Query:  --------------------------------------------------------------PEENKVLVSTNAP----------FLEEDHLRDHK----
                                                                      P +++ ++ T  P          FL++    ++K    
Subjt:  --------------------------------------------------------------PEENKVLVSTNAP----------FLEEDHLRDHK----

Query:  PRSKVVLSELSNEA--------------------TETSTRVVDD-------TGTSSQSRPSQE--------IREP----------RRSGRVVRQPDRYLG
           K++ +E  NE+                     E+  R  DD       +G  ++SR S+         I  P          RRS R+  +P     
Subjt:  PRSKVVLSELSNEA--------------------TETSTRVVDD-------TGTSSQSRPSQE--------IREP----------RRSGRVVRQPDRYLG

Query:  LTE---TPVVIPDDGI--EDPLTFKQAMDDVDKNKWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFKARLVAKGFTQREGID
          +     VV+    I  + P +F +     DK+ W +A++ E+ +   N+ W +  +PE    +  +W++  K +  G    +KARLVA+GFTQ+  ID
Subjt:  LTE---TPVVIPDDGI--EDPLTFKQAMDDVDKNKWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFKARLVAKGFTQREGID

Query:  YEETFSPIAMLKSIRILLSIATFYDYEIWQMDVKTVFLNGNLKK
        YEETF+P+A + S R +LS+   Y+ ++ QMDVKT FLNG LK+
Subjt:  YEETFSPIAMLKSIRILLSIATFYDYEIWQMDVKTVFLNGNLKK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-949.1e-7932.84Show/hide
Query:  VGDIKLFFTKERYMILDNVYIVPKIKRNLISLSCLLEQGYSISFSVNEAFITK-----------------------------------------RGHINL
        +GDI +       ++L +V  VP ++ NLIS   L   GY   F+  +  +TK                                          GH++ 
Subjt:  VGDIKLFFTKERYMILDNVYIVPKIKRNLISLSCLLEQGYSISFSVNEAFITK-----------------------------------------RGHINL

Query:  NRIGRLVKSGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYCAKEPLELIHSDLCGPMNIRARGGYEYFISFIDDYSRYGYLYLMHHKSEAFEKFKEYKA
          +  L K  L++  +  ++ PC+ CL GK  +  F          L+L++SD+CGPM I + GG +YF++FIDD SR  ++Y++  K + F+ F+++ A
Subjt:  NRIGRLVKSGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYCAKEPLELIHSDLCGPMNIRARGGYEYFISFIDDYSRYGYLYLMHHKSEAFEKFKEYKA

Query:  EVENLLGKTIKTLRSDRGGEYMDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFWGYAVETAVYILNVVPSKSVS-ETP
         VE   G+ +K LRSD GGEY    F++Y   HGI  + + PGTPQ NGV+ER NRT+++ VRSM+  A+LP SFWG AV+TA Y++N  PS  ++ E P
Subjt:  EVENLLGKTIKTLRSDRGGEYMDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFWGYAVETAVYILNVVPSKSVS-ETP

Query:  FELWKGR-----------------YPKETR--------------------GGYFFDPEENKVLVSTNAPFLEED--HLRDHKPRSK-------VVLSELS
          +W  +                  PKE R                    G   +DP + KV+ S +  F E +     D   + K       V +   S
Subjt:  FELWKGR-----------------YPKETR--------------------GGYFFDPEENKVLVSTNAPFLEED--HLRDHKPRSK-------VVLSELS

Query:  NEAT--ETSTRVVD----------------DTGTSSQSRPSQ--EIREPRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVDKNKWIKAMD
        N  T  E++T  V                 D G      P+Q  E  +P R     R   R    TE  V+I DD   +P + K+ +   +KN+ +KAM 
Subjt:  NEAT--ETSTRVVD----------------DTGTSSQSRPSQ--EIREPRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVDKNKWIKAMD

Query:  LEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPIAMLKSIRILLSIATFYDYEIWQMDVKTVFLNGN
         E+ES+  N  + LV+ P+G RP+ CKW++K K+D   K+  +KARLV KGF Q++GID++E FSP+  + SIR +LS+A   D E+ Q+DVKT FL+G+
Subjt:  LEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPIAMLKSIRILLSIATFYDYEIWQMDVKTVFLNGN

Query:  LKK
        L++
Subjt:  LKK

Q12490 Transposon Ty1-BL Gag-Pol polyprotein5.1e-2125.37Show/hide
Query:  ISAIAVGDIKLFFTKERYMILDNVYIVPKIKRN---LISLSCLLEQGYSISF--SVNEAFITKR----------GHINLNRIGRLVKSGLLNKLEDDSLP
        ++ +A  DI   FTK      D   + P +K      +S   LL    S+    +V+ +  T++           H N   I   +K+  +    +  + 
Subjt:  ISAIAVGDIKLFFTKERYMILDNVYIVPKIKRN---LISLSCLLEQGYSISF--SVNEAFITKR----------GHINLNRIGRLVKSGLLNKLEDDSLP

Query:  -------PCESCLEGKMTK-RPFTG---KGYCAKEPLELIHSDLCGPMNIRARGGYEYFISFIDDYSRYGYLYLMHHKSE--AFEKFKEYKAEVENLLGK
                C  CL GK TK R   G   K   + EP + +H+D+ GP++   +    YFISF D+ +++ ++Y +H + E    + F    A ++N    
Subjt:  -------PCESCLEGKMTK-RPFTG---KGYCAKEPLELIHSDLCGPMNIRARGGYEYFISFIDDYSRYGYLYLMHHKSE--AFEKFKEYKAEVENLLGK

Query:  TIKTLRSDRGGEYMDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFWGYAVETAVYILNVVPSKSVSETPFELWKGRYP
        ++  ++ DRG EY + +   +L ++GIT   +     + +GV+ER NRTLLD  R+ +  + LPN  W  A+E +  + N + S    ++          
Subjt:  TIKTLRSDRGGEYMDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFWGYAVETAVYILNVVPSKSVSETPFELWKGRYP

Query:  KETRGGYFFDPEENKVLVSTNAPFLEEDHLRDHKPRSKV
        ++  G    D       +ST  PF +   + DH P SK+
Subjt:  KETRGGYFFDPEENKVLVSTNAPFLEEDHLRDHKPRSKV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.4e-3925.56Show/hide
Query:  CESCLEGKMTKRPFTGKGYCAKEPLELIHSDLCGPMNIRARGGYEYFISFIDDYSRYGYLYLMHHKSEAFEKFKEYKAEVENLLGKTIKTLRSDRGGEYM
        C  CL  K  K PF+     +  PLE I+SD+     I +   Y Y++ F+D ++RY +LY +  KS+  E F  +K  +EN     I T  SD GGE++
Subjt:  CESCLEGKMTKRPFTGKGYCAKEPLELIHSDLCGPMNIRARGGYEYFISFIDDYSRYGYLYLMHHKSEAFEKFKEYKAEVENLLGKTIKTLRSDRGGEYM

Query:  DLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFWGYAVETAVYILNVVPSKSVS-ETPFELWKGRYPKE-----------
         L   +Y  +HGI+   S P TP+ NG+SER++R +++   +++S+A +P ++W YA   AVY++N +P+  +  E+PF+   G  P             
Subjt:  DLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFWGYAVETAVYILNVVPSKSVS-ETPFELWKGRYPKE-----------

Query:  -------------------------TRGGYF----------------FDP-------------------EENKVLVSTN-----------APFLEEDHLR
                                 T+  Y                 FD                     E+  + S +           AP   + H  
Subjt:  -------------------------TRGGYF----------------FDP-------------------EENKVLVSTN-----------APFLEEDHLR

Query:  DHKP--------RSKVVLSEL------------------------SNEATETSTRVVDDTGTS-------SQSRPSQEIREPRRSGRVVRQPDRYLGLTE
           P         S+V  S L                        + + T+T T+      TS       S S+ +Q +  P +S      P      + 
Subjt:  DHKP--------RSKVVLSEL------------------------SNEATETSTRVVDDTGTS-------SQSRPSQEIREPRRSGRVVRQPDRYLGLTE

Query:  TPVVIPDDGIEDPLTFKQ--------------------------------------------AMDDVDKNKWIKAMDLEIESMHFNSVWDLVDQPEG-VR
        T    P   I  P    Q                                            A+  +   +W  AM  EI +   N  WDLV  P   V 
Subjt:  TPVVIPDDGIEDPLTFKQ--------------------------------------------AMDDVDKNKWIKAMDLEIESMHFNSVWDLVDQPEG-VR

Query:  PIGCKWIYKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPIAMLKSIRILLSIATFYDYEIWQMDVKTVFLNGNL
         +GC+WI+ +K ++ G +  +KARLVAKG+ QR G+DY ETFSP+    SIRI+L +A    + I Q+DV   FL G L
Subjt:  PIGCKWIYKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPIAMLKSIRILLSIATFYDYEIWQMDVKTVFLNGNL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.0e-4325.84Show/hide
Query:  SFSVNEAFITKRGHINLNRIGRLVKSGLLNKLE-DDSLPPCESCLEGKMTKRPFTGKGYCAKEPLELIHSDLCGPMNIRARGGYEYFISFIDDYSRYGYL
        S + + ++ ++ GH +L  +  ++ +  L  L     L  C  C   K  K PF+     + +PLE I+SD+     I +   Y Y++ F+D ++RY +L
Subjt:  SFSVNEAFITKRGHINLNRIGRLVKSGLLNKLE-DDSLPPCESCLEGKMTKRPFTGKGYCAKEPLELIHSDLCGPMNIRARGGYEYFISFIDDYSRYGYL

Query:  YLMHHKSEAFEKFKEYKAEVENLLGKTIKTLRSDRGGEYMDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFWGYAVET
        Y +  KS+  + F  +K+ VEN     I TL SD GGE++ L  +DYL +HGI+   S P TP+ NG+SER++R +++M  +++S+A +P ++W YA   
Subjt:  YLMHHKSEAFEKFKEYKAEVENLLGKTIKTLRSDRGGEYMDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFWGYAVET

Query:  AVYILNVVPSKSVS-ETPFELWKGRYPKE------------------------------------TRGGYF---------------------FDPEENKV
        AVY++N +P+  +  ++PF+   G+ P                                      T+  Y                      F       
Subjt:  AVYILNVVPSKSVS-ETPFELWKGRYPKE------------------------------------TRGGYF---------------------FDPEENKV

Query:  LVSTN-------------------------APFLEEDHLRDHKPR-------------------SKVVLSELSNEATETSTRVVDDTGTSSQSRPSQE--
         VST+                         AP     HL D  PR                   S  + S  S+E T  S      T    Q++ S    
Subjt:  LVSTN-------------------------APFLEEDHLRDHKPR-------------------SKVVLSELSNEATETSTRVVDDTGTSSQSRPSQE--

Query:  --IREPRRSGRVVRQPDRYLGLTETPVVIP----------------------------------------------------DDGI--------------
          +  P  +      P++   L ++P+  P                                                     DGI              
Subjt:  --IREPRRSGRVVRQPDRYLGLTETPVVIP----------------------------------------------------DDGI--------------

Query:  --EDPLTFKQAMDDVDKNKWIKAMDLEIESMHFNSVWDLV-DQPEGVRPIGCKWIYKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPIAMLKSIR
           +P T  QAM D   ++W +AM  EI +   N  WDLV   P  V  +GC+WI+ +K ++ G +  +KARLVAKG+ QR G+DY ETFSP+    SIR
Subjt:  --EDPLTFKQAMDDVDKNKWIKAMDLEIESMHFNSVWDLV-DQPEGVRPIGCKWIYKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPIAMLKSIR

Query:  ILLSIATFYDYEIWQMDVKTVFLNGNL
        I+L +A    + I Q+DV   FL G L
Subjt:  ILLSIATFYDYEIWQMDVKTVFLNGNL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.2e-2742.74Show/hide
Query:  EDPLTFKQAMDDVDKNKWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPIAMLKSIRILL
        ++P T+ +A + +    W  AMD EI +M     W++   P   +PIGCKW+YK K ++ G ++ +KARLVAKG+TQ+EGID+ ETFSP+  L S++++L
Subjt:  EDPLTFKQAMDDVDKNKWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPIAMLKSIRILL

Query:  SIATFYDYEIWQMDVKTVFLNGNL
        +I+  Y++ + Q+D+   FLNG+L
Subjt:  SIATFYDYEIWQMDVKTVFLNGNL

ATMG00300.1 Gag-Pol-related retrotransposon family protein6.8e-0532.35Show/hide
Query:  TKRGHINLNRIGRLVKSGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYCAKEPLELIHSDLCGPMNI
        ++  H++   +  LVK G L+  +  SL  CE C+ GK  +  F+   +  K PL+ +HSDL G  ++
Subjt:  TKRGHINLNRIGRLVKSGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYCAKEPLELIHSDLCGPMNI

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)8.6e-1643.02Show/hide
Query:  WIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPIAMLKSIRILLSIA
        W +AM  E++++  N  W LV  P     +GCKW++K K  + G +   KARLVAKGF Q EGI + ET+SP+    +IR +L++A
Subjt:  WIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPIAMLKSIRILLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCAAACAAATTCAAGCTTCCACCTCAACATCAGGCACATCCTTCGATGAGGCACTTAAGAAGCTTACTGACAGCTCTTTACAGTTCCAGAAAGAGACCAGGAACGC
CATTCAGAATTTGGGAAACCAAATCACTCAGCTGGCAACCCAAGTCAGTAAGATGAACAACAAAAGCTCTGGTAAGCTACTTGCCCAACCTAAGGTAAATCCTAAGGCAA
ATGTGAATGTAGTGTGGAGTGCAGTGGCCACACCTGACTGTCCCCCTGGATTCCCCACTGTCTCTGATAAGTTTTCTATTTCTTCTGGAAAATTTTCTTCTTTAAATTTA
AATGATAAAAATTCTTTTTCACCTGTGAACTTCATTGACCCTGCTATTCCAAGTGTGCAGGATGATTTAGGTGTTTTTCGAAAAGTCAAGGTGAACATTCCATTGCTTGA
GGCAATCCTGAAAATCCCAGCCTATGCTGAGTTCCTAAAGACTTGGTATGAGGGTAAGAAAAAGTCATCAGGTAAAGAAGTTGTTAGTGAAAATGTAAGTGCCTTACTTT
CTAAGAACTTACCTGTTAAATGTTCTGATCCTCGCATGTTTACTTTACCGTATACTCCTAATCCTAATTACAATCCATCTTCTGCTACGATACTTCTTGGTTGTCCCTTC
ATGAAGACTGCCAAGATTGTCATTGATGTAGATGAAGGTGTTGTATCAGCTAGTCAATTAGGATTGACGCCATATGCGATTACCGCTTGTGTATTCTATCAATTATTTGA
GATAGCCACTAGGTTGAGGCAAGCCACTCTCTACGCGAGACGACCCTTGATAGGCCTAGGATACATGTTAAAGGGAAACGATATTAGATTTGAACCCAAAATTTCCAGAG
CGGAGACGAGATTTAGAAGGCAAGAAAGAAGGAGAAGAGTTAATAATCTCAGCGAGGTTCAAACACCAACCATGGCTGAAAGGACTTTAAGGCAGCTGGCAGCGCCAGAT
CTGAACCAGAAGCCACTCTGTATTACCTACCTCGAGACGACAGGAGAGGACCCGCACAAACATTTGCAGAAATTTCACATAATCTCCGATCAGTTGTTGATCCCGTATTT
TTACGAAGGGCTACTTCCTATGGACCGTAGTATGGTAGATGCAGCTAGTGGAGGAGCACTAGTCAACAAGACCCTTGTCGAGGCTAGACAACTACTTTCAAGTATGGCTG
AAAACTCTCAGCAATTTGGTACTAGAGGACCACCTGCAATTGTGCAAGCCTCCACAAAAAGTGAGGTTAATATCAAGGTGAATGGGGAAAGTGTTCATAGTAAGTGGGAG
AAGGAAACATGTCAAACGTATCCTGCGGTCTCCATCATTAGGTTGCACCGTGAGATTCCTATGCGCTGCCTGCGAGTCGCCCTGGGAGCGATCACTCTACGGAGGGCTTG
CGCATGGGATTCGGAACAACGCAAACTCCAGAAATGGATAGGGTTTCCTAGGGCCATTCCCAACATTAGCTCTTCCCTACTGTGGCATTGTTGGGGCCGCCCTCTGCAAT
CTGAAAATGATGGCAATATGTCGAAAAACGTATGTGCTTTGTCAAATGCCAAATTAAACGGTTTAGTACATTCGACTAAGAATACTTGCTTAAACGCTAAACACGTGGCA
AAGAATTTAGGTTTGGTCTCAACTGAGGAATGTCCTCAAACTAACTGCGTAGCCAAGAAAAACATGATCCACGACTCTAAATGTGATTTATTAGTCTTGGAAACGTGTTT
AGTGGAAAACGACGATTCTGCCTGGATACTTGATTCAGGAGAAGTCATCTCGGCTATTGCAGTGGGAGACATAAAATTATTCTTCACCAAGGAACGCTACATGATATTAG
ATAATGTGTACATAGTTCCGAAGATTAAAAGAAATTTGATTTCTTTATCTTGTTTATTAGAACAAGGTTACTCAATTTCTTTTTCTGTTAATGAAGCGTTCATAACTAAA
AGAGGCCATATCAACCTCAATAGGATTGGGAGATTGGTCAAGAGTGGACTTCTAAACAAGTTAGAAGACGACTCGTTGCCTCCTTGTGAATCATGCCTTGAGGGCAAGAT
GACAAAACGACCTTTTACCGGAAAAGGTTATTGTGCCAAAGAGCCCTTAGAGCTCATTCATTCGGACCTCTGTGGACCAATGAACATTAGAGCTCGAGGAGGGTACGAAT
ATTTCATCAGTTTCATTGATGATTATTCGAGGTATGGCTATCTCTACCTGATGCATCATAAGTCCGAAGCTTTTGAAAAGTTCAAGGAGTATAAGGCCGAGGTTGAGAAC
CTATTAGGTAAGACGATAAAAACACTTCGATCCGATCGAGGTGGAGAGTATATGGACTTAAGTTTTCAAGACTATTTGATAGAGCATGGAATTACGTCCCAACTCTCAGC
CCCTGGTACACCTCAGCAAAACGGTGTATCAGAAAGGAGAAACAGAACCTTGTTGGACATGGTTCGTTCGATGATGAGTTATGCTCAGTTGCCTAACTCGTTTTGGGGTT
ATGCAGTAGAGACTGCAGTTTATATCTTGAACGTAGTTCCCTCAAAGAGTGTTTCTGAAACACCATTTGAGCTCTGGAAAGGTCGCTATCCCAAAGAAACGAGAGGTGGA
TACTTCTTCGATCCAGAGGAAAATAAAGTACTTGTATCGACAAATGCTCCATTCCTAGAAGAAGACCACCTCAGAGATCACAAGCCACGTAGTAAAGTCGTTTTAAGTGA
ACTATCTAATGAAGCTACAGAAACATCAACAAGAGTTGTTGATGATACTGGCACCTCGAGTCAATCACGTCCTTCTCAAGAAATAAGAGAACCTCGACGTAGTGGGAGGG
TTGTTAGACAGCCTGATCGCTATTTGGGTTTAACTGAAACTCCAGTCGTCATACCTGATGACGGGATCGAGGATCCATTAACCTTTAAACAGGCAATGGATGACGTTGAC
AAGAACAAGTGGATTAAAGCCATGGACTTAGAGATCGAGTCCATGCATTTTAATTCAGTATGGGATCTTGTAGATCAACCTGAAGGGGTTAGACCCATAGGGTGCAAATG
GATCTACAAGAGAAAACGAGACGCTGCTGGAAAGGTACAGACCTTTAAGGCACGACTTGTGGCAAAGGGTTTTACCCAGCGAGAGGGAATTGACTATGAAGAAACCTTCT
CCCCTATTGCCATGTTAAAGTCGATCCGGATACTTTTATCCATTGCCACGTTTTACGATTATGAAATATGGCAAATGGATGTCAAGACTGTCTTTCTGAACGGTAATCTT
AAGAAAGTATCTACATGTCTCAGCCAGAGGGGTTCATAG
mRNA sequenceShow/hide mRNA sequence
ATGTTCAAACAAATTCAAGCTTCCACCTCAACATCAGGCACATCCTTCGATGAGGCACTTAAGAAGCTTACTGACAGCTCTTTACAGTTCCAGAAAGAGACCAGGAACGC
CATTCAGAATTTGGGAAACCAAATCACTCAGCTGGCAACCCAAGTCAGTAAGATGAACAACAAAAGCTCTGGTAAGCTACTTGCCCAACCTAAGGTAAATCCTAAGGCAA
ATGTGAATGTAGTGTGGAGTGCAGTGGCCACACCTGACTGTCCCCCTGGATTCCCCACTGTCTCTGATAAGTTTTCTATTTCTTCTGGAAAATTTTCTTCTTTAAATTTA
AATGATAAAAATTCTTTTTCACCTGTGAACTTCATTGACCCTGCTATTCCAAGTGTGCAGGATGATTTAGGTGTTTTTCGAAAAGTCAAGGTGAACATTCCATTGCTTGA
GGCAATCCTGAAAATCCCAGCCTATGCTGAGTTCCTAAAGACTTGGTATGAGGGTAAGAAAAAGTCATCAGGTAAAGAAGTTGTTAGTGAAAATGTAAGTGCCTTACTTT
CTAAGAACTTACCTGTTAAATGTTCTGATCCTCGCATGTTTACTTTACCGTATACTCCTAATCCTAATTACAATCCATCTTCTGCTACGATACTTCTTGGTTGTCCCTTC
ATGAAGACTGCCAAGATTGTCATTGATGTAGATGAAGGTGTTGTATCAGCTAGTCAATTAGGATTGACGCCATATGCGATTACCGCTTGTGTATTCTATCAATTATTTGA
GATAGCCACTAGGTTGAGGCAAGCCACTCTCTACGCGAGACGACCCTTGATAGGCCTAGGATACATGTTAAAGGGAAACGATATTAGATTTGAACCCAAAATTTCCAGAG
CGGAGACGAGATTTAGAAGGCAAGAAAGAAGGAGAAGAGTTAATAATCTCAGCGAGGTTCAAACACCAACCATGGCTGAAAGGACTTTAAGGCAGCTGGCAGCGCCAGAT
CTGAACCAGAAGCCACTCTGTATTACCTACCTCGAGACGACAGGAGAGGACCCGCACAAACATTTGCAGAAATTTCACATAATCTCCGATCAGTTGTTGATCCCGTATTT
TTACGAAGGGCTACTTCCTATGGACCGTAGTATGGTAGATGCAGCTAGTGGAGGAGCACTAGTCAACAAGACCCTTGTCGAGGCTAGACAACTACTTTCAAGTATGGCTG
AAAACTCTCAGCAATTTGGTACTAGAGGACCACCTGCAATTGTGCAAGCCTCCACAAAAAGTGAGGTTAATATCAAGGTGAATGGGGAAAGTGTTCATAGTAAGTGGGAG
AAGGAAACATGTCAAACGTATCCTGCGGTCTCCATCATTAGGTTGCACCGTGAGATTCCTATGCGCTGCCTGCGAGTCGCCCTGGGAGCGATCACTCTACGGAGGGCTTG
CGCATGGGATTCGGAACAACGCAAACTCCAGAAATGGATAGGGTTTCCTAGGGCCATTCCCAACATTAGCTCTTCCCTACTGTGGCATTGTTGGGGCCGCCCTCTGCAAT
CTGAAAATGATGGCAATATGTCGAAAAACGTATGTGCTTTGTCAAATGCCAAATTAAACGGTTTAGTACATTCGACTAAGAATACTTGCTTAAACGCTAAACACGTGGCA
AAGAATTTAGGTTTGGTCTCAACTGAGGAATGTCCTCAAACTAACTGCGTAGCCAAGAAAAACATGATCCACGACTCTAAATGTGATTTATTAGTCTTGGAAACGTGTTT
AGTGGAAAACGACGATTCTGCCTGGATACTTGATTCAGGAGAAGTCATCTCGGCTATTGCAGTGGGAGACATAAAATTATTCTTCACCAAGGAACGCTACATGATATTAG
ATAATGTGTACATAGTTCCGAAGATTAAAAGAAATTTGATTTCTTTATCTTGTTTATTAGAACAAGGTTACTCAATTTCTTTTTCTGTTAATGAAGCGTTCATAACTAAA
AGAGGCCATATCAACCTCAATAGGATTGGGAGATTGGTCAAGAGTGGACTTCTAAACAAGTTAGAAGACGACTCGTTGCCTCCTTGTGAATCATGCCTTGAGGGCAAGAT
GACAAAACGACCTTTTACCGGAAAAGGTTATTGTGCCAAAGAGCCCTTAGAGCTCATTCATTCGGACCTCTGTGGACCAATGAACATTAGAGCTCGAGGAGGGTACGAAT
ATTTCATCAGTTTCATTGATGATTATTCGAGGTATGGCTATCTCTACCTGATGCATCATAAGTCCGAAGCTTTTGAAAAGTTCAAGGAGTATAAGGCCGAGGTTGAGAAC
CTATTAGGTAAGACGATAAAAACACTTCGATCCGATCGAGGTGGAGAGTATATGGACTTAAGTTTTCAAGACTATTTGATAGAGCATGGAATTACGTCCCAACTCTCAGC
CCCTGGTACACCTCAGCAAAACGGTGTATCAGAAAGGAGAAACAGAACCTTGTTGGACATGGTTCGTTCGATGATGAGTTATGCTCAGTTGCCTAACTCGTTTTGGGGTT
ATGCAGTAGAGACTGCAGTTTATATCTTGAACGTAGTTCCCTCAAAGAGTGTTTCTGAAACACCATTTGAGCTCTGGAAAGGTCGCTATCCCAAAGAAACGAGAGGTGGA
TACTTCTTCGATCCAGAGGAAAATAAAGTACTTGTATCGACAAATGCTCCATTCCTAGAAGAAGACCACCTCAGAGATCACAAGCCACGTAGTAAAGTCGTTTTAAGTGA
ACTATCTAATGAAGCTACAGAAACATCAACAAGAGTTGTTGATGATACTGGCACCTCGAGTCAATCACGTCCTTCTCAAGAAATAAGAGAACCTCGACGTAGTGGGAGGG
TTGTTAGACAGCCTGATCGCTATTTGGGTTTAACTGAAACTCCAGTCGTCATACCTGATGACGGGATCGAGGATCCATTAACCTTTAAACAGGCAATGGATGACGTTGAC
AAGAACAAGTGGATTAAAGCCATGGACTTAGAGATCGAGTCCATGCATTTTAATTCAGTATGGGATCTTGTAGATCAACCTGAAGGGGTTAGACCCATAGGGTGCAAATG
GATCTACAAGAGAAAACGAGACGCTGCTGGAAAGGTACAGACCTTTAAGGCACGACTTGTGGCAAAGGGTTTTACCCAGCGAGAGGGAATTGACTATGAAGAAACCTTCT
CCCCTATTGCCATGTTAAAGTCGATCCGGATACTTTTATCCATTGCCACGTTTTACGATTATGAAATATGGCAAATGGATGTCAAGACTGTCTTTCTGAACGGTAATCTT
AAGAAAGTATCTACATGTCTCAGCCAGAGGGGTTCATAG
Protein sequenceShow/hide protein sequence
MFKQIQASTSTSGTSFDEALKKLTDSSLQFQKETRNAIQNLGNQITQLATQVSKMNNKSSGKLLAQPKVNPKANVNVVWSAVATPDCPPGFPTVSDKFSISSGKFSSLNL
NDKNSFSPVNFIDPAIPSVQDDLGVFRKVKVNIPLLEAILKIPAYAEFLKTWYEGKKKSSGKEVVSENVSALLSKNLPVKCSDPRMFTLPYTPNPNYNPSSATILLGCPF
MKTAKIVIDVDEGVVSASQLGLTPYAITACVFYQLFEIATRLRQATLYARRPLIGLGYMLKGNDIRFEPKISRAETRFRRQERRRRVNNLSEVQTPTMAERTLRQLAAPD
LNQKPLCITYLETTGEDPHKHLQKFHIISDQLLIPYFYEGLLPMDRSMVDAASGGALVNKTLVEARQLLSSMAENSQQFGTRGPPAIVQASTKSEVNIKVNGESVHSKWE
KETCQTYPAVSIIRLHREIPMRCLRVALGAITLRRACAWDSEQRKLQKWIGFPRAIPNISSSLLWHCWGRPLQSENDGNMSKNVCALSNAKLNGLVHSTKNTCLNAKHVA
KNLGLVSTEECPQTNCVAKKNMIHDSKCDLLVLETCLVENDDSAWILDSGEVISAIAVGDIKLFFTKERYMILDNVYIVPKIKRNLISLSCLLEQGYSISFSVNEAFITK
RGHINLNRIGRLVKSGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYCAKEPLELIHSDLCGPMNIRARGGYEYFISFIDDYSRYGYLYLMHHKSEAFEKFKEYKAEVEN
LLGKTIKTLRSDRGGEYMDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFWGYAVETAVYILNVVPSKSVSETPFELWKGRYPKETRGG
YFFDPEENKVLVSTNAPFLEEDHLRDHKPRSKVVLSELSNEATETSTRVVDDTGTSSQSRPSQEIREPRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVD
KNKWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPIAMLKSIRILLSIATFYDYEIWQMDVKTVFLNGNL
KKVSTCLSQRGS