; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026858 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026858
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr10:42619519..42626390
RNA-Seq ExpressionLag0026858
SyntenyLag0026858
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR005162 - Retrotransposon gag domain
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KZV17946.1 hypothetical protein F511_10775 [Dorcoceras hygrometricum]2.1e-17837.81Show/hide
Query:  MIIRLTVKNKMGFVDGTLLQPTGN--LRRSWIICNTVVTAWILNSLSNEVSASVNFAESAREIWLDLQQWYKRKNRPRIFQLKHEISNLVQDQQSVTTYF
        M   LT KNK+ F+DG+ L+P  +  L  +W+ CN +V +WILNS+S E++ S+ +  +A EIW DL++ + + N PR+FQ+K  ++ L Q    + +Y+
Subjt:  MIIRLTVKNKMGFVDGTLLQPTGN--LRRSWIICNTVVTAWILNSLSNEVSASVNFAESAREIWLDLQQWYKRKNRPRIFQLKHEISNLVQDQQSVTTYF

Query:  AKLKSLWNELSAYRPSCSCGQCTCGGVKELVTYFQTEHVMAFLMGLNESFAQIRTQLLLMEPEPTIQRAFSLVAQEVEQRASVTPPPATLPAATALLVKT
         K+++LW+EL  ++P      C CG +KE + Y   E  M FLMGLNES+AQIR Q+LLM+P PTI + FSLV QE  QR S+            L++  
Subjt:  AKLKSLWNELSAYRPSCSCGQCTCGGVKELVTYFQTEHVMAFLMGLNESFAQIRTQLLLMEPEPTIQRAFSLVAQEVEQRASVTPPPATLPAATALLVKT

Query:  NSSSNTSNSSRNSANTTKKKVRPFCTHCNIQGHTVDRCYKIHGYPPGY-------RNQRGSSTKSKTSTTAVNVTLNDPLSGLNAEQCQDILTLLQSHL-
         ++      S NS  T   KV   C+HC++  HTVD+CYK+HGYPPG+        +++   T+S +    V  T+ND    L  E C+ ++  L S L 
Subjt:  NSSSNTSNSSRNSANTTKKKVRPFCTHCNIQGHTVDRCYKIHGYPPGY-------RNQRGSSTKSKTSTTAVNVTLNDPLSGLNAEQCQDILTLLQSHL-

Query:  --NKVKSGSDSVESSSTTHVAGTHSDLSSVDL--QNIWILDSGASAHICCSKELFVSLKKVSAMTVSLPNHDRLSVNHVGNVHINSDIILHNVMFIPSFR
          N           SS +   GT+S  +S  +   + WI+D+GA+ HICCS   FVS +  ++  V+LPN+  + V H+G+V ++S+I LHNV+F+P F+
Subjt:  --NKVKSGSDSVESSSTTHVAGTHSDLSSVDL--QNIWILDSGASAHICCSKELFVSLKKVSAMTVSLPNHDRLSVNHVGNVHINSDIILHNVMFIPSFR

Query:  FNLISISALTANLPVMIKFIVDSCLIQDKCSLRMIGKAKIWQGLHLLQTGDVSVEQNLCNSLSVNKKHTDSTNIAVWHDRLGHLSDKHLDVLKGLLSVKQ
        FNL+SIS+LT  +P ++ F  +SC IQ     + IG  +    L++L      +E  +C +          +   +WH RLGH+    L +L   L    
Subjt:  FNLISISALTANLPVMIKFIVDSCLIQDKCSLRMIGKAKIWQGLHLLQTGDVSVEQNLCNSLSVNKKHTDSTNIAVWHDRLGHLSDKHLDVLKGLLSVKQ

Query:  VKSN-LSPCLVCPLAKQRRLTFQSNNNVSAHMFDLIHCDTWDPYHIPTHSGYKYFLTIVEDHSRYTWVFLMRTKSDALTIVPIFFQYIKTQFGTSIKSFR
        + ++ LS C +C L+KQ+RL F SNN++    FDL+H D W P++     G+KYFLTIV+DHSRYTWV L+++KS+ + I P F + I  QFG SIKS R
Subjt:  VKSN-LSPCLVCPLAKQRRLTFQSNNNVSAHMFDLIHCDTWDPYHIPTHSGYKYFLTIVEDHSRYTWVFLMRTKSDALTIVPIFFQYIKTQFGTSIKSFR

Query:  SDNAPELWFHDFSSPKELI---------------------------TSFLVWNVPSKIW-------------TPSRILNWQTPFFKLYQKNADYHALRTF
        SDNAPEL F +F   + ++                                  +P   W             TP+ +L+ +TPF  ++ K   Y  LR F
Subjt:  SDNAPELWFHDFSSPKELI---------------------------TSFLVWNVPSKIW-------------TPSRILNWQTPFFKLYQKNADYHALRTF

Query:  GCLAFASTLHAHRSKFHPRAIPTVFMGYPPDMKGYKLYDIENKKVIVSRDVIFHEIVFSFHTITLQGDVTDPFPDLVLPISPNFSGIPVVESPDVACTDA
        GCL + STL   R+KF PRA  ++F+GYPP  KGYKL +++  +V +SRDVIFHE VF F   +                           SP+  C D 
Subjt:  GCLAFASTLHAHRSKFHPRAIPTVFMGYPPDMKGYKLYDIENKKVIVSRDVIFHEIVFSFHTITLQGDVTDPFPDLVLPISPNFSGIPVVESPDVACTDA

Query:  QINEPTDVACTDADNLIHPSTDIHISTPTDTVVLPDVQFYAAQPSTQSTAFQTQPSLADPRRFSRAVKQPSYLRDYHCALSKTMSLPESKSKFPLHKVLS
         IN+ ++   T      + +T+I    P +T++                              SR  ++PS+L DYHC      +   S +  P+  VLS
Subjt:  QINEPTDVACTDADNLIHPSTDIHISTPTDTVVLPDVQFYAAQPSTQSTAFQTQPSLADPRRFSRAVKQPSYLRDYHCALSKTMSLPESKSKFPLHKVLS

Query:  YDALSKQFRNFVLSVSSVYEPQFYHQAVPHLHWQEAMHTELQAMEANNTWSVVSLPVGHHSVGCRWIYKVKYKVDGTMERYKARLVAKGYTQQEGLDYIE
           LS  ++  V+++SS+ +P  Y+QAV    W +AM  EL+A+E NNTWS+VSLP G H+VGCRW+YK K++ DG++ERYKARLVAKGYTQQEG++Y E
Subjt:  YDALSKQFRNFVLSVSSVYEPQFYHQAVPHLHWQEAMHTELQAMEANNTWSVVSLPVGHHSVGCRWIYKVKYKVDGTMERYKARLVAKGYTQQEGLDYIE

Query:  TFSPVAKVVTVKVLLTLAVSH
        TFSPVAK+VTV+ L+ LA  H
Subjt:  TFSPVAKVVTVKVLLTLAVSH

KZV25004.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Dorcoceras hygrometricum]1.0e-18539.28Show/hide
Query:  AMIIRLTVKNKMGFVDGTLLQPTGN--LRRSWIICNTVVTAWILNSLSNEVSASVNFAESAREIWLDLQQWYKRKNRPRIFQLKHEISNLVQDQQSVTTY
        AMI+ LT KNK+GF+D ++ +P     L  SWI CN++V +WILNS++  ++ S+ + ++A EIW DL + +   N PRI+Q+K  +S L Q    V++Y
Subjt:  AMIIRLTVKNKMGFVDGTLLQPTGN--LRRSWIICNTVVTAWILNSLSNEVSASVNFAESAREIWLDLQQWYKRKNRPRIFQLKHEISNLVQDQQSVTTY

Query:  FAKLKSLWNELSAYRPSCSCGQCTCGGVKELVTYFQTEHVMAFLMGLNESFAQIRTQLLLMEPEPTIQRAFSLVAQEVEQRASVTPPPATLPAATALLVK
        + KL++LW+EL  Y+P+ +   CTCG ++E   Y   E VM FLMGLN+S+AQ+R Q+L++EP PTI + F+LV QE  QR+            + +L  
Subjt:  FAKLKSLWNELSAYRPSCSCGQCTCGGVKELVTYFQTEHVMAFLMGLNESFAQIRTQLLLMEPEPTIQRAFSLVAQEVEQRASVTPPPATLPAATALLVK

Query:  TNSSSNTSNSSRNSANTT-KKKVRPFCTHCNIQGHTVDRCYKIHGYPPGY-----RNQRGSS---TKSKTSTTAVNVTLNDPLSGLNAEQCQDILTLLQS
         NSS+NT+ S R S N+   +  R  C+HC+ + HTVD+CYK+HGYPPG+     +  +GS+     S +S T       D    L   QC+ ++  L S
Subjt:  TNSSSNTSNSSRNSANTT-KKKVRPFCTHCNIQGHTVDRCYKIHGYPPGY-----RNQRGSS---TKSKTSTTAVNVTLNDPLSGLNAEQCQDILTLLQS

Query:  HL----NKVKSGSDSVESSSTTHVAGTHSDLSSVDLQNIWILDSGASAHICCSKELFVSLKKVSAMTVSLPNHDRLSVNHVGNVHINSDIILHNVMFIPS
         L    N +         S  T +    S + ++  ++ WI+D+GA+ HICCS  +F S + + +  V LPN   + V   G V + S+++L NV+++P 
Subjt:  HL----NKVKSGSDSVESSSTTHVAGTHSDLSSVDLQNIWILDSGASAHICCSKELFVSLKKVSAMTVSLPNHDRLSVNHVGNVHINSDIILHNVMFIPS

Query:  FRFNLISISALTANLPVMIKFIVDSCLIQDKCSLRMIGKAKIWQGLHLLQTGDVSVEQNLCNSLSVNKKHTDSTNIAVWHDRLGHLSDKHLDVLKGLLSV
        F+FNL+S+S+LT N    + F+ DSC IQD   +RMIG  K    L++LQ  D  +   +CN        T  +N  +WH R+GH S   L  LK +L++
Subjt:  FRFNLISISALTANLPVMIKFIVDSCLIQDKCSLRMIGKAKIWQGLHLLQTGDVSVEQNLCNSLSVNKKHTDSTNIAVWHDRLGHLSDKHLDVLKGLLSV

Query:  KQVKSNLSPCLVCPLAKQRRLTFQSNNNVSAHMFDLIHCDTWDPYHIPTHSGYKYFLTIVEDHSRYTWVFLMRTKSDALTIVPIFFQYIKTQFGTSIKSF
        +     ++ C  C L+KQRRL   S NN+SA +F+L+H DTW P+   +  G+++F TIV+DHSRYTWV+++++KSD L+I P F + + TQFG ++KS 
Subjt:  KQVKSNLSPCLVCPLAKQRRLTFQSNNNVSAHMFDLIHCDTWDPYHIPTHSGYKYFLTIVEDHSRYTWVFLMRTKSDALTIVPIFFQYIKTQFGTSIKSF

Query:  RSDNAPELWFHDFSSPKELITSF----------------------------LVWNVPSKIW-------------TPSRILNWQTPFFKLYQKNADYHALR
        RSDNAPEL F DF + K  IT +                               ++P   W             TPS IL  +TPF  L+ K   Y  L+
Subjt:  RSDNAPELWFHDFSSPKELITSF----------------------------LVWNVPSKIW-------------TPSRILNWQTPFFKLYQKNADYHALR

Query:  TFGCLAFASTLHAHRSKFHPRAIPTVFMGYPPDMKGYKLYDIENKKVIVSRDVIFHEIVFSFHTITLQGDVTDPFPDLVLPISPNFSGIPVVESPDVACT
         FGCL +ASTL + R KF PRAI  VF+GYPP  KGYKL ++E  ++ +SRDVIFHE  F +         T P                          
Subjt:  TFGCLAFASTLHAHRSKFHPRAIPTVFMGYPPDMKGYKLYDIENKKVIVSRDVIFHEIVFSFHTITLQGDVTDPFPDLVLPISPNFSGIPVVESPDVACT

Query:  DAQINEPTDVACTDADNLIHPSTDIHISTPTDTVVLPDVQFYAAQPSTQSTAFQTQPSLADPRRFSRAVKQPSYLRDYHCALSKTMSLPESKS-KFPLHK
                 ++ +D    + PS+ I  S P D          A Q S                R SR    PS+LRDYHC    ++S P S S   P+H 
Subjt:  DAQINEPTDVACTDADNLIHPSTDIHISTPTDTVVLPDVQFYAAQPSTQSTAFQTQPSLADPRRFSRAVKQPSYLRDYHCALSKTMSLPESKS-KFPLHK

Query:  VLSYDALSKQFRNFVLSVSSVYEPQFYHQAVPHLHWQEAMHTELQAMEANNTWSVVSLPVGHHSVGCRWIYKVKYKVDGTMERYKARLVAKGYTQQEGLD
        +++Y  LS   R FV ++SS+ EP  + QAV    W++AM  EL+A+E N+TWS+VSLP G  +VGCRW+YK K+  DG+++RYKARLVAKGYTQQEGLD
Subjt:  VLSYDALSKQFRNFVLSVSSVYEPQFYHQAVPHLHWQEAMHTELQAMEANNTWSVVSLPVGHHSVGCRWIYKVKYKVDGTMERYKARLVAKGYTQQEGLD

Query:  YIETFSPVAKVVTVKVLLTLA
        Y+ETFSPVAK+VTV+ LL LA
Subjt:  YIETFSPVAKVVTVKVLLTLA

KZV50756.1 hypothetical protein F511_19388 [Dorcoceras hygrometricum]1.7e-17736.97Show/hide
Query:  MIIRLTVKNKMGFVDGTLLQPTGN--LRRSWIICNTVVTAWILNSLSNEVSASVNFAESAREIWLDLQQWYKRKNRPRIFQLKHEISNLVQDQQSVTTYF
        M++ LT KNK+GFVD ++ QP  +  L  SW  CN++V +WILNS++ +++ S+ +  +ARE+W+DL   +   N PR++Q+K  ++ L Q    +++Y+
Subjt:  MIIRLTVKNKMGFVDGTLLQPTGN--LRRSWIICNTVVTAWILNSLSNEVSASVNFAESAREIWLDLQQWYKRKNRPRIFQLKHEISNLVQDQQSVTTYF

Query:  AKLKSLWNELSAYRPSCSCGQCTCGGVKELVTYFQTEHVMAFLMGLNESFAQIRTQLLLMEPEPTIQRAFSLVAQEVEQRASVTPPPATLPAATALLVKT
         KL+ LW+EL  Y+P+     C CG +KE + Y   E VM FL GLNES+AQIR Q+L+MEP P I   F+LV QE  QR S+    A +     + +  
Subjt:  AKLKSLWNELSAYRPSCSCGQCTCGGVKELVTYFQTEHVMAFLMGLNESFAQIRTQLLLMEPEPTIQRAFSLVAQEVEQRASVTPPPATLPAATALLVKT

Query:  NSSSNTSNSSRNSANTTKKKVRPFCTHCNIQGHTVDRCYKIHGYPPGYRNQRGSSTKSKTSTTAVNVTLND----PLSGLNAEQCQDILTLLQS-----H
         +S+  ++++      + K  +  C+HC+ + HTVD+CYK+HGYPPG+   +    +S      ++  + D    P   L   QC+ ++  L S     H
Subjt:  NSSSNTSNSSRNSANTTKKKVRPFCTHCNIQGHTVDRCYKIHGYPPGYRNQRGSSTKSKTSTTAVNVTLND----PLSGLNAEQCQDILTLLQS-----H

Query:  LNKVKSGSDSVESSSTTHVAGTHSDLSSVDLQNIWILDSGASAHICCSKELFVSLKKVSAMTVSLPNHDRLSVNHVGNVHINSDIILHNVMFIPSFRFNL
         ++V+       +S  T +  T S  SS+     W+LD+GA+ HICCS  +F S K V++  + LPN   + V    +V + +D+ILH+V+++P F+FNL
Subjt:  LNKVKSGSDSVESSSTTHVAGTHSDLSSVDLQNIWILDSGASAHICCSKELFVSLKKVSAMTVSLPNHDRLSVNHVGNVHINSDIILHNVMFIPSFRFNL

Query:  ISISALTANLPVMIKFIVDSCLIQDKCSLRMIGKAKIWQGLHLLQTGDVSVEQNLCNSLSVNKKHTDSTNIAVWHDRLGHLSDKHLDVLKGLLSVKQVKS
        +SIS+LT NL   + F+ DSC IQD    + IG  K    L++L    ++    +CN +SV K         + H R+GH S   L  L  +L       
Subjt:  ISISALTANLPVMIKFIVDSCLIQDKCSLRMIGKAKIWQGLHLLQTGDVSVEQNLCNSLSVNKKHTDSTNIAVWHDRLGHLSDKHLDVLKGLLSVKQVKS

Query:  NLSPCLVCPLAKQRRLTFQSNNNVSAHMFDLIHCDTWDPYHIPTHSGYKYFLTIVEDHSRYTWVFLMRTKSDALTIVPIFFQYIKTQFGTSIKSFRSDNA
        +++ C VC ++KQ+RL F+S+N  +AH F+L+H D W P+ + +  GY++FLTIV+DH+ +TWV+++R+KS+  +I+P+F + + TQFG  IKSFRSDNA
Subjt:  NLSPCLVCPLAKQRRLTFQSNNNVSAHMFDLIHCDTWDPYHIPTHSGYKYFLTIVEDHSRYTWVFLMRTKSDALTIVPIFFQYIKTQFGTSIKSFRSDNA

Query:  PELWFHDFSSPKELITSF---------------------------LVWNVPSKIW-------------TPSRILNWQTPFFKLYQKNADYHALRTFGCLA
        PEL F +  S   ++ ++                              +VP   W             TPS  L+ +TPF  L+ K   Y  L+ FGCL 
Subjt:  PELWFHDFSSPKELITSF---------------------------LVWNVPSKIW-------------TPSRILNWQTPFFKLYQKNADYHALRTFGCLA

Query:  FASTLHAHRSKFHPRAIPTVFMGYPPDMKGYKLYDIENKKVIVSRDVIFHEIVFSF-HTITLQGDVTDPFPDLVLPISPNFSGIPVVESPDVACTDAQIN
        +ASTL + R K  PRAI  VF GYPP  +GYKL +++  ++++SRDVIFHE  F F +T       +D F D +LP+                   +Q+N
Subjt:  FASTLHAHRSKFHPRAIPTVFMGYPPDMKGYKLYDIENKKVIVSRDVIFHEIVFSF-HTITLQGDVTDPFPDLVLPISPNFSGIPVVESPDVACTDAQIN

Query:  EPTDVACTDADNLIHPSTDIHISTPTDTVVLPDVQFYAAQPSTQSTAFQTQPSLADPRRFSRAVKQPSYLRDYHCALSKTMSLPESKSKFPLHKVLSYDA
                                  ++  +PD       P +  +  Q+        R  R ++ P +L+DYHC +    S P + +  PL   ++Y  
Subjt:  EPTDVACTDADNLIHPSTDIHISTPTDTVVLPDVQFYAAQPSTQSTAFQTQPSLADPRRFSRAVKQPSYLRDYHCALSKTMSLPESKSKFPLHKVLSYDA

Query:  LSKQFRNFVLSVSSVYEPQFYHQAVPHLHWQEAMHTELQAMEANNTWSVVSLPVGHHSVGCRWIYKVKYKVDGTMERYKARLVAKGYTQQEGLDYIETFS
        LS   RN V ++SS+ EP  + QAV    W++AM  EL+A+E N+TWS+VSLP+G   VGCRW+YK K+  DG+++RYKARLVAKGYTQQEGLDY+ETFS
Subjt:  LSKQFRNFVLSVSSVYEPQFYHQAVPHLHWQEAMHTELQAMEANNTWSVVSLPVGHHSVGCRWIYKVKYKVDGTMERYKARLVAKGYTQQEGLDYIETFS

Query:  PVAKVVTVKVLLTLAVS
        PVAK+VTV+ LL LA +
Subjt:  PVAKVVTVKVLLTLAVS

RVW82526.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.0e-18140.18Show/hide
Query:  AMIIRLTVKNKMGFVDGTLLQP--TGNLRRSWIICNTVVTAWILNSLSNEVSASVNFAESAREIWLDLQQWYKRKNRPRIFQLKHEISNLVQDQQSVTTY
        +M+  L  KNK+GF+DGT+ +P  T  L   W  CN++V +W+ NS+  E++ S+ + E+A EIW DL + + + + PRIF+LK +I    Q    V TY
Subjt:  AMIIRLTVKNKMGFVDGTLLQP--TGNLRRSWIICNTVVTAWILNSLSNEVSASVNFAESAREIWLDLQQWYKRKNRPRIFQLKHEISNLVQDQQSVTTY

Query:  FAKLKSLWNELSAYRPSCSCGQCTCGGVKELVTYFQTEHVMAFLMGLNESFAQIRTQLLLMEPEPTIQRAFSLVAQEVEQRASVT--PPPATLPAATALL
        + +LKSLW+EL  ++   +   C CGG++  +   Q E VM FL+GLNESFA I+ Q+LLMEP P + + FSLV QE  QR+  T   P  T P ++   
Subjt:  FAKLKSLWNELSAYRPSCSCGQCTCGGVKELVTYFQTEHVMAFLMGLNESFAQIRTQLLLMEPEPTIQRAFSLVAQEVEQRASVT--PPPATLPAATALL

Query:  VKTNSSSNTSNSSRNSANTTKKKVRPFCTHCNIQGHTVDRCYKIHGYPPGYRNQ-----RGSSTKS--KTSTTAVNVTLND---------PLSGLNAEQC
          + +SS T NSSR+      +K RP CTHCNI GHTVDRCYKIHGY PG+RN+      GS        S     +TL D         PL+     Q 
Subjt:  VKTNSSSNTSNSSRNSANTTKKKVRPFCTHCNIQGHTVDRCYKIHGYPPGYRNQ-----RGSSTKS--KTSTTAVNVTLND---------PLSGLNAEQC

Query:  QDILTLLQSHLNKVKSGSDSVESSSTTHVAG--THSDLSSVDLQNIWILDSGASAHICCSKELFVSLKKVSAMTVSLPNHDRLSVNHVGNVHINSDIILH
          +L+L  S  +    G  +    S ++  G  + S  SS    +IWILDSGA+ H+C +  +F S+   S+ TV+LP   ++ +  +G +H++  ++L 
Subjt:  QDILTLLQSHLNKVKSGSDSVESSSTTHVAG--THSDLSSVDLQNIWILDSGASAHICCSKELFVSLKKVSAMTVSLPNHDRLSVNHVGNVHINSDIILH

Query:  NVMFIPSFRFNLISISALTANLPVMIKFIVDSCLIQDKCSLRMIGKAKIWQGLHLLQTGDVSVEQNLCNSLSVNKKHTDSTNIAVWHDRLGHLSDKHLDV
        +V++IP+F+FNLISISALT        F    C IQD    ++IG  +    L+LL   D SV +++ +S+ V   +T +    +WH RL H S+  L V
Subjt:  NVMFIPSFRFNLISISALTANLPVMIKFIVDSCLIQDKCSLRMIGKAKIWQGLHLLQTGDVSVEQNLCNSLSVNKKHTDSTNIAVWHDRLGHLSDKHLDV

Query:  LKGLLSVKQVKSNLSPCLVCPLAKQRRLTFQSNNNVSAHMFDLIHCDTWDPYHIPTHSGYKYFLTIVEDHSRYTWVFLMRTKSDALTIVPIFFQYIKTQF
        LK  L ++   +    C +CPLAKQ+RL F  +NN+S+  FDLIHCD W P+HIPTH G++YFLTIV+D +R TWV L+R KSD  TI P FF  +KT+F
Subjt:  LKGLLSVKQVKSNLSPCLVCPLAKQRRLTFQSNNNVSAHMFDLIHCDTWDPYHIPTHSGYKYFLTIVEDHSRYTWVFLMRTKSDALTIVPIFFQYIKTQF

Query:  GTSIKSFRSDNAPEL-------------WFHDFSSPKE--------------LITSFLVWNVPSKIW-------------TPSRILNWQTPFFKLYQKNA
        G +IK+ RSDNAPEL             +F    +P++                  +   N+P   W              PS +LN +TPF  L+ K+ 
Subjt:  GTSIKSFRSDNAPEL-------------WFHDFSSPKE--------------LITSFLVWNVPSKIW-------------TPSRILNWQTPFFKLYQKNA

Query:  DYHALRTFGCLAFASTLHAHRSKFHPRAIPTVFMGYPPDMKGYKLYDIENKKVIVSRDVIFHEIVFSFHTITLQGDV-TDPFPDLVLPISPNFSGIPVVE
         Y  L++FGCL ++STL + R KF PRA+P VF+GYP   KGYK+ D+E  ++ VSR+V F E VF F        V +D F   VLP+ P       V 
Subjt:  DYHALRTFGCLAFASTLHAHRSKFHPRAIPTVFMGYPPDMKGYKLYDIENKKVIVSRDVIFHEIVFSFHTITLQGDV-TDPFPDLVLPISPNFSGIPVVE

Query:  SPDVACTDAQINEPTDVACTDADNLIHPSTDIHISTPTDTVVLPDVQFYAAQPSTQSTAFQTQPSLADPRRFSRAVKQPSYLRDYHCAL-SKTMSLPESK
        +P                         PS D   S P +    PD  F    P T S             R SR  + P YL DYHC L S T     S 
Subjt:  SPDVACTDAQINEPTDVACTDADNLIHPSTDIHISTPTDTVVLPDVQFYAAQPSTQSTAFQTQPSLADPRRFSRAVKQPSYLRDYHCAL-SKTMSLPESK

Query:  S-KFPLHKVLSYDALSKQFRNFVLSVSSVYEPQFYHQAVPHLHWQEAMHTELQAMEANNTWSVVSLPVGHHSVGCRWIYKVKYKVDGTMERYKARLVAKG
        S  +PL  V+SY+ LS  FR F +S+S++ EP  Y +AV    WQ AM  ELQA+E+NNTWS+ +LP G  +VGC+W+Y+VKY VDG++ERYKARLVAKG
Subjt:  S-KFPLHKVLSYDALSKQFRNFVLSVSSVYEPQFYHQAVPHLHWQEAMHTELQAMEANNTWSVVSLPVGHHSVGCRWIYKVKYKVDGTMERYKARLVAKG

Query:  YTQQEGLDYIETF
        +TQQEG+D+   F
Subjt:  YTQQEGLDYIETF

XP_010526680.1 PREDICTED: uncharacterized protein LOC104804180 isoform X2 [Tarenaya hassleriana]1.2e-17036.91Show/hide
Query:  STAMIIRLTVKNKMGFVDGTLLQPTGNLR--RSWIICNTVVTAWILNSLSNEVSASVNFAESAREIWLDLQQWYKRKNRPRIFQLKHEISNLVQDQQSVT
        S A+   L  KNK+GF+ GT+ QP  +     SW+ CN +V  W+ NS+  ++   +++ E A EIW+ LQ  + + N  +++ ++H+I +L Q   ++ 
Subjt:  STAMIIRLTVKNKMGFVDGTLLQPTGNLR--RSWIICNTVVTAWILNSLSNEVSASVNFAESAREIWLDLQQWYKRKNRPRIFQLKHEISNLVQDQQSVT

Query:  TYFAKLKSLWNELSAYR--PSCSCGQCTCGGVK-----ELVTYFQTEHVMAFLMGLNESFAQIRTQLLLMEPEPTIQRAFSLVAQEVEQRASVTPPPATL
        +YF KL +LW EL  +   P CSC  CTCGG K     +    F+   V+ FLM LN+SF+  R Q+L+ +P P + RA++LVAQE +Q+ +V    + L
Subjt:  TYFAKLKSLWNELSAYR--PSCSCGQCTCGGVK-----ELVTYFQTEHVMAFLMGLNESFAQIRTQLLLMEPEPTIQRAFSLVAQEVEQRASVTPPPATL

Query:  PAATALLVKTNSSSN-----TSNSSRNS----ANTTKKKVRPFCTHCNIQGHTVDRCYKIHGYPPGYR-----NQRGSSTKSKT-STTAVNVTLNDPLSG
        P A A    TNSS +     +SN    S     +T+  + RP CTHC + GH V RC+++HGYPPG++     N R    + K+ S       ++  LS 
Subjt:  PAATALLVKTNSSSN-----TSNSSRNS----ANTTKKKVRPFCTHCNIQGHTVDRCYKIHGYPPGYR-----NQRGSSTKSKT-STTAVNVTLNDPLSG

Query:  LNAEQCQDILTLLQSHLNKVKSGSDSVESSSTTHVAGTHSDLSSVDLQNIWILDSGASAHICCSKELFVSLKKVSAMTVSLPNHDRLSVNHVGNVHINSD
        L ++        L S+       S  +  S  T+   T S  +      +WILD+GAS H+CC+  LF  +  +  ++VSLPN   L V   G V ++S 
Subjt:  LNAEQCQDILTLLQSHLNKVKSGSDSVESSSTTHVAGTHSDLSSVDLQNIWILDSGASAHICCSKELFVSLKKVSAMTVSLPNHDRLSVNHVGNVHINSD

Query:  IILHNVMFIPSFRFNLISISALTANLPVMIKFIVDSCLIQDKCSLRMIGKAKIWQGLHLLQ-----TGDVSVEQNLCNSLSVNKKHTDSTNIAVWHDRLG
        I L +V+FIP+F +NL+S+S LT      + F  DS +IQD     MIGK K    L++L+     +  +S+    C +LS       +T   +WH RLG
Subjt:  IILHNVMFIPSFRFNLISISALTANLPVMIKFIVDSCLIQDKCSLRMIGKAKIWQGLHLLQ-----TGDVSVEQNLCNSLSVNKKHTDSTNIAVWHDRLG

Query:  HLSDKHLDVLKGLLSVKQVKSNLS--PCLVCPLAKQRRLTFQSNNNVSAHMFDLIHCDTWDPYHIPTHSGYKYFLTIVEDHSRYTWVFLMRTKSDALTIV
        H SD  +  +     +K   S  S   C VCPLAKQRRL+F  +++VS   F+L+H D W P    +  G+++FL+IV+D+SR TWV+L+++KSD L   
Subjt:  HLSDKHLDVLKGLLSVKQVKSNLS--PCLVCPLAKQRRLTFQSNNNVSAHMFDLIHCDTWDPYHIPTHSGYKYFLTIVEDHSRYTWVFLMRTKSDALTIV

Query:  PIFFQYIKTQFGTSIKSFRSDNAPELWF----------HDFSSPKELITSFLV-----------------WNVPSKIW-------------TPSRILNWQ
        P F  +++ QF  SIK  RSDNAPEL F          H FS P     + +V                  NVP   W             TPS +L  +
Subjt:  PIFFQYIKTQFGTSIKSFRSDNAPELWF----------HDFSSPKELITSFLV-----------------WNVPSKIW-------------TPSRILNWQ

Query:  TPFFKLYQKNADYHALRTFGCLAFASTLHAHRSKFHPRAIPTVFMGYPPDMKGYKLYDIENKKVIVSRDVIFHEIVFSFHTITLQGDVTDPFPDLVLPIS
        TPF  L   +  Y  LR FGCL + STL   R KF+PRA+  VF+GYP  +KGYK+ D+ +  V++SR+V+FHE  F F +        DPFP  V P  
Subjt:  TPFFKLYQKNADYHALRTFGCLAFASTLHAHRSKFHPRAIPTVFMGYPPDMKGYKLYDIENKKVIVSRDVIFHEIVFSFHTITLQGDVTDPFPDLVLPIS

Query:  PNFSGIPVVESPDVACTDAQINEPTDVACTDADNLIHPSTDIHISTPTDTVVLPDVQFYAAQPSTQSTAFQTQPSLADPRRFSRAVKQPSYLRDYHCALS
          +  I                 P +++ + A       + +    PTD +             T S+ F T  S A   R  R  K P+YL DYHC L 
Subjt:  PNFSGIPVVESPDVACTDAQINEPTDVACTDADNLIHPSTDIHISTPTDTVVLPDVQFYAAQPSTQSTAFQTQPSLADPRRFSRAVKQPSYLRDYHCALS

Query:  KTMSLPESK--SKFPLHKVLSYDALSKQFRNFVLSVSSVYEPQFYHQAVPHLHWQEAMHTELQAMEANNTWSVVSLPVGHHSVGCRWIYKVKYKVDGTME
           S P     + +PL   L+YD LS  +R F L++++  EPQ Y QA     W++AM  EL+A+   NTWS+ +LP G ++VGC+W++K KY  DG++E
Subjt:  KTMSLPESK--SKFPLHKVLSYDALSKQFRNFVLSVSSVYEPQFYHQAVPHLHWQEAMHTELQAMEANNTWSVVSLPVGHHSVGCRWIYKVKYKVDGTME

Query:  RYKARLVAKGYTQQEGLDYIETFSPVAKVVTVKVLLTLAVSHN
        R+KARLVAKGYTQ EG+D+ ETFSPVAK+ TV+VLL LA  +N
Subjt:  RYKARLVAKGYTQQEGLDYIETFSPVAKVVTVKVLLTLAVSHN

TrEMBL top hitse value%identityAlignment
A0A2N9EHN7 Integrase catalytic domain-containing protein5.3e-18838.41Show/hide
Query:  AMIIRLTVKNKMGFVDGTLLQPTGN---LRRSWIICNTVVTAWILNSLSNEVSASVNFAESAREIWLDLQQWYKRKNRPRIFQLKHEISNLVQDQQSVTT
        +M   L+ KNK+GFV+G++LQP      L   W  CN +V +WI N LS ++ A+V +  +A+E+W DLQQ Y + N  R+  LK  I++L QD   V+ 
Subjt:  AMIIRLTVKNKMGFVDGTLLQPTGN---LRRSWIICNTVVTAWILNSLSNEVSASVNFAESAREIWLDLQQWYKRKNRPRIFQLKHEISNLVQDQQSVTT

Query:  YFAKLKSLWNELSAYR--PSCSCG-QCTCGGVKELVTYFQTEHVMAFLMGLNESFAQIRTQLLLMEPEPTIQRAFSLVAQEVEQR-ASVTPPPATLP--A
        YF +LK LW+E   YR  P C+CG +C CG  + L+ Y   ++V +FLMGLN+SFA +R Q+LLMEP P I + FSL+  + +QR A + P P   P   
Subjt:  YFAKLKSLWNELSAYR--PSCSCG-QCTCGGVKELVTYFQTEHVMAFLMGLNESFAQIRTQLLLMEPEPTIQRAFSLVAQEVEQR-ASVTPPPATLP--A

Query:  ATALLVKTNSSSNTS----------------NSSRNSANTTKKKVRPFCTHCNIQGHTVDRCYKIHGYPPGYRNQ-RGSSTKSKTSTTAV------NVTL
        +TALL +  +  NT+                NS +      K K    C+HC  +GHT D+CYK+HGYPPG+R++ R  +  S+ S++AV      N   
Subjt:  ATALLVKTNSSSNTS----------------NSSRNSANTTKKKVRPFCTHCNIQGHTVDRCYKIHGYPPGYRNQ-RGSSTKSKTSTTAV------NVTL

Query:  NDPLSGLNAEQCQDILTLLQSHLNKVKSGSD--------SVESSSTT----HVAGTHSDLSSVDLQNI-------------------WILDSGASAHICC
        + P     + QCQ +L +L +   +  S SD        S+ S S T    ++AG  + LS+    N+                   W++D+GA+ H+  
Subjt:  NDPLSGLNAEQCQDILTLLQSHLNKVKSGSD--------SVESSSTT----HVAGTHSDLSSVDLQNI-------------------WILDSGASAHICC

Query:  SKELFVSLKKVSAMTVSLPNHDRLSVNHVGNVHINSDIILHNVMFIPSFRFNLISISALTANLPVMIKFIVDSCLIQDKCSLRMIGKAKIWQGLHLLQ--
        + + + ++  V  ++V+LPN   + V H+G+V I   ++L +V+ +PSF FNLIS+S LT++L   I F+   C IQD    RMIG  K   GL+LL   
Subjt:  SKELFVSLKKVSAMTVSLPNHDRLSVNHVGNVHINSDIILHNVMFIPSFRFNLISISALTANLPVMIKFIVDSCLIQDKCSLRMIGKAKIWQGLHLLQ--

Query:  -----------TGDVSVEQNLCNSLSVNKKHTDSTNIAVWHDRLGHLSDKHLDVLKGLLSVKQVKS-NLSPCLVCPLAKQRRLTFQSNNNVSAHMFDLIH
                   + D  + ++L +  S+   + D   I VWH R GH S   +  L  ++    + S + S C VCPLAKQ+RL F + N++S + FDL+H
Subjt:  -----------TGDVSVEQNLCNSLSVNKKHTDSTNIAVWHDRLGHLSDKHLDVLKGLLSVKQVKS-NLSPCLVCPLAKQRRLTFQSNNNVSAHMFDLIH

Query:  CDTWDPYHIPTHSGYKYFLTIVEDHSRYTWVFLMRTKSDALTIVPIFFQYIKTQFGTSIKSFRSDNAPELWFHDFSSPKELITSFL--------------
         D W PYH+PT  GY+YFLT+V+D +R TW++LMR+KSD   ++  F   I+TQF T IK  RSDN  E    +F + K +I                  
Subjt:  CDTWDPYHIPTHSGYKYFLTIVEDHSRYTWVFLMRTKSDALTIVPIFFQYIKTQFGTSIKSFRSDNAPELWFHDFSSPKELITSFL--------------

Query:  ---VWNV----------PSKIW-------------TPSRILNWQTPFFKLYQKNADYHALRTFGCLAFASTLHAHRSKFHPRAIPTVFMGYPPDMKGYKL
           + NV          P + W              P  IL+ ++PF  L  K   Y  L+ FGCL FASTL +HR+KF PRA   VF+GYP  +KGYKL
Subjt:  ---VWNV----------PSKIW-------------TPSRILNWQTPFFKLYQKNADYHALRTFGCLAFASTLHAHRSKFHPRAIPTVFMGYPPDMKGYKL

Query:  YDIENKKVIVSRDVIFHEIVFSFHTITLQGDVTDPFPDLVLPISPNFSGIPVVESPDVACTDAQINEPTDVACTDADNLIHPSTDIHISTPTDTVVLPDV
         D+   KV +SRDV+FHE +F F T T   D T        PIS     IP                    +C+   + I P + I  S P  ++    +
Subjt:  YDIENKKVIVSRDVIFHEIVFSFHTITLQGDVTDPFPDLVLPISPNFSGIPVVESPDVACTDAQINEPTDVACTDADNLIHPSTDIHISTPTDTVVLPDV

Query:  QFYAAQPSTQSTAFQTQPSL-------------ADPRRFSRAVKQPSYLRDYHCALS----KTMSLP--ESKSKFPLHKVLSYDALSKQFRNFVLSVSSV
         F    P    T   + PSL             +  RR +R  K P+YL+DYHC L+     T S P   S + +PL   LSYD LS   RNF LSV+++
Subjt:  QFYAAQPSTQSTAFQTQPSL-------------ADPRRFSRAVKQPSYLRDYHCALS----KTMSLP--ESKSKFPLHKVLSYDALSKQFRNFVLSVSSV

Query:  YEPQFYHQAVPHLHWQEAMHTELQAMEANNTWSVVSLPVGHHSVGCRWIYKVKYKVDGTMERYKARLVAKGYTQQEGLDYIETFSPVAKVVTVKVLLTLA
         EP  +HQA  + HWQEAM  EL A+EANNTW++  LP G H +GC+W+YKVK K DG++ERYKARLVAKGYTQQEGLDY ETFSPVAK  TV+ LL +A
Subjt:  YEPQFYHQAVPHLHWQEAMHTELQAMEANNTWSVVSLPVGHHSVGCRWIYKVKYKVDGTMERYKARLVAKGYTQQEGLDYIETFSPVAKVVTVKVLLTLA

Query:  VSHN
         + N
Subjt:  VSHN

A0A2N9ETL8 Uncharacterized protein5.6e-19038.46Show/hide
Query:  TAMIIRLTVKNKMGFVDGTLLQPTGN---LRRSWIICNTVVTAWILNSLSNEVSASVNFAESAREIWLDLQQWYKRKNRPRIFQLKHEISNLVQDQQSVT
        T+M   L+ KNK+GFV+GT+LQP      +   W  CN +V +WI N LS ++ A+V +A +A+E+W DLQQ Y + N  R+  LK  I++L Q+  SV+
Subjt:  TAMIIRLTVKNKMGFVDGTLLQPTGN---LRRSWIICNTVVTAWILNSLSNEVSASVNFAESAREIWLDLQQWYKRKNRPRIFQLKHEISNLVQDQQSVT

Query:  TYFAKLKSLWNELSAYR--PSCSCG-QCTCGGVKELVTYFQTEHVMAFLMGLNESFAQIRTQLLLMEPEPTIQRAFSLVAQEVEQR-ASVTPPPATLPA-
         YF  LK LW+E   YR  PSC+CG +C CG  K L+ Y   ++V +FLMGLNE+FA +R Q+LLMEP P I + FSL+    +Q+ A + P P   P+ 
Subjt:  TYFAKLKSLWNELSAYR--PSCSCG-QCTCGGVKELVTYFQTEHVMAFLMGLNESFAQIRTQLLLMEPEPTIQRAFSLVAQEVEQR-ASVTPPPATLPA-

Query:  -ATALLVKTNSSSNTSNSSRNSANTT-------------KKKVRPFCTHCNIQGHTVDRCYKIHGYPPGY----RNQRGSSTKSKTSTTAVNVTLNDPLS
         +TAL  + ++  N + +S NS +                +K +P C+HC  +GH  ++CYK+HGYPPG+    RN   ++  S   T A N   N    
Subjt:  -ATALLVKTNSSSNTSNSSRNSANTT-------------KKKVRPFCTHCNIQGHTVDRCYKIHGYPPGY----RNQRGSSTKSKTSTTAVNVTLNDPLS

Query:  GLNAEQCQDILTLLQSHLNKVKSGSDSVES-----------------------SSTTHVAGTHSDLSSVDLQNI-------------------WILDSGA
           A QCQ  L +L +   K  S SDS  S                          +++AG    LS+    N+                   W++D+GA
Subjt:  GLNAEQCQDILTLLQSHLNKVKSGSDSVES-----------------------SSTTHVAGTHSDLSSVDLQNI-------------------WILDSGA

Query:  SAHICCSKELFVSLKKVSAMTVSLPNHDRLSVNHVGNVHINSDIILHNVMFIPSFRFNLISISALTANLPVMIKFIVDSCLIQDKCSLRMIGKAKIWQGL
        + H+  +   F ++K V  +TV+LPN   ++V H+G++ + + ++L +V+ +PSF FNLIS+S LT++L   I F+   C IQD    RMIG  +   GL
Subjt:  SAHICCSKELFVSLKKVSAMTVSLPNHDRLSVNHVGNVHINSDIILHNVMFIPSFRFNLISISALTANLPVMIKFIVDSCLIQDKCSLRMIGKAKIWQGL

Query:  HLLQTGDVSVEQNLCNSLSV------------NKKHTDSTNIAVWHDRLGHLSDKHLDVLKGLL-SVKQVKSNLSPCLVCPLAKQRRLTFQSNNNVSAHM
        ++L   D+S    L  +++V              KH+ S +   WH RLGH S   ++ L  ++  +     +   C VCPLAKQ+RL F +NN+VS+  
Subjt:  HLLQTGDVSVEQNLCNSLSV------------NKKHTDSTNIAVWHDRLGHLSDKHLDVLKGLL-SVKQVKSNLSPCLVCPLAKQRRLTFQSNNNVSAHM

Query:  FDLIHCDTWDPYHIPTHSGYKYFLTIVEDHSRYTWVFLMRTKSDALTIVPIFFQYIKTQFGTSIKSFRSDNAPELWFHDFSSPKELI-------------
        FD++H D W PYH+PT  GYKYFLT+V+D +R TWV+LM++KS+   ++  F   I+TQFG+ +K  RSDN  E    DF + + +I             
Subjt:  FDLIHCDTWDPYHIPTHSGYKYFLTIVEDHSRYTWVFLMRTKSDALTIVPIFFQYIKTQFGTSIKSFRSDNAPELWFHDFSSPKELI-------------

Query:  --------------TSFLVWNVPSKIW-------------TPSRILNWQTPFFKLYQKNADYHALRTFGCLAFASTLHAHRSKFHPRAIPTVFMGYPPDM
                      +     N+P K W              PS IL+ ++P+ KL  K   Y  LR FGCL FASTL  HR+KF PRA P VF+GYP  +
Subjt:  --------------TSFLVWNVPSKIW-------------TPSRILNWQTPFFKLYQKNADYHALRTFGCLAFASTLHAHRSKFHPRAIPTVFMGYPPDM

Query:  KGYKLYDIENKKVIVSRDVIFHEIVFSFHTITLQGDVTDPFPDLVLPISPNFSGIPVVESPDVACTDAQINEPTDVACTDADNLIHPSTDIHISTPTDTV
        KGYKL D+ N  VI+SRDVIFHE VF F   T   D + PF + +    PNFS IP+         D+ I+ P +   + ++     ST I  S   ++ 
Subjt:  KGYKLYDIENKKVIVSRDVIFHEIVFSFHTITLQGDVTDPFPDLVLPISPNFSGIPVVESPDVACTDAQINEPTDVACTDADNLIHPSTDIHISTPTDTV

Query:  VLPDVQFYAAQPSTQSTAFQTQPSLADPRRFSRAVKQPSYLRDYHCALSKTMSLPESKSK------FPLHKVLSYDALSKQFRNFVLSVSSVYEPQFYHQ
         +P +       S  S            RR +R  K P+YL+DYHC ++++     S S       +PL   LSYD LS   R F LS++++ EP  + Q
Subjt:  VLPDVQFYAAQPSTQSTAFQTQPSLADPRRFSRAVKQPSYLRDYHCALSKTMSLPESKSK------FPLHKVLSYDALSKQFRNFVLSVSSVYEPQFYHQ

Query:  AVPHLHWQEAMHTELQAMEANNTWSVVSLPVGHHSVGCRWIYKVKYKVDGTMERYKARLVAKGYTQQEGLDYIETFSPVAKVVTVKVLLTLA
        A  H HW++AM  EL+A+EANNTWS+  LP G H +GC+W+YKVK K DG++ERYKARLVAKGYTQQEGLDY ETFSPVAK  TV+ LL +A
Subjt:  AVPHLHWQEAMHTELQAMEANNTWSVVSLPVGHHSVGCRWIYKVKYKVDGTMERYKARLVAKGYTQQEGLDYIETFSPVAKVVTVKVLLTLA

A0A2N9G1Y1 Integrase catalytic domain-containing protein4.4e-18738.41Show/hide
Query:  AMIIRLTVKNKMGFVDGTLLQPTGN---LRRSWIICNTVVTAWILNSLSNEVSASVNFAESAREIWLDLQQWYKRKNRPRIFQLKHEISNLVQDQQSVTT
        +M   L+ KNK+GFV+G++LQP      L   W  CN +V +WI N LS ++ A+V +  +A+E+W DLQQ Y + N  R+  LK  I++L QD   V+ 
Subjt:  AMIIRLTVKNKMGFVDGTLLQPTGN---LRRSWIICNTVVTAWILNSLSNEVSASVNFAESAREIWLDLQQWYKRKNRPRIFQLKHEISNLVQDQQSVTT

Query:  YFAKLKSLWNELSAYR--PSCSCG-QCTCGGVKELVTYFQTEHVMAFLMGLNESFAQIRTQLLLMEPEPTIQRAFSLVAQEVEQR-ASVTPPPATLP--A
        YF +LK LW+E   YR  P C+CG +C CG  + L+ Y   ++V +FLMGLN+SFA +R Q+LLMEP P I + FSL+  + +QR A + P P   P   
Subjt:  YFAKLKSLWNELSAYR--PSCSCG-QCTCGGVKELVTYFQTEHVMAFLMGLNESFAQIRTQLLLMEPEPTIQRAFSLVAQEVEQR-ASVTPPPATLP--A

Query:  ATALLVKTNSSSNTS----------------NSSRNSANTTKKKVRPFCTHCNIQGHTVDRCYKIHGYPPGYRNQ-RGSSTKSKTSTTAV------NVTL
        +TALL +  +  NT+                NS +      K K    C+HC  +GHT D+CYK+HGYPPG+R++ R  +  S+ S++AV      N   
Subjt:  ATALLVKTNSSSNTS----------------NSSRNSANTTKKKVRPFCTHCNIQGHTVDRCYKIHGYPPGYRNQ-RGSSTKSKTSTTAV------NVTL

Query:  NDPLSGLNAEQCQDILTLLQSHLNKVKSGSD--------SVESSSTT----HVAGTHSDLSSVDLQNI-------------------WILDSGASAHICC
        + P     + QCQ +L +L +   +  S SD        S+ S S T    ++AG  + LS+    N+                   W++D+GA+ H+  
Subjt:  NDPLSGLNAEQCQDILTLLQSHLNKVKSGSD--------SVESSSTT----HVAGTHSDLSSVDLQNI-------------------WILDSGASAHICC

Query:  SKELFVSLKKVSAMTVSLPNHDRLSVNHVGNVHINSDIILHNVMFIPSFRFNLISISALTANLPVMIKFIVDSCLIQDKCSLRMIGKAKIWQGLHLLQ--
        + + + ++  V  ++V+LPN   + V H+G+V I   ++L +V+ +PSF FNLIS+S LT++L   I F+   C IQD    RMIG  K   GL+LL   
Subjt:  SKELFVSLKKVSAMTVSLPNHDRLSVNHVGNVHINSDIILHNVMFIPSFRFNLISISALTANLPVMIKFIVDSCLIQDKCSLRMIGKAKIWQGLHLLQ--

Query:  -----------TGDVSVEQNLCNSLSVNKKHTDSTNIAVWHDRLGHLSDKHLDVLKGLLSVKQVKS-NLSPCLVCPLAKQRRLTFQSNNNVSAHMFDLIH
                   + D  + ++L +  S+   + D   I VWH R GH S   +  L  ++    + S + S C VCPLAKQ+RL F + N++S + FDL+H
Subjt:  -----------TGDVSVEQNLCNSLSVNKKHTDSTNIAVWHDRLGHLSDKHLDVLKGLLSVKQVKS-NLSPCLVCPLAKQRRLTFQSNNNVSAHMFDLIH

Query:  CDTWDPYHIPTHSGYKYFLTIVEDHSRYTWVFLMRTKSDALTIVPIFFQYIKTQFGTSIKSFRSDNAPELWFHDFSSPKELITSFL--------------
         D W PYH+PT  GY+YFLT+V+D +R TW++LMR+KSD   ++  F   I+TQF T IK  RSDN  E    +F + K +I                  
Subjt:  CDTWDPYHIPTHSGYKYFLTIVEDHSRYTWVFLMRTKSDALTIVPIFFQYIKTQFGTSIKSFRSDNAPELWFHDFSSPKELITSFL--------------

Query:  ---VWNV----------PSKIW-------------TPSRILNWQTPFFKLYQKNADYHALRTFGCLAFASTLHAHRSKFHPRAIPTVFMGYPPDMKGYKL
           + NV          P + W              P  IL+ ++PF  L  K   Y  L+ FGCL FASTL +HR+KF PRA   VF+GYP  +KGYKL
Subjt:  ---VWNV----------PSKIW-------------TPSRILNWQTPFFKLYQKNADYHALRTFGCLAFASTLHAHRSKFHPRAIPTVFMGYPPDMKGYKL

Query:  YDIENKKVIVSRDVIFHEIVFSFHTITLQGDVTDPFPDLVLPISPNFSGIPVVESPDVACTDAQINEPTDVACTDADNLIHPSTDIHISTPTDTVVLPDV
         D+   KV +SRDV+FHE +F F T T   D T        PIS     IP                    +C+   + I P + I  S P  ++    +
Subjt:  YDIENKKVIVSRDVIFHEIVFSFHTITLQGDVTDPFPDLVLPISPNFSGIPVVESPDVACTDAQINEPTDVACTDADNLIHPSTDIHISTPTDTVVLPDV

Query:  QFYAAQPSTQSTAFQTQPSL------------ADPRRFSRAVKQP-SYLRDYHCALS----KTMSLPESKS--KFPLHKVLSYDALSKQFRNFVLSVSSV
         F    P    T   + PSL            + P R S  V +P +YL+DYHC L+     T S P + S   +PL   LSYD LS   RNF LSV+++
Subjt:  QFYAAQPSTQSTAFQTQPSL------------ADPRRFSRAVKQP-SYLRDYHCALS----KTMSLPESKS--KFPLHKVLSYDALSKQFRNFVLSVSSV

Query:  YEPQFYHQAVPHLHWQEAMHTELQAMEANNTWSVVSLPVGHHSVGCRWIYKVKYKVDGTMERYKARLVAKGYTQQEGLDYIETFSPVAKVVTVKVLLTLA
         EP  +HQA  + HWQEAM  EL A+EANNTW++  LP G H +GC+W+YKVK K DG++ERYKARLVAKGYTQQEGLDY ETFSPVAK  TV+ LL +A
Subjt:  YEPQFYHQAVPHLHWQEAMHTELQAMEANNTWSVVSLPVGHHSVGCRWIYKVKYKVDGTMERYKARLVAKGYTQQEGLDYIETFSPVAKVVTVKVLLTLA

Query:  VSHN
         + N
Subjt:  VSHN

A0A2N9H2Y3 Integrase catalytic domain-containing protein9.0e-18838.4Show/hide
Query:  AMIIRLTVKNKMGFVDGTLLQPTGN---LRRSWIICNTVVTAWILNSLSNEVSASVNFAESAREIWLDLQQWYKRKNRPRIFQLKHEISNLVQDQQSVTT
        +M   L+ KNK+GFV+G +LQP      L   W  CN +V +WI N LS ++ A+V +  +A+E+W DLQQ Y + N  R+  LK  I++L QD   V+ 
Subjt:  AMIIRLTVKNKMGFVDGTLLQPTGN---LRRSWIICNTVVTAWILNSLSNEVSASVNFAESAREIWLDLQQWYKRKNRPRIFQLKHEISNLVQDQQSVTT

Query:  YFAKLKSLWNELSAYR--PSCSCG-QCTCGGVKELVTYFQTEHVMAFLMGLNESFAQIRTQLLLMEPEPTIQRAFSLVAQEVEQRASVTPPPATLPAATA
        YF +LK LW+E   YR  P C+CG +C CG  + L+ Y   ++V +FLMGLN+SFA +R Q+LLMEP P I + FSL+  + +QR +   P  T+  +TA
Subjt:  YFAKLKSLWNELSAYR--PSCSCG-QCTCGGVKELVTYFQTEHVMAFLMGLNESFAQIRTQLLLMEPEPTIQRAFSLVAQEVEQRASVTPPPATLPAATA

Query:  LLVKTNSSSNTS----------------NSSRNSANTTKKKVRPFCTHCNIQGHTVDRCYKIHGYPPGYRNQ-RGSSTKSKTSTTAV-------NVTLND
        LL +  +  NT+                N  ++     K K    C+HC  +GHT D+CYK+HGYPPG+R++ R  +  ++ S++AV       N     
Subjt:  LLVKTNSSSNTS----------------NSSRNSANTTKKKVRPFCTHCNIQGHTVDRCYKIHGYPPGYRNQ-RGSSTKSKTSTTAV-------NVTLND

Query:  PLSGLNAEQCQDILTLLQSHLNKVKSGSDSVESSSTTHVAGTHSD---------LSSVDLQNI-------------------WILDSGASAHICCSKELF
         L+ ++  QCQ +L +L +   +    SDS    + T ++ T S          LS+    N+                   W++D+GA  H+  + + +
Subjt:  PLSGLNAEQCQDILTLLQSHLNKVKSGSDSVESSSTTHVAGTHSD---------LSSVDLQNI-------------------WILDSGASAHICCSKELF

Query:  VSLKKVSAMTVSLPNHDRLSVNHVGNVHINSDIILHNVMFIPSFRFNLISISALTANLPVMIKFIVDSCLIQDKCSLRMIGKAKIWQGLHLLQ-------
         +   V  ++V+LPN   + V H+G+V +   ++L NV+ +PSF FNLIS+S LT++L   I F+   C IQD    RMIG  +   GL+LL        
Subjt:  VSLKKVSAMTVSLPNHDRLSVNHVGNVHINSDIILHNVMFIPSFRFNLISISALTANLPVMIKFIVDSCLIQDKCSLRMIGKAKIWQGLHLLQ-------

Query:  -----TGDVSVEQNLCNSLSVNKKHTDSTNIAVWHDRLGHLSDKHLDVLKGLLSVKQVKSN-LSPCLVCPLAKQRRLTFQSNNNVSAHMFDLIHCDTWDP
             T D S+ ++L +  S+   + D   I VWH RLGH S   +  L  ++      SN  S C VCPLAKQR+L F +NN++S   FDL+H D W P
Subjt:  -----TGDVSVEQNLCNSLSVNKKHTDSTNIAVWHDRLGHLSDKHLDVLKGLLSVKQVKSN-LSPCLVCPLAKQRRLTFQSNNNVSAHMFDLIHCDTWDP

Query:  YHIPTHSGYKYFLTIVEDHSRYTWVFLMRTKSDALTIVPIFFQYIKTQFGTSIKSFRSDNAPELWFHDFSSPKELITSFL-----------------VWN
        YHIPT  GY+YFLT+V+D +R TW++LMR+KSD  T++  F   I TQF T IK  RSDN  E    DF + K +I                     + N
Subjt:  YHIPTHSGYKYFLTIVEDHSRYTWVFLMRTKSDALTIVPIFFQYIKTQFGTSIKSFRSDNAPELWFHDFSSPKELITSFL-----------------VWN

Query:  V----------PSKIW-------------TPSRILNWQTPFFKLYQKNADYHALRTFGCLAFASTLHAHRSKFHPRAIPTVFMGYPPDMKGYKLYDIENK
        V          P K W              P  IL+ ++PF  L  K   Y  L+ FGCL FASTL  HR+KF PRA    F+GYP  +KGYKL ++   
Subjt:  V----------PSKIW-------------TPSRILNWQTPFFKLYQKNADYHALRTFGCLAFASTLHAHRSKFHPRAIPTVFMGYPPDMKGYKLYDIENK

Query:  KVIVSRDVIFHEIVFSFHTITLQGDVTDPFPDLVLPISPNFSGIP----VVESPDVACTDAQINEPTDVACTDADNLIHPSTDIHISTPTDTVVLPDVQF
        KV++SRDV+FHE +F F   T   D +        P+SP    IP    + + P      A    P   +                 +P DT  L D   
Subjt:  KVIVSRDVIFHEIVFSFHTITLQGDVTDPFPDLVLPISPNFSGIP----VVESPDVACTDAQINEPTDVACTDADNLIHPSTDIHISTPTDTVVLPDVQF

Query:  YAAQPSTQSTAFQTQPSLADP-RRFSRAVKQPSYLRDYHCALSKTMS------LPESKSKFPLHKVLSYDALSKQFRNFVLSVSSVYEPQFYHQAVPHLH
         ++             S++ P RR +R  K P+YL+DYHC L+  +       L  S   +PL   LSYD LS   RNF LSV+++ EP F+HQA    H
Subjt:  YAAQPSTQSTAFQTQPSLADP-RRFSRAVKQPSYLRDYHCALSKTMS------LPESKSKFPLHKVLSYDALSKQFRNFVLSVSSVYEPQFYHQAVPHLH

Query:  WQEAMHTELQAMEANNTWSVVSLPVGHHSVGCRWIYKVKYKVDGTMERYKARLVAKGYTQQEGLDYIETFSPVAKVVTVKVLLTLA
        WQEAM  EL A+EANNTW++  LP+G H +GC+W+YKVK K DG++ERYKARLVAKGYTQQEGLDY ETFSPVAK  TV+ LL +A
Subjt:  WQEAMHTELQAMEANNTWSVVSLPVGHHSVGCRWIYKVKYKVDGTMERYKARLVAKGYTQQEGLDYIETFSPVAKVVTVKVLLTLA

A0A2N9IZK3 Uncharacterized protein3.6e-18938.72Show/hide
Query:  TAMIIRLTVKNKMGFVDGTLLQPTGN---LRRSWIICNTVVTAWILNSLSNEVSASVNFAESAREIWLDLQQWYKRKNRPRIFQLKHEISNLVQDQQSVT
        T+M   L+VKNK+GFV+GT+LQP      +   W  CN +V +WI N LS ++ A+V +A +A+E+W DLQQ Y + N  R+  LK  I++L Q+  SV+
Subjt:  TAMIIRLTVKNKMGFVDGTLLQPTGN---LRRSWIICNTVVTAWILNSLSNEVSASVNFAESAREIWLDLQQWYKRKNRPRIFQLKHEISNLVQDQQSVT

Query:  TYFAKLKSLWNELSAYR--PSCSCG-QCTCGGVKELVTYFQTEHVMAFLMGLNESFAQIRTQLLLMEPEPTIQRAFSLVAQEVEQR-ASVTPPPATLPAA
         YF  LK LW+E   YR  PSC+CG +C CG  K L+ Y   ++V +FLMGLNE+FA +R Q+LLMEP P I + FSL+    +Q+ A + P P      
Subjt:  TYFAKLKSLWNELSAYR--PSCSCG-QCTCGGVKELVTYFQTEHVMAFLMGLNESFAQIRTQLLLMEPEPTIQRAFSLVAQEVEQR-ASVTPPPATLPAA

Query:  TALLVKTNSSSNTSNSSRNSANTTKKKVRPFCTHCNIQGHTVDRCYKIHGYPPGY----RNQRGSSTKSKTSTTAVNVTLNDPLSGLNAEQCQDILTLLQ
                       SS +S     +K +P C+HC  +GH  ++CYK+HGYPPG+    RN   ++  S   T A N   N       A QCQ  L +L 
Subjt:  TALLVKTNSSSNTSNSSRNSANTTKKKVRPFCTHCNIQGHTVDRCYKIHGYPPGY----RNQRGSSTKSKTSTTAVNVTLNDPLSGLNAEQCQDILTLLQ

Query:  SHLNKVKSGSDSVES-----------------------SSTTHVAGTHSDLSSVDLQNI-------------------WILDSGASAHICCSKELFVSLK
        +   K  S SDS  S                          +++AG    LS+    N+                   W++D+GA+ H+  +   F ++K
Subjt:  SHLNKVKSGSDSVES-----------------------SSTTHVAGTHSDLSSVDLQNI-------------------WILDSGASAHICCSKELFVSLK

Query:  KVSAMTVSLPNHDRLSVNHVGNVHINSDIILHNVMFIPSFRFNLISISALTANLPVMIKFIVDSCLIQDKCSLRMIGKAKIWQGLHLLQTGDVSVEQNLC
         V  +TV+LPN   ++V H+G++ + + ++L +V+ +PSF FNLIS+S LT++L   I F+   C IQD    RMIG  +   GL++L   D+S    L 
Subjt:  KVSAMTVSLPNHDRLSVNHVGNVHINSDIILHNVMFIPSFRFNLISISALTANLPVMIKFIVDSCLIQDKCSLRMIGKAKIWQGLHLLQTGDVSVEQNLC

Query:  NSLSV------------NKKHTDSTNIAVWHDRLGHLSDKHLDVLKGLL-SVKQVKSNLSPCLVCPLAKQRRLTFQSNNNVSAHMFDLIHCDTWDPYHIP
         +++V              KH+ S +   WH RLGH S   ++ L  ++  +     +   C VCPLAKQ+RL F +NN+VS+  FD++H D W PYH+P
Subjt:  NSLSV------------NKKHTDSTNIAVWHDRLGHLSDKHLDVLKGLL-SVKQVKSNLSPCLVCPLAKQRRLTFQSNNNVSAHMFDLIHCDTWDPYHIP

Query:  THSGYKYFLTIVEDHSRYTWVFLMRTKSDALTIVPIFFQYIKTQFGTSIKSFRSDNAPELWFHDFSSPKELI---------------------------T
        T  GYKYFLT+V+D +R TWV+LM++KS+   ++  F   I+TQFG+ +K  RSDN  E    DF + + +I                           +
Subjt:  THSGYKYFLTIVEDHSRYTWVFLMRTKSDALTIVPIFFQYIKTQFGTSIKSFRSDNAPELWFHDFSSPKELI---------------------------T

Query:  SFLVWNVPSKIW-------------TPSRILNWQTPFFKLYQKNADYHALRTFGCLAFASTLHAHRSKFHPRAIPTVFMGYPPDMKGYKLYDIENKKVIV
             N+P K W              PS IL+ ++P+ KL  K   Y  LR FGCL FASTL  HR+KF PRA P VF+GYP  +KGYKL D+ N  VI+
Subjt:  SFLVWNVPSKIW-------------TPSRILNWQTPFFKLYQKNADYHALRTFGCLAFASTLHAHRSKFHPRAIPTVFMGYPPDMKGYKLYDIENKKVIV

Query:  SRDVIFHEIVFSFHTITLQGDVTDPFPDLVLPISPNFSGIPVVESPDVACTDAQINEPTDVACTDADNLIHPSTDIHISTPTDTVVLPDVQFYAAQPSTQ
        SRDVIFHE VF F   T   D + PF + +    PNFS IP+         D+ I+ P +   + ++     ST I  S   ++  +P +       S  
Subjt:  SRDVIFHEIVFSFHTITLQGDVTDPFPDLVLPISPNFSGIPVVESPDVACTDAQINEPTDVACTDADNLIHPSTDIHISTPTDTVVLPDVQFYAAQPSTQ

Query:  STAFQTQPSLADPRRFSRAVKQPSYLRDYHCALSKTMSLPESKSK------FPLHKVLSYDALSKQFRNFVLSVSSVYEPQFYHQAVPHLHWQEAMHTEL
        S            RR +R  K P+YL+DYHC ++++     S S       +PL   LSYD LS   R F LSV+++ EP  + QA  H HW++AM  EL
Subjt:  STAFQTQPSLADPRRFSRAVKQPSYLRDYHCALSKTMSLPESKSK------FPLHKVLSYDALSKQFRNFVLSVSSVYEPQFYHQAVPHLHWQEAMHTEL

Query:  QAMEANNTWSVVSLPVGHHSVGCRWIYKVKYKVDGTMERYKARLVAKGYTQQEGLDYIETFSPVAKVVTVKVLLTLA
        +A+EANNTWS+  LP G H +GC+W+YKVK K DG++ERYKARLVAKGYTQQEGLDY ETFSPVAK  TV+ LL +A
Subjt:  QAMEANNTWSVVSLPVGHHSVGCRWIYKVKYKVDGTMERYKARLVAKGYTQQEGLDYIETFSPVAKVVTVKVLLTLA

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.5e-3022.06Show/hide
Query:  SWIICNTVVTAWILNSLSNEVSASVNFAES---AREIWLDLQQWYKRKNRPRIFQLKHEISNL-VQDQQSVTTYFAKLKSLWNELSAYRPSCSCGQCTCG
        SW        + I+  LS+   + +NFA S   AR+I  +L   Y+RK+      L+  + +L +  + S+ ++F     L +EL A             
Subjt:  SWIICNTVVTAWILNSLSNEVSASVNFAES---AREIWLDLQQWYKRKNRPRIFQLKHEISNL-VQDQQSVTTYFAKLKSLWNELSAYRPSCSCGQCTCG

Query:  GVKELVTYFQTEHVMAFLMGLNESFAQIRTQLLLMEPEPTIQRAF---SLVAQEVEQRASVTPPPATLPAATALLVKTNSSSNTSNSSRNSANTTKK---
        G K      + + +   L+ L   +  I T +  +  E  +  AF    L+ QE++ +        T       +V  N+++  +N  +N     KK   
Subjt:  GVKELVTYFQTEHVMAFLMGLNESFAQIRTQLLLMEPEPTIQRAF---SLVAQEVEQRASVTPPPATLPAATALLVKTNSSSNTSNSSRNSANTTKK---

Query:  ---KVRPFCTHCNIQGHTVDRCYKIHGYPPGYRNQRGSSTKSKTSTTAVNVTLNDPLSGLNAEQCQDILTLLQSHLNKVKSGSDSVESSSTTHVAGTHSD
           K +  C HC  +GH    C+        Y+                   LN                      NK K     V+++++  +A    +
Subjt:  ---KVRPFCTHCNIQGHTVDRCYKIHGYPPGYRNQRGSSTKSKTSTTAVNVTLNDPLSGLNAEQCQDILTLLQSHLNKVKSGSDSVESSSTTHVAGTHSD

Query:  LSSVDLQNI--WILDSGASAHICCSKELFV-SLKKVSAMTVSLPNH-DRLSVNHVGNVHINSD--IILHNVMFIPSFRFNLISISALTANLPVMIKFIVD
        +++  + +   ++LDSGAS H+   + L+  S++ V  + +++    + +     G V + +D  I L +V+F      NL+S+  L     + I+F   
Subjt:  LSSVDLQNI--WILDSGASAHICCSKELFV-SLKKVSAMTVSLPNH-DRLSVNHVGNVHINSD--IILHNVMFIPSFRFNLISISALTANLPVMIKFIVD

Query:  SCLIQDKCSLRMIGKAKIWQGLHLLQTGDVSVEQNLCNSLSVNKKHTDSTNIAVWHDRLGHLSD-KHLDV-LKGLLSVKQVKSNL----SPCLVCPLAKQ
           I  K  L ++  + +   + ++             + S+N KH    N  +WH+R GH+SD K L++  K + S + + +NL      C  C   KQ
Subjt:  SCLIQDKCSLRMIGKAKIWQGLHLLQTGDVSVEQNLCNSLSVNKKHTDSTNIAVWHDRLGHLSD-KHLDV-LKGLLSVKQVKSNL----SPCLVCPLAKQ

Query:  RRLTF---QSNNNVSAHMFDLIHCDTWDPYHIPTHSGYKYFLTIVEDHSRYTWVFLMRTKSDALTIVPIFFQYIKTQFGTSIKSFRSDNAPE--------
         RL F   +   ++   +F ++H D   P    T     YF+  V+  + Y   +L++ KSD  ++   F    +  F   +     DN  E        
Subjt:  RRLTF---QSNNNVSAHMFDLIHCDTWDPYHIPTHSGYKYFLTIVEDHSRYTWVFLMRTKSDALTIVPIFFQYIKTQFGTSIKSFRSDNAPE--------

Query:  ------LWFH-----------------------------------DFSSPKELITSFLVWNVPSKIWTPSRILNWQTPFFKLYQKNADYHALRTFGCLAF
              + +H                                    F     L  ++L+  +PS+    S     +TP+   + K      LR FG   +
Subjt:  ------LWFH-----------------------------------DFSSPKELITSFLVWNVPSKIWTPSRILNWQTPFFKLYQKNADYHALRTFGCLAF

Query:  ASTLHAHRSKFHPRAIPTVFMGYPPDMKGYKLYDIENKKVIVSRDVIFHEI------VFSFHTITLQGDVTDPFPDLVLPISPNFS-GIPVVESPDVACT
           +   + KF  ++  ++F+GY P+  G+KL+D  N+K IV+RDV+  E          F T+ L+        +      PN S  I   E P+ +  
Subjt:  ASTLHAHRSKFHPRAIPTVFMGYPPDMKGYKLYDIENKKVIVSRDVIFHEI------VFSFHTITLQGDVTDPFPDLVLPISPNFS-GIPVVESPDVACT

Query:  DAQINEPTDVACTDADNLIHPSTD-IHISTPTDTVVLPDVQFYAAQPSTQSTAFQTQPSLA----DPRRFSRAVKQPSYLRDYHCAL--------SKTMS
           I    D   ++  N  + S   I    P ++    ++QF   + S +S  +    S      D    S+    P+  R+   A         + T +
Subjt:  DAQINEPTDVACTDADNLIHPSTD-IHISTPTDTVVLPDVQFYAAQPSTQSTAFQTQPSLA----DPRRFSRAVKQPSYLRDYHCAL--------SKTMS

Query:  -----LPESKSKFPLHKVLSYDALSKQFRNFVLSVSSVYE--PQFYHQAV---PHLHWQEAMHTELQAMEANNTWSVVSLPVGHHSVGCRWIYKVKYKVD
             +     +      +SY+         VL+  +++   P  + +         W+EA++TEL A + NNTW++   P   + V  RW++ VKY   
Subjt:  -----LPESKSKFPLHKVLSYDALSKQFRNFVLSVSSVYE--PQFYHQAV---PHLHWQEAMHTELQAMEANNTWSVVSLPVGHHSVGCRWIYKVKYKVD

Query:  GTMERYKARLVAKGYTQQEGLDYIETFSPVAKVVTVKVLLTLAVSHN
        G   RYKARLVA+G+TQ+  +DY ETF+PVA++ + + +L+L + +N
Subjt:  GTMERYKARLVAKGYTQQEGLDYIETFSPVAKVVTVKVLLTLAVSHN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.4e-5224.31Show/hide
Query:  LSNEVSASVNFAESAREIWLDLQQWYKRKNRPRIFQLKHEISNLVQDQQSVTTYFAKLKSLWNELSAYRPSCSCGQCTCGGVKELVTYFQTEHVMAFLMG
        LS++V  ++   ++AR IW  L+  Y  K       LK ++  L     S  T F    +++N L          Q    GVK      + +  +  L  
Subjt:  LSNEVSASVNFAESAREIWLDLQQWYKRKNRPRIFQLKHEISNLVQDQQSVTTYFAKLKSLWNELSAYRPSCSCGQCTCGGVKELVTYFQTEHVMAFLMG

Query:  LNESFAQIRTQLLLMEPEPTIQRAFS-LVAQEVEQRASVTPPPATLPAATALLVKTNSSSNTSNSSR-NSANTTKKKVRPFCTHCNIQGHTVDRCYKIHG
        L  S+  + T +L  +    ++   S L+  E  ++       A +        + +S++   + +R  S N +K +VR  C +CN  GH    C     
Subjt:  LNESFAQIRTQLLLMEPEPTIQRAFS-LVAQEVEQRASVTPPPATLPAATALLVKTNSSSNTSNSSR-NSANTTKKKVRPFCTHCNIQGHTVDRCYKIHG

Query:  YPPGYRNQRGSSTKSKT-STTAVNVTLNDPLSGLNAEQCQDILTLLQSHLNKVKSGSDSVESSSTTHVAGTHSDLSSVDLQNIWILDSGASAHICCSKEL
          P  R  +G ++  K    TA  V  ND           +++  +              E     H++G  S+         W++D+ AS H    ++L
Subjt:  YPPGYRNQRGSSTKSKT-STTAVNVTLNDPLSGLNAEQCQDILTLLQSHLNKVKSGSDSVESSSTTHVAGTHSDLSSVDLQNIWILDSGASAHICCSKEL

Query:  FVSLKKVSAMTVSLPNHDRLSVNHVGNVHINSDI----ILHNVMFIPSFRFNLISISALTANLPVMIKFIVDSCLIQDKCSLRMIGKAKIWQGL--HLLQ
        F         TV + N     +  +G++ I +++    +L +V  +P  R NLIS  AL  +         +S     K  L   G   I +G+    L 
Subjt:  FVSLKKVSAMTVSLPNHDRLSVNHVGNVHINSDI----ILHNVMFIPSFRFNLISISALTANLPVMIKFIVDSCLIQDKCSLRMIGKAKIWQGL--HLLQ

Query:  TGDVSVEQNLCNSLSVNKKHTDSTNIAVWHDRLGHLSDKHLDVL--KGLLSVKQVKSNLSPCLVCPLAKQRRLTFQSNNNVSAHMFDLIHCDTWDPYHIP
          +  + Q   N+        D  ++ +WH R+GH+S+K L +L  K L+S  +  + + PC  C   KQ R++FQ+++    ++ DL++ D   P  I 
Subjt:  TGDVSVEQNLCNSLSVNKKHTDSTNIAVWHDRLGHLSDKHLDVL--KGLLSVKQVKSNLSPCLVCPLAKQRRLTFQSNNNVSAHMFDLIHCDTWDPYHIP

Query:  THSGYKYFLTIVEDHSRYTWVFLMRTKSDALTIVPIFFQYIKTQFGTSIKSFRSDNAPELWFHDF----------------SSPK-------------EL
        +  G KYF+T ++D SR  WV++++TK     +   F   ++ + G  +K  RSDN  E    +F                 +P+             E 
Subjt:  THSGYKYFLTIVEDHSRYTWVFLMRTKSDALTIVPIFFQYIKTQFGTSIKSFRSDNAPELWFHDF----------------SSPK-------------EL

Query:  ITSFL-VWNVPSKIW-------------TPSRILNWQTPFFKLYQKNADYHALRTFGCLAFASTLHAHRSKFHPRAIPTVFMGYPPDMKGYKLYDIENKK
        + S L +  +P   W             +PS  L ++ P      K   Y  L+ FGC AFA      R+K   ++IP +F+GY  +  GY+L+D   KK
Subjt:  ITSFL-VWNVPSKIW-------------TPSRILNWQTPFFKLYQKNADYHALRTFGCLAFASTLHAHRSKFHPRAIPTVFMGYPPDMKGYKLYDIENKK

Query:  VIVSRDVIFHEIVFSFHTITLQGDVTDPFPDLVLPISPNFSGIPVVESPDVACTDAQINEPTDVACTDADNLIHPSTDIHISTPTDTVVLPDVQFYAAQP
        VI SRDV+F E       +    D+++   + ++   PNF  IP           +  N PT    T                 TD V         ++ 
Subjt:  VIVSRDVIFHEIVFSFHTITLQGDVTDPFPDLVLPISPNFSGIPVVESPDVACTDAQINEPTDVACTDADNLIHPSTDIHISTPTDTVVLPDVQFYAAQP

Query:  STQSTAFQTQPSLADPRRFSRAVKQPSYLRDYHCALSKTMSLPESKSKFPLHKVLSYDALSKQFRNFVLSVSSVYEPQFYHQAVPH---LHWQEAMHTEL
          Q      Q    D       V+ P+   + H  L ++        ++P                +VL +S   EP+   + + H       +AM  E+
Subjt:  STQSTAFQTQPSLADPRRFSRAVKQPSYLRDYHCALSKTMSLPESKSKFPLHKVLSYDALSKQFRNFVLSVSSVYEPQFYHQAVPH---LHWQEAMHTEL

Query:  QAMEANNTWSVVSLPVGHHSVGCRWIYKVKYKVDGTMERYKARLVAKGYTQQEGLDYIETFSPVAKVVTVKVLLTLAVS
        ++++ N T+ +V LP G   + C+W++K+K   D  + RYKARLV KG+ Q++G+D+ E FSPV K+ +++ +L+LA S
Subjt:  QAMEANNTWSVVSLPVGHHSVGCRWIYKVKYKVDGTMERYKARLVAKGYTQQEGLDYIETFSPVAKVVTVKVLLTLAVS

P92520 Uncharacterized mitochondrial protein AtMg008202.2e-1843.43Show/hide
Query:  EPQFYHQAVPHLHWQEAMHTELQAMEANNTWSVVSLPVGHHSVGCRWIYKVKYKVDGTMERYKARLVAKGYTQQEGLDYIETFSPVAKVVTVKVLLTLA
        EP+    A+    W +AM  EL A+  N TW +V  PV  + +GC+W++K K   DGT++R KARLVAKG+ Q+EG+ ++ET+SPV +  T++ +L +A
Subjt:  EPQFYHQAVPHLHWQEAMHTELQAMEANNTWSVVSLPVGHHSVGCRWIYKVKYKVDGTMERYKARLVAKGYTQQEGLDYIETFSPVAKVVTVKVLLTLA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.3e-5422.91Show/hide
Query:  GFVDGTLLQPTGNLRRS-----------WIICNTVVTAWILNSLSNEVSASVNFAESAREIWLDLQQWYKRKNRPRIFQLKHEISNLVQDQQSVTTYFAK
        GF+DG+   P   +              W   + ++ + +L ++S  V  +V+ A +A +IW  L++ Y   +   + QL+ ++    +  +++  Y   
Subjt:  GFVDGTLLQPTGNLRRS-----------WIICNTVVTAWILNSLSNEVSASVNFAESAREIWLDLQQWYKRKNRPRIFQLKHEISNLVQDQQSVTTYFAK

Query:  LKSLWNELSAYRPSCSCGQCTCGGVKELVTYFQTEHVMAFLMGLNESFAQIRTQLLLMEPEPTIQRAFSLV---AQEVEQRASVTPPPATLPAATALLVK
        L + +++L+                         E V   L  L E +  +  Q+   +  PT+      +     ++   +S T  P T  A +     
Subjt:  LKSLWNELSAYRPSCSCGQCTCGGVKELVTYFQTEHVMAFLMGLNESFAQIRTQLLLMEPEPTIQRAFSLV---AQEVEQRASVTPPPATLPAATALLVK

Query:  TNSSSNTSN-----SSRNSANTTK-------------KKVRPF---CTHCNIQGHTVDRCYKIHGYPPGYRNQRGSSTKSKTSTTAVNVTLNDPLSGLNA
        T +++N  N      +RN+ N +K              + +P+   C  C +QGH+  RC ++  +                            LS +N+
Subjt:  TNSSSNTSN-----SSRNSANTTK-------------KKVRPF---CTHCNIQGHTVDRCYKIHGYPPGYRNQRGSSTKSKTSTTAVNVTLNDPLSGLNA

Query:  EQCQDILTLLQSHLNKVKSGSDSVESSSTTHVAGTHSDLSSVDLQNIWILDSGASAHICCS-KELFVSLKKVSAMTVSLPNHDRLSVNHVGNVHINS---
        +Q     T  Q   N                       L S    N W+LDSGA+ HI      L +         V + +   + ++H G+  +++   
Subjt:  EQCQDILTLLQSHLNKVKSGSDSVESSSTTHVAGTHSDLSSVDLQNIWILDSGASAHICCS-KELFVSLKKVSAMTVSLPNHDRLSVNHVGNVHINS---

Query:  DIILHNVMFIPSFRFNLISISALTANLPVMIKFIVDSCLIQDKCSLRMIGKAKIWQGLHLLQ--TGDVSVEQNLCNSLSVNKKHTDSTNI--AVWHDRLG
         + LHN++++P+   NLIS+  L     V ++F   S  ++D           +  G+ LLQ  T D   E  + +S  V+   + S+    + WH RLG
Subjt:  DIILHNVMFIPSFRFNLISISALTANLPVMIKFIVDSCLIQDKCSLRMIGKAKIWQGLHLLQ--TGDVSVEQNLCNSLSVNKKHTDSTNI--AVWHDRLG

Query:  HLSDKHLD--VLKGLLSVKQVKSNLSPCLVCPLAKQRRLTFQSNNNVSAHMFDLIHCDTWDPYHIPTHSGYKYFLTIVEDHSRYTWVFLMRTKSDALTIV
        H +   L+  +    LSV         C  C + K  ++ F  +   S    + I+ D W    I +H  Y+Y++  V+  +RYTW++ ++ KS      
Subjt:  HLSDKHLD--VLKGLLSVKQVKSNLSPCLVCPLAKQRRLTFQSNNNVSAHMFDLIHCDTWDPYHIPTHSGYKYFLTIVEDHSRYTWVFLMRTKSDALTIV

Query:  PIFFQYIKTQFGTSIKSFRSDNAPE---LW-------FHDFSSPKEL------------------ITSFLVWNVPSKIW-------------TPSRILNW
          F   ++ +F T I +F SDN  E   LW           +SP                     +T     ++P   W              P+ +L  
Subjt:  PIFFQYIKTQFGTSIKSFRSDNAPE---LW-------FHDFSSPKEL------------------ITSFLVWNVPSKIW-------------TPSRILNW

Query:  QTPFFKLYQKNADYHALRTFGCLAFASTLHAHRSKFHPRAIPTVFMGYPPDMKGYKLYDIENKKVIVSRDVIFHEIVFSF-HTITLQGDVTDP-------
        ++PF KL+  + +Y  LR FGC  +      ++ K   ++   VF+GY      Y    ++  ++ +SR V F E  F F + +     V +        
Subjt:  QTPFFKLYQKNADYHALRTFGCLAFASTLHAHRSKFHPRAIPTVFMGYPPDMKGYKLYDIENKKVIVSRDVIFHEIVFSF-HTITLQGDVTDP-------

Query:  -FPDLVLPISPNFSGIPVVESPDVACT-DAQINEPTDVACTDADNLIHPSTDIHISTPTDTVVLPDVQFYAAQPSTQSTAFQTQ-----------PSLAD
          P   LP        P    P  A T  +  + P   +   + NL    +    S+P  T    +      QP+TQ T  QTQ           P+   
Subjt:  -FPDLVLPISPNFSGIPVVESPDVACT-DAQINEPTDVACTDADNLIHPSTDIHISTPTDTVVLPDVQFYAAQPSTQSTAFQTQ-----------PSLAD

Query:  PRRFSRAVKQPSYLRD------YHCALSKTMSLPES---KSKFPLHKVLSYD----------------ALSKQFRNFVLSVS--SVYEPQFYHQAVPHLH
        P + ++++  P+             + S T   P S       PL ++++ +                 + K    + L+VS  +  EP+   QA+    
Subjt:  PRRFSRAVKQPSYLRD------YHCALSKTMSLPES---KSKFPLHKVLSYD----------------ALSKQFRNFVLSVS--SVYEPQFYHQAVPHLH

Query:  WQEAMHTELQAMEANNTWSVVSLPVGHHS-VGCRWIYKVKYKVDGTMERYKARLVAKGYTQQEGLDYIETFSPVAKVVTVKVLLTLAVSHN
        W+ AM +E+ A   N+TW +V  P  H + VGCRWI+  KY  DG++ RYKARLVAKGY Q+ GLDY ETFSPV K  +++++L +AV  +
Subjt:  WQEAMHTELQAMEANNTWSVVSLPVGHHS-VGCRWIYKVKYKVDGTMERYKARLVAKGYTQQEGLDYIETFSPVAKVVTVKVLLTLAVSHN

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.9e-5023.12Show/hide
Query:  GFVDGTLLQPTGNLRRS-----------WIICNTVVTAWILNSLSNEVSASVNFAESAREIWLDLQQWYKRKNRPRIFQLKHEISNLVQDQQSVTTYFAK
        GF+DG+   P   +              W   + ++ + IL ++S  V  +V+ A +A +IW  L++ Y   +   + QL+              T F +
Subjt:  GFVDGTLLQPTGNLRRS-----------WIICNTVVTAWILNSLSNEVSASVNFAESAREIWLDLQQWYKRKNRPRIFQLKHEISNLVQDQQSVTTYFAK

Query:  LKSLWNELSAYRPSCSCGQCTCGGVKELVTYFQTEHVMAFLMGLNESFAQIRTQLLLMEPEPTIQRAFSLVAQEVEQRASVTPPPATLPAATALLVKTNS
        L  L   +          +      K ++     +     L  ++E      ++LL +     +     + A  V  R + T                N+
Subjt:  LKSLWNELSAYRPSCSCGQCTCGGVKELVTYFQTEHVMAFLMGLNESFAQIRTQLLLMEPEPTIQRAFSLVAQEVEQRASVTPPPATLPAATALLVKTNS

Query:  SSNTSNSSRNSANTTKKKVRPF---CTHCNIQGHTVDRCYKIHGYPPGYRNQRGSSTKSKTSTTAVNVTLNDPLSGLNAEQCQDILTLLQSHLNKVKSGS
         SN+   S + + +  ++ +P+   C  C++QGH+  RC ++H +     NQ+ S++         N+ +N P +                         
Subjt:  SSNTSNSSRNSANTTKKKVRPF---CTHCNIQGHTVDRCYKIHGYPPGYRNQRGSSTKSKTSTTAVNVTLNDPLSGLNAEQCQDILTLLQSHLNKVKSGS

Query:  DSVESSSTTHVAGTHSDLSSVDLQNIWILDSGASAHICCS-KELFVSLKKVSAMTVSLPNHDRLSVNHVGNVHI---NSDIILHNVMFIPSFRFNLISIS
                                N W+LDSGA+ HI      L           V + +   + + H G+  +   +  + L+ V+++P+   NLIS+ 
Subjt:  DSVESSSTTHVAGTHSDLSSVDLQNIWILDSGASAHICCS-KELFVSLKKVSAMTVSLPNHDRLSVNHVGNVHI---NSDIILHNVMFIPSFRFNLISIS

Query:  ALTANLPVMIKFIVDSCLIQDKCSLRMIGKAKIWQGLHLLQ--TGDVSVEQNLCNSLSVN------KKHTDSTNIAVWHDRLGHLSDKHLDVLKGLLS--
         L     V ++F   S  ++D           +  G+ LLQ  T D   E  + +S +V+       K T S+    WH RLGH S   L +L  ++S  
Subjt:  ALTANLPVMIKFIVDSCLIQDKCSLRMIGKAKIWQGLHLLQ--TGDVSVEQNLCNSLSVN------KKHTDSTNIAVWHDRLGHLSDKHLDVLKGLLS--

Query:  ---VKQVKSNLSPCLVCPLAKQRRLTFQSNNNVSAHMFDLIHCDTWDPYHIPTHSGYKYFLTIVEDHSRYTWVFLMRTKSDALTIVPIFFQYIKTQFGTS
           V      L  C  C + K  ++ F ++   S+   + I+ D W    I +   Y+Y++  V+  +RYTW++ ++ KS       IF   ++ +F T 
Subjt:  ---VKQVKSNLSPCLVCPLAKQRRLTFQSNNNVSAHMFDLIHCDTWDPYHIPTHSGYKYFLTIVEDHSRYTWVFLMRTKSDALTIVPIFFQYIKTQFGTS

Query:  IKSFRSDNAPEL----------WFHDFSSPKEL------------------ITSFLVWNVPSKIW-------------TPSRILNWQTPFFKLYQKNADY
        I +  SDN  E               F+SP                     +T     +VP   W              P+ +L  Q+PF KL+ +  +Y
Subjt:  IKSFRSDNAPEL----------WFHDFSSPKEL------------------ITSFLVWNVPSKIW-------------TPSRILNWQTPFFKLYQKNADY

Query:  HALRTFGCLAFASTLHAHRSKFHPRAIPTVFMGYPPDMKGYKLYDIENKKVIVSRDVIFHEIVFSFHTI-----TLQGDVTDPFPD----LVLPISPNFS
          L+ FGC  +      +R K   ++    FMGY      Y    I   ++  SR V F E  F F T      T Q   +D  P+      LP +P   
Subjt:  HALRTFGCLAFASTLHAHRSKFHPRAIPTVFMGYPPDMKGYKLYDIENKKVIVSRDVIFHEIVFSFHTI-----TLQGDVTDPFPD----LVLPISPNFS

Query:  GIPVVESPDVACTDAQINEPTDVACTDADNLIHPSTDI------HISTPTDTVVLPDVQFYAAQPSTQSTAFQTQPSLADPRRFSRAVKQP---SYLRDY
          P    P +  +    + P+ +  T   +   PS+ I        + P+     P  Q +  Q S  ++     P+   P   S     P   S +   
Subjt:  GIPVVESPDVACTDAQINEPTDVACTDADNLIHPSTDI------HISTPTDTVVLPDVQFYAAQPSTQSTAFQTQPSLADPRRFSRAVKQP---SYLRDY

Query:  HCAL-SKTMSLPESKSKF-----PLHKVL----------------------SYDALSK--QFRNFVLSVSSVYEPQFYHQAVPHLHWQEAMHTELQAMEA
        H    S ++S P S S       PL  VL                      + D + K  Q  ++  S+++  EP+   QA+    W++AM +E+ A   
Subjt:  HCAL-SKTMSLPESKSKF-----PLHKVL----------------------SYDALSK--QFRNFVLSVSSVYEPQFYHQAVPHLHWQEAMHTELQAMEA

Query:  NNTWSVVSLPVGHHS-VGCRWIYKVKYKVDGTMERYKARLVAKGYTQQEGLDYIETFSPVAKVVTVKVLLTLAVSHN
        N+TW +V  P    + VGCRWI+  K+  DG++ RYKARLVAKGY Q+ GLDY ETFSPV K  +++++L +AV  +
Subjt:  NNTWSVVSLPVGHHS-VGCRWIYKVKYKVDGTMERYKARLVAKGYTQQEGLDYIETFSPVAKVVTVKVLLTLAVSHN

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).8.7e-2636.42Show/hide
Query:  LTVKNKMGFVDGTLLQPT--GNLRRSWIICNTVVTAWILNSLSNEVSASVNFAESAREIWLDLQQWYKRKNRPRIFQLKHEISNLVQDQQSVTTYFAKLK
        L V  K GF+DGTL +P     L + W  CN +V  W++NS+++++  SV +AE+A ++W DL++ +      +I+QL+  ++ L Q   SV  YF KL 
Subjt:  LTVKNKMGFVDGTLLQPT--GNLRRSWIICNTVVTAWILNSLSNEVSASVNFAESAREIWLDLQQWYKRKNRPRIFQLKHEISNLVQDQQSVTTYFAKLK

Query:  SLWNELSAYR--PSCSCGQCTCGGVKELVTYFQTEHVMAFLMG--LNESFAQIRTQLLLMEPEPTIQRAFSLV
         +W ELS Y   P C CG C C   K      + E    FLMG  LN+ F  + T+++  +P P++  AF++V
Subjt:  SLWNELSAYR--PSCSCGQCTCGGVKELVTYFQTEHVMAFLMG--LNESFAQIRTQLLLMEPEPTIQRAFSLV

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.2e-3539.68Show/hide
Query:  AQPSTQSTAFQTQPS------LADP--RRFSRAVKQPSYLRDYHCALSKTMSLPESKSKFPLHKVLSYDALSKQFRNFVLSVSSVYEPQFYHQAVPHLHW
        A  ST S++    PS      + +P      R  ++P+YL+DY+C    ++++ +      + + LSY+ +S  + +F++ ++   EP  Y++A   L W
Subjt:  AQPSTQSTAFQTQPS------LADP--RRFSRAVKQPSYLRDYHCALSKTMSLPESKSKFPLHKVLSYDALSKQFRNFVLSVSSVYEPQFYHQAVPHLHW

Query:  QEAMHTELQAMEANNTWSVVSLPVGHHSVGCRWIYKVKYKVDGTMERYKARLVAKGYTQQEGLDYIETFSPVAKVVTVKVLLTLAVSHN
          AM  E+ AME  +TW + +LP     +GC+W+YK+KY  DGT+ERYKARLVAKGYTQQEG+D+IETFSPV K+ +VK++L ++  +N
Subjt:  QEAMHTELQAMEANNTWSVVSLPVGHHSVGCRWIYKVKYKVDGTMERYKARLVAKGYTQQEGLDYIETFSPVAKVVTVKVLLTLAVSHN

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.6e-1943.43Show/hide
Query:  EPQFYHQAVPHLHWQEAMHTELQAMEANNTWSVVSLPVGHHSVGCRWIYKVKYKVDGTMERYKARLVAKGYTQQEGLDYIETFSPVAKVVTVKVLLTLA
        EP+    A+    W +AM  EL A+  N TW +V  PV  + +GC+W++K K   DGT++R KARLVAKG+ Q+EG+ ++ET+SPV +  T++ +L +A
Subjt:  EPQFYHQAVPHLHWQEAMHTELQAMEANNTWSVVSLPVGHHSVGCRWIYKVKYKVDGTMERYKARLVAKGYTQQEGLDYIETFSPVAKVVTVKVLLTLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTCAGCCAAGCCTTCAATTCCTCAGTCATGCATGGGGTGCTCCACGCTAGCTACTGGTTTTGCTTCGGGCTAACTCATTATCAACATCAAAATCAATCCCGAAC
TCAGAACGTAGGCCTGCAAAGCCTTCTGAGAGTTGTTTTCCCCTTGCACCCTTGTACCGTGAGCAACAAACCTTTGCGCGCCATGGCTGATGATCCTGAAAATGGCACTG
AAACAAATCAAGTTTCACCTTCCGCCCCAACTTCCCCTTCTTCCACGGCGATGATCATCAGGCTTACTGTGAAGAATAAAATGGGCTTCGTTGACGGCACCTTGCTACAA
CCAACTGGAAATTTACGGCGGTCTTGGATAATTTGCAACACTGTGGTAACAGCATGGATCTTGAATTCCTTGTCGAATGAAGTCTCTGCCAGTGTTAACTTCGCTGAATC
CGCTCGAGAGATATGGCTTGATCTCCAACAGTGGTATAAACGAAAGAATCGTCCACGAATCTTTCAATTAAAACATGAAATTTCAAATCTTGTCCAAGATCAACAGTCCG
TCACTACATATTTCGCCAAATTGAAGTCTCTATGGAATGAACTATCTGCATATCGTCCTTCTTGTTCCTGCGGACAATGCACCTGTGGTGGAGTCAAGGAATTGGTAACC
TATTTTCAAACGGAACACGTTATGGCGTTCCTCATGGGTCTCAATGAGTCGTTTGCTCAAATTCGGACGCAATTGTTACTCATGGAGCCCGAACCTACCATTCAACGAGC
TTTCTCTTTAGTGGCACAAGAAGTTGAACAACGGGCCTCTGTAACTCCGCCTCCTGCTACACTTCCGGCTGCTACTGCCCTCCTTGTGAAGACCAATTCGAGCTCTAACA
CATCCAATTCCTCTCGGAATTCAGCGAATACCACAAAGAAGAAGGTGCGTCCCTTTTGCACTCACTGCAATATTCAGGGTCACACAGTTGACCGTTGCTACAAAATTCAC
GGATATCCCCCTGGATATCGTAATCAAAGAGGCAGTTCAACCAAATCAAAGACTTCGACAACTGCCGTTAATGTCACTCTTAATGATCCTCTCTCTGGCCTAAATGCAGA
GCAATGCCAAGATATATTAACTCTTTTACAATCGCACCTCAACAAAGTCAAGTCTGGGTCTGATTCTGTTGAATCCTCCAGTACTACTCATGTAGCAGGTACTCATTCTG
ACTTATCCTCTGTTGACTTGCAAAATATATGGATACTTGACTCCGGTGCTTCCGCTCACATTTGTTGTTCAAAAGAGTTGTTTGTTTCCCTCAAGAAAGTTTCTGCTATG
ACTGTCTCCTTGCCCAATCATGATCGATTATCTGTAAATCATGTTGGTAATGTTCACATTAACTCTGATATTATTCTTCATAATGTCATGTTCATCCCATCCTTCCGGTT
CAACCTGATTTCTATCAGTGCCTTAACTGCCAATTTGCCTGTTATGATCAAATTTATTGTTGATTCTTGTCTCATTCAAGACAAGTGCTCTTTGAGGATGATTGGCAAAG
CTAAAATCTGGCAAGGCTTGCATCTCCTCCAAACTGGTGATGTGTCTGTTGAACAAAATCTTTGTAATTCTCTATCTGTGAACAAAAAACATACTGATTCTACCAATATT
GCTGTTTGGCATGATAGATTAGGGCATCTCTCTGATAAGCATCTGGATGTTCTTAAAGGTCTCTTGTCTGTAAAACAAGTTAAGAGCAATCTCTCCCCTTGCTTAGTATG
TCCTTTGGCTAAACAACGTCGTCTTACTTTTCAATCCAATAACAATGTTTCTGCGCATATGTTTGATCTCATTCATTGTGATACCTGGGATCCTTATCACATACCTACTC
ACTCGGGTTACAAGTATTTTTTGACTATAGTGGAAGACCACTCTAGGTACACTTGGGTATTTCTCATGAGGACCAAATCAGATGCCTTAACCATTGTTCCAATTTTTTTT
CAGTACATCAAAACACAATTTGGAACTTCTATCAAAAGTTTTCGATCTGATAATGCTCCTGAGCTATGGTTCCATGATTTTTCCTCTCCCAAGGAGTTAATCACCAGTTT
TCTTGTGTGGAACGTCCCGAGCAAAATTTGGACTCCTTCCCGAATTCTCAATTGGCAAACTCCATTCTTCAAGCTATATCAGAAGAATGCTGATTATCATGCCTTAAGGA
CTTTTGGCTGCCTTGCTTTTGCCTCAACACTACATGCTCATCGTTCAAAGTTTCATCCAAGGGCCATCCCTACTGTCTTCATGGGATATCCACCTGACATGAAAGGGTAC
AAACTCTATGACATTGAGAATAAAAAGGTAATTGTCTCAAGAGATGTTATTTTCCATGAGATTGTCTTTTCTTTTCACACAATTACCTTGCAAGGAGATGTCACAGATCC
TTTTCCAGATCTAGTTTTACCCATTTCTCCAAACTTCTCTGGAATTCCTGTTGTTGAAAGCCCTGATGTTGCTTGCACTGATGCTCAAATAAATGAGCCTACTGATGTTG
CTTGCACTGATGCTGATAATCTGATTCATCCTAGTACTGATATTCATATCAGTACACCAACTGACACGGTTGTACTGCCTGATGTCCAATTTTATGCTGCTCAGCCTTCA
ACACAGTCAACTGCTTTTCAAACACAACCCTCACTAGCAGATCCTCGAAGGTTTTCCCGTGCTGTTAAACAACCCTCTTACCTTCGTGACTATCATTGTGCTTTGTCTAA
AACCATGTCTCTGCCTGAGAGCAAATCAAAGTTTCCCTTGCACAAAGTACTTTCTTATGATGCCTTGTCAAAACAATTTCGTAACTTTGTTTTGTCTGTGTCTTCTGTCT
ATGAGCCTCAATTCTACCATCAAGCAGTCCCACACTTACATTGGCAAGAAGCTATGCATACTGAGTTGCAAGCAATGGAAGCAAATAACACCTGGAGTGTTGTATCTCTT
CCTGTTGGTCATCACTCGGTTGGATGTCGATGGATCTACAAGGTGAAATACAAAGTCGATGGGACTATGGAGCGTTATAAAGCAAGGCTCGTTGCGAAAGGTTACACACA
ACAGGAAGGTCTCGATTATATAGAGACATTCTCGCCTGTTGCCAAAGTAGTAACTGTCAAAGTTTTGCTCACTCTTGCTGTGTCCCATAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATTTCAGCCAAGCCTTCAATTCCTCAGTCATGCATGGGGTGCTCCACGCTAGCTACTGGTTTTGCTTCGGGCTAACTCATTATCAACATCAAAATCAATCCCGAAC
TCAGAACGTAGGCCTGCAAAGCCTTCTGAGAGTTGTTTTCCCCTTGCACCCTTGTACCGTGAGCAACAAACCTTTGCGCGCCATGGCTGATGATCCTGAAAATGGCACTG
AAACAAATCAAGTTTCACCTTCCGCCCCAACTTCCCCTTCTTCCACGGCGATGATCATCAGGCTTACTGTGAAGAATAAAATGGGCTTCGTTGACGGCACCTTGCTACAA
CCAACTGGAAATTTACGGCGGTCTTGGATAATTTGCAACACTGTGGTAACAGCATGGATCTTGAATTCCTTGTCGAATGAAGTCTCTGCCAGTGTTAACTTCGCTGAATC
CGCTCGAGAGATATGGCTTGATCTCCAACAGTGGTATAAACGAAAGAATCGTCCACGAATCTTTCAATTAAAACATGAAATTTCAAATCTTGTCCAAGATCAACAGTCCG
TCACTACATATTTCGCCAAATTGAAGTCTCTATGGAATGAACTATCTGCATATCGTCCTTCTTGTTCCTGCGGACAATGCACCTGTGGTGGAGTCAAGGAATTGGTAACC
TATTTTCAAACGGAACACGTTATGGCGTTCCTCATGGGTCTCAATGAGTCGTTTGCTCAAATTCGGACGCAATTGTTACTCATGGAGCCCGAACCTACCATTCAACGAGC
TTTCTCTTTAGTGGCACAAGAAGTTGAACAACGGGCCTCTGTAACTCCGCCTCCTGCTACACTTCCGGCTGCTACTGCCCTCCTTGTGAAGACCAATTCGAGCTCTAACA
CATCCAATTCCTCTCGGAATTCAGCGAATACCACAAAGAAGAAGGTGCGTCCCTTTTGCACTCACTGCAATATTCAGGGTCACACAGTTGACCGTTGCTACAAAATTCAC
GGATATCCCCCTGGATATCGTAATCAAAGAGGCAGTTCAACCAAATCAAAGACTTCGACAACTGCCGTTAATGTCACTCTTAATGATCCTCTCTCTGGCCTAAATGCAGA
GCAATGCCAAGATATATTAACTCTTTTACAATCGCACCTCAACAAAGTCAAGTCTGGGTCTGATTCTGTTGAATCCTCCAGTACTACTCATGTAGCAGGTACTCATTCTG
ACTTATCCTCTGTTGACTTGCAAAATATATGGATACTTGACTCCGGTGCTTCCGCTCACATTTGTTGTTCAAAAGAGTTGTTTGTTTCCCTCAAGAAAGTTTCTGCTATG
ACTGTCTCCTTGCCCAATCATGATCGATTATCTGTAAATCATGTTGGTAATGTTCACATTAACTCTGATATTATTCTTCATAATGTCATGTTCATCCCATCCTTCCGGTT
CAACCTGATTTCTATCAGTGCCTTAACTGCCAATTTGCCTGTTATGATCAAATTTATTGTTGATTCTTGTCTCATTCAAGACAAGTGCTCTTTGAGGATGATTGGCAAAG
CTAAAATCTGGCAAGGCTTGCATCTCCTCCAAACTGGTGATGTGTCTGTTGAACAAAATCTTTGTAATTCTCTATCTGTGAACAAAAAACATACTGATTCTACCAATATT
GCTGTTTGGCATGATAGATTAGGGCATCTCTCTGATAAGCATCTGGATGTTCTTAAAGGTCTCTTGTCTGTAAAACAAGTTAAGAGCAATCTCTCCCCTTGCTTAGTATG
TCCTTTGGCTAAACAACGTCGTCTTACTTTTCAATCCAATAACAATGTTTCTGCGCATATGTTTGATCTCATTCATTGTGATACCTGGGATCCTTATCACATACCTACTC
ACTCGGGTTACAAGTATTTTTTGACTATAGTGGAAGACCACTCTAGGTACACTTGGGTATTTCTCATGAGGACCAAATCAGATGCCTTAACCATTGTTCCAATTTTTTTT
CAGTACATCAAAACACAATTTGGAACTTCTATCAAAAGTTTTCGATCTGATAATGCTCCTGAGCTATGGTTCCATGATTTTTCCTCTCCCAAGGAGTTAATCACCAGTTT
TCTTGTGTGGAACGTCCCGAGCAAAATTTGGACTCCTTCCCGAATTCTCAATTGGCAAACTCCATTCTTCAAGCTATATCAGAAGAATGCTGATTATCATGCCTTAAGGA
CTTTTGGCTGCCTTGCTTTTGCCTCAACACTACATGCTCATCGTTCAAAGTTTCATCCAAGGGCCATCCCTACTGTCTTCATGGGATATCCACCTGACATGAAAGGGTAC
AAACTCTATGACATTGAGAATAAAAAGGTAATTGTCTCAAGAGATGTTATTTTCCATGAGATTGTCTTTTCTTTTCACACAATTACCTTGCAAGGAGATGTCACAGATCC
TTTTCCAGATCTAGTTTTACCCATTTCTCCAAACTTCTCTGGAATTCCTGTTGTTGAAAGCCCTGATGTTGCTTGCACTGATGCTCAAATAAATGAGCCTACTGATGTTG
CTTGCACTGATGCTGATAATCTGATTCATCCTAGTACTGATATTCATATCAGTACACCAACTGACACGGTTGTACTGCCTGATGTCCAATTTTATGCTGCTCAGCCTTCA
ACACAGTCAACTGCTTTTCAAACACAACCCTCACTAGCAGATCCTCGAAGGTTTTCCCGTGCTGTTAAACAACCCTCTTACCTTCGTGACTATCATTGTGCTTTGTCTAA
AACCATGTCTCTGCCTGAGAGCAAATCAAAGTTTCCCTTGCACAAAGTACTTTCTTATGATGCCTTGTCAAAACAATTTCGTAACTTTGTTTTGTCTGTGTCTTCTGTCT
ATGAGCCTCAATTCTACCATCAAGCAGTCCCACACTTACATTGGCAAGAAGCTATGCATACTGAGTTGCAAGCAATGGAAGCAAATAACACCTGGAGTGTTGTATCTCTT
CCTGTTGGTCATCACTCGGTTGGATGTCGATGGATCTACAAGGTGAAATACAAAGTCGATGGGACTATGGAGCGTTATAAAGCAAGGCTCGTTGCGAAAGGTTACACACA
ACAGGAAGGTCTCGATTATATAGAGACATTCTCGCCTGTTGCCAAAGTAGTAACTGTCAAAGTTTTGCTCACTCTTGCTGTGTCCCATAATTAG
Protein sequenceShow/hide protein sequence
MDFSQAFNSSVMHGVLHASYWFCFGLTHYQHQNQSRTQNVGLQSLLRVVFPLHPCTVSNKPLRAMADDPENGTETNQVSPSAPTSPSSTAMIIRLTVKNKMGFVDGTLLQ
PTGNLRRSWIICNTVVTAWILNSLSNEVSASVNFAESAREIWLDLQQWYKRKNRPRIFQLKHEISNLVQDQQSVTTYFAKLKSLWNELSAYRPSCSCGQCTCGGVKELVT
YFQTEHVMAFLMGLNESFAQIRTQLLLMEPEPTIQRAFSLVAQEVEQRASVTPPPATLPAATALLVKTNSSSNTSNSSRNSANTTKKKVRPFCTHCNIQGHTVDRCYKIH
GYPPGYRNQRGSSTKSKTSTTAVNVTLNDPLSGLNAEQCQDILTLLQSHLNKVKSGSDSVESSSTTHVAGTHSDLSSVDLQNIWILDSGASAHICCSKELFVSLKKVSAM
TVSLPNHDRLSVNHVGNVHINSDIILHNVMFIPSFRFNLISISALTANLPVMIKFIVDSCLIQDKCSLRMIGKAKIWQGLHLLQTGDVSVEQNLCNSLSVNKKHTDSTNI
AVWHDRLGHLSDKHLDVLKGLLSVKQVKSNLSPCLVCPLAKQRRLTFQSNNNVSAHMFDLIHCDTWDPYHIPTHSGYKYFLTIVEDHSRYTWVFLMRTKSDALTIVPIFF
QYIKTQFGTSIKSFRSDNAPELWFHDFSSPKELITSFLVWNVPSKIWTPSRILNWQTPFFKLYQKNADYHALRTFGCLAFASTLHAHRSKFHPRAIPTVFMGYPPDMKGY
KLYDIENKKVIVSRDVIFHEIVFSFHTITLQGDVTDPFPDLVLPISPNFSGIPVVESPDVACTDAQINEPTDVACTDADNLIHPSTDIHISTPTDTVVLPDVQFYAAQPS
TQSTAFQTQPSLADPRRFSRAVKQPSYLRDYHCALSKTMSLPESKSKFPLHKVLSYDALSKQFRNFVLSVSSVYEPQFYHQAVPHLHWQEAMHTELQAMEANNTWSVVSL
PVGHHSVGCRWIYKVKYKVDGTMERYKARLVAKGYTQQEGLDYIETFSPVAKVVTVKVLLTLAVSHN