; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0019368 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0019368
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr04:11574402..11578736
RNA-Seq ExpressionIVF0019368
SyntenyIVF0019368
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037867.1 putative polyprotein (retrotrasposon protein) [Cucumis melo var. makuwa]0.093.25Show/hide
Query:  MAEETNNRQLIVDGNTVSSKFIEDLTEKLTKAMFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPASNDTGYKRW
        MAEETNNRQLIVDGNTVSSKFIEDLTEKLTKAMFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPASNDTGYKRW
Subjt:  MAEETNNRQLIVDGNTVSSKFIEDLTEKLTKAMFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPASNDTGYKRW

Query:  MSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDLEVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVE
        MSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDLEVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVE
Subjt:  MSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDLEVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVE

Query:  QDRKVYIFLGGLEDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSVAMIVQGAKQVAGVPHISTNSTHGNSNNSV--QEIHCLCHR
        QDRKVYIFLGGLEDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSVAMIVQGAKQVAGVPHISTNSTHGNSNNSV    +      
Subjt:  QDRKVYIFLGGLEDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSVAMIVQGAKQVAGVPHISTNSTHGNSNNSV--QEIHCLCHR

Query:  RYRERNPVLIKVLYTLQHDTTKLKQFSSAPTVAAKAIVKTPVLKSM---DWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEGIIFLSSNLSNDNSTWIV
        + +E +     V +T +++ T+            K   K    K     DWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEGIIFLSSNLSNDNSTWIV
Subjt:  RYRERNPVLIKVLYTLQHDTTKLKQFSSAPTVAAKAIVKTPVLKSM---DWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEGIIFLSSNLSNDNSTWIV

Query:  DSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVLVVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIE
        DSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVLVVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIE
Subjt:  DSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVLVVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIE

Query:  RDGLYHLEDLKAGRANLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLPRLFKNNNLTVFNCDTCIKAKSHRVSYAPSSNKSNSPFDLIHSDVWGPAPVSS
        RDGLYHLEDLKAGRANLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLPRLFKNNNLTVFNCDTCIKAKSHRVSYAPSSNKSNSPFDLIHSDVWGPAPVSS
Subjt:  RDGLYHLEDLKAGRANLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLPRLFKNNNLTVFNCDTCIKAKSHRVSYAPSSNKSNSPFDLIHSDVWGPAPVSS

Query:  IDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFKKFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNRHILEMS
        IDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFKKFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNRHILEMS
Subjt:  IDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFKKFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNRHILEMS

Query:  RALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSLTPKIFGCVAYVHVPKTQRSKLSPCAVRCVFLGFGQNQKGYKCFDVD
        RALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSLTPKIFGCVAYVHVPKTQRSKLSPCAVRCVFLGFGQNQKGYKCFDVD
Subjt:  RALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSLTPKIFGCVAYVHVPKTQRSKLSPCAVRCVFLGFGQNQKGYKCFDVD

Query:  SRKWYVTMDVTFVEHEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWT----LYGQDGVSGTDESDGNGNGTDESDGDGNGATDGNKPLEMYSRNKNRA
        SRKWYVTMDVTFVEHEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWT     YGQDGVSGTDESDGNGNGTDESDGDGNGATDGNKPLEMYSRNKNRA
Subjt:  SRKWYVTMDVTFVEHEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWT----LYGQDGVSGTDESDGNGNGTDESDGDGNGATDGNKPLEMYSRNKNRA

Query:  VFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPTSKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPENVDEAKS
        VFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPTSKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPENVDEAKS
Subjt:  VFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPTSKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPENVDEAKS

Query:  DPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKARLVAKGYTQTQ-----------------------------------
        DPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKARLVAKGYTQTQ                                   
Subjt:  DPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKARLVAKGYTQTQ-----------------------------------

Query:  --ECFLHGELKEEVYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKRNQEKLTALIIYVDDMIVTGNDTQEIA
            FLHGELKEEVYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKRNQEKLTALIIYVDDMIVTGNDTQEIA
Subjt:  --ECFLHGELKEEVYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKRNQEKLTALIIYVDDMIVTGNDTQEIA

Query:  TLEKKLSGEFEMKNLGGLKYFLRIEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTRPDIAYG
        TLEKKLSGEFEMKNLGGLKYFLRIEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTRPDIAYG
Subjt:  TLEKKLSGEFEMKNLGGLKYFLRIEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTRPDIAYG

Query:  VSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGI
        VSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGI
Subjt:  VSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGI

Query:  CELLWLRRLLGELGFAPTQAMDLYCDSRPAIDISHNP
        CELLWLRRLLGELGFAPTQAMDLYCDSRPAIDISHNP
Subjt:  CELLWLRRLLGELGFAPTQAMDLYCDSRPAIDISHNP

RVW36328.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]0.050.38Show/hide
Query:  MAEETNNRQ-LIVDGNTVSSKFIEDLTEKLTKAMFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPASNDTGYKR
        MAE  +  Q LI   +++ S     +TE LT+A   + QP L      +    P+ I   G NY +WSQ+VE+ ++ KDKLG+ING+  QP   D  ++R
Subjt:  MAEETNNRQ-LIVDGNTVSSKFIEDLTEKLTKAMFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPASNDTGYKR

Query:  WMSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDLEVKVYGLKQTG-TLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNR
        W ++N++VKGW+++S+DP+LI NFIRF TAK+VWDS  +T+FDG D +Q+YDL  +V  +KQ G ++E Y+  LQGLW+EI+FR+PNPM C IDI+KYN 
Subjt:  WMSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDLEVKVYGLKQTG-TLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNR

Query:  VEQDRKVYIFLGGLEDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSVAMIVQGAK----QVAGVPHISTNSTHGNSNNSVQEIHC
        + Q+ +VY FL GL+DR D  R+++L++ P PTVE+ Y  VRRE+VRQ+VM+  +  L  +V M  +G K    Q+   P   + S+ G SN+S      
Subjt:  VEQDRKVYIFLGGLEDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSVAMIVQGAK----QVAGVPHISTNSTHGNSNNSVQEIHC

Query:  LCHRRYRERNP---VLIKVLYTLQHDTTKLKQFSSAPTVAAKAIVKTPVLKSMDWYKELKKKQAEAKR------------------GKVSIAVSGKNGDD
             ++ + P   +        +H      +    P                DW+ +L   QA  KR                    +S+    ++  D
Subjt:  LCHRRYRERNP---VLIKVLYTLQHDTTKLKQFSSAPTVAAKAIVKTPVLKSMDWYKELKKKQAEAKR------------------GKVSIAVSGKNGDD

Query:  SPSEGIIFLSSNLSNDNSTWIVDSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVLVVPSLATRLMSVSKLTKDRNCVVK
        S +   +F  S    D+  WI+DSGA+DHMTF   D +     +++ + NANG++YPV  AG++ ++  L+LSN L+VPSL+ RLMSVS++T D NCVV 
Subjt:  SPSEGIIFLSSNLSNDNSTWIVDSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVLVVPSLATRLMSVSKLTKDRNCVVK

Query:  MYSDYFIIQDILTKEIIGRGIERDGLYHLEDLKAGRANLVVDQT-EVQNKIWTWHRRLGHPSLGYMKKLLPRLFKNNNLTVFNCDTCIKAKSHRVSYAPS
        MYS + ++QDILTKEIIGRG +R GLY+++   +GRAN +  +    + +IW WH RLGHPS GY+K LLP LF       F CDTCI AKSHR SY  S
Subjt:  MYSDYFIIQDILTKEIIGRGIERDGLYHLEDLKAGRANLVVDQT-EVQNKIWTWHRRLGHPSLGYMKKLLPRLFKNNNLTVFNCDTCIKAKSHRVSYAPS

Query:  SNKSNSPFDLIHSDVWGPAPVSSIDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFKKFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHET
         NKS  PFDLIHSDVWGP+ V++  G++WFV+F+DDCTRMTW+YLLK KDEV S+F+ F+ M++TQF   IK  RSDNGGE++ +  + +F   GI+HET
Subjt:  SNKSNSPFDLIHSDVWGPAPVSSIDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFKKFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHET

Query:  SCVGTPQQNGVAERKNRHILEMSRALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSLTPKIFGCVAYVHVPKTQRSKLSP
        SC  TPQQNG+AERKNRHILE +RALL   HVP ++W  ++  AVY+LNR+PTK+  FQTPLK L  H S+P+VL + P+IFGCVA+VH+ K QR+KL P
Subjt:  SCVGTPQQNGVAERKNRHILEMSRALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSLTPKIFGCVAYVHVPKTQRSKLSP

Query:  CAVRCVFLGFGQNQKGYKCFDVDSRKWYVTMDVTFVEHEAFFKPTIQSD-QGENSSESDVWKNKDLLSFQGWTLYGQDGVSGTDESDGNGNGTDESDGDG
        CAVRC+FLG+G  +KGY+C+D  +++ Y+TMDVTF+E E FF P   S  QGE   E   W + ++L        G +     D++D       E D   
Subjt:  CAVRCVFLGFGQNQKGYKCFDVDSRKWYVTMDVTFVEHEAFFKPTIQSD-QGENSSESDVWKNKDLLSFQGWTLYGQDGVSGTDESDGNGNGTDESDGDG

Query:  NGATDGNKPLEMYSRNKNRAVFGMESENEEVIQPNEEPQGPM----SPEDQLQVS-LPTSK-YTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKS
               +P+   S +    VF         + PN+ P   +    SP   LQ + + TS  Y LP R NRG PP RY PD +  ++SKYPIAN V T+ 
Subjt:  NGATDGNKPLEMYSRNKNRAVFGMESENEEVIQPNEEPQGPM----SPEDQLQVS-LPTSK-YTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKS

Query:  LSGPVKRFNENLLSCKIPENVDEAKSDPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKARLVAKGYTQ-----------
        LS P++ F   L SC+IP  V+EA SDP W+QA++ E+ AL  N TW L  LP+G+  V C+W+FSIKY A+G IDRYKARLVAKGYTQ           
Subjt:  LSGPVKRFNENLLSCKIPENVDEAKSDPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKARLVAKGYTQ-----------

Query:  --------------------------TQECFLHGELKEEVYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKR
                                   +  FLHG+L+EE+YM+ PPGY +T+   + C+L +ALYGLKQSPRAWFGR   AM+ YGF+Q +SDHTLFLK 
Subjt:  --------------------------TQECFLHGELKEEVYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKR

Query:  NQEKLTALIIYVDDMIVTGNDTQEIATLEKKLSGEFEMKNLGGLKYFLRIEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVP
           K+TALI+YVDDMI+TG+D +EI+ L+ +LS EFEMKNLGGLKYFL IEV RS+QGI LSQRKYILDLLAE+G+L+C+PAD P+VQ  KLGE+ DQVP
Subjt:  NQEKLTALIIYVDDMIVTGNDTQEIATLEKKLSGEFEMKNLGGLKYFLRIEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVP

Query:  ANKERYQRLVGKLIYLAHTRPDIAYGVSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFVGGNLV
        A+K+RYQRLVGKLIYL+HTRPDIAY VSVVSQFMH PSEDHMDAVMRI+RYLK  PGKG+ F KNGHL V+G+TDADWAG+++DR+STAGYFTFVGGNLV
Subjt:  ANKERYQRLVGKLIYLAHTRPDIAYGVSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFVGGNLV

Query:  TWRSKKQSVVALSSAEAEFRGVAKGICELLWLRRLLGELGFAPTQAMDLYCDSRPAIDISHNP
        TWRSKKQ VVALSSAEAEFRG+ KGICEL+WL++LL E+G AP+  M+L+CD+  AI ISHNP
Subjt:  TWRSKKQSVVALSSAEAEFRGVAKGICELLWLRRLLGELGFAPTQAMDLYCDSRPAIDISHNP

RVX17869.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]0.047.13Show/hide
Query:  SSKFIEDLTEKLTKAMFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPASNDTGYKRWMSDNSMVKGWILSSLDP
        S     ++T K+T+A+ +   P L   T P   L  + I  +G NY +WSQ+VE+ ++ KDKLG+ING+  QP S D  +++W +DN++VKGW+++S+DP
Subjt:  SSKFIEDLTEKLTKAMFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPASNDTGYKRWMSDNSMVKGWILSSLDP

Query:  NLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDLEVKVYGLKQTG-TLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVEQDRKVYIFLGGLEDRF
         LIGNFIRF TAK VWDSI  T+FDG+D +Q+YDL  +V  LKQ G +LE  +  LQGLW+EI+FR+PNPM C +DI  YN + Q+ +VY+FL GL+DR 
Subjt:  NLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDLEVKVYGLKQTG-TLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVEQDRKVYIFLGGLEDRF

Query:  DGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSVAMIVQGAKQVAGVPHISTNSTHGNSNNSVQEIHCLCHRRYRERNPVLIKVL-YTLQ
        D IR ++L++ P PTVE+ Y HVRRE +RQSVM+  + D  +   +  +G K  + +   + ++    S  S + + C      +       K+  Y+  
Subjt:  DGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSVAMIVQGAKQVAGVPHISTNSTHGNSNNSVQEIHCLCHRRYRERNPVLIKVL-YTLQ

Query:  HDTTKLKQFSSAPTV---AAKAIVKTPVLKSMDWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEGIIFLSSNLSNDNSTWIVDSGASDHMTFTKTDLTA
         +  + K+   A T    +A A+V T                AE +   +       +G+     G    +S        W++DSGA+DHMTFT  D T 
Subjt:  HDTTKLKQFSSAPTV---AAKAIVKTPVLKSMDWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEGIIFLSSNLSNDNSTWIVDSGASDHMTFTKTDLTA

Query:  ECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVLVVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIERDGLYHLEDLKAGRANL
          + ++T   NANG+  PV  AG++ ++ +L L N L VPSL+ +L+SVS++T D NC+V +Y    ++QDILTKEIIGRG +R GLY++EDL  GRA+ 
Subjt:  ECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVLVVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIERDGLYHLEDLKAGRANL

Query:  VVDQTEV-QNKIWTWHRRLGHPSLGYMKKLLPRLFKNNNLTVFNCDTCIKAKSHRVSYAPSSNKSNSPFDLIHSDVWGPAPVSSIDGNKWFVLFIDDCTR
             +V +N++W WHRRLGHPS  YMK L P LF       F C+TCI AKS+R S+    NK ++PF LIHSDVWG +P+++++G KWFVLF+DDCTR
Subjt:  VVDQTEV-QNKIWTWHRRLGHPSLGYMKKLLPRLFKNNNLTVFNCDTCIKAKSHRVSYAPSSNKSNSPFDLIHSDVWGPAPVSSIDGNKWFVLFIDDCTR

Query:  MTWVYLLKSKDEVPSVFKKFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNRHILEMSRALLFEYHVPTKFWDK
        MTW+YLLK KDEV  VFK F+ M++TQF   ++  RSDNGGE++    +++F Q GI+HETSC  TPQQNG+ ERKNRH+LE +RALL   H PT+FW  
Subjt:  MTWVYLLKSKDEVPSVFKKFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNRHILEMSRALLFEYHVPTKFWDK

Query:  SILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSLTPKIFGCVAYVHVPKTQRSKLSPCAVRCVFLGFGQNQKGYKCFDVDSRKWYVTMDVTFVEHE
        ++  AV++LNR+ +K+ +FQTPL+ L  + ++P++L L P +FGCVAYVH+ K Q++KL PCA  C+FLG+  +QKGY+C+D+ S + Y+TMDVTF+E E
Subjt:  SILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSLTPKIFGCVAYVHVPKTQRSKLSPCAVRCVFLGFGQNQKGYKCFDVDSRKWYVTMDVTFVEHE

Query:  AFFKPTIQSDQGENSSESDVWKNKDLLSFQGWTLYGQDGVSGTDESDGNGNGTDESDGDGNGATDGNKPLEMYSRNKNRAVFGMESENEE----VIQPNE
         FF P     QGE   E            Q WT      VS                            + +  R         E   ++    V  P+ 
Subjt:  AFFKPTIQSDQGENSSESDVWKNKDLLSFQGWTLYGQDGVSGTDESDGNGNGTDESDGDGNGATDGNKPLEMYSRNKNRAVFGMESENEE----VIQPNE

Query:  EPQGPMSPEDQLQVS-------LPTSKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPENVDEAKSDPNWSQAME
         P+ P +PE+  +VS        P   Y LP R NRG  P RY PD +  ++S+YPIAN+V TK L+ P+K F  N+  C +P  V+EA  DP W+QA++
Subjt:  EPQGPMSPEDQLQVS-------LPTSKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPENVDEAKSDPNWSQAME

Query:  AEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKARLVAKGYTQT-------------------------------------QECFLHGE
         EM  L  NKTW LV L +GK  VGC+WVFSIK+ A+G I+RYKARLVAKGYTQT                                     +  FLHG 
Subjt:  AEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKARLVAKGYTQT-------------------------------------QECFLHGE

Query:  LKEEVYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKRNQEKLTALIIYVDDMIVTGNDTQEIATLEKKLSGE
        L+EEVYM+ PPGY  T                                  G  + ++DHTLFLK+ Q K+TALI+YVDDM++TG+D +EI+ L+ +L+ E
Subjt:  LKEEVYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKRNQEKLTALIIYVDDMIVTGNDTQEIATLEKKLSGE

Query:  FEMKNLGGLKYFLRIEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTRPDIAYGVSVVSQFMH
        FEMKNLGGLKYFL I+V RS QGI LSQRKY+LDLL+E+G+L+C+P DTP+VQ  KLG +P+Q P +K RYQRLV KLIYL+HTRPDIAY VSVVSQFMH
Subjt:  FEMKNLGGLKYFLRIEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTRPDIAYGVSVVSQFMH

Query:  NPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGICELLWLRRL
         PSE+HM+AV+RI+RYLK  PGKG+ F KN H+ V G+TDADWAG++SDR+ST+GYFTFVGGNLVTWRSKKQ VVALSSAEAEFRG+AKG+CELLWL+RL
Subjt:  NPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGICELLWLRRL

Query:  LGELGFAPTQAMDLYCDSRPAIDISHNP
        L E+GFAP   M+L+CD++  IDISHNP
Subjt:  LGELGFAPTQAMDLYCDSRPAIDISHNP

TYJ98005.1 gag-pol polyprotein [Cucumis melo var. makuwa]0.077.31Show/hide
Query:  MAEETNNRQLIVDGNTVSSKFIEDLTEKLTKAMFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPASNDTGYKRW
        MAEETNNRQLIVDGNTVSSKFIEDLTEKLTKAMFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPASNDTGYKRW
Subjt:  MAEETNNRQLIVDGNTVSSKFIEDLTEKLTKAMFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPASNDTGYKRW

Query:  MSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDLEVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVE
        MSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDLEVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVE
Subjt:  MSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDLEVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVE

Query:  QDRKVYIFLGGLEDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSVAMIVQGAKQVAGVPHISTNSTHGNSNNSV--QEIHCLCHR
        QDRKVYIFLGGLEDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSVAMIVQGAKQVAGVPHISTNSTHGNSNNSV    +      
Subjt:  QDRKVYIFLGGLEDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSVAMIVQGAKQVAGVPHISTNSTHGNSNNSV--QEIHCLCHR

Query:  RYRERNPVLIKVLYTLQHDTTKLKQFSSAPTVAAKAIVKTPVLKSM---DWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEGIIFLSSNLSNDNSTWIV
        + +E +     V +T +++ T+            K   K    K     DWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEG IFLSSNLSND STWIV
Subjt:  RYRERNPVLIKVLYTLQHDTTKLKQFSSAPTVAAKAIVKTPVLKSM---DWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEGIIFLSSNLSNDNSTWIV

Query:  DSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVLVVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIE
        DSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVLVVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIE
Subjt:  DSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVLVVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIE

Query:  RDGLYHLEDLKAGRANLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLPRLFKNNNLTVFNCDTCIKAKSHRVSYAPSSNKSNSPFDLIHSDVWGPAPVSS
        RDGLYHLEDLKAGRANLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLPRLFKNNNLTVFNCDTCIKAKSHRVSYAPSSNKSNSPFDLIHSDVWGPAP   
Subjt:  RDGLYHLEDLKAGRANLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLPRLFKNNNLTVFNCDTCIKAKSHRVSYAPSSNKSNSPFDLIHSDVWGPAPVSS

Query:  IDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFKKFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNRHILEMS
                                                                                                            
Subjt:  IDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFKKFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNRHILEMS

Query:  RALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSLTPKIFGCVAYVHVPKTQRSKLSPCAVRCVFLGFGQNQKGYKCFDVD
                                                                                                 NQKGYKCFDVD
Subjt:  RALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSLTPKIFGCVAYVHVPKTQRSKLSPCAVRCVFLGFGQNQKGYKCFDVD

Query:  SRKWYVTMDVTFVEHEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWT----LYGQDGVSGTDESDGNGNGTDESDGDGNGATDGNKPLEMYSRNKNRA
        SRKWYVTMDVTF+EHEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWT     YGQDGVSGTDESDGNGNGTDESDGDGNGATDGNKPLEMYSRNKNRA
Subjt:  SRKWYVTMDVTFVEHEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWT----LYGQDGVSGTDESDGNGNGTDESDGDGNGATDGNKPLEMYSRNKNRA

Query:  VFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPTSKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPENVDEAKS
        VFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPTSKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPENVDEAKS
Subjt:  VFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPTSKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPENVDEAKS

Query:  DPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKARLVAKGYTQTQ-----------------------------------
        DPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKARLVAKGYTQTQ                                   
Subjt:  DPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKARLVAKGYTQTQ-----------------------------------

Query:  --ECFLHGELKEEVYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKRNQEKLTALIIYVDDMIVTGNDTQEIA
            FLHGELKEEVYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKRNQEKLTALIIYVDDMIV GNDTQEIA
Subjt:  --ECFLHGELKEEVYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKRNQEKLTALIIYVDDMIVTGNDTQEIA

Query:  TLEKKLSGEFEMKNLGGLKYFLRIEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTRPDIAYG
        TLEKKLSGEFEMKNLGGLKYFL IEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTRPDIAYG
Subjt:  TLEKKLSGEFEMKNLGGLKYFLRIEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTRPDIAYG

Query:  VSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGI
        VSVVSQFMHNPSEDHMDAVMRI+RYLKGCPGKG                               YFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGI
Subjt:  VSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGI

Query:  CELLWLRRLLGELGFAPTQAMDLYCDSRPAIDISHNP
        CELLWLRRLLGELGFAPTQAMDLYCDSRPAIDISHNP
Subjt:  CELLWLRRLLGELGFAPTQAMDLYCDSRPAIDISHNP

TYK09814.1 putative polyprotein (retrotrasposon protein) [Cucumis melo var. makuwa]0.092.9Show/hide
Query:  MAEETNNRQLIVDGNTVSSKFIEDLTEKLTKAMFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPASNDTGYKRW
        MAEETNNRQLIVDGNTVSSKFIEDLTEKLTKAMFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPASNDTGYKRW
Subjt:  MAEETNNRQLIVDGNTVSSKFIEDLTEKLTKAMFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPASNDTGYKRW

Query:  MSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDLEVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVE
        MSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDLEVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVE
Subjt:  MSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDLEVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVE

Query:  QDRKVYIFLGGLEDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSVAMIVQGAKQVAGVPHISTNSTHGNSNNSV--QEIHCLCHR
        QDRKVYIFLGGLEDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSVAMIVQGAKQVAGVPHISTNSTHGNSNNSV    +      
Subjt:  QDRKVYIFLGGLEDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSVAMIVQGAKQVAGVPHISTNSTHGNSNNSV--QEIHCLCHR

Query:  RYRERNPVLIKVLYTLQHDTTKLKQFSSAPTVAAKAIVKTPVLKSM---DWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEGIIFLSSNLSNDNSTWIV
        + +E +     V +T +++ T+            K   K    K     DWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEG IFLSSNLSND STWIV
Subjt:  RYRERNPVLIKVLYTLQHDTTKLKQFSSAPTVAAKAIVKTPVLKSM---DWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEGIIFLSSNLSNDNSTWIV

Query:  DSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVLVVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIE
        DSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVLVVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIE
Subjt:  DSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVLVVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIE

Query:  RDGLYHLEDLKAGRANLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLPRLFKNNNLTVFNCDTCIKAKSHRVSYAPSSNKSNSPFDLIHSDVWGPAPVSS
        RDGLYHLEDLKAGRANLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLPRLFKNNNLTVFNCDTCIKAKSHRVSYAPSSNKSNSPFDLIHSDVWGPAPVSS
Subjt:  RDGLYHLEDLKAGRANLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLPRLFKNNNLTVFNCDTCIKAKSHRVSYAPSSNKSNSPFDLIHSDVWGPAPVSS

Query:  IDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFKKFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNRHILEMS
        IDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFKKFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNRHILEMS
Subjt:  IDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFKKFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNRHILEMS

Query:  RALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSLTPKIFGCVAYVHVPKTQRSKLSPCAVRCVFLGFGQNQKGYKCFDVD
        RALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSLTPKIFGCVAYVHVPKTQRSKLSPCAVRCVFLGFGQNQKGYKCFDVD
Subjt:  RALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSLTPKIFGCVAYVHVPKTQRSKLSPCAVRCVFLGFGQNQKGYKCFDVD

Query:  SRKWYVTMDVTFVEHEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWT----LYGQDGVSGTDESDGNGNGTDESDGDGNGATDGNKPLEMYSRNKNRA
        SRKWYVTMDVTFVEHEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWT     YGQDGVSGTDESDGNGNGTDESDGDGNGATDGNKPLEMYSRNKNRA
Subjt:  SRKWYVTMDVTFVEHEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWT----LYGQDGVSGTDESDGNGNGTDESDGDGNGATDGNKPLEMYSRNKNRA

Query:  VFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPTSKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPENVDEAKS
        VFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPTSKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPENVDEAKS
Subjt:  VFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPTSKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPENVDEAKS

Query:  DPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKARLVAKGYTQTQ-----------------------------------
        DPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKARLVAKGYTQTQ                                   
Subjt:  DPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKARLVAKGYTQTQ-----------------------------------

Query:  --ECFLHGELKEEVYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKRNQEKLTALIIYVDDMIVTGNDTQEIA
            FLHGELKEEVYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKRNQEKLTALIIYVDDMIV GNDTQEIA
Subjt:  --ECFLHGELKEEVYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKRNQEKLTALIIYVDDMIVTGNDTQEIA

Query:  TLEKKLSGEFEMKNLGGLKYFLRIEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTRPDIAYG
        TLEKKLSGEFEMKNLGGLKYFL IEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTRPDIAYG
Subjt:  TLEKKLSGEFEMKNLGGLKYFLRIEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTRPDIAYG

Query:  VSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGI
        VSVVSQFMHNPSEDHMDAVMRI+RYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGI
Subjt:  VSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGI

Query:  CELLWLRRLLGELGFAPTQAMDLYCDSRPAIDISHNP
        CELLWLRRLLGELGFAPTQAMDLYCDSRPAIDISHNP
Subjt:  CELLWLRRLLGELGFAPTQAMDLYCDSRPAIDISHNP

TrEMBL top hitse value%identityAlignment
A0A2N9G4T4 Integrase catalytic domain-containing protein0.0e+0051.62Show/hide
Query:  MAEETNNRQLIVDGNTVSSKFIEDLTEKLTKAMFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPASNDTGYKRW
        MAE T+  Q +   N  +   +  LT K+T+ + R+   P  P    +    P+ I  +G NY +WSQ+VE+ ++ KDKLG+ING+  QP   D  ++RW
Subjt:  MAEETNNRQLIVDGNTVSSKFIEDLTEKLTKAMFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPASNDTGYKRW

Query:  MSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDLEVKVYGLKQT-GTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRV
         ++N++VKGW+++S+D +LI NFIRF TAK+VWDS   T+FDG D +Q+YDL  +V   KQ  G++E Y+  LQGLW+EI+FR+PNPM C  DI+KYN +
Subjt:  MSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDLEVKVYGLKQT-GTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRV

Query:  EQDRKVYIFLGGLEDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSVAMIVQGAKQVAGV-PHISTNSTHGNSNNSVQEIHCLCHR
         Q+ +VYIFL GL+DR D  R+++L++ P PTVE+ Y HVRREDVRQ VM                GA    GV P      TH  +    +E  C    
Subjt:  EQDRKVYIFLGGLEDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSVAMIVQGAKQVAGV-PHISTNSTHGNSNNSVQEIHCLCHR

Query:  RYRERNPVLIKVLYTLQHDTTKLKQFSSAPTVAAKAIVKTPVLKSMDWYKELKKKQAEAKRGKVSIAVSGK-NGDDSPSEGIIFLSSNLSNDNSTWIVDS
         Y +             HD    K+             + PV+            +     G+ S++++ +     +P      L S+  ND+  WI+DS
Subjt:  RYRERNPVLIKVLYTLQHDTTKLKQFSSAPTVAAKAIVKTPVLKSMDWYKELKKKQAEAKRGKVSIAVSGK-NGDDSPSEGIIFLSSNLSNDNSTWIVDS

Query:  GASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVLVVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIERD
        GA+DHMTF   D +     +++ + NANG++YPV  AG + ++  L+LS+ L+VPSL+ +LMSVS++T D NCVV MYS + ++QDILTKEIIGRG +R 
Subjt:  GASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVLVVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIERD

Query:  GLYHLEDLKAGRANLVVDQTEVQNK---IWTWHRRLGHPSLGYMKKLLPRLFKNNNLTVFNCDTCIKAKSHRVSYAPSSNKSNSPFDLIHSDVWGPAPVS
        GLY+++D    RAN +     V NK   IW WH RLGHPS GY+K L P LF N   + F C+TCI AKSHRVSY  S NKS  PF LIHSDVWGP+P++
Subjt:  GLYHLEDLKAGRANLVVDQTEVQNK---IWTWHRRLGHPSLGYMKKLLPRLFKNNNLTVFNCDTCIKAKSHRVSYAPSSNKSNSPFDLIHSDVWGPAPVS

Query:  SIDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFKKFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNRHILEM
        +  G++WFV+F+DDCTRMTW+YLLK KDEV  VFK F+ M++TQF   I+  RSDNGGE++ +  + +F   G+ HETSC  TPQQNG+AERKNRHILE 
Subjt:  SIDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFKKFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNRHILEM

Query:  SRALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSLTPKIFGCVAYVHVPKTQRSKLSPCAVRCVFLGFGQNQKGYKCFDV
        +RALL   HVP+++WD ++  AV++LNR+PTK+  FQTPLK L  H  +P+VL + P+IFGCVA+VH+ K QR+KL PCAVRC+FLG+G ++KGY+CFD 
Subjt:  SRALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSLTPKIFGCVAYVHVPKTQRSKLSPCAVRCVFLGFGQNQKGYKCFDV

Query:  DSRKWYVTMDVTFVEHEAFF-KPTIQSD-QGENSSESDVWKNKDLLSFQGWTLYGQDGVSGTDESDGNGNGTDESDGDGNGATDGNKPLEMYSRNKNRAV
         +++ Y+TMDVTF+E + FF  P   S  QGE   E   W             +G + +   D      +G D  + D          ++MY R +  ++
Subjt:  DSRKWYVTMDVTFVEHEAFF-KPTIQSD-QGENSSESDVWKNKDLLSFQGWTLYGQDGVSGTDESDGNGNGTDESDGDGNGATDGNKPLEMYSRNKNRAV

Query:  FGMESENEEVIQPNEEPQGPMSPEDQLQVSLPTSK-----------YTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCK
           ESE+E       +P  P S E+  +VS PT+            Y LP R NRG PP RY PD +  ++SKYPIAN+V T+ LS P+K F   L SC 
Subjt:  FGMESENEEVIQPNEEPQGPMSPEDQLQVSLPTSK-----------YTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCK

Query:  IPENVDEAKSDPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKARLVAKG--------------YTQTQECFLHGELKEE
        IP +V+EA SDP W+QA++ E+ AL  NKTW LV LP+GK  VGC+W+FSIKY A+G IDRYKARLVAKG                  +  FLHG+L+EE
Subjt:  IPENVDEAKSDPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKARLVAKG--------------YTQTQECFLHGELKEE

Query:  VYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKRNQEKLTALIIYVDDMIVTGNDTQEIATLEKKLSGEFEMK
        VYM+ PPGY +++   + CKL +ALYGLKQSPRAWFGRF  AM+ YGF+Q +SDHTLFLK    K+TALI+YVDDMI+TG+D +EI+ L+++LS EFEMK
Subjt:  VYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKRNQEKLTALIIYVDDMIVTGNDTQEIATLEKKLSGEFEMK

Query:  NLGGLKYFLRIEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTRPDIAYGVSVVSQFMHNPSE
        NLGGLKYFL IEV RS+QGI LSQRKY+LDLL+E+G+L+C+PADTP+V   KLGE+ DQVPA+KERYQRLVGKLIYL+HTRPDIAY VSVVSQFMH PSE
Subjt:  NLGGLKYFLRIEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTRPDIAYGVSVVSQFMHNPSE

Query:  DHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGICELLWLRRLLGEL
        DHMDAV+RI+RYLK  PGKG+ F KN HL+V G+TDADWAG++SDR+ST+GYFTFVGGNLVTWRSKKQ VVALSSAEAEFRG+AKG+CELLWLRRLL E+
Subjt:  DHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGICELLWLRRLLGEL

Query:  GFAPTQAMDLYCDSRPAIDISHNP
        GFAP+  M+L+CD++ AIDISHNP
Subjt:  GFAPTQAMDLYCDSRPAIDISHNP

A0A2N9IDK7 Integrase catalytic domain-containing protein0.0e+0050.61Show/hide
Query:  MAEETNNRQLIVDGNTVSSKFIEDLTEKLTKAMFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPASNDTGYKRW
        MAE T+  Q +   N  +   +  LT K+T+ + R+   P  P    +    P+ I  +G NY +WSQ+VE+ ++ KDKLG+ING+  QP   D  ++RW
Subjt:  MAEETNNRQLIVDGNTVSSKFIEDLTEKLTKAMFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPASNDTGYKRW

Query:  MSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDLEVKVYGLKQT-GTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRV
         ++N++VKGW+++S+D +LI NFIRF TAK+VWDS   T+FDG D +Q+YDL  +V   KQ  G++E Y+  LQGLW+EI+FR+PNPM C  DI+KYN +
Subjt:  MSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDLEVKVYGLKQT-GTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRV

Query:  EQDRKVYIFLGGLEDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSVAMIVQGAK-------QVAGVPHISTNSTHGNSNNSVQEI
         Q+ +VYIFL GL+DR D  R+++L++ P PTVE+ Y HVRREDVRQ VM     + +  V M  +G K          GV  +S+  ++  S +     
Subjt:  EQDRKVYIFLGGLEDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSVAMIVQGAK-------QVAGVPHISTNSTHGNSNNSVQEI

Query:  HCLCHRRYRERNPVLIKVLYTLQHDTTKLKQFSSAPTVAAKAIVKTPVLKSMDWYKELK-KKQAEA-----KRGKVSIA--------VSGKNGDDSPSEG
           C                  +H      +    P                DW+ +L+ +K+ EA       G+V++          S      +P   
Subjt:  HCLCHRRYRERNPVLIKVLYTLQHDTTKLKQFSSAPTVAAKAIVKTPVLKSMDWYKELK-KKQAEA-----KRGKVSIA--------VSGKNGDDSPSEG

Query:  IIFLSSNLSNDNSTWIVDSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVLVVPSLATRLMSVSKLTKDRNCVVKMYSDY
           L S+  ND+  WI+DSGA+DHMTF   D +     +++ + NANG++YPV  AG + ++  L+LS+ L+VPSL+ +LMSVS++T D NCVV MYS +
Subjt:  IIFLSSNLSNDNSTWIVDSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVLVVPSLATRLMSVSKLTKDRNCVVKMYSDY

Query:  FIIQDILTKEIIGRGIERDGLYHLEDLKAGRANLVVDQTEVQNK---IWTWHRRLGHPSLGYMKKLLPRLFKNNNLTVFNCDTCIKAKSHRVSYAPSSNK
         ++QDILTKEIIGRG +R GLY+++D    RAN +     V NK   IW WH RLGHPS GY+K L P LF N   + F C+TCI AKSHRVSY  S NK
Subjt:  FIIQDILTKEIIGRGIERDGLYHLEDLKAGRANLVVDQTEVQNK---IWTWHRRLGHPSLGYMKKLLPRLFKNNNLTVFNCDTCIKAKSHRVSYAPSSNK

Query:  SNSPFDLIHSDVWGPAPVSSIDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFKKFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCV
        S  PF LIHSDVWGP+PV++  G++WFV+F+DDCTRMTW+YLLK KDEV  VFK F+ M++TQF   I+  RSDNGGE++ +  + +F   G+ HETSC 
Subjt:  SNSPFDLIHSDVWGPAPVSSIDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFKKFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCV

Query:  GTPQQNGVAERKNRHILEMSRALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSLTPKIFGCVAYVHVPKTQRSKLSPCAV
         TPQQNG+AERKNRHILE +RALL   HVP+++WD ++  AV++LNR+PTK+  FQTPLK L  H  +P+VL + P+IFGCVA+VH+ K QR+KL PCAV
Subjt:  GTPQQNGVAERKNRHILEMSRALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSLTPKIFGCVAYVHVPKTQRSKLSPCAV

Query:  RCVFLGFGQNQKGYKCFDVDSRKWYVTMDVTFVEHEAFF-KPTIQSD-QGENSSESDVWKNKDLLSFQGWTLYGQDGVSGTDESDGNGNGTDESDGDGNG
        RC+FLG+G ++KGY+CFD  +++ Y+TMDVTF+E + FF  P   S  QGE   E   W             +G + +   D      +G D  + D   
Subjt:  RCVFLGFGQNQKGYKCFDVDSRKWYVTMDVTFVEHEAFF-KPTIQSD-QGENSSESDVWKNKDLLSFQGWTLYGQDGVSGTDESDGNGNGTDESDGDGNG

Query:  ATDGNKPLEMYSRNKNRAVFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPTSK-----------YTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVD
               ++MY R +  ++   ESE+E       +P  P S E+  +VS PT+            Y LP R NRG PP RY PD +  ++SKYPIAN+V 
Subjt:  ATDGNKPLEMYSRNKNRAVFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPTSK-----------YTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVD

Query:  TKSLSGPVKRFNENLLSCKIPENVDEAKSDPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKARLVAKGYTQT-------
        T+ LS P+K F   L SC IP +V+EA SDP W+QA++ E+ AL  NKTW LV LP+GK  VGC+W+FSIKY A+G IDR KARLVAKGYTQT       
Subjt:  TKSLSGPVKRFNENLLSCKIPENVDEAKSDPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKARLVAKGYTQT-------

Query:  ------------------------------QECFLHGELKEEVYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLF
                                      +  FLHG+L+EEVYM+ PPGY +++   + CKL +ALYGLKQSPRAWFGRF  AM+ YGF+Q +SDHTLF
Subjt:  ------------------------------QECFLHGELKEEVYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLF

Query:  LKRNQEKLTALIIYVDDMIVTGNDTQEIATLEKKLSGEFEMKNLGGLKYFLRIEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPD
        LK    K+TALI+YVDDMI+TG+D +EI+ L+++LS EFEMKNLGGLKYFL IEV RS+QGI LSQRKY+LDLL+E+G+L+C+PADTP+V   KLGE+ D
Subjt:  LKRNQEKLTALIIYVDDMIVTGNDTQEIATLEKKLSGEFEMKNLGGLKYFLRIEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPD

Query:  QVPANKERYQRLVGKLIYLAHTRPDIAYGVSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFVGG
        QVPA+KERYQRLVGKLIYL+HTRPDIAY VSVVSQFMH PSEDHMDAV+RI+RYLK  PGKG+ F KN HL+V G+TDADWAG++SDR+ST+GYFTFVGG
Subjt:  QVPANKERYQRLVGKLIYLAHTRPDIAYGVSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFVGG

Query:  NLVTWRSKKQSVVALSSAEAEFRGVAKGICELLWLRRLLGELGFAPTQAMDLYCDSRPAIDISHNP
        NLVTWRSKKQ VVALSSAEAEFRG+AKG+CELLWLRRLL E+GFAP+  M+L+CD++ AIDISHNP
Subjt:  NLVTWRSKKQSVVALSSAEAEFRGVAKGICELLWLRRLLGELGFAPTQAMDLYCDSRPAIDISHNP

A0A5A7T8G9 Putative polyprotein (Retrotrasposon protein)0.0e+0093.25Show/hide
Query:  MAEETNNRQLIVDGNTVSSKFIEDLTEKLTKAMFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPASNDTGYKRW
        MAEETNNRQLIVDGNTVSSKFIEDLTEKLTKAMFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPASNDTGYKRW
Subjt:  MAEETNNRQLIVDGNTVSSKFIEDLTEKLTKAMFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPASNDTGYKRW

Query:  MSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDLEVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVE
        MSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDLEVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVE
Subjt:  MSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDLEVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVE

Query:  QDRKVYIFLGGLEDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSVAMIVQGAKQVAGVPHISTNSTHGNSNNSV--QEIHCLCHR
        QDRKVYIFLGGLEDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSVAMIVQGAKQVAGVPHISTNSTHGNSNNSV    +      
Subjt:  QDRKVYIFLGGLEDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSVAMIVQGAKQVAGVPHISTNSTHGNSNNSV--QEIHCLCHR

Query:  RYRERNPVLIKVLYTLQHDTTKLKQFSSAPTVAAKAIVKTPVLK---SMDWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEGIIFLSSNLSNDNSTWIV
        + +E +     V +T +++  K +          K   K    K     DWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEGIIFLSSNLSNDNSTWIV
Subjt:  RYRERNPVLIKVLYTLQHDTTKLKQFSSAPTVAAKAIVKTPVLK---SMDWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEGIIFLSSNLSNDNSTWIV

Query:  DSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVLVVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIE
        DSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVLVVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIE
Subjt:  DSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVLVVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIE

Query:  RDGLYHLEDLKAGRANLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLPRLFKNNNLTVFNCDTCIKAKSHRVSYAPSSNKSNSPFDLIHSDVWGPAPVSS
        RDGLYHLEDLKAGRANLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLPRLFKNNNLTVFNCDTCIKAKSHRVSYAPSSNKSNSPFDLIHSDVWGPAPVSS
Subjt:  RDGLYHLEDLKAGRANLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLPRLFKNNNLTVFNCDTCIKAKSHRVSYAPSSNKSNSPFDLIHSDVWGPAPVSS

Query:  IDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFKKFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNRHILEMS
        IDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFKKFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNRHILEMS
Subjt:  IDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFKKFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNRHILEMS

Query:  RALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSLTPKIFGCVAYVHVPKTQRSKLSPCAVRCVFLGFGQNQKGYKCFDVD
        RALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSLTPKIFGCVAYVHVPKTQRSKLSPCAVRCVFLGFGQNQKGYKCFDVD
Subjt:  RALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSLTPKIFGCVAYVHVPKTQRSKLSPCAVRCVFLGFGQNQKGYKCFDVD

Query:  SRKWYVTMDVTFVEHEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWT----LYGQDGVSGTDESDGNGNGTDESDGDGNGATDGNKPLEMYSRNKNRA
        SRKWYVTMDVTFVEHEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWT     YGQDGVSGTDESDGNGNGTDESDGDGNGATDGNKPLEMYSRNKNRA
Subjt:  SRKWYVTMDVTFVEHEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWT----LYGQDGVSGTDESDGNGNGTDESDGDGNGATDGNKPLEMYSRNKNRA

Query:  VFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPTSKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPENVDEAKS
        VFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPTSKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPENVDEAKS
Subjt:  VFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPTSKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPENVDEAKS

Query:  DPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKARLVAKGYTQTQ-----------------------------------
        DPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKARLVAKGYTQTQ                                   
Subjt:  DPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKARLVAKGYTQTQ-----------------------------------

Query:  --ECFLHGELKEEVYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKRNQEKLTALIIYVDDMIVTGNDTQEIA
            FLHGELKEEVYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKRNQEKLTALIIYVDDMIVTGNDTQEIA
Subjt:  --ECFLHGELKEEVYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKRNQEKLTALIIYVDDMIVTGNDTQEIA

Query:  TLEKKLSGEFEMKNLGGLKYFLRIEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTRPDIAYG
        TLEKKLSGEFEMKNLGGLKYFLRIEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTRPDIAYG
Subjt:  TLEKKLSGEFEMKNLGGLKYFLRIEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTRPDIAYG

Query:  VSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGI
        VSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGI
Subjt:  VSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGI

Query:  CELLWLRRLLGELGFAPTQAMDLYCDSRPAIDISHNP
        CELLWLRRLLGELGFAPTQAMDLYCDSRPAIDISHNP
Subjt:  CELLWLRRLLGELGFAPTQAMDLYCDSRPAIDISHNP

A0A5D3BHP1 Gag-pol polyprotein0.0e+0077.31Show/hide
Query:  MAEETNNRQLIVDGNTVSSKFIEDLTEKLTKAMFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPASNDTGYKRW
        MAEETNNRQLIVDGNTVSSKFIEDLTEKLTKAMFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPASNDTGYKRW
Subjt:  MAEETNNRQLIVDGNTVSSKFIEDLTEKLTKAMFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPASNDTGYKRW

Query:  MSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDLEVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVE
        MSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDLEVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVE
Subjt:  MSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDLEVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVE

Query:  QDRKVYIFLGGLEDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSVAMIVQGAKQVAGVPHISTNSTHGNSNNSV--QEIHCLCHR
        QDRKVYIFLGGLEDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSVAMIVQGAKQVAGVPHISTNSTHGNSNNSV    +      
Subjt:  QDRKVYIFLGGLEDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSVAMIVQGAKQVAGVPHISTNSTHGNSNNSV--QEIHCLCHR

Query:  RYRERNPVLIKVLYTLQHDTTKLKQFSSAPTVAAKAIVKTPVLK---SMDWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEGIIFLSSNLSNDNSTWIV
        + +E +     V +T +++  K +          K   K    K     DWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEG IFLSSNLSND STWIV
Subjt:  RYRERNPVLIKVLYTLQHDTTKLKQFSSAPTVAAKAIVKTPVLK---SMDWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEGIIFLSSNLSNDNSTWIV

Query:  DSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVLVVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIE
        DSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVLVVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIE
Subjt:  DSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVLVVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIE

Query:  RDGLYHLEDLKAGRANLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLPRLFKNNNLTVFNCDTCIKAKSHRVSYAPSSNKSNSPFDLIHSDVWGPAPVSS
        RDGLYHLEDLKAGRANLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLPRLFKNNNLTVFNCDTCIKAKSHRVSYAPSSNKSNSPFDLIHSDVWGPAP   
Subjt:  RDGLYHLEDLKAGRANLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLPRLFKNNNLTVFNCDTCIKAKSHRVSYAPSSNKSNSPFDLIHSDVWGPAPVSS

Query:  IDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFKKFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNRHILEMS
                                                                                                            
Subjt:  IDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFKKFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNRHILEMS

Query:  RALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSLTPKIFGCVAYVHVPKTQRSKLSPCAVRCVFLGFGQNQKGYKCFDVD
                                                                                                 NQKGYKCFDVD
Subjt:  RALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSLTPKIFGCVAYVHVPKTQRSKLSPCAVRCVFLGFGQNQKGYKCFDVD

Query:  SRKWYVTMDVTFVEHEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWT----LYGQDGVSGTDESDGNGNGTDESDGDGNGATDGNKPLEMYSRNKNRA
        SRKWYVTMDVTF+EHEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWT     YGQDGVSGTDESDGNGNGTDESDGDGNGATDGNKPLEMYSRNKNRA
Subjt:  SRKWYVTMDVTFVEHEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWT----LYGQDGVSGTDESDGNGNGTDESDGDGNGATDGNKPLEMYSRNKNRA

Query:  VFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPTSKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPENVDEAKS
        VFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPTSKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPENVDEAKS
Subjt:  VFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPTSKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPENVDEAKS

Query:  DPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKARLVAKGYTQTQ-----------------------------------
        DPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKARLVAKGYTQTQ                                   
Subjt:  DPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKARLVAKGYTQTQ-----------------------------------

Query:  --ECFLHGELKEEVYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKRNQEKLTALIIYVDDMIVTGNDTQEIA
            FLHGELKEEVYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKRNQEKLTALIIYVDDMIV GNDTQEIA
Subjt:  --ECFLHGELKEEVYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKRNQEKLTALIIYVDDMIVTGNDTQEIA

Query:  TLEKKLSGEFEMKNLGGLKYFLRIEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTRPDIAYG
        TLEKKLSGEFEMKNLGGLKYFL IEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTRPDIAYG
Subjt:  TLEKKLSGEFEMKNLGGLKYFLRIEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTRPDIAYG

Query:  VSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGI
        VSVVSQFMHNPSEDHMDAVMRI+RYLKGCPGK                               GYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGI
Subjt:  VSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGI

Query:  CELLWLRRLLGELGFAPTQAMDLYCDSRPAIDISHNP
        CELLWLRRLLGELGFAPTQAMDLYCDSRPAIDISHNP
Subjt:  CELLWLRRLLGELGFAPTQAMDLYCDSRPAIDISHNP

A0A5D3CF38 Putative polyprotein (Retrotrasposon protein)0.0e+0092.9Show/hide
Query:  MAEETNNRQLIVDGNTVSSKFIEDLTEKLTKAMFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPASNDTGYKRW
        MAEETNNRQLIVDGNTVSSKFIEDLTEKLTKAMFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPASNDTGYKRW
Subjt:  MAEETNNRQLIVDGNTVSSKFIEDLTEKLTKAMFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPASNDTGYKRW

Query:  MSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDLEVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVE
        MSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDLEVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVE
Subjt:  MSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDLEVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVE

Query:  QDRKVYIFLGGLEDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSVAMIVQGAKQVAGVPHISTNSTHGNSNNSV--QEIHCLCHR
        QDRKVYIFLGGLEDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSVAMIVQGAKQVAGVPHISTNSTHGNSNNSV    +      
Subjt:  QDRKVYIFLGGLEDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSVAMIVQGAKQVAGVPHISTNSTHGNSNNSV--QEIHCLCHR

Query:  RYRERNPVLIKVLYTLQHDTTKLKQFSSAPTVAAKAIVKTPVLK---SMDWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEGIIFLSSNLSNDNSTWIV
        + +E +     V +T +++  K +          K   K    K     DWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEG IFLSSNLSND STWIV
Subjt:  RYRERNPVLIKVLYTLQHDTTKLKQFSSAPTVAAKAIVKTPVLK---SMDWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEGIIFLSSNLSNDNSTWIV

Query:  DSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVLVVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIE
        DSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVLVVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIE
Subjt:  DSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVLVVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIE

Query:  RDGLYHLEDLKAGRANLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLPRLFKNNNLTVFNCDTCIKAKSHRVSYAPSSNKSNSPFDLIHSDVWGPAPVSS
        RDGLYHLEDLKAGRANLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLPRLFKNNNLTVFNCDTCIKAKSHRVSYAPSSNKSNSPFDLIHSDVWGPAPVSS
Subjt:  RDGLYHLEDLKAGRANLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLPRLFKNNNLTVFNCDTCIKAKSHRVSYAPSSNKSNSPFDLIHSDVWGPAPVSS

Query:  IDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFKKFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNRHILEMS
        IDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFKKFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNRHILEMS
Subjt:  IDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFKKFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNRHILEMS

Query:  RALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSLTPKIFGCVAYVHVPKTQRSKLSPCAVRCVFLGFGQNQKGYKCFDVD
        RALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSLTPKIFGCVAYVHVPKTQRSKLSPCAVRCVFLGFGQNQKGYKCFDVD
Subjt:  RALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSLTPKIFGCVAYVHVPKTQRSKLSPCAVRCVFLGFGQNQKGYKCFDVD

Query:  SRKWYVTMDVTFVEHEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWT----LYGQDGVSGTDESDGNGNGTDESDGDGNGATDGNKPLEMYSRNKNRA
        SRKWYVTMDVTFVEHEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWT     YGQDGVSGTDESDGNGNGTDESDGDGNGATDGNKPLEMYSRNKNRA
Subjt:  SRKWYVTMDVTFVEHEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWT----LYGQDGVSGTDESDGNGNGTDESDGDGNGATDGNKPLEMYSRNKNRA

Query:  VFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPTSKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPENVDEAKS
        VFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPTSKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPENVDEAKS
Subjt:  VFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPTSKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPENVDEAKS

Query:  DPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKARLVAKGYTQTQ-----------------------------------
        DPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKARLVAKGYTQTQ                                   
Subjt:  DPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKARLVAKGYTQTQ-----------------------------------

Query:  --ECFLHGELKEEVYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKRNQEKLTALIIYVDDMIVTGNDTQEIA
            FLHGELKEEVYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKRNQEKLTALIIYVDDMIV GNDTQEIA
Subjt:  --ECFLHGELKEEVYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKRNQEKLTALIIYVDDMIVTGNDTQEIA

Query:  TLEKKLSGEFEMKNLGGLKYFLRIEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTRPDIAYG
        TLEKKLSGEFEMKNLGGLKYFL IEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTRPDIAYG
Subjt:  TLEKKLSGEFEMKNLGGLKYFLRIEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTRPDIAYG

Query:  VSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGI
        VSVVSQFMHNPSEDHMDAVMRI+RYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGI
Subjt:  VSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGI

Query:  CELLWLRRLLGELGFAPTQAMDLYCDSRPAIDISHNP
        CELLWLRRLLGELGFAPTQAMDLYCDSRPAIDISHNP
Subjt:  CELLWLRRLLGELGFAPTQAMDLYCDSRPAIDISHNP

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.6e-10424.42Show/hide
Query:  FNGRNYGVWSQMVEVLLASKDKLGHINGERTQPASNDTGYKRWMSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDLEVKVY
        F+G  Y +W   +  LLA +D L  ++G    P   D     W       K  I+  L  + +       TA+++ +++   +   +  +Q+  L  ++ 
Subjt:  FNGRNYGVWSQMVEVLLASKDKLGHINGERTQPASNDTGYKRWMSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDLEVKVY

Query:  GLKQTG--TLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVEQDRKVYIFLGGLEDRFDGIRAEILRMTPLPTVEE---TYGHVRREDVRQSVMVGK
         LK +   +L  +F+    L  E+             +    ++E+  K+   L  L   +DGI      +T + T+ E   T   V+   + Q + +  
Subjt:  GLKQTG--TLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVEQDRKVYIFLGGLEDRFDGIRAEILRMTPLPTVEE---TYGHVRREDVRQSVMVGK

Query:  STDLSNSVAMIVQGAKQVAGVPHISTNSTHGNSNNSVQEIHCLCHRRYRERNPVLIKVLYTLQHDTTKLKQFSSAPTVAAKAIVKTPVLKSMDWYKELKK
          + ++   M                N+   N+NN+ +      +  ++ R     K+           K  S           +  + K    YK +  
Subjt:  STDLSNSVAMIVQGAKQVAGVPHISTNSTHGNSNNSVQEIHCLCHRRYRERNPVLIKVLYTLQHDTTKLKQFSSAPTVAAKAIVKTPVLKSMDWYKELKK

Query:  KQAEAKRGKVSIAVSGKNGDDSPSEGIIFLSSNLSN----DNSTWIVDSGASDHMTFTK---TDLTAECVTKKTEILNANGISYPVKCAGSIKVTS--QL
         + +    +V  A          S GI F+   ++N    DN  +++DSGASDH+   +   TD        K  +       Y  K  G +++ +  ++
Subjt:  KQAEAKRGKVSIAVSGKNGDDSPSEGIIFLSSNLSN----DNSTWIVDSGASDHMTFTK---TDLTAECVTKKTEILNANGISYPVKCAGSIKVTS--QL

Query:  NLSNVLVVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIERDGLYHLEDLKAGRANLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLP
         L +VL     A  LMSV +L +    +    S   I ++ L        ++  G+  L ++            + +N    WH R GH S G + ++  
Subjt:  NLSNVLVVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIERDGLYHLEDLKAGRANLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLP

Query:  R-------LFKNNNLTVFNCDTCIKAKSHRVSYAPSSNKSN--SPFDLIHSDVWGPAPVSSIDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFKKFYTM
        +       L  N  L+   C+ C+  K  R+ +    +K++   P  ++HSDV GP    ++D   +FV+F+D  T     YL+K K +V S+F+ F   
Subjt:  R-------LFKNNNLTVFNCDTCIKAKSHRVSYAPSSNKSN--SPFDLIHSDVWGPAPVSSIDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFKKFYTM

Query:  IKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNRHILEMSRALLFEYHVPTKFWDKSILMAVYVLNRLPTK--INNFQT
         +  F   + +   DNG E++   ++ F V+ GI +  +   TPQ NGV+ER  R I E +R ++    +   FW +++L A Y++NR+P++  +++ +T
Subjt:  IKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNRHILEMSRALLFEYHVPTKFWDKSILMAVYVLNRLPTK--INNFQT

Query:  PLKTLEKHHSIPSVLSLTPKIFGCVAYVHVPKTQRSKLSPCAVRCVFLGFGQNQKGYKCFDVDSRKWYVTMDVTFVE------HEAFFKPTIQSDQGENS
        P +    H+  P +  L  ++FG   YVH+ K ++ K    + + +F+G+  N  G+K +D  + K+ V  DV   E          F+     D  E+ 
Subjt:  PLKTLEKHHSIPSVLSLTPKIFGCVAYVHVPKTQRSKLSPCAVRCVFLGFGQNQKGYKCFDVDSRKWYVTMDVTFVE------HEAFFKPTIQSDQGENS

Query:  SESDVWKNKDLLSFQGWTLYGQ-DGVSGTDESDGNGNGTDESDGD-------GNGATDGNKPLEMYSRNKNRAVFGMESENEEVIQPNEEPQGPMSPEDQ
        +++    ++ ++  +      + D +    +S  + N    +D          N + + +    +    ++   F  ES+  +      E +G  +P + 
Subjt:  SESDVWKNKDLLSFQGWTLYGQ-DGVSGTDESDGNGNGTDESDGD-------GNGATDGNKPLEMYSRNKNRAVFGMESENEEVIQPNEEPQGPMSPEDQ

Query:  LQVSLPTSKYTLPVRCNRGIPPKRYEPDEDRHK--KSKYPIANFVDTKSLSGPVKRFNENLLSCKIPENVDEAK---SDPNWSQAMEAEMSALYNNKTWT
         +    T+++   +  +        E    R +  K+K  I+   +  SL+  V   N + +   +P + DE +      +W +A+  E++A   N TWT
Subjt:  LQVSLPTSKYTLPVRCNRGIPPKRYEPDEDRHK--KSKYPIANFVDTKSLSGPVKRFNENLLSCKIPENVDEAK---SDPNWSQAMEAEMSALYNNKTWT

Query:  LVELPQGKIPVGCRWVFSIKYNANGEIDRYKARLVAKGYTQ-------------------------------------TQECFLHGELKEEVYMEQPPGY
        + + P+ K  V  RWVFS+KYN  G   RYKARLVA+G+TQ                                      +  FL+G LKEE+YM  P G 
Subjt:  LVELPQGKIPVGCRWVFSIKYNANGEIDRYKARLVAKGYTQ-------------------------------------TQECFLHGELKEEVYMEQPPGY

Query:  KSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFL--KRNQEKLTALIIYVDDMIVTGNDTQEIATLEKKLSGEFEMKNLGGLKY
           +    VCKLNKA+YGLKQ+ R WF  F +A++   F     D  +++  K N  +   +++YVDD+++   D   +   ++ L  +F M +L  +K+
Subjt:  KSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFL--KRNQEKLTALIIYVDDMIVTGNDTQEIATLEKKLSGEFEMKNLGGLKY

Query:  FLRIEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIY-LAHTRPDIAYGVSVVSQFMHNPSEDHMDAV
        F+ I +   +  I LSQ  Y+  +L++  M +C    TP+   +           N    + L+G L+Y +  TRPD+   V+++S++    + +    +
Subjt:  FLRIEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIY-LAHTRPDIAYGVSVVSQFMHNPSEDHMDAV

Query:  MRIIRYLKGCPGKGITFKKNGHLD--VSGFTDADWAGSVSDRRSTAGY-FTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGICELLWLRRLLGELGFA
         R++RYLKG     + FKKN   +  + G+ D+DWAGS  DR+ST GY F     NL+ W +K+Q+ VA SS EAE+  + + + E LWL+ LL  +   
Subjt:  MRIIRYLKGCPGKGITFKKNGHLD--VSGFTDADWAGSVSDRRSTAGY-FTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGICELLWLRRLLGELGFA

Query:  PTQAMDLYCDSRPAIDISHNP
            + +Y D++  I I++NP
Subjt:  PTQAMDLYCDSRPAIDISHNP

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.7e-14933.48Show/hide
Query:  SGKNGDDSPS-------EGIIFLSS-----NLSNDNSTWIVDSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVLV----
        SG+  DD+ +         ++F++      +LS   S W+VD+ AS H T  + DL    V      +     SY  K AG   +  + N+   LV    
Subjt:  SGKNGDDSPS-------EGIIFLSS-----NLSNDNSTWIVDSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVLV----

Query:  --VPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQD-ILTKE--IIGRGIERDGLYHLE-DLKAGRANLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLPR
          VP L   L+S   L +D       Y  YF  Q   LTK   +I +G+ R  LY    ++  G  N   D+  V      WH+R+GH S   ++ L  +
Subjt:  --VPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQD-ILTKE--IIGRGIERDGLYHLE-DLKAGRANLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLPR

Query:  --LFKNNNLTVFNCDTCIKAKSHRVSYAPSSNKSNSPFDLIHSDVWGPAPVSSIDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFKKFYTMIKTQFGKG
          +      TV  CD C+  K HRVS+  SS +  +  DL++SDV GP  + S+ GNK+FV FIDD +R  WVY+LK+KD+V  VF+KF+ +++ + G+ 
Subjt:  --LFKNNNLTVFNCDTCIKAKSHRVSYAPSSNKSNSPFDLIHSDVWGPAPVSSIDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFKKFYTMIKTQFGKG

Query:  IKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNRHILEMSRALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHS
        +K  RSDNGGE+  +  +++    GI HE +  GTPQ NGVAER NR I+E  R++L    +P  FW +++  A Y++NR P+    F+ P +       
Subjt:  IKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNRHILEMSRALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHS

Query:  IPSVLSLTPKIFGCVAYVHVPKTQRSKLSPCAVRCVFLGFGQNQKGYKCFDVDSRKWYVTMDVTFVEHEAFFKPTIQSDQGENSSESDVWKNKDLLSFQG
          S L    K+FGC A+ HVPK QR+KL   ++ C+F+G+G  + GY+ +D   +K   + DV F E E                               
Subjt:  IPSVLSLTPKIFGCVAYVHVPKTQRSKLSPCAVRCVFLGFGQNQKGYKCFDVDSRKWYVTMDVTFVEHEAFFKPTIQSDQGENSSESDVWKNKDLLSFQG

Query:  WTLYGQDGVSGTDESDGNGNGTDESDGDGNGATDGNKPLEMYSRNKNRAVFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPTSKYTLPVRCNRGIPPKR
                              D S+   NG       +   S N   A    ES  +EV +  E+P   +   +QL   +   ++  P +      P R
Subjt:  WTLYGQDGVSGTDESDGNGNGTDESDGDGNGATDGNKPLEMYSRNKNRAVFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPTSKYTLPVRCNRGIPPKR

Query:  YEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPENVDEAKSDPNWSQ---AMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGE
           +  R +  +YP   +V               +   + PE++ E  S P  +Q   AM+ EM +L  N T+ LVELP+GK P+ C+WVF +K + + +
Subjt:  YEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPENVDEAKSDPNWSQ---AMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGE

Query:  IDRYKARLVAKGYTQ-------------------------------------TQECFLHGELKEEVYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAW
        + RYKARLV KG+ Q                                      +  FLHG+L+EE+YMEQP G++    K MVCKLNK+LYGLKQ+PR W
Subjt:  IDRYKARLVAKGYTQ-------------------------------------TQECFLHGELKEEVYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAW

Query:  FGRFCRAMQSYGFKQCDSDHTLFLKR-NQEKLTALIIYVDDMIVTGNDTQEIATLEKKLSGEFEMKNLGGLKYFLRIEVMRSK--QGIVLSQRKYILDLL
        + +F   M+S  + +  SD  ++ KR ++     L++YVDDM++ G D   IA L+  LS  F+MK+LG  +  L ++++R +  + + LSQ KYI  +L
Subjt:  FGRFCRAMQSYGFKQCDSDHTLFLKR-NQEKLTALIIYVDDMIVTGNDTQEIATLEKKLSGEFEMKNLGGLKYFLRIEVMRSK--QGIVLSQRKYILDLL

Query:  AEIGMLDCRPADTPVVQGVKLGE--FPDQVPAN----KERYQRLVGKLIY-LAHTRPDIAYGVSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKK
            M + +P  TP+   +KL +   P  V       K  Y   VG L+Y +  TRPDIA+ V VVS+F+ NP ++H +AV  I+RYL+G  G  + F  
Subjt:  AEIGMLDCRPADTPVVQGVKLGE--FPDQVPAN----KERYQRLVGKLIY-LAHTRPDIAYGVSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKK

Query:  NGHLDVSGFTDADWAGSVSDRRSTAGY-FTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGICELLWLRRLLGELGFAPTQAMDLYCDSRPAIDISHN
        +  + + G+TDAD AG + +R+S+ GY FTF GG  ++W+SK Q  VALS+ EAE+    +   E++WL+R L ELG    + + +YCDS+ AID+S N
Subjt:  NGHLDVSGFTDADWAGSVSDRRSTAGY-FTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGICELLWLRRLLGELGFAPTQAMDLYCDSRPAIDISHN

P92519 Uncharacterized mitochondrial protein AtMg008103.7e-4541.67Show/hide
Query:  LIIYVDDMIVTGNDTQEIATLEKKLSGEFEMKNLGGLKYFLRIEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTPVV----QGVKLGEFPDQVPANK
        L++YVDD+++TG+    +  L  +LS  F MK+LG + YFL I++     G+ LSQ KY   +L   GMLDC+P  TP+       V   ++PD  P++ 
Subjt:  LIIYVDDMIVTGNDTQEIATLEKKLSGEFEMKNLGGLKYFLRIEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTPVV----QGVKLGEFPDQVPANK

Query:  ERYQRLVGKLIYLAHTRPDIAYGVSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFVGGNLVTWR
          ++ +VG L YL  TRPDI+Y V++V Q MH P+    D + R++RY+KG    G+   KN  L+V  F D+DWAG  S RRST G+ TF+G N+++W 
Subjt:  ERYQRLVGKLIYLAHTRPDIAYGVSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFVGGNLVTWR

Query:  SKKQSVVALSSAEAEFRGVAKGICELLW
        +K+Q  V+ SS E E+R +A    EL W
Subjt:  SKKQSVVALSSAEAEFRGVAKGICELLW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.8e-15629.18Show/hide
Query:  NYGVWSQMVEVLLASKDKLGHINGERTQPASN---------DTGYKRWMSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDL
        NY +WS+ V  L    +  G ++G  T P +          +  Y RW   + ++   +L ++  ++     R +TA ++W++++  + + +    +  L
Subjt:  NYGVWSQMVEVLLASKDKLGHINGERTQPASN---------DTGYKRWMSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDL

Query:  EVKV-YGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVEQDRKVYIFLGGLEDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVG
          ++    K T T++DY   +QGL     F +   +  P+D         D +V   L  L + +  +  +I      PT+ E         + + ++  
Subjt:  EVKV-YGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVEQDRKVYIFLGGLEDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVG

Query:  KSTDLSNSVAMIVQGAKQVAGVPHISTNSTHGNSNNSVQEIHCLCHRRYRERNPVLIKVLYTLQHDTTKLKQFSSAPTVAAKAIVKTPVLKSMDWYKELK
        +S  L+ S A ++        V H +T +T+ N+N +        + RY  RN           ++ +K  Q SS          K  + K     +   
Subjt:  KSTDLSNSVAMIVQGAKQVAGVPHISTNSTHGNSNNSVQEIHCLCHRRYRERNPVLIKVLYTLQHDTTKLKQFSSAPTVAAKAIVKTPVLKSMDWYKELK

Query:  KKQAEAKR-GKVSIAVSGKNGDDSPS-------EGIIFLSSNLSNDNSTWIVDSGASDHMTFTKTDLTA-ECVTKKTEILNANGISYPVKCAGSIKVTSQ
         +   AKR  ++   +S  N    PS          + L S  S++N  W++DSGA+ H+T    +L+  +  T   +++ A+G + P+   GS  ++++
Subjt:  KKQAEAKR-GKVSIAVSGKNGDDSPS-------EGIIFLSSNLSNDNSTWIVDSGASDHMTFTKTDLTA-ECVTKKTEILNANGISYPVKCAGSIKVTSQ

Query:  ---LNLSNVLVVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIERDGLYHLEDLKAGRANLVVDQTEVQNKIWTWHRRLGHPSLGYMK
           LNL N+L VP++   L+SV +L       V+ +   F ++D+ T   + +G  +D LY      +   +L    +       +WH RLGHP+     
Subjt:  ---LNLSNVLVVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIERDGLYHLEDLKAGRANLVVDQTEVQNKIWTWHRRLGHPSLGYMK

Query:  KLLPRLFKNNNLTVFN-------CDTCIKAKSHRVSYAPSSNKSNSPFDLIHSDVWGPAPVSSIDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFKKFY
         +L  +  N +L+V N       C  C+  KS++V ++ S+  S  P + I+SDVW  +P+ S D  +++V+F+D  TR TW+Y LK K +V   F  F 
Subjt:  KLLPRLFKNNNLTVFN-------CDTCIKAKSHRVSYAPSSNKSNSPFDLIHSDVWGPAPVSSIDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFKKFY

Query:  TMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNRHILEMSRALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQT
         +++ +F   I  F SDNGGEF+   L ++F Q GI H TS   TP+ NG++ERK+RHI+E    LL    +P  +W  +  +AVY++NRLPT +   ++
Subjt:  TMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNRHILEMSRALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQT

Query:  PLKTLEKHHSIPSVLSLTPKIFGCVAYVHVPKTQRSKLSPCAVRCVFLGFGQNQKGYKCFDVDSRKWYVTMDVTFVEH---EAFFKPTIQSDQGENSSES
        P + L    + P+   L  ++FGC  Y  +    + KL   + +CVFLG+   Q  Y C  + + + Y++  V F E+    + +  T+   Q +    S
Subjt:  PLKTLEKHHSIPSVLSLTPKIFGCVAYVHVPKTQRSKLSPCAVRCVFLGFGQNQKGYKCFDVDSRKWYVTMDVTFVEH---EAFFKPTIQSDQGENSSES

Query:  DVWKNKDLLSFQGWTL---------------------YGQDGVSGTDESDGNGNGTDESDGDGNGATDGNKPLEMYSRNKNRAVFGMESENEEVIQP-NE
         VW     L  +   L                     +    VS ++      +    S        +G +P    ++ + +      S+N     P NE
Subjt:  DVWKNKDLLSFQGWTL---------------------YGQDGVSGTDESDGNGNGTDESDGDGNGATDGNKPLEMYSRNKNRAVFGMESENEEVIQP-NE

Query:  EPQ---GPMSPEDQLQVSLPTSKYTLPVRCNRGIPPK--RYEPD---EDRHKKSKYPI-ANFVDTKSLSGPVKRFNENLLSCKI-----PENVDEAKSDP
         P      +S   Q   S P+   +         PP    + P    +  +  ++ P+  + + T++ +G +K   +  L+  +     P    +A  D 
Subjt:  EPQ---GPMSPEDQLQVSLPTSKYTLPVRCNRGIPPK--RYEPD---EDRHKKSKYPI-ANFVDTKSLSGPVKRFNENLLSCKI-----PENVDEAKSDP

Query:  NWSQAMEAEMSALYNNKTWTLVELPQGKIP-VGCRWVFSIKYNANGEIDRYKARLVAKGYTQ-------------------------------------T
         W  AM +E++A   N TW LV  P   +  VGCRW+F+ KYN++G ++RYKARLVAKGY Q                                      
Subjt:  NWSQAMEAEMSALYNNKTWTLVELPQGKIP-VGCRWVFSIKYNANGEIDRYKARLVAKGYTQ-------------------------------------T

Query:  QECFLHGELKEEVYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKRNQEKLTALIIYVDDMIVTGNDTQEIAT
           FL G L ++VYM QPPG+   +    VCKL KALYGLKQ+PRAW+      + + GF    SD +LF+ +  + +  +++YVDD+++TGND   +  
Subjt:  QECFLHGELKEEVYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKRNQEKLTALIIYVDDMIVTGNDTQEIAT

Query:  LEKKLSGEFEMKNLGGLKYFLRIEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTRPDIAYGV
            LS  F +K+   L YFL IE  R   G+ LSQR+YILDLLA   M+  +P  TP+    KL  +      +   Y+ +VG L YLA TRPDI+Y V
Subjt:  LEKKLSGEFEMKNLGGLKYFLRIEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTRPDIAYGV

Query:  SVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGIC
        + +SQFMH P+E+H+ A+ RI+RYL G P  GI  KK   L +  ++DADWAG   D  ST GY  ++G + ++W SKKQ  V  SS EAE+R VA    
Subjt:  SVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGIC

Query:  ELLWLRRLLGELGFAPTQAMDLYCDSRPAIDISHNP
        E+ W+  LL ELG   T+   +YCD+  A  +  NP
Subjt:  ELLWLRRLLGELGFAPTQAMDLYCDSRPAIDISHNP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.7e-15428.64Show/hide
Query:  NYGVWSQMVEVLLASKDKLGHINGERTQPASN---------DTGYKRWMSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDL
        NY +WS+ V  L    +  G ++G    P +          +  Y RW   + ++   IL ++  ++     R +TA ++W++++               
Subjt:  NYGVWSQMVEVLLASKDKLGHINGERTQPASN---------DTGYKRWMSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDL

Query:  EVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVEQDRKVYIFLGGLEDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGK
          K+Y     G +    +  +       F +   +  P+D         D +V   L  L D +  +  +I      P++ E         + + ++  +
Subjt:  EVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVEQDRKVYIFLGGLEDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGK

Query:  STDLSNSVAMIVQGAKQVAGVPHISTNSTHGNSNNSVQEIHCLCHRRYRERNPVLIKVLYTLQHDTTKLKQFSSAPTVAAKAIVKTPVLKSMDWYKELKK
        S  L+ + A +V     V  V H +TN T+ N NN         +R Y   N           +        S +     K  +    + S+  +   + 
Subjt:  STDLSNSVAMIVQGAKQVAGVPHISTNSTHGNSNNSVQEIHCLCHRRYRERNPVLIKVLYTLQHDTTKLKQFSSAPTVAAKAIVKTPVLKSMDWYKELKK

Query:  KQAEAKRGKVSIAVSGKNGDDSPSEGIIFLSSNLSNDNSTWIVDSGASDHMTFTKTDLT-AECVTKKTEILNANGISYPVKCAGSIKV---TSQLNLSNV
         Q    +   +   S      +P +    L+ N   + + W++DSGA+ H+T    +L+  +  T   +++ A+G + P+   GS  +   +  L+L+ V
Subjt:  KQAEAKRGKVSIAVSGKNGDDSPSEGIIFLSSNLSNDNSTWIVDSGASDHMTFTKTDLT-AECVTKKTEILNANGISYPVKCAGSIKV---TSQLNLSNV

Query:  LVVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIERDGLYHLEDLKAGRANLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLPRLFKN
        L VP++   L+SV +L       V+ +   F ++D+ T   + +G  +D LY      +   ++            +WH RLGHPSL     +L  +  N
Subjt:  LVVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIERDGLYHLEDLKAGRANLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLPRLFKN

Query:  NNLTVFN-------CDTCIKAKSHRVSYAPSSNKSNSPFDLIHSDVWGPAPVSSIDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFKKFYTMIKTQFGK
        ++L V N       C  C   KSH+V ++ S+  S+ P + I+SDVW  +P+ SID  +++V+F+D  TR TW+Y LK K +V   F  F ++++ +F  
Subjt:  NNLTVFN-------CDTCIKAKSHRVSYAPSSNKSNSPFDLIHSDVWGPAPVSSIDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFKKFYTMIKTQFGK

Query:  GIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNRHILEMSRALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHH
         I    SDNGGEF+  VL+D+  Q GI H TS   TP+ NG++ERK+RHI+EM   LL    VP  +W  +  +AVY++NRLPT +   Q+P + L    
Subjt:  GIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNRHILEMSRALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHH

Query:  SIPSVLSLTPKIFGCVAYVHVPKTQRSKLSPCAVRCVFLGFGQNQKGYKCFDVDSRKWYVTMDVTFVEHEAFFKPT---IQSDQGENSSESDVWKNKDLL
          P+   L  K+FGC  Y  +    R KL   + +C F+G+   Q  Y C  + + + Y +  V F E    F  T   + + Q + S  +  W +   L
Subjt:  SIPSVLSLTPKIFGCVAYVHVPKTQRSKLSPCAVRCVFLGFGQNQKGYKCFDVDSRKWYVTMDVTFVEHEAFFKPT---IQSDQGENSSESDVWKNKDLL

Query:  SFQGWTL-----------------YGQDGVSGTDESDGN--------GNGTDESDGDGNGATDGNKPLEMYSRNKNRAVFGMESENE-EVIQPNEEPQGP
              L                      +  T  S  N         + ++ +    NG     +P +  + N N  +    + N      PN+    P
Subjt:  SFQGWTL-----------------YGQDGVSGTDESDGN--------GNGTDESDGDGNGATDGNKPLEMYSRNKNRAVFGMESENE-EVIQPNEEPQGP

Query:  MSPEDQLQVSLPTSKYTLPVRCNRG------IPPKRYEPDEDRHKKSKYPIANFVDTKSLSG---PVKRFN--ENLLSCKIPENVDEAKSDPNWSQAMEA
         SP     +  P++  + P   +        +PP    P   +         + + T++  G   P ++++   +L +   P    +A  D  W QAM +
Subjt:  MSPEDQLQVSLPTSKYTLPVRCNRG------IPPKRYEPDEDRHKKSKYPIANFVDTKSLSG---PVKRFN--ENLLSCKIPENVDEAKSDPNWSQAMEA

Query:  EMSALYNNKTWTLVELPQGKIP-VGCRWVFSIKYNANGEIDRYKARLVAKGYTQ-------------------------------------TQECFLHGE
        E++A   N TW LV  P   +  VGCRW+F+ K+N++G ++RYKARLVAKGY Q                                         FL G 
Subjt:  EMSALYNNKTWTLVELPQGKIP-VGCRWVFSIKYNANGEIDRYKARLVAKGYTQ-------------------------------------TQECFLHGE

Query:  LKEEVYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKRNQEKLTALIIYVDDMIVTGNDTQEIATLEKKLSGE
        L +EVYM QPPG+   +    VC+L KA+YGLKQ+PRAW+      + + GF    SD +LF+ +    +  +++YVDD+++TGNDT  +      LS  
Subjt:  LKEEVYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKRNQEKLTALIIYVDDMIVTGNDTQEIATLEKKLSGE

Query:  FEMKNLGGLKYFLRIEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTRPDIAYGVSVVSQFMH
        F +K    L YFL IE  R  QG+ LSQR+Y LDLLA   ML  +P  TP+    KL         +   Y+ +VG L YLA TRPD++Y V+ +SQ+MH
Subjt:  FEMKNLGGLKYFLRIEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTRPDIAYGVSVVSQFMH

Query:  NPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGICELLWLRRL
         P++DH +A+ R++RYL G P  GI  KK   L +  ++DADWAG   D  ST GY  ++G + ++W SKKQ  V  SS EAE+R VA    EL W+  L
Subjt:  NPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGICELLWLRRL

Query:  LGELGFAPTQAMDLYCDSRPAIDISHNP
        L ELG   +    +YCD+  A  +  NP
Subjt:  LGELGFAPTQAMDLYCDSRPAIDISHNP

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.4e-9337.32Show/hide
Query:  YPIANFVDTKSLSGPVKRFNENLLSCKIPENVDEAKSDPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKARLVAKGYTQ
        + I+ F+  + +S     F   +   K P   +EAK    W  AM+ E+ A+    TW +  LP  K P+GC+WV+ IKYN++G I+RYKARLVAKGYTQ
Subjt:  YPIANFVDTKSLSGPVKRFNENLLSCKIPENVDEAKSDPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKARLVAKGYTQ

Query:  TQ-------------------------------------ECFLHGELKEEVYMEQPPGYKSTNAKPM----VCKLNKALYGLKQSPRAWFGRFCRAMQSY
         +                                       FL+G+L EE+YM+ PPGY +     +    VC L K++YGLKQ+ R WF +F   +  +
Subjt:  TQ-------------------------------------ECFLHGELKEEVYMEQPPGYKSTNAKPM----VCKLNKALYGLKQSPRAWFGRFCRAMQSY

Query:  GFKQCDSDHTLFLKRNQEKLTALIIYVDDMIVTGNDTQEIATLEKKLSGEFEMKNLGGLKYFLRIEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTP
        GF Q  SDHT FLK        +++YVDD+I+  N+   +  L+ +L   F++++LG LKYFL +E+ RS  GI + QRKY LDLL E G+L C+P+  P
Subjt:  GFKQCDSDHTLFLKRNQEKLTALIIYVDDMIVTGNDTQEIATLEKKLSGEFEMKNLGGLKYFLRIEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTP

Query:  VVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTRPDIAYGVSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDR
        +   V           + + Y+RL+G+L+YL  TR DI++ V+ +SQF   P   H  AVM+I+ Y+KG  G+G+ +     + +  F+DA +      R
Subjt:  VVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTRPDIAYGVSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDR

Query:  RSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGICELLWLRRLLGELGFAPTQAMDLYCDSRPAIDISHN
        RST GY  F+G +L++W+SKKQ VV+ SSAEAE+R ++    E++WL +   EL    ++   L+CD+  AI I+ N
Subjt:  RSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGICELLWLRRLLGELGFAPTQAMDLYCDSRPAIDISHN

ATMG00240.1 Gag-Pol-related retrotransposon family protein1.8e-1035.37Show/hide
Query:  IYLAHTRPDIAYGVSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFV
        +YL  TRPD+ + V+ +SQF        M AV +++ Y+KG  G+G+ +     L +  F D+DWA     RRS  G+ + V
Subjt:  IYLAHTRPDIAYGVSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFV

ATMG00810.1 DNA/RNA polymerases superfamily protein2.6e-4641.67Show/hide
Query:  LIIYVDDMIVTGNDTQEIATLEKKLSGEFEMKNLGGLKYFLRIEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTPVV----QGVKLGEFPDQVPANK
        L++YVDD+++TG+    +  L  +LS  F MK+LG + YFL I++     G+ LSQ KY   +L   GMLDC+P  TP+       V   ++PD  P++ 
Subjt:  LIIYVDDMIVTGNDTQEIATLEKKLSGEFEMKNLGGLKYFLRIEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPADTPVV----QGVKLGEFPDQVPANK

Query:  ERYQRLVGKLIYLAHTRPDIAYGVSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFVGGNLVTWR
          ++ +VG L YL  TRPDI+Y V++V Q MH P+    D + R++RY+KG    G+   KN  L+V  F D+DWAG  S RRST G+ TF+G N+++W 
Subjt:  ERYQRLVGKLIYLAHTRPDIAYGVSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFVGGNLVTWR

Query:  SKKQSVVALSSAEAEFRGVAKGICELLW
        +K+Q  V+ SS E E+R +A    EL W
Subjt:  SKKQSVVALSSAEAEFRGVAKGICELLW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.1e-1550Show/hide
Query:  KIPENVDEAKSDPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKARLVAKGYTQTQ
        K P++V  A  DP W QAM+ E+ AL  NKTW LV  P  +  +GC+WVF  K +++G +DR KARLVAKG+ Q +
Subjt:  KIPENVDEAKSDPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKARLVAKGYTQTQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGAAGAAACGAATAACCGGCAACTCATTGTGGATGGCAATACTGTGTCGTCCAAGTTCATTGAAGATCTCACCGAGAAATTAACCAAGGCGATGTTTCGAAACTT
TCAACCGCCGCTTGGTCCAGCCACAGGACCTGAGGGAGGGCTACCACCAATGACGATAGTGTTCAACGGCCGTAACTATGGTGTATGGTCGCAAATGGTGGAGGTGCTCC
TTGCATCTAAGGACAAACTCGGCCATATCAATGGCGAACGAACTCAACCTGCGTCTAATGACACAGGCTACAAGCGATGGATGTCTGACAACTCCATGGTAAAAGGTTGG
ATTCTCAGCTCCCTTGACCCTAATTTGATTGGGAATTTCATACGGTTTTCTACTGCCAAGGAGGTATGGGATAGCATCAAATTAACCTTTTTTGATGGTAATGACAACAC
CCAGATCTATGATCTTGAAGTAAAGGTTTATGGGTTAAAACAAACCGGTACACTAGAAGATTATTTCTATACTCTGCAAGGACTTTGGAAGGAGATTGAGTTTAGAAAAC
CAAACCCTATGACCTGTCCCATAGATATTGAGAAATATAACAGAGTAGAGCAAGATAGAAAGGTGTATATTTTCCTTGGGGGTCTTGAGGACAGATTCGATGGCATTCGG
GCCGAGATCCTTAGGATGACACCATTACCAACTGTTGAAGAGACGTATGGACATGTGCGTCGTGAAGATGTGCGTCAATCTGTTATGGTTGGCAAGAGCACGGATTTGTC
TAACTCAGTGGCAATGATTGTGCAAGGTGCAAAACAGGTTGCTGGTGTTCCTCACATCAGTACTAACTCAACCCATGGGAACAGTAACAACTCGGTCCAGGAGATCCACT
GTCTCTGTCATCGCAGGTACAGGGAAAGGAATCCAGTTCTAATCAAGGTGCTGTACACTTTACAACACGATACAACAAAACTGAAGCAGTTCTCAAGTGCACCCACTGTG
GCGGCAAAGGCCATAGTAAAGACACCTGTTTTAAAATCCATGGATTGGTACAAGGAGTTGAAAAAGAAGCAAGCTGAGGCAAAAAGAGGCAAGGTCTCCATTGCAGTATC
GGGTAAGAATGGTGATGATTCTCCAAGTGAAGGTATAATCTTTCTCTCCTCCAACTTGTCTAATGATAACAGTACATGGATAGTTGATTCTGGTGCATCTGATCACATGA
CATTCACTAAAACTGATCTTACTGCTGAGTGTGTTACAAAGAAAACTGAGATATTAAATGCTAACGGAATCTCCTATCCAGTGAAATGTGCAGGAAGTATTAAAGTAACC
TCTCAGTTAAACTTATCAAATGTACTGGTAGTACCATCTCTGGCTACGAGGCTAATGTCAGTCAGTAAATTGACAAAGGACAGAAATTGTGTGGTAAAGATGTATTCTGA
CTATTTTATAATCCAGGACATCCTCACGAAGGAGATCATTGGGCGTGGTATTGAAAGAGATGGGTTGTATCATCTTGAAGATCTTAAAGCTGGAAGAGCAAATTTAGTGG
TTGATCAAACGGAAGTGCAGAATAAAATTTGGACATGGCACCGACGTTTGGGTCATCCTTCTCTTGGGTATATGAAGAAACTTCTGCCTCGTTTATTCAAGAATAACAAT
CTTACTGTGTTTAATTGTGATACTTGCATAAAAGCAAAGAGTCATCGTGTGTCGTATGCACCAAGTTCAAATAAAAGTAATAGTCCTTTTGATTTGATTCACTCAGACGT
GTGGGGACCCGCTCCTGTAAGCTCTATAGATGGAAATAAATGGTTTGTGTTATTTATCGATGACTGTACAAGAATGACATGGGTTTACCTGTTAAAGAGTAAAGATGAAG
TGCCAAGTGTTTTTAAAAAATTTTATACCATGATTAAAACACAGTTTGGAAAAGGGATTAAATTTTTTCGGAGTGATAACGGCGGGGAGTTCATTGGAAAAGTACTTAAA
GATTTCTTTGTTCAAATGGGAATTGTACATGAGACCTCGTGTGTAGGGACACCGCAACAAAATGGGGTCGCCGAACGGAAAAATAGACACATCTTGGAAATGTCACGAGC
ACTGTTGTTTGAATATCATGTTCCGACAAAATTCTGGGATAAATCAATCCTTATGGCAGTATATGTATTAAATCGATTACCCACAAAGATCAATAATTTCCAAACTCCTC
TAAAAACATTAGAAAAGCATCACTCCATTCCATCGGTTCTTAGTCTGACACCAAAAATATTCGGGTGTGTTGCATATGTCCACGTACCAAAAACACAACGCTCAAAACTT
TCACCTTGTGCTGTACGGTGTGTGTTTCTCGGTTTTGGGCAGAATCAAAAAGGGTATAAGTGCTTTGATGTAGACTCTCGAAAATGGTACGTTACGATGGATGTTACGTT
CGTGGAACATGAGGCATTCTTCAAACCCACCATACAATCGGATCAGGGGGAGAACAGTAGTGAATCCGATGTATGGAAGAACAAGGATTTATTAAGTTTTCAGGGATGGA
CCCTGTATGGCCAGGATGGCGTTTCTGGTACTGATGAATCAGATGGAAATGGTAATGGAACTGATGAATCGGATGGAGATGGTAATGGGGCTACGGATGGAAATAAACCA
CTAGAAATGTACTCTAGAAACAAAAACAGAGCAGTCTTTGGAATGGAAAGTGAAAATGAAGAAGTGATACAACCCAATGAAGAACCACAAGGCCCAATGTCTCCTGAGGA
TCAGCTTCAGGTAAGTCTTCCTACCTCTAAGTATACGTTACCAGTCAGATGCAATAGGGGTATACCACCTAAACGGTATGAACCAGATGAAGATAGGCATAAAAAGAGCA
AATACCCCATTGCAAATTTCGTAGATACCAAATCTTTATCTGGACCTGTAAAAAGGTTTAATGAGAACTTATTGTCCTGTAAGATACCAGAAAATGTAGATGAGGCAAAA
TCTGATCCGAATTGGTCACAAGCTATGGAAGCTGAAATGAGTGCATTATATAACAACAAAACTTGGACTTTGGTGGAACTACCACAAGGGAAAATACCTGTAGGGTGCAG
ATGGGTGTTTTCAATTAAGTACAATGCTAATGGGGAGATTGACAGATACAAAGCGAGACTGGTTGCGAAAGGGTATACACAAACACAGGAATGCTTTTTACATGGAGAGC
TGAAAGAAGAAGTGTACATGGAGCAACCCCCAGGATACAAAAGTACAAATGCCAAGCCGATGGTGTGTAAATTGAACAAAGCGCTGTACGGATTGAAGCAATCCCCACGT
GCCTGGTTCGGCAGATTTTGTAGAGCAATGCAAAGTTACGGATTCAAACAATGTGACTCTGACCACACATTGTTCCTGAAAAGAAACCAGGAGAAGTTGACAGCCTTAAT
CATTTATGTCGATGACATGATAGTGACGGGAAATGACACACAAGAAATTGCAACTCTGGAAAAGAAACTCTCAGGTGAATTCGAGATGAAGAATTTGGGAGGATTGAAGT
ATTTTCTTAGAATTGAGGTAATGCGATCAAAACAAGGTATTGTATTATCTCAAAGAAAATACATTTTGGATCTACTCGCAGAGATTGGAATGCTGGACTGTAGACCAGCT
GATACACCTGTAGTTCAAGGGGTTAAATTAGGAGAATTTCCAGACCAAGTACCAGCTAACAAAGAGAGGTACCAAAGGTTGGTAGGAAAGTTGATCTATCTAGCACATAC
AAGGCCGGATATAGCCTATGGGGTTAGTGTGGTAAGTCAGTTTATGCACAACCCAAGTGAAGACCACATGGACGCAGTCATGCGAATAATACGATATCTGAAGGGATGCC
CAGGCAAGGGTATAACTTTCAAGAAGAATGGACACCTTGATGTTTCAGGTTTCACTGATGCTGATTGGGCAGGGAGTGTAAGTGATAGAAGGTCCACAGCAGGTTACTTT
ACCTTTGTCGGAGGTAATCTAGTAACTTGGAGAAGTAAGAAACAAAGTGTTGTAGCACTCTCTAGTGCAGAAGCTGAATTCAGAGGAGTTGCGAAGGGAATATGTGAATT
ATTGTGGTTGAGGAGGTTACTCGGAGAATTGGGATTCGCACCTACTCAGGCAATGGACTTATATTGTGACAGTCGACCTGCGATTGATATCTCTCACAACCCGTACAGCA
TGACAGAACTAAACATGTGGAGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAGAAGAAACGAATAACCGGCAACTCATTGTGGATGGCAATACTGTGTCGTCCAAGTTCATTGAAGATCTCACCGAGAAATTAACCAAGGCGATGTTTCGAAACTT
TCAACCGCCGCTTGGTCCAGCCACAGGACCTGAGGGAGGGCTACCACCAATGACGATAGTGTTCAACGGCCGTAACTATGGTGTATGGTCGCAAATGGTGGAGGTGCTCC
TTGCATCTAAGGACAAACTCGGCCATATCAATGGCGAACGAACTCAACCTGCGTCTAATGACACAGGCTACAAGCGATGGATGTCTGACAACTCCATGGTAAAAGGTTGG
ATTCTCAGCTCCCTTGACCCTAATTTGATTGGGAATTTCATACGGTTTTCTACTGCCAAGGAGGTATGGGATAGCATCAAATTAACCTTTTTTGATGGTAATGACAACAC
CCAGATCTATGATCTTGAAGTAAAGGTTTATGGGTTAAAACAAACCGGTACACTAGAAGATTATTTCTATACTCTGCAAGGACTTTGGAAGGAGATTGAGTTTAGAAAAC
CAAACCCTATGACCTGTCCCATAGATATTGAGAAATATAACAGAGTAGAGCAAGATAGAAAGGTGTATATTTTCCTTGGGGGTCTTGAGGACAGATTCGATGGCATTCGG
GCCGAGATCCTTAGGATGACACCATTACCAACTGTTGAAGAGACGTATGGACATGTGCGTCGTGAAGATGTGCGTCAATCTGTTATGGTTGGCAAGAGCACGGATTTGTC
TAACTCAGTGGCAATGATTGTGCAAGGTGCAAAACAGGTTGCTGGTGTTCCTCACATCAGTACTAACTCAACCCATGGGAACAGTAACAACTCGGTCCAGGAGATCCACT
GTCTCTGTCATCGCAGGTACAGGGAAAGGAATCCAGTTCTAATCAAGGTGCTGTACACTTTACAACACGATACAACAAAACTGAAGCAGTTCTCAAGTGCACCCACTGTG
GCGGCAAAGGCCATAGTAAAGACACCTGTTTTAAAATCCATGGATTGGTACAAGGAGTTGAAAAAGAAGCAAGCTGAGGCAAAAAGAGGCAAGGTCTCCATTGCAGTATC
GGGTAAGAATGGTGATGATTCTCCAAGTGAAGGTATAATCTTTCTCTCCTCCAACTTGTCTAATGATAACAGTACATGGATAGTTGATTCTGGTGCATCTGATCACATGA
CATTCACTAAAACTGATCTTACTGCTGAGTGTGTTACAAAGAAAACTGAGATATTAAATGCTAACGGAATCTCCTATCCAGTGAAATGTGCAGGAAGTATTAAAGTAACC
TCTCAGTTAAACTTATCAAATGTACTGGTAGTACCATCTCTGGCTACGAGGCTAATGTCAGTCAGTAAATTGACAAAGGACAGAAATTGTGTGGTAAAGATGTATTCTGA
CTATTTTATAATCCAGGACATCCTCACGAAGGAGATCATTGGGCGTGGTATTGAAAGAGATGGGTTGTATCATCTTGAAGATCTTAAAGCTGGAAGAGCAAATTTAGTGG
TTGATCAAACGGAAGTGCAGAATAAAATTTGGACATGGCACCGACGTTTGGGTCATCCTTCTCTTGGGTATATGAAGAAACTTCTGCCTCGTTTATTCAAGAATAACAAT
CTTACTGTGTTTAATTGTGATACTTGCATAAAAGCAAAGAGTCATCGTGTGTCGTATGCACCAAGTTCAAATAAAAGTAATAGTCCTTTTGATTTGATTCACTCAGACGT
GTGGGGACCCGCTCCTGTAAGCTCTATAGATGGAAATAAATGGTTTGTGTTATTTATCGATGACTGTACAAGAATGACATGGGTTTACCTGTTAAAGAGTAAAGATGAAG
TGCCAAGTGTTTTTAAAAAATTTTATACCATGATTAAAACACAGTTTGGAAAAGGGATTAAATTTTTTCGGAGTGATAACGGCGGGGAGTTCATTGGAAAAGTACTTAAA
GATTTCTTTGTTCAAATGGGAATTGTACATGAGACCTCGTGTGTAGGGACACCGCAACAAAATGGGGTCGCCGAACGGAAAAATAGACACATCTTGGAAATGTCACGAGC
ACTGTTGTTTGAATATCATGTTCCGACAAAATTCTGGGATAAATCAATCCTTATGGCAGTATATGTATTAAATCGATTACCCACAAAGATCAATAATTTCCAAACTCCTC
TAAAAACATTAGAAAAGCATCACTCCATTCCATCGGTTCTTAGTCTGACACCAAAAATATTCGGGTGTGTTGCATATGTCCACGTACCAAAAACACAACGCTCAAAACTT
TCACCTTGTGCTGTACGGTGTGTGTTTCTCGGTTTTGGGCAGAATCAAAAAGGGTATAAGTGCTTTGATGTAGACTCTCGAAAATGGTACGTTACGATGGATGTTACGTT
CGTGGAACATGAGGCATTCTTCAAACCCACCATACAATCGGATCAGGGGGAGAACAGTAGTGAATCCGATGTATGGAAGAACAAGGATTTATTAAGTTTTCAGGGATGGA
CCCTGTATGGCCAGGATGGCGTTTCTGGTACTGATGAATCAGATGGAAATGGTAATGGAACTGATGAATCGGATGGAGATGGTAATGGGGCTACGGATGGAAATAAACCA
CTAGAAATGTACTCTAGAAACAAAAACAGAGCAGTCTTTGGAATGGAAAGTGAAAATGAAGAAGTGATACAACCCAATGAAGAACCACAAGGCCCAATGTCTCCTGAGGA
TCAGCTTCAGGTAAGTCTTCCTACCTCTAAGTATACGTTACCAGTCAGATGCAATAGGGGTATACCACCTAAACGGTATGAACCAGATGAAGATAGGCATAAAAAGAGCA
AATACCCCATTGCAAATTTCGTAGATACCAAATCTTTATCTGGACCTGTAAAAAGGTTTAATGAGAACTTATTGTCCTGTAAGATACCAGAAAATGTAGATGAGGCAAAA
TCTGATCCGAATTGGTCACAAGCTATGGAAGCTGAAATGAGTGCATTATATAACAACAAAACTTGGACTTTGGTGGAACTACCACAAGGGAAAATACCTGTAGGGTGCAG
ATGGGTGTTTTCAATTAAGTACAATGCTAATGGGGAGATTGACAGATACAAAGCGAGACTGGTTGCGAAAGGGTATACACAAACACAGGAATGCTTTTTACATGGAGAGC
TGAAAGAAGAAGTGTACATGGAGCAACCCCCAGGATACAAAAGTACAAATGCCAAGCCGATGGTGTGTAAATTGAACAAAGCGCTGTACGGATTGAAGCAATCCCCACGT
GCCTGGTTCGGCAGATTTTGTAGAGCAATGCAAAGTTACGGATTCAAACAATGTGACTCTGACCACACATTGTTCCTGAAAAGAAACCAGGAGAAGTTGACAGCCTTAAT
CATTTATGTCGATGACATGATAGTGACGGGAAATGACACACAAGAAATTGCAACTCTGGAAAAGAAACTCTCAGGTGAATTCGAGATGAAGAATTTGGGAGGATTGAAGT
ATTTTCTTAGAATTGAGGTAATGCGATCAAAACAAGGTATTGTATTATCTCAAAGAAAATACATTTTGGATCTACTCGCAGAGATTGGAATGCTGGACTGTAGACCAGCT
GATACACCTGTAGTTCAAGGGGTTAAATTAGGAGAATTTCCAGACCAAGTACCAGCTAACAAAGAGAGGTACCAAAGGTTGGTAGGAAAGTTGATCTATCTAGCACATAC
AAGGCCGGATATAGCCTATGGGGTTAGTGTGGTAAGTCAGTTTATGCACAACCCAAGTGAAGACCACATGGACGCAGTCATGCGAATAATACGATATCTGAAGGGATGCC
CAGGCAAGGGTATAACTTTCAAGAAGAATGGACACCTTGATGTTTCAGGTTTCACTGATGCTGATTGGGCAGGGAGTGTAAGTGATAGAAGGTCCACAGCAGGTTACTTT
ACCTTTGTCGGAGGTAATCTAGTAACTTGGAGAAGTAAGAAACAAAGTGTTGTAGCACTCTCTAGTGCAGAAGCTGAATTCAGAGGAGTTGCGAAGGGAATATGTGAATT
ATTGTGGTTGAGGAGGTTACTCGGAGAATTGGGATTCGCACCTACTCAGGCAATGGACTTATATTGTGACAGTCGACCTGCGATTGATATCTCTCACAACCCGTACAGCA
TGACAGAACTAAACATGTGGAGATAG
Protein sequenceShow/hide protein sequence
MAEETNNRQLIVDGNTVSSKFIEDLTEKLTKAMFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPASNDTGYKRWMSDNSMVKGW
ILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDLEVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVEQDRKVYIFLGGLEDRFDGIR
AEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSVAMIVQGAKQVAGVPHISTNSTHGNSNNSVQEIHCLCHRRYRERNPVLIKVLYTLQHDTTKLKQFSSAPTV
AAKAIVKTPVLKSMDWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEGIIFLSSNLSNDNSTWIVDSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVT
SQLNLSNVLVVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIERDGLYHLEDLKAGRANLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLPRLFKNNN
LTVFNCDTCIKAKSHRVSYAPSSNKSNSPFDLIHSDVWGPAPVSSIDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFKKFYTMIKTQFGKGIKFFRSDNGGEFIGKVLK
DFFVQMGIVHETSCVGTPQQNGVAERKNRHILEMSRALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSLTPKIFGCVAYVHVPKTQRSKL
SPCAVRCVFLGFGQNQKGYKCFDVDSRKWYVTMDVTFVEHEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWTLYGQDGVSGTDESDGNGNGTDESDGDGNGATDGNKP
LEMYSRNKNRAVFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPTSKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPENVDEAK
SDPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKARLVAKGYTQTQECFLHGELKEEVYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPR
AWFGRFCRAMQSYGFKQCDSDHTLFLKRNQEKLTALIIYVDDMIVTGNDTQEIATLEKKLSGEFEMKNLGGLKYFLRIEVMRSKQGIVLSQRKYILDLLAEIGMLDCRPA
DTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTRPDIAYGVSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYF
TFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGICELLWLRRLLGELGFAPTQAMDLYCDSRPAIDISHNPYSMTELNMWR