; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0007837 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0007837
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionGag-pol polyprotein
Genome locationchr10:15466483..15471846
RNA-Seq ExpressionIVF0007837
SyntenyIVF0007837
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035966.1 F5J5.1 [Cucumis melo var. makuwa]0.087.31Show/hide
Query:  MLNSGAKNLDSILKTGHNGSQRYGLGFVSSASSSKATSEIKFVPASMRVEYDTIHLETGIRASVKSLGRTYYYCGQKGHIRSICYKLRQDQLRQQKHWNR
        MLNSGAKNLDSILKTGHNGSQRYGLGFVSSASSSKATSEIKFVPASMRVEYDTIHLETGIRASVKSLGRTYYYCGQKGHIRSICYKLRQDQLRQQKHWNR
Subjt:  MLNSGAKNLDSILKTGHNGSQRYGLGFVSSASSSKATSEIKFVPASMRVEYDTIHLETGIRASVKSLGRTYYYCGQKGHIRSICYKLRQDQLRQQKHWNR

Query:  SCAQPCMVWRIKYIERCKIAFTSVQTADDVWYFDSGCSRHMTGNRSYFMNLNDCVIGHVTFGDGAKGKIIAKGNINKDDLPRLNDVRYVDGLKANLISIN
        SCAQPCMVWRIKYIERCKIAFTSVQTADDVWYFDSGCSRHMTGNRSYFMNLNDCVIGHVTFGDGAKGKIIAKGNINKDDLPRLNDVRYVDGLKANLISIN
Subjt:  SCAQPCMVWRIKYIERCKIAFTSVQTADDVWYFDSGCSRHMTGNRSYFMNLNDCVIGHVTFGDGAKGKIIAKGNINKDDLPRLNDVRYVDGLKANLISIN

Query:  QLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCELIRSDQTWLWHRKLEHASMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGK
        QLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCELIRSDQTWLWHRKLEHASMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGK
Subjt:  QLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCELIRSDQTWLWHRKLEHASMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGK

Query:  Q---------------------------------------------------------------------TRSDSVMETINVVINDLDSSIKQMNDEEDE
        +                                                                      RSDSVMETINVVINDLDSSIKQMNDEEDE
Subjt:  Q---------------------------------------------------------------------TRSDSVMETINVVINDLDSSIKQMNDEEDE

Query:  TPNMSEVRTMSIVEESKADNSSNGP-----GDPSVGMQTRRKDKIDYLKMVADLYYIFTIEPSTVDSVIKDEYWLNAMQEELLQFRRNNIWTLVSKPEAT
        TPNMSEVRTMSIVEESKADNSSNGP     GDPSVGMQTRRKDKIDYLKMVADLYYIFTIEPSTVDSVIKDEYWLNAMQEELLQFRRNNIWTLV      
Subjt:  TPNMSEVRTMSIVEESKADNSSNGP-----GDPSVGMQTRRKDKIDYLKMVADLYYIFTIEPSTVDSVIKDEYWLNAMQEELLQFRRNNIWTLVSKPEAT

Query:  DETGCVTKNKARLVAQGYTQVEGVDFDETFALVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNGEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQ
                      AQGYTQVEGVDFDETFALVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNGEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQ
Subjt:  DETGCVTKNKARLVAQGYTQVEGVDFDETFALVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNGEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQ

Query:  APRAWYNRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDL---------SEFEMSMVGELSCFLGLQIKQKNDSIFISQEKSIVG
        APRAWYNRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDL         SEFEMSMVGELSCFLGLQIKQKNDSIFISQEKSIVG
Subjt:  APRAWYNRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDL---------SEFEMSMVGELSCFLGLQIKQKNDSIFISQEKSIVG

Query:  SLLYLTASRSDIAYAVGICARYQTDPRITHLEVVKRILKYVHGISDFGMMYSYNTTPTLVGYFDVDWAGSTDDRKIHHVRVRDRRFKSTPPRRPYWLPSE
        SLLYLTASRSDIAYAVGICARYQTDPRITHLEVVKRILKYVHGISDFGMMYSYNTTPTLVGYFDVDWAGSTDDRKIHHVRVRDRRFKSTPPRRPYWLPSE
Subjt:  SLLYLTASRSDIAYAVGICARYQTDPRITHLEVVKRILKYVHGISDFGMMYSYNTTPTLVGYFDVDWAGSTDDRKIHHVRVRDRRFKSTPPRRPYWLPSE

Query:  KVQGEATSRLQESLRSEAMPKVGESAAPVSPTVHAHRAFEATVSDMDSDNKDNVLLIHLLKKPSEPVTAERLSSDPPGFHSLSRELINRR---GHSPSVH
        KVQGEATSRLQESLRSEAMPKVGESAAPVSPTVHAHRAFEAT S +            LL  P    + E  S++     +L     +     GHSPSVH
Subjt:  KVQGEATSRLQESLRSEAMPKVGESAAPVSPTVHAHRAFEATVSDMDSDNKDNVLLIHLLKKPSEPVTAERLSSDPPGFHSLSRELINRR---GHSPSVH

Query:  PSLSKLPTLQPDAVPAHILEIATA------TEIPPEDISPPTDDPIAPSSEGRTKSPKGPKPPKRKTQQVRRNVTTKIGRKKIPANVPSVPIDGISFHHK
        PSLSKLPTLQPDAVPAHILEIATA      TEIPPEDISPPTDDPIAPSSEGRTKSPKGPKPPKRKTQQVRRNVTTKIGRKKIPANVPSVPIDGISFHHK
Subjt:  PSLSKLPTLQPDAVPAHILEIATA------TEIPPEDISPPTDDPIAPSSEGRTKSPKGPKPPKRKTQQVRRNVTTKIGRKKIPANVPSVPIDGISFHHK

Query:  ESVQRWKFVMERRIIDELIKEFIVNLPDEFNDPSSADYQTVHNRGFKFVISLAVINGFLGNTVDIDCSPSCLTNELLATVLSEGTLSTWPVNGIPAAALS
        ESVQRWKFVMERRIIDELIKEFIVNLPDEFNDPSSADYQTVHNRGFKFVISLAVINGFLGNTVDIDCSPSCLTNELLATVLSEGTLSTWPVNGIPAAALS
Subjt:  ESVQRWKFVMERRIIDELIKEFIVNLPDEFNDPSSADYQTVHNRGFKFVISLAVINGFLGNTVDIDCSPSCLTNELLATVLSEGTLSTWPVNGIPAAALS

Query:  VKYAILHKIGIGNWFPSLHASSISAALGTFLYQICNDDKVDTGVFIYNQLLRHVGSFGVKVPIAFPRLFSSLLLHLNGVVLTTSDAPGPEPKTIALRYRL
        VKYAILHKIGIGNWFPSLHASSISAALGTFLYQICNDDKVDTGVFIYNQLLRHVGSFGVKVPIAFPRLFSSLLLHLNGVVLTTSDAPGPEPKTIALRYRL
Subjt:  VKYAILHKIGIGNWFPSLHASSISAALGTFLYQICNDDKVDTGVFIYNQLLRHVGSFGVKVPIAFPRLFSSLLLHLNGVVLTTSDAPGPEPKTIALRYRL

Query:  FQGSHMPDIDHDVHPTQGPHI
        FQGSHMPDIDHDVHPTQGPHI
Subjt:  FQGSHMPDIDHDVHPTQGPHI

KAA0066164.1 gag-pol polyprotein [Cucumis melo var. makuwa]0.054.24Show/hide
Query:  KIAFTSVQTADDVWYFDSGCSRHMTGNRSYFMNLNDCVIGHVTFGDGAKGKIIAKGNINKDDLPRLNDVRYVDGLKANLISINQLCDQGYKVSFDDIGCV
        ++     +T DD WYFDSGCSRHMTGNRSYF NL DCV GHVTF +GAKGKIIAKGNIN ++LPRLND+R ++ L  +L+   Q+   G           
Subjt:  KIAFTSVQTADDVWYFDSGCSRHMTGNRSYFMNLNDCVIGHVTFGDGAKGKIIAKGNINKDDLPRLNDVRYVDGLKANLISINQLCDQGYKVSFDDIGCV

Query:  VMNKENQICMSGKRQADNCYHWNSNMSDTCELIRSDQTWLWHRKLEHASMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGK-----QTRSDSVMETIN
                   GKR+                         +H+K +  S +G+                     F G  Q  +       RS  VMETIN
Subjt:  VMNKENQICMSGKRQADNCYHWNSNMSDTCELIRSDQTWLWHRKLEHASMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGK-----QTRSDSVMETIN

Query:  VVINDLDSSIKQMNDEEDETPNMSEVRTMSIVEESKADNSS-----------------------------NGP-----GDPSVGMQTRRKDKIDYLKMVA
        VVINDL+ +IKQ+NDEEDET NMSE RT S VE  KA   S                             N P     GDPS GMQTRRK+KIDY+KMVA
Subjt:  VVINDLDSSIKQMNDEEDETPNMSEVRTMSIVEESKADNSS-----------------------------NGP-----GDPSVGMQTRRKDKIDYLKMVA

Query:  DLYYIFTIEPSTVDSVIKDEYWLNAMQEELLQFRRNNIWTLVSKPEA-------------TDETGCVTKNKARLVAQGYTQVEGVDFDETFALVARLEAI
        DL YI T+EPSTVDS ++DEYWLNAMQEELLQFR+NN+WTLVSKPE              TDE GCVTKNKA+LVAQGYTQVEG+DFDETFA VARLEAI
Subjt:  DLYYIFTIEPSTVDSVIKDEYWLNAMQEELLQFRRNNIWTLVSKPEA-------------TDETGCVTKNKARLVAQGYTQVEGVDFDETFALVARLEAI

Query:  RLLLGISCIQKFKLYQMDVKSAFLNGYLNGEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYNRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVA
        RLLLGISCIQKFKLYQMDVKSAFL+GYLN EVYVAQPKGFVD EHPKH+YKLNKALYGLKQA RAWY++LTVYLRG+GYSRGEIDKTLFI RKSDQLLVA
Subjt:  RLLLGISCIQKFKLYQMDVKSAFLNGYLNGEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYNRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVA

Query:  QIYVDDIIFGGFPQDLS----EFEMSMVGELSCFLGLQIKQKNDSIFISQEKSIVG-----SLLYLTASRSDIAYAVGICA------------RYQTDPR
        QIYVDDIIF GFP DL     EFEMSMVGELSCFLGLQIKQKND IFISQEK          L      R+  A  V +              R   DPR
Subjt:  QIYVDDIIFGGFPQDLS----EFEMSMVGELSCFLGLQIKQKNDSIFISQEKSIVG-----SLLYLTASRSDIAYAVGICA------------RYQTDPR

Query:  ITHLEVVKRILKYVHGISDFGMMYSYNTTPTLVGYFDVDWAGSTDDRK----------------------------------------------------
        ITHLE VKRILKYVHG SDFGMMYSY+TTPTLVGY D +WAGSTDD K                                                    
Subjt:  ITHLEVVKRILKYVHGISDFGMMYSYNTTPTLVGYFDVDWAGSTDDRK----------------------------------------------------

Query:  ----------------------------------------------------------------IHHVRVRDRRFKSTPPRRPYWLPSEKVQGEATSRLQ
                                                                        I H RVR R+FKSTPPRR Y LPSEKVQGEA+SRLQ
Subjt:  ----------------------------------------------------------------IHHVRVRDRRFKSTPPRRPYWLPSEKVQGEATSRLQ

Query:  ESLRSEAMPKVGESAAPVSPTVHAHRAFEATVSDMDSDNKDNVLLIHLLKKPSEPVTAERLSSDPPG-FHSLSRELINR----RGHSPSVHPSLSKLPTL
        ESLRSEA+P+VGESAAPV+                               KPSEPV  ERL SDP G  HS     I          P   P++  +PT 
Subjt:  ESLRSEAMPKVGESAAPVSPTVHAHRAFEATVSDMDSDNKDNVLLIHLLKKPSEPVTAERLSSDPPG-FHSLSRELINR----RGHSPSVHPSLSKLPTL

Query:  QPDAVPAHILEIATAT--------------------EIPPEDISPPTDDPIAPSSEGRTKSPKGPKPPKRKTQQVRRNVTTKIGRKKIPANVPSVPIDGI
               HI  IATA                     EIP EDI PPTDDPIAPSSEGR +SPK                      +K        P+  I
Subjt:  QPDAVPAHILEIATAT--------------------EIPPEDISPPTDDPIAPSSEGRTKSPKGPKPPKRKTQQVRRNVTTKIGRKKIPANVPSVPIDGI

Query:  SFHHKESVQRWKFVMERRIIDELIKEFIVNLPDEFNDPSSADYQTVHNRGFKFVISLAVINGFLGNTVDIDCSPSCLTNELLATVLSEGTLSTWPVNGIP
        SFHH++SVQRWKFVM+RRI +ELI++FIVNLPD+FNDPSSADYQTVH RGFKF+IS AVINGFLGNTVDIDCSPSC T E+LATVLS  TLSTW VN IP
Subjt:  SFHHKESVQRWKFVMERRIIDELIKEFIVNLPDEFNDPSSADYQTVHNRGFKFVISLAVINGFLGNTVDIDCSPSCLTNELLATVLSEGTLSTWPVNGIP

Query:  AAALSVKYAILHKIGIGNWFPSLHASSISAALGTFLYQICNDDKVDTGVFIYNQLLRHVGSFGVKVPIAFPRLFSSLLLHLNGVVLTTSDAPGPEPKTIA
        AAALSVKYAILHKI I NWF S HA SI  ALGTFLYQ CNDDKVDTG FIYNQLLRHVGSFGVKVPIA P+LFSSLLLHLN VVLT +DA GPEPKTI 
Subjt:  AAALSVKYAILHKIGIGNWFPSLHASSISAALGTFLYQICNDDKVDTGVFIYNQLLRHVGSFGVKVPIAFPRLFSSLLLHLNGVVLTTSDAPGPEPKTIA

Query:  LRYRLFQGSHMPDIDHDVHPTQGPHI
        L YRLFQGSH+PDIDHDVHPT+GP I
Subjt:  LRYRLFQGSHMPDIDHDVHPTQGPHI

TYK11265.1 uncharacterized protein E5676_scaffold227G001210 [Cucumis melo var. makuwa]0.052.02Show/hide
Query:  KNLDSILKTGHNGSQRYGLGFVSSASSSKATSEIKFVPASMRVEYDTIHLETGIRASVKSLGRTYYYCGQKGHIRSICYKLRQDQLRQQKHWNRSCAQPC
        +NLDSILK GHNGS RYGLGF  SASSSKATSEIKF+PAS+RVEYDTIH ETGIR  VKSLG T YYCG+KGHIR +  +L    L            P 
Subjt:  KNLDSILKTGHNGSQRYGLGFVSSASSSKATSEIKFVPASMRVEYDTIHLETGIRASVKSLGRTYYYCGQKGHIRSICYKLRQDQLRQQKHWNRSCAQPC

Query:  MVWRIKYIERCKIAFTSVQTADDVWYFDSGCSRHMTGNRSYFMNLNDCVIGHVTFGDGAKGKIIAKGNINKDDLPRLNDVRYVDGLKANLISINQLC---
         +                                + G R+  + +                          DD  R   V ++ G    +     LC   
Subjt:  MVWRIKYIERCKIAFTSVQTADDVWYFDSGCSRHMTGNRSYFMNLNDCVIGHVTFGDGAKGKIIAKGNINKDDLPRLNDVRYVDGLKANLISINQLC---

Query:  --DQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYH--WNSNMSDTCELIRSDQTWLWHRKLEHASMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIG
          ++G K            +E+++ M     AD  Y   W+         +RS+Q       L ++      +V  N                       
Subjt:  --DQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYH--WNSNMSDTCELIRSDQTWLWHRKLEHASMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIG

Query:  KQTRSDSVMETINVVINDLDSSIKQMNDEEDETPNMSEVRTMSIVEESKADNSSNGPG----------------------------------DPSVGMQT
           RS++VMETINVVINDLDS IKQMNDEEDE  NMSEVRTMS VEESKADNS +GPG                                  DPSVGMQT
Subjt:  KQTRSDSVMETINVVINDLDSSIKQMNDEEDETPNMSEVRTMSIVEESKADNSSNGPG----------------------------------DPSVGMQT

Query:  RRKDKIDYLKMVADLYYIFTIEPSTVDSVIKDEYWLNAMQEELLQFRRNNIWTLVSKPEATDETGCVTKNKARLVAQGYTQVEGVDFDETFALVARLEAI
        RRKDKIDYLKMVAD  YI TI PSTVDS +KDEYWLNAMQEELLQF+RNN+WTLVSKP+  D                   ++  DF   +         
Subjt:  RRKDKIDYLKMVADLYYIFTIEPSTVDSVIKDEYWLNAMQEELLQFRRNNIWTLVSKPEATDETGCVTKNKARLVAQGYTQVEGVDFDETFALVARLEAI

Query:  RLLLGISCIQKFKLYQMDVKSAFLNGYLNGEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYNRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVA
               CIQKFKLYQMDVKS FLNGYLN EVYVAQPK FVD EHPKHVYKLNKALYGLKQAPRAWY++LTVYLRGRGYSRGEIDKTLFIHRKSDQLLVA
Subjt:  RLLLGISCIQKFKLYQMDVKSAFLNGYLNGEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYNRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVA

Query:  QIYVDDIIFGGFPQDLSEFEMSMVGELSCFLGLQIKQKNDSIFISQEKSIVGSLLYLTASRSDIAYAVGICARYQTDPRITHLE----------------
        QIYVDDIIFGGFPQDL+  + +     + ++ L    +   +     +SIVGSLLYLTASR DIAY VGICARYQ DPRITHLE                
Subjt:  QIYVDDIIFGGFPQDLSEFEMSMVGELSCFLGLQIKQKNDSIFISQEKSIVGSLLYLTASRSDIAYAVGICARYQTDPRITHLE----------------

Query:  ---------VVKRILKY-------------VHGISDFGMMYSYNTTPTL------------------------------------VGYFDVDWAGSTDDR
                 + K +L+Y             +  I     +  ++ T  +                                       F+   AG    +
Subjt:  ---------VVKRILKY-------------VHGISDFGMMYSYNTTPTL------------------------------------VGYFDVDWAGSTDDR

Query:  KIH------------------------------HVRVRDRRFKSTPPRRPYWLPSEKVQGEATSRLQESLRSEAMPKVGESAAPVSPTVHAHRAFEATVS
         +H                              H RVR RRFKSTPP RPY L  EKVQGEA+SRLQESLRSEA+P+VGES  PVSP VHAHRA EATVS
Subjt:  KIH------------------------------HVRVRDRRFKSTPPRRPYWLPSEKVQGEATSRLQESLRSEAMPKVGESAAPVSPTVHAHRAFEATVS

Query:  DMDSDNKDNVLLIHLLKKPSEPVTAERLSSDPPGFHSLSRELINRRGHSPSVHPSLSKLPTLQPDAVPAHILEIATA--------------------TEI
        D+DSD++DNV                              ELINRR                    VPAHI EIATA                    TEI
Subjt:  DMDSDNKDNVLLIHLLKKPSEPVTAERLSSDPPGFHSLSRELINRRGHSPSVHPSLSKLPTLQPDAVPAHILEIATA--------------------TEI

Query:  PPEDISPPTDDPIAPSSEGRTKSPKGPKPPKRKTQQVRRNVTTKIGRKKIPANVPSVPIDGISFHHKESVQRWKFVMERRIIDELIKEFIVNLPDEFNDP
        PPEDI PPTDDPIAPSS+GR +S KGPKP K+KTQQ RRNVTTKI RKKIPANVPSVPIDGISFHH+ESVQRWKFVM+RRI+DELI++FIVNLPD+FNDP
Subjt:  PPEDISPPTDDPIAPSSEGRTKSPKGPKPPKRKTQQVRRNVTTKIGRKKIPANVPSVPIDGISFHHKESVQRWKFVMERRIIDELIKEFIVNLPDEFNDP

Query:  SSADYQTVHNRGFKFVISLAVINGFLGNTVDIDCSPSCLTNELLATVLSEGTLSTWPVNGIPAAALSVKYAILHKIGIGNWFPSLHASSISAALGTFLYQ
        SSADYQTVH RGFKFVIS AVIN FL NTVDIDCS SC T E+LA VLSEGTLSTW VNGI A ALSVKY IL+KIGI NWFPS HASS SAALGTFLYQ
Subjt:  SSADYQTVHNRGFKFVISLAVINGFLGNTVDIDCSPSCLTNELLATVLSEGTLSTWPVNGIPAAALSVKYAILHKIGIGNWFPSLHASSISAALGTFLYQ

Query:  ICNDDKVDTGVFIYNQLLRHVGSFGVKVPIAFPRLFSSLLLHLNGVVLTTSDAPGPEPKTIALRYRLFQGSHMPDIDHDVHPTQGPHI
        ICN DKVDTG FIYNQLLRHVGSFGVKVPIAFPR FSSLLLHLNG VLT SDAPGPEPKTIAL YRLFQGSH+PDIDHDVHPT+GPHI
Subjt:  ICNDDKVDTGVFIYNQLLRHVGSFGVKVPIAFPRLFSSLLLHLNGVVLTTSDAPGPEPKTIALRYRLFQGSHMPDIDHDVHPTQGPHI

TYK16854.1 F5J5.1 [Cucumis melo var. makuwa]0.052.66Show/hide
Query:  MLNSGAKNLDSILKTGHNGSQRYGLGFVSSASSSKATSEIKFVPASMRVEYDTIHLETGIRASVKSLGRTYYYCGQKGHIRSICYKLRQDQLRQQKHWNR
        MLNSG KNLDSIL +GHNGS RYGLGFV+SA+S KATSEIKFVPASM VE++TIH ET IR +VKSLGRT YYCG+KGHIRSICYKLRQDQLRQQK+WNR
Subjt:  MLNSGAKNLDSILKTGHNGSQRYGLGFVSSASSSKATSEIKFVPASMRVEYDTIHLETGIRASVKSLGRTYYYCGQKGHIRSICYKLRQDQLRQQKHWNR

Query:  SCAQPCMVWRIKYIERCKIAFTSVQTADDVWYFDSGCSRHMTGNRSYFMNLNDCVIGHVTFGDGAKGKIIAKGNINKDDLPRLNDVRYVDGLKANLISIN
        S AQP MVW IK  ERCKIAFTSVQTADDVWYFDSGCSRHMTGNRSY   L DCV G VTFGDGAKGKIIA GNI K++LPRLNDVRYVDGLKANLISI+
Subjt:  SCAQPCMVWRIKYIERCKIAFTSVQTADDVWYFDSGCSRHMTGNRSYFMNLNDCVIGHVTFGDGAKGKIIAKGNINKDDLPRLNDVRYVDGLKANLISIN

Query:  QLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCELIRSDQTWLWHRKLEHASMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGK
        QLCDQ YK+SFDDIGCVVMNKENQICMSGKRQ DNCY+WNSN SDTC+L RSDQTW+WH+KL H SMRGLEK+IKN+A++GIPDLDVNG FFC DCQIGK
Subjt:  QLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCELIRSDQTWLWHRKLEHASMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGK

Query:  QTRS------------------------------------------------------------------------------------------------
        QTRS                                                                                                
Subjt:  QTRS------------------------------------------------------------------------------------------------

Query:  -----------------------DSVMETINVVINDLDSSIKQMNDEEDETPNMSEVRTMSIVEESKADNSSNGPG--DPSVGMQTRRKDKIDYLKMVAD
                               +SVMETINVVINDL+S+IKQMND+EDETP MSE RT S VE SK DNSS+ PG  D S GMQTRRK+KIDY+KMV D
Subjt:  -----------------------DSVMETINVVINDLDSSIKQMNDEEDETPNMSEVRTMSIVEESKADNSSNGPG--DPSVGMQTRRKDKIDYLKMVAD

Query:  LYYIFTIEPSTVDSVIKDEYWLNAMQEELLQFRRNNIWTLVSKPEATDETGCVTKNKARLVAQGYTQVEGVDFDETFALVARLEAIRLLLGISCIQKFKL
        L                 +YWLNAMQEELLQF+RNN+WTLVSK EA            RLVAQGYTQVEGVDFDETFA +ARLEAI+LLL          
Subjt:  LYYIFTIEPSTVDSVIKDEYWLNAMQEELLQFRRNNIWTLVSKPEATDETGCVTKNKARLVAQGYTQVEGVDFDETFALVARLEAIRLLLGISCIQKFKL

Query:  YQMDVKSAFLNGYLNGEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYNRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQ
                        EVYVAQPK FVD +HPKHVYKLNKALYGLKQAPRAWY RLTVY RG+GYSRGEIDKTLFIHRKSDQ+LVAQIYVDDIIFGGFPQ
Subjt:  YQMDVKSAFLNGYLNGEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYNRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQ

Query:  DL---------SEFEMSMVGELSCFLGLQIKQKNDSIFISQEKSIVGSLLYLTASRSDIAYAVGICARYQTDPRITHLEVVKRILKYVHGISDFGMMYSY
        DL         SEFEMSMVGELSCFLGLQ KQKND IF+SQEKSIVGSLLYLTASR DIAY VGICARYQ DPRI+HLE +KRILKYVHG +DFGM+YSY
Subjt:  DL---------SEFEMSMVGELSCFLGLQIKQKNDSIFISQEKSIVGSLLYLTASRSDIAYAVGICARYQTDPRITHLEVVKRILKYVHGISDFGMMYSY

Query:  NTTPTLVGYFDVDWAGSTDDRK---------------------IH--------------------------------HVRVR---------DRRFKSTPP
        NTTPTLVGY D DWA S D+ K                     +H                                H+ +R         D+  +    
Subjt:  NTTPTLVGYFDVDWAGSTDDRK---------------------IH--------------------------------HVRVR---------DRRFKSTPP

Query:  RRPYWLP---------------------------SEKVQGEATS------RLQESLRSEAMPKVGESAAPVSPTVHAHRAFEATVSDMDSDNKDNVLLIH
        R    L                            SE  +    S        ++SLR E++P+VGES  PVS  VHA+RA EA V DMD D++D++ L  
Subjt:  RRPYWLP---------------------------SEKVQGEATS------RLQESLRSEAMPKVGESAAPVSPTVHAHRAFEATVSDMDSDNKDNVLLIH

Query:  LLKKPSEPVTAERLSSDPPGF----HSLSRELI-------NRR------GHSPSVHPSLSKLPTLQPDAVPAHILEI-ATATEIPPEDISPPTDDPIAPS
        LLKK S PV +E+L SDP        S S E +       +RR      G+SPSVHP  SK  +L  + + +   ++ A +TE  P       DDP APS
Subjt:  LLKKPSEPVTAERLSSDPPGF----HSLSRELI-------NRR------GHSPSVHPSLSKLPTLQPDAVPAHILEI-ATATEIPPEDISPPTDDPIAPS

Query:  SEGRTKSPKGPKPPKRKTQQVRRNVTTKIGRKKIPANVPSVPIDGISFHHKESVQRWKFVMERRIIDELIKEFIVNLPDEFNDPSSADYQTVHNRGFKFV
        +EG  +SPK  +PPKRK+QQ+RRN+TTK G +K            +   H   +  + F M +                            + N G  + 
Subjt:  SEGRTKSPKGPKPPKRKTQQVRRNVTTKIGRKKIPANVPSVPIDGISFHHKESVQRWKFVMERRIIDELIKEFIVNLPDEFNDPSSADYQTVHNRGFKFV

Query:  ISLAVINGFLGNTVDIDCSPSCLTNELLATVLSEGTLSTWPVNGIPAAALSVKYAILHKIGIGNWFPSLHASSISAALGTFLYQICNDDKVDTGVFIYNQ
            ++     N VDIDCSPSC   E+LAT+L  GTLSTWPVNGIPAAALS+KYAILHKIGI  +                                   
Subjt:  ISLAVINGFLGNTVDIDCSPSCLTNELLATVLSEGTLSTWPVNGIPAAALSVKYAILHKIGIGNWFPSLHASSISAALGTFLYQICNDDKVDTGVFIYNQ

Query:  LLRHVGSFGVKVPIAFPRLFSSLLLHLNGVVLTTSDAPGPEPKTIALRYRLF
                 + VPIA PR FSSLLLHLNG VLTT+DAPGPE KTIAL YRLF
Subjt:  LLRHVGSFGVKVPIAFPRLFSSLLLHLNGVVLTTSDAPGPEPKTIALRYRLF

TYK30437.1 F5J5.1 [Cucumis melo var. makuwa]0.087.74Show/hide
Query:  MLNSGAKNLDSILKTGHNGSQRYGLGFVSSASSSKATSEIKFVPASMRVEYDTIHLETGIRASVKSLGRTYYYCGQKGHIRSICYKLRQDQLRQQKHWNR
        MLNSGAKNLDSILKTGHNGSQRYGLGFVSSASSSKATSEIKFVPASMRVEYDTIHLETGIRASVKSLGRTYYYCGQKGHIRSICYKLRQDQLRQQKHWNR
Subjt:  MLNSGAKNLDSILKTGHNGSQRYGLGFVSSASSSKATSEIKFVPASMRVEYDTIHLETGIRASVKSLGRTYYYCGQKGHIRSICYKLRQDQLRQQKHWNR

Query:  SCAQPCMVWRIKYIERCKIAFTSVQTADDVWYFDSGCSRHMTGNRSYFMNLNDCVIGHVTFGDGAKGKIIAKGNINKDDLPRLNDVRYVDGLKANLISIN
        SCAQPCMVWRIKYIERCKIAFTSVQTADDVWYFDSGCSRHMTGNRSYFMNLNDCVIGHVTFGDGAKGKIIAKGNINKDDLPRLNDVRYVDGLKANLISIN
Subjt:  SCAQPCMVWRIKYIERCKIAFTSVQTADDVWYFDSGCSRHMTGNRSYFMNLNDCVIGHVTFGDGAKGKIIAKGNINKDDLPRLNDVRYVDGLKANLISIN

Query:  QLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCELIRSDQTWLWHRKLEHASMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGK
        QLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCELIRSDQTWLWHRKLEHASMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGK
Subjt:  QLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCELIRSDQTWLWHRKLEHASMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGK

Query:  Q---------------------------------------------------------------------TRSDSVMETINVVINDLDSSIKQMNDEEDE
        +                                                                      RSDSVMETINVVINDLDSSIKQMNDEEDE
Subjt:  Q---------------------------------------------------------------------TRSDSVMETINVVINDLDSSIKQMNDEEDE

Query:  TPNMSEVRTMSIVEESKADNSSNGP-----GDPSVGMQTRRKDKIDYLKMVADLYYIFTIEPSTVDSVIKDEYWLNAMQEELLQFRRNNIWTLVSKPEAT
        TPNMSEVRTMSIVEESKADNSSNGP     GDPSVGMQTRRKDKIDYLKMVADLYYIFTIEPSTVDSVIKDEYWLNAMQEELLQFRRNNIWTLV      
Subjt:  TPNMSEVRTMSIVEESKADNSSNGP-----GDPSVGMQTRRKDKIDYLKMVADLYYIFTIEPSTVDSVIKDEYWLNAMQEELLQFRRNNIWTLVSKPEAT

Query:  DETGCVTKNKARLVAQGYTQVEGVDFDETFALVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNGEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQ
                      AQGYTQVEGVDFDETFALVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNGEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQ
Subjt:  DETGCVTKNKARLVAQGYTQVEGVDFDETFALVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNGEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQ

Query:  APRAWYNRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDL---------SEFEMSMVGELSCFLGLQIKQKNDSIFISQEKSIVG
        APRAWYNRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDL         SEFEMSMVGELSCFLGLQIKQKNDSIFISQEKSIVG
Subjt:  APRAWYNRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDL---------SEFEMSMVGELSCFLGLQIKQKNDSIFISQEKSIVG

Query:  SLLYLTASRSDIAYAVGICARYQTDPRITHLEVVKRILKYVHGISDFGMMYSYNTTPTLVGYFDVDWAGSTDDRKIHHVRVRDRRFKSTPPRRPYWLPSE
        SLLYLTASRSDIAYAVGICARYQTDPRITHLEVVKRILKYVHGISDFGMMYSYNTTPTLVGYFDVDWAGSTDDRKIHHVRVRDRRFKSTPPRRPYWLPSE
Subjt:  SLLYLTASRSDIAYAVGICARYQTDPRITHLEVVKRILKYVHGISDFGMMYSYNTTPTLVGYFDVDWAGSTDDRKIHHVRVRDRRFKSTPPRRPYWLPSE

Query:  KVQGEATSRLQESLRSEAMPKVGESAAPVSPTVHAHRAFEATVSDMDSDNKDNVLLIHLLKKPSEPVTAERLSSDPPGFHSLSRELINRR---GHSPSVH
        KVQGEATSRLQESLRSEAMPKVGESAAPVSPTVHAHRAFEAT S +            LL  P    + E  S++     +L     +     GHSPSVH
Subjt:  KVQGEATSRLQESLRSEAMPKVGESAAPVSPTVHAHRAFEATVSDMDSDNKDNVLLIHLLKKPSEPVTAERLSSDPPGFHSLSRELINRR---GHSPSVH

Query:  PSLSKLPTLQPDAVPAHILEIATATEIPPEDISPPTDDPIAPSSEGRTKSPKGPKPPKRKTQQVRRNVTTKIGRKKIPANVPSVPIDGISFHHKESVQRW
        PSLSKLPTLQPDAVPAHILEIATATEIPPEDISPPTDDPIAPSSEGRTKSPKGPKPPKRKTQQVRRNVTTKIGRKKIPANVPSVPIDGISFHHKESVQRW
Subjt:  PSLSKLPTLQPDAVPAHILEIATATEIPPEDISPPTDDPIAPSSEGRTKSPKGPKPPKRKTQQVRRNVTTKIGRKKIPANVPSVPIDGISFHHKESVQRW

Query:  KFVMERRIIDELIKEFIVNLPDEFNDPSSADYQTVHNRGFKFVISLAVINGFLGNTVDIDCSPSCLTNELLATVLSEGTLSTWPVNGIPAAALSVKYAIL
        KFVMERRIIDELIKEFIVNLPDEFNDPSSADYQTVHNRGFKFVISLAVINGFLGNTVDIDCSPSCLTNELLATVLSEGTLSTWPVNGIPAAALSVKYAIL
Subjt:  KFVMERRIIDELIKEFIVNLPDEFNDPSSADYQTVHNRGFKFVISLAVINGFLGNTVDIDCSPSCLTNELLATVLSEGTLSTWPVNGIPAAALSVKYAIL

Query:  HKIGIGNWFPSLHASSISAALGTFLYQICNDDKVDTGVFIYNQLLRHVGSFGVKVPIAFPRLFSSLLLHLNGVVLTTSDAPGPEPKTIALRYRLFQGSHM
        HKIGIGNWFPSLHASSISAALGTFLYQICNDDKVDTGVFIYNQLLRHVGSFGVKVPIAFPRLFSSLLLHLNGVVLTTSDAPGPEPKTIALRYRLFQGSHM
Subjt:  HKIGIGNWFPSLHASSISAALGTFLYQICNDDKVDTGVFIYNQLLRHVGSFGVKVPIAFPRLFSSLLLHLNGVVLTTSDAPGPEPKTIALRYRLFQGSHM

Query:  PDIDHDVHPTQGPHI
        PDIDHDVHPTQGPHI
Subjt:  PDIDHDVHPTQGPHI

TrEMBL top hitse value%identityAlignment
A0A5A7T169 F5J5.10.0e+0087.31Show/hide
Query:  MLNSGAKNLDSILKTGHNGSQRYGLGFVSSASSSKATSEIKFVPASMRVEYDTIHLETGIRASVKSLGRTYYYCGQKGHIRSICYKLRQDQLRQQKHWNR
        MLNSGAKNLDSILKTGHNGSQRYGLGFVSSASSSKATSEIKFVPASMRVEYDTIHLETGIRASVKSLGRTYYYCGQKGHIRSICYKLRQDQLRQQKHWNR
Subjt:  MLNSGAKNLDSILKTGHNGSQRYGLGFVSSASSSKATSEIKFVPASMRVEYDTIHLETGIRASVKSLGRTYYYCGQKGHIRSICYKLRQDQLRQQKHWNR

Query:  SCAQPCMVWRIKYIERCKIAFTSVQTADDVWYFDSGCSRHMTGNRSYFMNLNDCVIGHVTFGDGAKGKIIAKGNINKDDLPRLNDVRYVDGLKANLISIN
        SCAQPCMVWRIKYIERCKIAFTSVQTADDVWYFDSGCSRHMTGNRSYFMNLNDCVIGHVTFGDGAKGKIIAKGNINKDDLPRLNDVRYVDGLKANLISIN
Subjt:  SCAQPCMVWRIKYIERCKIAFTSVQTADDVWYFDSGCSRHMTGNRSYFMNLNDCVIGHVTFGDGAKGKIIAKGNINKDDLPRLNDVRYVDGLKANLISIN

Query:  QLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCELIRSDQTWLWHRKLEHASMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGK
        QLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCELIRSDQTWLWHRKLEHASMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGK
Subjt:  QLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCELIRSDQTWLWHRKLEHASMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGK

Query:  Q---------------------------------------------------------------------TRSDSVMETINVVINDLDSSIKQMNDEEDE
        +                                                                      RSDSVMETINVVINDLDSSIKQMNDEEDE
Subjt:  Q---------------------------------------------------------------------TRSDSVMETINVVINDLDSSIKQMNDEEDE

Query:  TPNMSEVRTMSIVEESKADNSSNGP-----GDPSVGMQTRRKDKIDYLKMVADLYYIFTIEPSTVDSVIKDEYWLNAMQEELLQFRRNNIWTLVSKPEAT
        TPNMSEVRTMSIVEESKADNSSNGP     GDPSVGMQTRRKDKIDYLKMVADLYYIFTIEPSTVDSVIKDEYWLNAMQEELLQFRRNNIWT        
Subjt:  TPNMSEVRTMSIVEESKADNSSNGP-----GDPSVGMQTRRKDKIDYLKMVADLYYIFTIEPSTVDSVIKDEYWLNAMQEELLQFRRNNIWTLVSKPEAT

Query:  DETGCVTKNKARLVAQGYTQVEGVDFDETFALVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNGEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQ
                    LVAQGYTQVEGVDFDETFALVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNGEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQ
Subjt:  DETGCVTKNKARLVAQGYTQVEGVDFDETFALVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNGEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQ

Query:  APRAWYNRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDL---------SEFEMSMVGELSCFLGLQIKQKNDSIFISQEKSIVG
        APRAWYNRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDL         SEFEMSMVGELSCFLGLQIKQKNDSIFISQEKSIVG
Subjt:  APRAWYNRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDL---------SEFEMSMVGELSCFLGLQIKQKNDSIFISQEKSIVG

Query:  SLLYLTASRSDIAYAVGICARYQTDPRITHLEVVKRILKYVHGISDFGMMYSYNTTPTLVGYFDVDWAGSTDDRKIHHVRVRDRRFKSTPPRRPYWLPSE
        SLLYLTASRSDIAYAVGICARYQTDPRITHLEVVKRILKYVHGISDFGMMYSYNTTPTLVGYFDVDWAGSTDDRKIHHVRVRDRRFKSTPPRRPYWLPSE
Subjt:  SLLYLTASRSDIAYAVGICARYQTDPRITHLEVVKRILKYVHGISDFGMMYSYNTTPTLVGYFDVDWAGSTDDRKIHHVRVRDRRFKSTPPRRPYWLPSE

Query:  KVQGEATSRLQESLRSEAMPKVGESAAPVSPTVHAHRAFEATVSDMDSDNKDNVLLIHLLKKPSEPVTAERLSSDPPGFHSLSRELIN---RRGHSPSVH
        KVQGEATSRLQESLRSEAMPKVGESAAPVSPTVHAHRAFEAT S +            LL  P    + E  S++     +L     +     GHSPSVH
Subjt:  KVQGEATSRLQESLRSEAMPKVGESAAPVSPTVHAHRAFEATVSDMDSDNKDNVLLIHLLKKPSEPVTAERLSSDPPGFHSLSRELIN---RRGHSPSVH

Query:  PSLSKLPTLQPDAVPAHILEIATA------TEIPPEDISPPTDDPIAPSSEGRTKSPKGPKPPKRKTQQVRRNVTTKIGRKKIPANVPSVPIDGISFHHK
        PSLSKLPTLQPDAVPAHILEIATA      TEIPPEDISPPTDDPIAPSSEGRTKSPKGPKPPKRKTQQVRRNVTTKIGRKKIPANVPSVPIDGISFHHK
Subjt:  PSLSKLPTLQPDAVPAHILEIATA------TEIPPEDISPPTDDPIAPSSEGRTKSPKGPKPPKRKTQQVRRNVTTKIGRKKIPANVPSVPIDGISFHHK

Query:  ESVQRWKFVMERRIIDELIKEFIVNLPDEFNDPSSADYQTVHNRGFKFVISLAVINGFLGNTVDIDCSPSCLTNELLATVLSEGTLSTWPVNGIPAAALS
        ESVQRWKFVMERRIIDELIKEFIVNLPDEFNDPSSADYQTVHNRGFKFVISLAVINGFLGNTVDIDCSPSCLTNELLATVLSEGTLSTWPVNGIPAAALS
Subjt:  ESVQRWKFVMERRIIDELIKEFIVNLPDEFNDPSSADYQTVHNRGFKFVISLAVINGFLGNTVDIDCSPSCLTNELLATVLSEGTLSTWPVNGIPAAALS

Query:  VKYAILHKIGIGNWFPSLHASSISAALGTFLYQICNDDKVDTGVFIYNQLLRHVGSFGVKVPIAFPRLFSSLLLHLNGVVLTTSDAPGPEPKTIALRYRL
        VKYAILHKIGIGNWFPSLHASSISAALGTFLYQICNDDKVDTGVFIYNQLLRHVGSFGVKVPIAFPRLFSSLLLHLNGVVLTTSDAPGPEPKTIALRYRL
Subjt:  VKYAILHKIGIGNWFPSLHASSISAALGTFLYQICNDDKVDTGVFIYNQLLRHVGSFGVKVPIAFPRLFSSLLLHLNGVVLTTSDAPGPEPKTIALRYRL

Query:  FQGSHMPDIDHDVHPTQGPHI
        FQGSHMPDIDHDVHPTQGPHI
Subjt:  FQGSHMPDIDHDVHPTQGPHI

A0A5A7U5N3 Reverse transcriptase Ty1/copia-type domain-containing protein3.9e-29852.83Show/hide
Query:  KNLDSILKTGHNGSQRYGLGFVSSASSSKATSEIKFVPASMRVEYDTIHLETGIRASVKSLGRTYYYCGQKGHIRSICYKLRQDQLRQQKHWNRSCAQPC
        +NLDSILK GHNGS RYGLGF+ SASSSKATSEIKF+PAS+RVEYDTIH ETGIR  VKSLG T YYCG+KGHIR                         
Subjt:  KNLDSILKTGHNGSQRYGLGFVSSASSSKATSEIKFVPASMRVEYDTIHLETGIRASVKSLGRTYYYCGQKGHIRSICYKLRQDQLRQQKHWNRSCAQPC

Query:  MVWRIKYIERCKIAFTSVQTADDVWYFDSGCSRHMTGNRSYFMNLNDCVIGHVTFGDGAKGKIIAKGNINKDDLPRLNDVRYVDGLKANLISINQLCDQG
         V  + +++                                                                   L     ++ L+A         D+ 
Subjt:  MVWRIKYIERCKIAFTSVQTADDVWYFDSGCSRHMTGNRSYFMNLNDCVIGHVTFGDGAKGKIIAKGNINKDDLPRLNDVRYVDGLKANLISINQLCDQG

Query:  YKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCELIRSDQTWLWHRKLEHASMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSDS
        Y+  +D                                     +RS+Q       L ++      +V  N                          RS++
Subjt:  YKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCELIRSDQTWLWHRKLEHASMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSDS

Query:  VMETINVVINDLDSSIKQMNDEEDETPNMSEVRTMSIVEESKADNSSNGP----------------------------------GDPSVGMQTRRKDKID
        VMETINVVINDLDS IKQMNDEEDE  NMSEVRTMS VEESKADNS +GP                                  GDPSVGMQTRRKDKID
Subjt:  VMETINVVINDLDSSIKQMNDEEDETPNMSEVRTMSIVEESKADNSSNGP----------------------------------GDPSVGMQTRRKDKID

Query:  YLKMVADLYYIFTIEPSTVDSVIKDEYWLNAMQEELLQFRRNNIWTLVSKPEATDETGCVTKNKARLVAQGYTQVEGVDFDETFALVARLEAIRLLLGIS
        YLKMVAD  YI TI PSTVDS +KDEYWLNAMQEELLQF+RNN+WTLVSKP+  D                                  L+         
Subjt:  YLKMVADLYYIFTIEPSTVDSVIKDEYWLNAMQEELLQFRRNNIWTLVSKPEATDETGCVTKNKARLVAQGYTQVEGVDFDETFALVARLEAIRLLLGIS

Query:  CIQKFKLYQMDVKSAFLNGYLNGEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYNRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDI
        CIQKFKLYQMDVKS FLNGYLN EVYVAQPK FVD EHPKHVYKLNKALYGLKQAPRAWY++LTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDI
Subjt:  CIQKFKLYQMDVKSAFLNGYLNGEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYNRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDI

Query:  IFGGFPQDLSEFEMSMVGELSCFLGLQIKQKNDSIFISQEKSIVGSLLYLTASRSDIAYAVGICARYQTDPRITHLEV-----VKRILKYVHGISDF---
        IFGGFPQDL+  + +     + ++ L    +   +     +SIVGSLLYLTASR DIAY VGICARYQ DPRITHLE       K  ++Y+   S     
Subjt:  IFGGFPQDLSEFEMSMVGELSCFLGLQIKQKNDSIFISQEKSIVGSLLYLTASRSDIAYAVGICARYQTDPRITHLEV-----VKRILKYVHGISDF---

Query:  ----GMMYSYN-TTPTLVGYFD----VDWAG--------STDDRKIHHVRVRDRRFKSTPPRRPYWLPSEKVQGEATSRLQESLRSEAMPKVGESAAPVS
             M+  Y     T+  Y D    +D +         ++    + H RVR RRFKSTPP RPY L  EKVQGEA+SRLQESLRSEA+P+VG+S  PVS
Subjt:  ----GMMYSYN-TTPTLVGYFD----VDWAG--------STDDRKIHHVRVRDRRFKSTPPRRPYWLPSEKVQGEATSRLQESLRSEAMPKVGESAAPVS

Query:  PTVHAHRAFEATVSDMDSDNKDNVLLIHLLKKPSEPVTAERLSSDPPGFHSLSRELINRRGHSPSVHPSLSKLPTLQPDAVPAHILEIATA---------
        P VHAHRA EATVSD+DSD++DNV                              ELINRR                    VPAHI EIATA         
Subjt:  PTVHAHRAFEATVSDMDSDNKDNVLLIHLLKKPSEPVTAERLSSDPPGFHSLSRELINRRGHSPSVHPSLSKLPTLQPDAVPAHILEIATA---------

Query:  -----------TEIPPEDISPPTDDPIAPSSEGRTKSPKGPKPPKRKTQQVRRNVTTKIGRKKIPANVPSVPIDGISFHHKESVQRWKFVMERRIIDELI
                   TEIPPEDI PPTDDPIAPSS+GR +                         KKIPANVPSVPIDGISFHH+ESVQRWKFVM+RRI+DELI
Subjt:  -----------TEIPPEDISPPTDDPIAPSSEGRTKSPKGPKPPKRKTQQVRRNVTTKIGRKKIPANVPSVPIDGISFHHKESVQRWKFVMERRIIDELI

Query:  KEFIVNLPDEFNDPSSADYQTVHNRGFKFVISLAVINGFLGNTVDIDCSPSCLTNELLATVLSEGTLSTWPVNGIPAAALSVKYAILHKIGIGNWFPSLH
        ++FIVNLPD+FNDPSSADYQTVH RGFKFVIS AVIN FL NTVDIDCS SC T E+LA VLSEGTLSTW VNGI A ALSVKY IL+KIGI NWFPS H
Subjt:  KEFIVNLPDEFNDPSSADYQTVHNRGFKFVISLAVINGFLGNTVDIDCSPSCLTNELLATVLSEGTLSTWPVNGIPAAALSVKYAILHKIGIGNWFPSLH

Query:  ASSISAALGTFLYQICNDDKVDTGVFIYNQLLRHVGSFGVKVPIAFPRLFSSLLLHLNGVVLTTSDAPGPEPKTIALRYRLFQGSHMPDIDHDVHPTQGP
        ASS SAALGTFLYQICN DKVDTG FIYNQLLRHVGSFGVKVPIAFPR FSSLLLHLNG VLT SDAPGPEPKTIAL YRLFQGSH+PDIDHDVHPT+GP
Subjt:  ASSISAALGTFLYQICNDDKVDTGVFIYNQLLRHVGSFGVKVPIAFPRLFSSLLLHLNGVVLTTSDAPGPEPKTIALRYRLFQGSHMPDIDHDVHPTQGP

Query:  HI
        HI
Subjt:  HI

A0A5D3CI89 Reverse transcriptase Ty1/copia-type domain-containing protein2.1e-30951.86Show/hide
Query:  KNLDSILKTGHNGSQRYGLGFVSSASSSKATSEIKFVPASMRVEYDTIHLETGIRASVKSLGRTYYYCGQKGHIRSICYKLRQDQLRQQKHWNRSCAQPC
        +NLDSILK GHNGS RYGLGF  SASSSKATSEIKF+PAS+RVEYDTIH ETGIR  VKSLG T YYCG+KGHIR +          +  H +     P 
Subjt:  KNLDSILKTGHNGSQRYGLGFVSSASSSKATSEIKFVPASMRVEYDTIHLETGIRASVKSLGRTYYYCGQKGHIRSICYKLRQDQLRQQKHWNRSCAQPC

Query:  MVWRIKYIERCKIAFTSVQTADDVWYFDSGCSRHMTGNRSYFMNLNDCVIGHVTFGDGAKGKIIAKGNINKDDLPRLNDVRYVDGLKANLISINQLC---
         +                                + G R+  + +                          DD  R   V ++ G    +     LC   
Subjt:  MVWRIKYIERCKIAFTSVQTADDVWYFDSGCSRHMTGNRSYFMNLNDCVIGHVTFGDGAKGKIIAKGNINKDDLPRLNDVRYVDGLKANLISINQLC---

Query:  --DQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYH--WNSNMSDTCELIRSDQTWLWHRKLEHASMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIG
          ++G K            +E+++ M     AD  Y   W+         +RS+Q       L ++      +V  N                       
Subjt:  --DQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYH--WNSNMSDTCELIRSDQTWLWHRKLEHASMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIG

Query:  KQTRSDSVMETINVVINDLDSSIKQMNDEEDETPNMSEVRTMSIVEESKADNSSNGP----------------------------------GDPSVGMQT
           RS++VMETINVVINDLDS IKQMNDEEDE  NMSEVRTMS VEESKADNS +GP                                  GDPSVGMQT
Subjt:  KQTRSDSVMETINVVINDLDSSIKQMNDEEDETPNMSEVRTMSIVEESKADNSSNGP----------------------------------GDPSVGMQT

Query:  RRKDKIDYLKMVADLYYIFTIEPSTVDSVIKDEYWLNAMQEELLQFRRNNIWTLVSKPEATDETGCVTKNKARLVAQGYTQVEGVDFDETFALVARLEAI
        RRKDKIDYLKMVAD  YI TI PSTVDS +KDEYWLNAMQEELLQF+RNN+WTLVSKP+  D                                  L+  
Subjt:  RRKDKIDYLKMVADLYYIFTIEPSTVDSVIKDEYWLNAMQEELLQFRRNNIWTLVSKPEATDETGCVTKNKARLVAQGYTQVEGVDFDETFALVARLEAI

Query:  RLLLGISCIQKFKLYQMDVKSAFLNGYLNGEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYNRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVA
               CIQKFKLYQMDVKS FLNGYLN EVYVAQPK FVD EHPKHVYKLNKALYGLKQAPRAWY++LTVYLRGRGYSRGEIDKTLFIHRKSDQLLVA
Subjt:  RLLLGISCIQKFKLYQMDVKSAFLNGYLNGEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYNRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVA

Query:  QIYVDDIIFGGFPQDLSEFEMSMVGELSCFLGLQIKQKNDSIFISQEKSIVGSLLYLTASRSDIAYAVGICARYQTDPRITHLE----------------
        QIYVDDIIFGGFPQDL+  + +     + ++ L    +   +     +SIVGSLLYLTASR DIAY VGICARYQ DPRITHLE                
Subjt:  QIYVDDIIFGGFPQDLSEFEMSMVGELSCFLGLQIKQKNDSIFISQEKSIVGSLLYLTASRSDIAYAVGICARYQTDPRITHLE----------------

Query:  ---------VVKRILKY-------------VHGISDFGMMYSYNTTPTL------------------------------------VGYFDVDWAGSTDDR
                 + K +L+Y             +  I     +  ++ T  +                                       F+   AG    +
Subjt:  ---------VVKRILKY-------------VHGISDFGMMYSYNTTPTL------------------------------------VGYFDVDWAGSTDDR

Query:  KIH------------------------------HVRVRDRRFKSTPPRRPYWLPSEKVQGEATSRLQESLRSEAMPKVGESAAPVSPTVHAHRAFEATVS
         +H                              H RVR RRFKSTPP RPY L  EKVQGEA+SRLQESLRSEA+P+VGES  PVSP VHAHRA EATVS
Subjt:  KIH------------------------------HVRVRDRRFKSTPPRRPYWLPSEKVQGEATSRLQESLRSEAMPKVGESAAPVSPTVHAHRAFEATVS

Query:  DMDSDNKDNVLLIHLLKKPSEPVTAERLSSDPPGFHSLSRELINRRGHSPSVHPSLSKLPTLQPDAVPAHILEIATA--------------------TEI
        D+DSD++DNV                              ELINRR                    VPAHI EIATA                    TEI
Subjt:  DMDSDNKDNVLLIHLLKKPSEPVTAERLSSDPPGFHSLSRELINRRGHSPSVHPSLSKLPTLQPDAVPAHILEIATA--------------------TEI

Query:  PPEDISPPTDDPIAPSSEGRTKSPKGPKPPKRKTQQVRRNVTTKIGRKKIPANVPSVPIDGISFHHKESVQRWKFVMERRIIDELIKEFIVNLPDEFNDP
        PPEDI PPTDDPIAPSS+GR +S KGPK PK+KTQQ RRNVTTKI RKKIPANVPSVPIDGISFHH+ESVQRWKFVM+RRI+DELI++FIVNLPD+FNDP
Subjt:  PPEDISPPTDDPIAPSSEGRTKSPKGPKPPKRKTQQVRRNVTTKIGRKKIPANVPSVPIDGISFHHKESVQRWKFVMERRIIDELIKEFIVNLPDEFNDP

Query:  SSADYQTVHNRGFKFVISLAVINGFLGNTVDIDCSPSCLTNELLATVLSEGTLSTWPVNGIPAAALSVKYAILHKIGIGNWFPSLHASSISAALGTFLYQ
        SSADYQTVH RGFKFVIS AVIN FL NTVDIDCS SC T E+LA VLSEGTLSTW VNGI A ALSVKY IL+KIGI NWFPS HASS SAALGTFLYQ
Subjt:  SSADYQTVHNRGFKFVISLAVINGFLGNTVDIDCSPSCLTNELLATVLSEGTLSTWPVNGIPAAALSVKYAILHKIGIGNWFPSLHASSISAALGTFLYQ

Query:  ICNDDKVDTGVFIYNQLLRHVGSFGVKVPIAFPRLFSSLLLHLNGVVLTTSDAPGPEPKTIALRYRLFQGSHMPDIDHDVHPTQGPHI
        ICN DKVDTG FIYNQLLRHVGSFGVKVPIAFPR FSSLLLHLNG VLT SDAPGPEPKTIAL YRLFQGSH+PDIDHDVHPT+GPHI
Subjt:  ICNDDKVDTGVFIYNQLLRHVGSFGVKVPIAFPRLFSSLLLHLNGVVLTTSDAPGPEPKTIALRYRLFQGSHMPDIDHDVHPTQGPHI

A0A5D3CXU0 Gag-pol polyprotein0.0e+0054.24Show/hide
Query:  KIAFTSVQTADDVWYFDSGCSRHMTGNRSYFMNLNDCVIGHVTFGDGAKGKIIAKGNINKDDLPRLNDVRYVDGLKANLISINQLCDQGYKVSFDDIGCV
        ++     +T DD WYFDSGCSRHMTGNRSYF NL DCV GHVTF +GAKGKIIAKGNIN ++LPRLND+R ++ L  +L+   Q+   G           
Subjt:  KIAFTSVQTADDVWYFDSGCSRHMTGNRSYFMNLNDCVIGHVTFGDGAKGKIIAKGNINKDDLPRLNDVRYVDGLKANLISINQLCDQGYKVSFDDIGCV

Query:  VMNKENQICMSGKRQADNCYHWNSNMSDTCELIRSDQTWLWHRKLEHASMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGK-----QTRSDSVMETIN
                   GKR+                         +H+K +  S +G+                     F G  Q  +       RS  VMETIN
Subjt:  VMNKENQICMSGKRQADNCYHWNSNMSDTCELIRSDQTWLWHRKLEHASMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGK-----QTRSDSVMETIN

Query:  VVINDLDSSIKQMNDEEDETPNMSEVRTMSIVEESKADNSS-----------------------------NGP-----GDPSVGMQTRRKDKIDYLKMVA
        VVINDL+ +IKQ+NDEEDET NMSE RT S VE  KA   S                             N P     GDPS GMQTRRK+KIDY+KMVA
Subjt:  VVINDLDSSIKQMNDEEDETPNMSEVRTMSIVEESKADNSS-----------------------------NGP-----GDPSVGMQTRRKDKIDYLKMVA

Query:  DLYYIFTIEPSTVDSVIKDEYWLNAMQEELLQFRRNNIWTLVSKPEA-------------TDETGCVTKNKARLVAQGYTQVEGVDFDETFALVARLEAI
        DL YI T+EPSTVDS ++DEYWLNAMQEELLQFR+NN+WTLVSKPE              TDE GCVTKNKA+LVAQGYTQVEG+DFDETFA VARLEAI
Subjt:  DLYYIFTIEPSTVDSVIKDEYWLNAMQEELLQFRRNNIWTLVSKPEA-------------TDETGCVTKNKARLVAQGYTQVEGVDFDETFALVARLEAI

Query:  RLLLGISCIQKFKLYQMDVKSAFLNGYLNGEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYNRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVA
        RLLLGISCIQKFKLYQMDVKSAFL+GYLN EVYVAQPKGFVD EHPKH+YKLNKALYGLKQA RAWY++LTVYLRG+GYSRGEIDKTLFI RKSDQLLVA
Subjt:  RLLLGISCIQKFKLYQMDVKSAFLNGYLNGEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYNRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVA

Query:  QIYVDDIIFGGFPQDL----SEFEMSMVGELSCFLGLQIKQKNDSIFISQEKSIVG-----SLLYLTASRSDIAYAVGICA------------RYQTDPR
        QIYVDDIIF GFP DL     EFEMSMVGELSCFLGLQIKQKND IFISQEK          L      R+  A  V +              R   DPR
Subjt:  QIYVDDIIFGGFPQDL----SEFEMSMVGELSCFLGLQIKQKNDSIFISQEKSIVG-----SLLYLTASRSDIAYAVGICA------------RYQTDPR

Query:  ITHLEVVKRILKYVHGISDFGMMYSYNTTPTLVGYFDVDWAGSTDDRK----------------------------------------------------
        ITHLE VKRILKYVHG SDFGMMYSY+TTPTLVGY D +WAGSTDD K                                                    
Subjt:  ITHLEVVKRILKYVHGISDFGMMYSYNTTPTLVGYFDVDWAGSTDDRK----------------------------------------------------

Query:  ----------------------------------------------------------------IHHVRVRDRRFKSTPPRRPYWLPSEKVQGEATSRLQ
                                                                        I H RVR R+FKSTPPRR Y LPSEKVQGEA+SRLQ
Subjt:  ----------------------------------------------------------------IHHVRVRDRRFKSTPPRRPYWLPSEKVQGEATSRLQ

Query:  ESLRSEAMPKVGESAAPVSPTVHAHRAFEATVSDMDSDNKDNVLLIHLLKKPSEPVTAERLSSDPPG-FHSLSRELIN----RRGHSPSVHPSLSKLPTL
        ESLRSEA+P+VGESAAPV+                               KPSEPV  ERL SDP G  HS     I          P   P++  +PT 
Subjt:  ESLRSEAMPKVGESAAPVSPTVHAHRAFEATVSDMDSDNKDNVLLIHLLKKPSEPVTAERLSSDPPG-FHSLSRELIN----RRGHSPSVHPSLSKLPTL

Query:  QPDAVPAHILEIATAT--------------------EIPPEDISPPTDDPIAPSSEGRTKSPKGPKPPKRKTQQVRRNVTTKIGRKKIPANVPSVPIDGI
               HI  IATA                     EIP EDI PPTDDPIAPSSEGR +SPK                      +K        P+  I
Subjt:  QPDAVPAHILEIATAT--------------------EIPPEDISPPTDDPIAPSSEGRTKSPKGPKPPKRKTQQVRRNVTTKIGRKKIPANVPSVPIDGI

Query:  SFHHKESVQRWKFVMERRIIDELIKEFIVNLPDEFNDPSSADYQTVHNRGFKFVISLAVINGFLGNTVDIDCSPSCLTNELLATVLSEGTLSTWPVNGIP
        SFHH++SVQRWKFVM+RRI +ELI++FIVNLPD+FNDPSSADYQTVH RGFKF+IS AVINGFLGNTVDIDCSPSC T E+LATVLS  TLSTW VN IP
Subjt:  SFHHKESVQRWKFVMERRIIDELIKEFIVNLPDEFNDPSSADYQTVHNRGFKFVISLAVINGFLGNTVDIDCSPSCLTNELLATVLSEGTLSTWPVNGIP

Query:  AAALSVKYAILHKIGIGNWFPSLHASSISAALGTFLYQICNDDKVDTGVFIYNQLLRHVGSFGVKVPIAFPRLFSSLLLHLNGVVLTTSDAPGPEPKTIA
        AAALSVKYAILHKI I NWF S HA SI  ALGTFLYQ CNDDKVDTG FIYNQLLRHVGSFGVKVPIA P+LFSSLLLHLN VVLT +DA GPEPKTI 
Subjt:  AAALSVKYAILHKIGIGNWFPSLHASSISAALGTFLYQICNDDKVDTGVFIYNQLLRHVGSFGVKVPIAFPRLFSSLLLHLNGVVLTTSDAPGPEPKTIA

Query:  LRYRLFQGSHMPDIDHDVHPTQGPHI
        L YRLFQGSH+PDIDHDVHPT+GP I
Subjt:  LRYRLFQGSHMPDIDHDVHPTQGPHI

A0A5D3E2Y4 F5J5.10.0e+0087.74Show/hide
Query:  MLNSGAKNLDSILKTGHNGSQRYGLGFVSSASSSKATSEIKFVPASMRVEYDTIHLETGIRASVKSLGRTYYYCGQKGHIRSICYKLRQDQLRQQKHWNR
        MLNSGAKNLDSILKTGHNGSQRYGLGFVSSASSSKATSEIKFVPASMRVEYDTIHLETGIRASVKSLGRTYYYCGQKGHIRSICYKLRQDQLRQQKHWNR
Subjt:  MLNSGAKNLDSILKTGHNGSQRYGLGFVSSASSSKATSEIKFVPASMRVEYDTIHLETGIRASVKSLGRTYYYCGQKGHIRSICYKLRQDQLRQQKHWNR

Query:  SCAQPCMVWRIKYIERCKIAFTSVQTADDVWYFDSGCSRHMTGNRSYFMNLNDCVIGHVTFGDGAKGKIIAKGNINKDDLPRLNDVRYVDGLKANLISIN
        SCAQPCMVWRIKYIERCKIAFTSVQTADDVWYFDSGCSRHMTGNRSYFMNLNDCVIGHVTFGDGAKGKIIAKGNINKDDLPRLNDVRYVDGLKANLISIN
Subjt:  SCAQPCMVWRIKYIERCKIAFTSVQTADDVWYFDSGCSRHMTGNRSYFMNLNDCVIGHVTFGDGAKGKIIAKGNINKDDLPRLNDVRYVDGLKANLISIN

Query:  QLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCELIRSDQTWLWHRKLEHASMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGK
        QLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCELIRSDQTWLWHRKLEHASMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGK
Subjt:  QLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCELIRSDQTWLWHRKLEHASMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGK

Query:  Q---------------------------------------------------------------------TRSDSVMETINVVINDLDSSIKQMNDEEDE
        +                                                                      RSDSVMETINVVINDLDSSIKQMNDEEDE
Subjt:  Q---------------------------------------------------------------------TRSDSVMETINVVINDLDSSIKQMNDEEDE

Query:  TPNMSEVRTMSIVEESKADNSSNGP-----GDPSVGMQTRRKDKIDYLKMVADLYYIFTIEPSTVDSVIKDEYWLNAMQEELLQFRRNNIWTLVSKPEAT
        TPNMSEVRTMSIVEESKADNSSNGP     GDPSVGMQTRRKDKIDYLKMVADLYYIFTIEPSTVDSVIKDEYWLNAMQEELLQFRRNNIWT        
Subjt:  TPNMSEVRTMSIVEESKADNSSNGP-----GDPSVGMQTRRKDKIDYLKMVADLYYIFTIEPSTVDSVIKDEYWLNAMQEELLQFRRNNIWTLVSKPEAT

Query:  DETGCVTKNKARLVAQGYTQVEGVDFDETFALVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNGEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQ
                    LVAQGYTQVEGVDFDETFALVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNGEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQ
Subjt:  DETGCVTKNKARLVAQGYTQVEGVDFDETFALVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNGEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQ

Query:  APRAWYNRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDL---------SEFEMSMVGELSCFLGLQIKQKNDSIFISQEKSIVG
        APRAWYNRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDL         SEFEMSMVGELSCFLGLQIKQKNDSIFISQEKSIVG
Subjt:  APRAWYNRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDL---------SEFEMSMVGELSCFLGLQIKQKNDSIFISQEKSIVG

Query:  SLLYLTASRSDIAYAVGICARYQTDPRITHLEVVKRILKYVHGISDFGMMYSYNTTPTLVGYFDVDWAGSTDDRKIHHVRVRDRRFKSTPPRRPYWLPSE
        SLLYLTASRSDIAYAVGICARYQTDPRITHLEVVKRILKYVHGISDFGMMYSYNTTPTLVGYFDVDWAGSTDDRKIHHVRVRDRRFKSTPPRRPYWLPSE
Subjt:  SLLYLTASRSDIAYAVGICARYQTDPRITHLEVVKRILKYVHGISDFGMMYSYNTTPTLVGYFDVDWAGSTDDRKIHHVRVRDRRFKSTPPRRPYWLPSE

Query:  KVQGEATSRLQESLRSEAMPKVGESAAPVSPTVHAHRAFEATVSDMDSDNKDNVLLIHLLKKPSEPVTAERLSSDPPGFHSLSRELIN---RRGHSPSVH
        KVQGEATSRLQESLRSEAMPKVGESAAPVSPTVHAHRAFEAT S +            LL  P    + E  S++     +L     +     GHSPSVH
Subjt:  KVQGEATSRLQESLRSEAMPKVGESAAPVSPTVHAHRAFEATVSDMDSDNKDNVLLIHLLKKPSEPVTAERLSSDPPGFHSLSRELIN---RRGHSPSVH

Query:  PSLSKLPTLQPDAVPAHILEIATATEIPPEDISPPTDDPIAPSSEGRTKSPKGPKPPKRKTQQVRRNVTTKIGRKKIPANVPSVPIDGISFHHKESVQRW
        PSLSKLPTLQPDAVPAHILEIATATEIPPEDISPPTDDPIAPSSEGRTKSPKGPKPPKRKTQQVRRNVTTKIGRKKIPANVPSVPIDGISFHHKESVQRW
Subjt:  PSLSKLPTLQPDAVPAHILEIATATEIPPEDISPPTDDPIAPSSEGRTKSPKGPKPPKRKTQQVRRNVTTKIGRKKIPANVPSVPIDGISFHHKESVQRW

Query:  KFVMERRIIDELIKEFIVNLPDEFNDPSSADYQTVHNRGFKFVISLAVINGFLGNTVDIDCSPSCLTNELLATVLSEGTLSTWPVNGIPAAALSVKYAIL
        KFVMERRIIDELIKEFIVNLPDEFNDPSSADYQTVHNRGFKFVISLAVINGFLGNTVDIDCSPSCLTNELLATVLSEGTLSTWPVNGIPAAALSVKYAIL
Subjt:  KFVMERRIIDELIKEFIVNLPDEFNDPSSADYQTVHNRGFKFVISLAVINGFLGNTVDIDCSPSCLTNELLATVLSEGTLSTWPVNGIPAAALSVKYAIL

Query:  HKIGIGNWFPSLHASSISAALGTFLYQICNDDKVDTGVFIYNQLLRHVGSFGVKVPIAFPRLFSSLLLHLNGVVLTTSDAPGPEPKTIALRYRLFQGSHM
        HKIGIGNWFPSLHASSISAALGTFLYQICNDDKVDTGVFIYNQLLRHVGSFGVKVPIAFPRLFSSLLLHLNGVVLTTSDAPGPEPKTIALRYRLFQGSHM
Subjt:  HKIGIGNWFPSLHASSISAALGTFLYQICNDDKVDTGVFIYNQLLRHVGSFGVKVPIAFPRLFSSLLLHLNGVVLTTSDAPGPEPKTIALRYRLFQGSHM

Query:  PDIDHDVHPTQGPHI
        PDIDHDVHPTQGPHI
Subjt:  PDIDHDVHPTQGPHI

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.3e-4329.64Show/hide
Query:  PNMS-EVRTMSIVEESKADNSSNGPGDPSVGMQTRR---KDKIDY-------LKMVADLYYIFTIEPSTVDSV-IKDE--YWLNAMQEELLQFRRNNIWT
        PN S E  T   ++E   DN +   G   +  ++ R   K +I Y        K+V + + IF   P++ D +  +D+   W  A+  EL   + NN WT
Subjt:  PNMS-EVRTMSIVEESKADNSSNGPGDPSVGMQTRR---KDKIDY-------LKMVADLYYIFTIEPSTVDSV-IKDE--YWLNAMQEELLQFRRNNIWT

Query:  LVSKPE-------------ATDETGCVTKNKARLVAQGYTQVEGVDFDETFALVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNGEVYVAQPKGF
        +  +PE               +E G   + KARLVA+G+TQ   +D++ETFA VAR+ + R +L +      K++QMDVK+AFLNG L  E+Y+  P+G 
Subjt:  LVSKPE-------------ATDETGCVTKNKARLVAQGYTQVEGVDFDETFALVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNGEVYVAQPKGF

Query:  VDFEHPKHVYKLNKALYGLKQAPRAWYNRLTVYLRGRGYSRGEIDKTLFIHRKS--DQLLVAQIYVDDII--------FGGFPQDLSE-FEMSMVGELSC
            +  +V KLNKA+YGLKQA R W+      L+   +    +D+ ++I  K   ++ +   +YVDD++           F + L E F M+ + E+  
Subjt:  VDFEHPKHVYKLNKALYGLKQAPRAWYNRLTVYLRGRGYSRGEIDKTLFIHRKS--DQLLVAQIYVDDII--------FGGFPQDLSE-FEMSMVGELSC

Query:  FLGLQIKQKNDSIFISQE-----------------------------------------KSIVGSLLY-LTASRSDIAYAVGICARYQTDPRITHLEVVK
        F+G++I+ + D I++SQ                                          +S++G L+Y +  +R D+  AV I +RY +       + +K
Subjt:  FLGLQIKQKNDSIFISQE-----------------------------------------KSIVGSLLY-LTASRSDIAYAVGICARYQTDPRITHLEVVK

Query:  RILKYVHGISDFGMMYSYNTT--PTLVGYFDVDWAGSTDDRK
        R+L+Y+ G  D  +++  N      ++GY D DWAGS  DRK
Subjt:  RILKYVHGISDFGMMYSYNTT--PTLVGYFDVDWAGSTDDRK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-4129.98Show/hide
Query:  MQTRRKDKIDYLKMVADLYYIFTIEPSTVDSVI---KDEYWLNAMQEELLQFRRNNIWTLVSKPEATDETGC-------------VTKNKARLVAQGYTQ
        +++RR    +Y+ +  D       EP ++  V+   +    + AMQEE+   ++N  + LV  P+      C             + + KARLV +G+ Q
Subjt:  MQTRRKDKIDYLKMVADLYYIFTIEPSTVDSVI---KDEYWLNAMQEELLQFRRNNIWTLVSKPEATDETGC-------------VTKNKARLVAQGYTQ

Query:  VEGVDFDETFALVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNGEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYNRLTVYLRGRGYSR
         +G+DFDE F+ V ++ +IR +L ++     ++ Q+DVK+AFL+G L  E+Y+ QP+GF        V KLNK+LYGLKQAPR WY +   +++ + Y +
Subjt:  VEGVDFDETFALVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNGEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYNRLTVYLRGRGYSR

Query:  GEIDKTLFIHRKSD-QLLVAQIYVDDIIFGG--------FPQDLSE-FEMSMVGELSCFLGLQIKQKNDS--IFISQEK---------------------
           D  ++  R S+   ++  +YVDD++  G           DLS+ F+M  +G     LG++I ++  S  +++SQEK                     
Subjt:  GEIDKTLFIHRKSD-QLLVAQIYVDDIIFGG--------FPQDLSE-FEMSMVGELSCFLGLQIKQKNDS--IFISQEK---------------------

Query:  ---------------------------SIVGSLLY-LTASRSDIAYAVGICARYQTDPRITHLEVVKRILKYVHGISDFGMMYSYNTTPTLVGYFDVDWA
                                   S VGSL+Y +  +R DIA+AVG+ +R+  +P   H E VK IL+Y+ G +   + +   + P L GY D D A
Subjt:  ---------------------------SIVGSLLY-LTASRSDIAYAVGICARYQTDPRITHLEVVKRILKYVHGISDFGMMYSYNTTPTLVGYFDVDWA

Query:  GSTDDRK
        G  D+RK
Subjt:  GSTDDRK

P25600 Putative transposon Ty5-1 protein YCL074W1.4e-2129.25Show/hide
Query:  MDVKSAFLNGYLNGEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYNRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDI--------I
        MDV +AFLN  ++  +YV QP GFV+  +P +V++L   +YGLKQAP  W   +   L+  G+ R E +  L+    SD  +   +YVDD+        I
Subjt:  MDVKSAFLNGYLNGEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYNRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDI--------I

Query:  FGGFPQDLSE-FEMSMVGELSCFLGLQIKQ-KNDSIFISQE------------------------------------------KSIVGSLLY-LTASRSD
        +    Q+L++ + M  +G++  FLGL I Q  N  I +S +                                          +SIVG LL+     R D
Subjt:  FGGFPQDLSE-FEMSMVGELSCFLGLQIKQ-KNDSIFISQE------------------------------------------KSIVGSLLY-LTASRSD

Query:  IAYAVGICARYQTDPRITHLEVVKRILKYVHGISDFGMMYSYNTTPTLVGYFD
        I+Y V + +R+  +PR  HLE  +R+L+Y++      + Y   +   L  Y D
Subjt:  IAYAVGICARYQTDPRITHLEVVKRILKYVHGISDFGMMYSYNTTPTLVGYFD

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.8e-5133.88Show/hide
Query:  EPSTVDSVIKDEYWLNAMQEELLQFRRNNIWTLVSKP--------------EATDETGCVTKNKARLVAQGYTQVEGVDFDETFALVARLEAIRLLLGIS
        EP T    +KDE W NAM  E+     N+ W LV  P              +  +  G + + KARLVA+GY Q  G+D+ ETF+ V +  +IR++LG++
Subjt:  EPSTVDSVIKDEYWLNAMQEELLQFRRNNIWTLVSKP--------------EATDETGCVTKNKARLVAQGYTQVEGVDFDETFALVARLEAIRLLLGIS

Query:  CIQKFKLYQMDVKSAFLNGYLNGEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYNRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDI
          + + + Q+DV +AFL G L  +VY++QP GF+D + P +V KL KALYGLKQAPRAWY  L  YL   G+     D +LF+ ++   ++   +YVDDI
Subjt:  CIQKFKLYQMDVKSAFLNGYLNGEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYNRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDI

Query:  IFGGFPQDL---------SEFEMSMVGELSCFLGLQIKQKNDSIFISQE------------------------------------------KSIVGSLLY
        +  G    L           F +    EL  FLG++ K+    + +SQ                                           + IVGSL Y
Subjt:  IFGGFPQDL---------SEFEMSMVGELSCFLGLQIKQKNDSIFISQE------------------------------------------KSIVGSLLY

Query:  LTASRSDIAYAVGICARYQTDPRITHLEVVKRILKYVHGISDFGMMYSYNTTPTLVGYFDVDWAGSTDD
        L  +R DI+YAV   +++   P   HL+ +KRIL+Y+ G  + G+      T +L  Y D DWAG  DD
Subjt:  LTASRSDIAYAVGICARYQTDPRITHLEVVKRILKYVHGISDFGMMYSYNTTPTLVGYFDVDWAGSTDD

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.4e-5033.06Show/hide
Query:  EPSTVDSVIKDEYWLNAMQEELLQFRRNNIWTLVSKPEAT--------------DETGCVTKNKARLVAQGYTQVEGVDFDETFALVARLEAIRLLLGIS
        EP T    +KD+ W  AM  E+     N+ W LV  P  +              +  G + + KARLVA+GY Q  G+D+ ETF+ V +  +IR++LG++
Subjt:  EPSTVDSVIKDEYWLNAMQEELLQFRRNNIWTLVSKPEAT--------------DETGCVTKNKARLVAQGYTQVEGVDFDETFALVARLEAIRLLLGIS

Query:  CIQKFKLYQMDVKSAFLNGYLNGEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYNRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDI
          + + + Q+DV +AFL G L  EVY++QP GFVD + P +V +L KA+YGLKQAPRAWY  L  YL   G+     D +LF+ ++   ++   +YVDDI
Subjt:  CIQKFKLYQMDVKSAFLNGYLNGEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYNRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDI

Query:  IFGGFPQDL---------SEFEMSMVGELSCFLGLQIKQKNDSIFISQE------------------------------------------KSIVGSLLY
        +  G    L           F +    +L  FLG++ K+    + +SQ                                           + IVGSL Y
Subjt:  IFGGFPQDL---------SEFEMSMVGELSCFLGLQIKQKNDSIFISQE------------------------------------------KSIVGSLLY

Query:  LTASRSDIAYAVGICARYQTDPRITHLEVVKRILKYVHGISDFGMMYSYNTTPTLVGYFDVDWAGSTDD
        L  +R D++YAV   ++Y   P   H   +KR+L+Y+ G  D G+      T +L  Y D DWAG TDD
Subjt:  LTASRSDIAYAVGICARYQTDPRITHLEVVKRILKYVHGISDFGMMYSYNTTPTLVGYFDVDWAGSTDD

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.7e-4329.08Show/hide
Query:  SIVEESKADNSSNGPGDPSVGMQTRRKDKIDYL----------------------KMVADLYYIFTI------EPSTVDSVIKDEYWLNAMQEELLQFRR
        S ++   + N  N   +PSV    RR  K  YL                      + V+ LY+ F +      EPST +   +   W  AM +E+     
Subjt:  SIVEESKADNSSNGPGDPSVGMQTRRKDKIDYL----------------------KMVADLYYIFTI------EPSTVDSVIKDEYWLNAMQEELLQFRR

Query:  NNIWTLVSKPEATDETGC-------------VTKNKARLVAQGYTQVEGVDFDETFALVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNGEVYVA
         + W + + P      GC             + + KARLVA+GYTQ EG+DF ETF+ V +L +++L+L IS I  F L+Q+D+ +AFLNG L+ E+Y+ 
Subjt:  NNIWTLVSKPEATDETGC-------------VTKNKARLVAQGYTQVEGVDFDETFALVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNGEVYVA

Query:  QPKGFV----DFEHPKHVYKLNKALYGLKQAPRAWYNRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDIIFGG---------FPQDLSEFEMS
         P G+     D   P  V  L K++YGLKQA R W+ + +V L G G+ +   D T F+   +   L   +YVDDII              Q  S F++ 
Subjt:  QPKGFV----DFEHPKHVYKLNKALYGLKQAPRAWYNRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDIIFGG---------FPQDLSEFEMS

Query:  MVGELSCFLGLQIKQKNDSIFISQEK------------------------------------------SIVGSLLYLTASRSDIAYAVGICARYQTDPRI
         +G L  FLGL+I +    I I Q K                                           ++G L+YL  +R DI++AV   +++   PR+
Subjt:  MVGELSCFLGLQIKQKNDSIFISQEK------------------------------------------SIVGSLLYLTASRSDIAYAVGICARYQTDPRI

Query:  THLEVVKRILKYVHGISDFGMMYSYNTTPTLVGYFDVDWAGSTDDRK
         H + V +IL Y+ G    G+ YS      L  + D  +    D R+
Subjt:  THLEVVKRILKYVHGISDFGMMYSYNTTPTLVGYFDVDWAGSTDDRK

ATMG00810.1 DNA/RNA polymerases superfamily protein1.8e-1329.55Show/hide
Query:  IYVDDIIFGGFPQDL---------SEFEMSMVGELSCFLGLQIKQKNDSIFISQEK-----------------------------------------SIV
        +YVDDI+  G    L         S F M  +G +  FLG+QIK     +F+SQ K                                         SIV
Subjt:  IYVDDIIFGGFPQDL---------SEFEMSMVGELSCFLGLQIKQKNDSIFISQEK-----------------------------------------SIV

Query:  GSLLYLTASRSDIAYAVGICARYQTDPRITHLEVVKRILKYVHGISDFGMMYSYNTTPTLVGYFDVDWAGSTDDRK
        G+L YLT +R DI+YAV I  +   +P +   +++KR+L+YV G    G+    N+   +  + D DWAG T  R+
Subjt:  GSLLYLTASRSDIAYAVGICARYQTDPRITHLEVVKRILKYVHGISDFGMMYSYNTTPTLVGYFDVDWAGSTDDRK

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)6.5e-1137.6Show/hide
Query:  MQTRRKDKIDYLKMVADLYYIFTI--EPSTVDSVIKDEYWLNAMQEELLQFRRNNIWTLVSKPEATDETGC-------------VTKNKARLVAQGYTQV
        M TR K  I+ L     L    TI  EP +V   +KD  W  AMQEEL    RN  W LV  P   +  GC             + + KARLVA+G+ Q 
Subjt:  MQTRRKDKIDYLKMVADLYYIFTI--EPSTVDSVIKDEYWLNAMQEELLQFRRNNIWTLVSKPEATDETGC-------------VTKNKARLVAQGYTQV

Query:  EGVDFDETFALVARLEAIRLLLGIS
        EG+ F ET++ V R   IR +L ++
Subjt:  EGVDFDETFALVARLEAIRLLLGIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAAATTCAGGAGCAAAGAATCTAGATTCAATACTAAAAACTGGACATAATGGCTCTCAAAGATATGGGTTGGGATTTGTGTCCTCTGCAAGTAGTTCTAAAGCTAC
ATCAGAAATCAAATTTGTCCCTGCCTCAATGAGAGTTGAATATGACACGATTCATTTAGAGACTGGCATTAGGGCTTCAGTTAAATCTCTTGGGAGAACTTATTACTATT
GTGGTCAAAAAGGTCATATTAGGTCAATTTGTTATAAATTAAGGCAAGACCAGTTACGTCAACAGAAACACTGGAATAGAAGCTGTGCTCAACCTTGCATGGTTTGGAGA
ATTAAATATATTGAAAGATGTAAGATTGCCTTTACATCCGTTCAGACCGCAGATGATGTGTGGTATTTTGATAGTGGGTGCTCCAGACATATGACTGGAAACAGATCCTA
CTTTATGAATTTAAACGACTGTGTCATCGGACATGTTACCTTTGGTGATGGTGCAAAAGGAAAAATTATAGCTAAAGGTAACATAAACAAAGATGATCTACCACGACTGA
ACGATGTTAGGTATGTGGATGGACTAAAAGCAAACTTGATCAGTATAAATCAACTGTGTGATCAAGGTTACAAAGTTAGTTTTGATGATATTGGTTGTGTTGTCATGAAT
AAAGAAAATCAAATTTGTATGAGTGGTAAACGACAAGCTGATAATTGCTACCATTGGAATTCAAATATGTCTGACACCTGTGAGTTGATAAGATCCGATCAAACATGGCT
ATGGCATAGAAAGCTAGAGCATGCCAGCATGAGAGGATTGGAAAAAGTTATTAAAAATAAAGCAGTTGTGGGAATTCCTGATTTAGACGTAAATGGAAACTTCTTCTGTG
GAGACTGTCAAATTGGTAAGCAGACAAGATCCGACAGTGTTATGGAAACAATCAATGTAGTTATAAATGATCTCGATTCTTCTATCAAACAGATGAATGATGAGGAAGAT
GAGACTCCAAACATGTCTGAAGTCAGAACTATGAGTATTGTGGAAGAGTCTAAAGCTGATAATTCATCTAACGGTCCAGGTGATCCATCAGTTGGGATGCAGACCAGAAG
GAAAGATAAGATTGACTATTTGAAGATGGTTGCTGATTTGTATTATATTTTCACCATTGAACCTTCAACAGTTGACTCTGTTATCAAGGATGAGTATTGGTTAAATGCTA
TGCAAGAGGAGCTACTGCAATTTAGACGAAACAACATATGGACGTTAGTCTCAAAGCCAGAAGCTACTGATGAAACTGGATGTGTGACGAAAAATAAAGCCAGATTAGTA
GCTCAAGGGTATACTCAAGTTGAAGGTGTTGACTTTGATGAAACGTTTGCTCTTGTTGCTCGACTTGAAGCCATTCGACTTTTACTGGGTATATCATGCATACAGAAATT
TAAATTGTATCAGATGGATGTAAAGAGTGCCTTCTTAAATGGGTACTTGAATGGGGAGGTTTATGTGGCTCAACCAAAAGGTTTTGTAGATTTCGAGCACCCGAAGCATG
TGTATAAGCTCAACAAAGCCTTATATGGACTAAAGCAAGCTCCGAGAGCTTGGTATAACCGGCTAACTGTATACTTGAGAGGTAGAGGATATTCCAGAGGAGAAATTGAC
AAGACCTTGTTTATACACAGGAAATCTGACCAACTGTTGGTGGCTCAAATTTATGTTGATGACATCATTTTTGGAGGATTTCCTCAGGATCTATCAGAATTCGAAATGAG
CATGGTTGGGGAGCTTTCATGCTTTCTAGGACTTCAAATTAAGCAAAAGAATGATAGCATTTTCATATCTCAAGAAAAGAGTATAGTAGGTAGCCTATTATACTTAACAG
CAAGTCGATCAGACATAGCTTATGCTGTGGGAATATGTGCTCGTTATCAGACAGATCCTCGCATCACTCACTTAGAAGTTGTTAAACGAATTCTCAAATACGTTCATGGG
ATCAGTGACTTTGGAATGATGTATTCCTATAATACAACTCCTACTCTAGTTGGATATTTTGATGTTGACTGGGCAGGTTCAACTGATGATCGTAAAATACATCATGTAAG
GGTCAGGGATCGCCGTTTCAAAAGCACTCCTCCCCGGAGACCCTATTGGCTTCCATCTGAGAAAGTGCAGGGAGAGGCCACCAGTAGGTTGCAAGAGTCTTTACGTTCTG
AGGCGATGCCTAAAGTCGGGGAGTCTGCTGCTCCTGTTTCTCCTACTGTGCATGCTCATCGAGCTTTTGAGGCCACTGTATCAGATATGGATTCAGATAATAAGGATAAT
GTTCTGTTAATTCATTTGTTAAAGAAACCTTCAGAGCCAGTTACAGCCGAGAGGCTCTCTTCTGATCCCCCCGGGTTCCATTCACTCTCAAGAGAGCTCATCAACCGAAG
GGGTCATTCTCCTTCTGTTCACCCTTCTCTGTCGAAACTACCTACTTTGCAACCGGATGCAGTGCCTGCACACATTCTTGAAATCGCTACTGCTACTGAGATACCTCCTG
AAGACATTTCTCCTCCTACTGATGATCCCATTGCACCATCTTCTGAAGGAAGGACAAAGTCTCCTAAAGGTCCTAAACCCCCAAAAAGAAAAACTCAACAGGTTAGGAGA
AATGTTACCACAAAAATTGGCAGAAAGAAAATCCCTGCAAATGTTCCATCTGTTCCTATTGATGGAATTTCTTTTCATCACAAGGAAAGCGTTCAACGCTGGAAGTTTGT
GATGGAGAGGAGAATTATCGATGAGTTAATTAAAGAATTTATTGTCAATCTGCCTGATGAGTTTAATGATCCGAGTAGTGCTGACTATCAAACGGTGCACAATAGAGGGT
TCAAATTTGTGATTTCACTTGCTGTGATAAATGGTTTTCTTGGAAATACTGTTGATATTGACTGCTCTCCATCATGTCTTACTAATGAGCTTCTAGCTACTGTCTTATCT
GAAGGGACTTTATCCACATGGCCTGTGAATGGAATCCCTGCAGCTGCTCTCAGCGTCAAGTATGCTATTCTGCACAAGATTGGCATTGGCAATTGGTTCCCTTCCTTACA
TGCCTCAAGCATATCTGCTGCCTTAGGTACATTCTTGTATCAAATTTGCAATGATGATAAAGTAGATACGGGTGTCTTCATTTACAATCAACTATTGAGGCATGTTGGGT
CGTTTGGGGTCAAGGTTCCCATTGCTTTTCCAAGGTTGTTCTCCAGTCTGCTACTTCACCTAAATGGAGTCGTGCTTACTACATCTGATGCTCCTGGACCTGAACCTAAG
ACAATTGCACTTAGGTACAGACTCTTTCAAGGCAGTCATATGCCTGATATTGACCATGATGTGCATCCAACTCAGGGCCCACATATTTTGACACTACTGACTGGGATGAC
TCTCCTGAAGGCTTCTATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTAAATTCAGGAGCAAAGAATCTAGATTCAATACTAAAAACTGGACATAATGGCTCTCAAAGATATGGGTTGGGATTTGTGTCCTCTGCAAGTAGTTCTAAAGCTAC
ATCAGAAATCAAATTTGTCCCTGCCTCAATGAGAGTTGAATATGACACGATTCATTTAGAGACTGGCATTAGGGCTTCAGTTAAATCTCTTGGGAGAACTTATTACTATT
GTGGTCAAAAAGGTCATATTAGGTCAATTTGTTATAAATTAAGGCAAGACCAGTTACGTCAACAGAAACACTGGAATAGAAGCTGTGCTCAACCTTGCATGGTTTGGAGA
ATTAAATATATTGAAAGATGTAAGATTGCCTTTACATCCGTTCAGACCGCAGATGATGTGTGGTATTTTGATAGTGGGTGCTCCAGACATATGACTGGAAACAGATCCTA
CTTTATGAATTTAAACGACTGTGTCATCGGACATGTTACCTTTGGTGATGGTGCAAAAGGAAAAATTATAGCTAAAGGTAACATAAACAAAGATGATCTACCACGACTGA
ACGATGTTAGGTATGTGGATGGACTAAAAGCAAACTTGATCAGTATAAATCAACTGTGTGATCAAGGTTACAAAGTTAGTTTTGATGATATTGGTTGTGTTGTCATGAAT
AAAGAAAATCAAATTTGTATGAGTGGTAAACGACAAGCTGATAATTGCTACCATTGGAATTCAAATATGTCTGACACCTGTGAGTTGATAAGATCCGATCAAACATGGCT
ATGGCATAGAAAGCTAGAGCATGCCAGCATGAGAGGATTGGAAAAAGTTATTAAAAATAAAGCAGTTGTGGGAATTCCTGATTTAGACGTAAATGGAAACTTCTTCTGTG
GAGACTGTCAAATTGGTAAGCAGACAAGATCCGACAGTGTTATGGAAACAATCAATGTAGTTATAAATGATCTCGATTCTTCTATCAAACAGATGAATGATGAGGAAGAT
GAGACTCCAAACATGTCTGAAGTCAGAACTATGAGTATTGTGGAAGAGTCTAAAGCTGATAATTCATCTAACGGTCCAGGTGATCCATCAGTTGGGATGCAGACCAGAAG
GAAAGATAAGATTGACTATTTGAAGATGGTTGCTGATTTGTATTATATTTTCACCATTGAACCTTCAACAGTTGACTCTGTTATCAAGGATGAGTATTGGTTAAATGCTA
TGCAAGAGGAGCTACTGCAATTTAGACGAAACAACATATGGACGTTAGTCTCAAAGCCAGAAGCTACTGATGAAACTGGATGTGTGACGAAAAATAAAGCCAGATTAGTA
GCTCAAGGGTATACTCAAGTTGAAGGTGTTGACTTTGATGAAACGTTTGCTCTTGTTGCTCGACTTGAAGCCATTCGACTTTTACTGGGTATATCATGCATACAGAAATT
TAAATTGTATCAGATGGATGTAAAGAGTGCCTTCTTAAATGGGTACTTGAATGGGGAGGTTTATGTGGCTCAACCAAAAGGTTTTGTAGATTTCGAGCACCCGAAGCATG
TGTATAAGCTCAACAAAGCCTTATATGGACTAAAGCAAGCTCCGAGAGCTTGGTATAACCGGCTAACTGTATACTTGAGAGGTAGAGGATATTCCAGAGGAGAAATTGAC
AAGACCTTGTTTATACACAGGAAATCTGACCAACTGTTGGTGGCTCAAATTTATGTTGATGACATCATTTTTGGAGGATTTCCTCAGGATCTATCAGAATTCGAAATGAG
CATGGTTGGGGAGCTTTCATGCTTTCTAGGACTTCAAATTAAGCAAAAGAATGATAGCATTTTCATATCTCAAGAAAAGAGTATAGTAGGTAGCCTATTATACTTAACAG
CAAGTCGATCAGACATAGCTTATGCTGTGGGAATATGTGCTCGTTATCAGACAGATCCTCGCATCACTCACTTAGAAGTTGTTAAACGAATTCTCAAATACGTTCATGGG
ATCAGTGACTTTGGAATGATGTATTCCTATAATACAACTCCTACTCTAGTTGGATATTTTGATGTTGACTGGGCAGGTTCAACTGATGATCGTAAAATACATCATGTAAG
GGTCAGGGATCGCCGTTTCAAAAGCACTCCTCCCCGGAGACCCTATTGGCTTCCATCTGAGAAAGTGCAGGGAGAGGCCACCAGTAGGTTGCAAGAGTCTTTACGTTCTG
AGGCGATGCCTAAAGTCGGGGAGTCTGCTGCTCCTGTTTCTCCTACTGTGCATGCTCATCGAGCTTTTGAGGCCACTGTATCAGATATGGATTCAGATAATAAGGATAAT
GTTCTGTTAATTCATTTGTTAAAGAAACCTTCAGAGCCAGTTACAGCCGAGAGGCTCTCTTCTGATCCCCCCGGGTTCCATTCACTCTCAAGAGAGCTCATCAACCGAAG
GGGTCATTCTCCTTCTGTTCACCCTTCTCTGTCGAAACTACCTACTTTGCAACCGGATGCAGTGCCTGCACACATTCTTGAAATCGCTACTGCTACTGAGATACCTCCTG
AAGACATTTCTCCTCCTACTGATGATCCCATTGCACCATCTTCTGAAGGAAGGACAAAGTCTCCTAAAGGTCCTAAACCCCCAAAAAGAAAAACTCAACAGGTTAGGAGA
AATGTTACCACAAAAATTGGCAGAAAGAAAATCCCTGCAAATGTTCCATCTGTTCCTATTGATGGAATTTCTTTTCATCACAAGGAAAGCGTTCAACGCTGGAAGTTTGT
GATGGAGAGGAGAATTATCGATGAGTTAATTAAAGAATTTATTGTCAATCTGCCTGATGAGTTTAATGATCCGAGTAGTGCTGACTATCAAACGGTGCACAATAGAGGGT
TCAAATTTGTGATTTCACTTGCTGTGATAAATGGTTTTCTTGGAAATACTGTTGATATTGACTGCTCTCCATCATGTCTTACTAATGAGCTTCTAGCTACTGTCTTATCT
GAAGGGACTTTATCCACATGGCCTGTGAATGGAATCCCTGCAGCTGCTCTCAGCGTCAAGTATGCTATTCTGCACAAGATTGGCATTGGCAATTGGTTCCCTTCCTTACA
TGCCTCAAGCATATCTGCTGCCTTAGGTACATTCTTGTATCAAATTTGCAATGATGATAAAGTAGATACGGGTGTCTTCATTTACAATCAACTATTGAGGCATGTTGGGT
CGTTTGGGGTCAAGGTTCCCATTGCTTTTCCAAGGTTGTTCTCCAGTCTGCTACTTCACCTAAATGGAGTCGTGCTTACTACATCTGATGCTCCTGGACCTGAACCTAAG
ACAATTGCACTTAGGTACAGACTCTTTCAAGGCAGTCATATGCCTGATATTGACCATGATGTGCATCCAACTCAGGGCCCACATATTTTGACACTACTGACTGGGATGAC
TCTCCTGAAGGCTTCTATGTGA
Protein sequenceShow/hide protein sequence
MLNSGAKNLDSILKTGHNGSQRYGLGFVSSASSSKATSEIKFVPASMRVEYDTIHLETGIRASVKSLGRTYYYCGQKGHIRSICYKLRQDQLRQQKHWNRSCAQPCMVWR
IKYIERCKIAFTSVQTADDVWYFDSGCSRHMTGNRSYFMNLNDCVIGHVTFGDGAKGKIIAKGNINKDDLPRLNDVRYVDGLKANLISINQLCDQGYKVSFDDIGCVVMN
KENQICMSGKRQADNCYHWNSNMSDTCELIRSDQTWLWHRKLEHASMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSDSVMETINVVINDLDSSIKQMNDEED
ETPNMSEVRTMSIVEESKADNSSNGPGDPSVGMQTRRKDKIDYLKMVADLYYIFTIEPSTVDSVIKDEYWLNAMQEELLQFRRNNIWTLVSKPEATDETGCVTKNKARLV
AQGYTQVEGVDFDETFALVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNGEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYNRLTVYLRGRGYSRGEID
KTLFIHRKSDQLLVAQIYVDDIIFGGFPQDLSEFEMSMVGELSCFLGLQIKQKNDSIFISQEKSIVGSLLYLTASRSDIAYAVGICARYQTDPRITHLEVVKRILKYVHG
ISDFGMMYSYNTTPTLVGYFDVDWAGSTDDRKIHHVRVRDRRFKSTPPRRPYWLPSEKVQGEATSRLQESLRSEAMPKVGESAAPVSPTVHAHRAFEATVSDMDSDNKDN
VLLIHLLKKPSEPVTAERLSSDPPGFHSLSRELINRRGHSPSVHPSLSKLPTLQPDAVPAHILEIATATEIPPEDISPPTDDPIAPSSEGRTKSPKGPKPPKRKTQQVRR
NVTTKIGRKKIPANVPSVPIDGISFHHKESVQRWKFVMERRIIDELIKEFIVNLPDEFNDPSSADYQTVHNRGFKFVISLAVINGFLGNTVDIDCSPSCLTNELLATVLS
EGTLSTWPVNGIPAAALSVKYAILHKIGIGNWFPSLHASSISAALGTFLYQICNDDKVDTGVFIYNQLLRHVGSFGVKVPIAFPRLFSSLLLHLNGVVLTTSDAPGPEPK
TIALRYRLFQGSHMPDIDHDVHPTQGPHILTLLTGMTLLKASM