; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0003322 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0003322
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr04:22436819..22440674
RNA-Seq ExpressionIVF0003322
SyntenyIVF0003322
Gene Ontology termsGO:0007127 - meiosis I (biological process)
GO:0015074 - DNA integration (biological process)
GO:0050896 - response to stimulus (biological process)
GO:0000166 - nucleotide binding (molecular function)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016772 - transferase activity, transferring phosphorus-containing groups (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA7051484.1 unnamed protein product [Microthlaspi erraticum]0.050.41Show/hide
Query:  LDLALLSE-KPGAITSANSDEDRSFYKAWERSNRLSLMFIRMTVANNIKFTIKNTEDAKEFMKSVKKCFQSESADKSLAGTLMSTLTNIKFDGSRTIHEH
        LD A+L++ +P AIT  +S+ ++S ++ WERSNRLSL  +RMT+A +IK ++  TE A+EF++ +K+C QS+ ADKS+ G+LM+ LT  KFD S+ IH+H
Subjt:  LDLALLSE-KPGAITSANSDEDRSFYKAWERSNRLSLMFIRMTVANNIKFTIKNTEDAKEFMKSVKKCFQSESADKSLAGTLMSTLTNIKFDGSRTIHEH

Query:  ILEMTNLAARLKTMGMEVNENFLVMFILNSLPSEYGPFHMNYNTLKDKWNVHELQSMLIQEEARLKKPIIHSVNLMGHKGAGKKPGKKNGKGNHGQLKVK
        +  M+NLAA+L T+GMEVNE+FLV FI+NSLP E+  F +NYNT+KDKWN  EL++ML+QEE RLKK   H  N +GH  A    GK +     G+ K K
Subjt:  ILEMTNLAARLKTMGMEVNENFLVMFILNSLPSEYGPFHMNYNTLKDKWNVHELQSMLIQEEARLKKPIIHSVNLMGHKGAGKKPGKKNGKGNHGQLKVK

Query:  QSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAWFENK-------------------------------------------------------------
                + Q + KC FC + GH++KDC KRKAWF+ K                                                             
Subjt:  QSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAWFENK-------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------DLDFTDLGICVDCIKGKQTKHTISKEATRSSQLLEIIHTDIYGPFDVPSFGGEKYFITFIDDFSRYGYIYLLHEK
                                  LDF+DL +C+DCIKGKQTKH + K ATRS+QLLE+IHTDI GPFD PS+ GEKYFITFIDD+SRYGY YLLHEK
Subjt:  -------------------------DLDFTDLGICVDCIKGKQTKHTISKEATRSSQLLEIIHTDIYGPFDVPSFGGEKYFITFIDDFSRYGYIYLLHEK

Query:  SQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQCPGPFAKFLESHGICAQYTMPGYTMPGTPQQNDVAE----------------------
        SQ+++ LKVFI+EVERQL+R +K++RSDRGGE+YGK++E+GQCPGPFAK LES GICAQYTMPG     TPQQN VAE                      
Subjt:  SQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQCPGPFAKFLESHGICAQYTMPGYTMPGTPQQNDVAE----------------------

Query:  ------------------------------------------------------------------------------------STRIVETGNVRFIEND
                                                                                            STRIVETGN RFIEN 
Subjt:  ------------------------------------------------------------------------------------STRIVETGNVRFIEND

Query:  IINGSLEPRKVEIQEVRVEIPSSITSSQIVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVT-----EGPQEIELRRSVRSRRSAISDDYLVYLHESEFD
          +GS E RKV+IQE++ E+ S + S  IVVP+V    N+  EQ  +   P ++ + NEPVT       PQE  LRRS R RRSAIS+DY+VY  ESE D
Subjt:  IINGSLEPRKVEIQEVRVEIPSSITSSQIVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVT-----EGPQEIELRRSVRSRRSAISDDYLVYLHESEFD

Query:  LSIDNDPVSFSQAIKGIILPNELPKKSKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKKDSLRIIMALVAHYDLELHQMDVKT
        LSID  P SF +A++      +LP   K VG KW++  KRDS G I+R KA LVAKG+TQKDGIDYKETFS VSKKDSLRI++AL+A YDLELHQMD+KT
Subjt:  LSIDNDPVSFSQAIKGIILPNELPKKSKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKKDSLRIIMALVAHYDLELHQMDVKT

Query:  AFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTK
        AFLNG+L+EEV+MDQPEGF+V GKEHMVCKL++SIYGLKQASRQWYLKFNDTITS+GF E IVDRCIY+KISGSKFI+LVLYVDDILLA ND G+L  TK
Subjt:  AFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTK

Query:  EFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQNAYINKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTR
        ++LSKNFEMKDM EASYVIGIEIFRDR+ GLL LSQ  YINK+LE++KM KCS+ + PIQKGDKFS MQCPKNELER +ME I YAS+VGSL Y QTCTR
Subjt:  EFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQNAYINKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTR

Query:  LDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAEFV
         DI+FAVGMLGRYQSNPGMDHWKAAKKVLRYLQG K++MLT++RS+HLE+I YSDSD+AGCVD+RKSTFGYLF LA GA+SWKS KQS+IA STMEAEFV
Subjt:  LDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAEFV

Query:  TCFEAT
         CFEAT
Subjt:  TCFEAT

KAA0052755.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]0.084.03Show/hide
Query:  MMYKHIYFVIAASAPVSLYSHATSIIKFNGLNFSDWCEQIRFHLGVLDLDLALLSEKPGAITSANSDEDRSFYKAWERSNRLSLMFIRMTVANNIKFTIK
        MMYKHIYFVIAASAPVSLYSHATSIIKFNGLNFSDWCEQIRFHLGVLDLDLALLSEKPGAITSANSDEDRSFYKAWERSNRLSLMFIRMTVANNIKFTIK
Subjt:  MMYKHIYFVIAASAPVSLYSHATSIIKFNGLNFSDWCEQIRFHLGVLDLDLALLSEKPGAITSANSDEDRSFYKAWERSNRLSLMFIRMTVANNIKFTIK

Query:  NTEDAKEFMKSVKKCFQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNENFLVMFILNSLPSEYGPFHMNYNTLKDKWNVHE
        NTEDAKEFMKSVKKCFQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNENFLVMFILNSLPSEYGPFHMNYNTLKDKWNVHE
Subjt:  NTEDAKEFMKSVKKCFQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNENFLVMFILNSLPSEYGPFHMNYNTLKDKWNVHE

Query:  LQSMLIQEEARLKKPIIHSVNLMGHKGAGKKPGKKNGKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAWFENKDLDFTDLGICVDCI
        LQSMLIQEEARLKKPIIHSVNLMGHKGAGKKPGKKNGKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAWFENK              
Subjt:  LQSMLIQEEARLKKPIIHSVNLMGHKGAGKKPGKKNGKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAWFENKDLDFTDLGICVDCI

Query:  KGKQTKHTISKEATRSSQLLEIIHTDIYGPFDVPSFGGEKYFITFIDDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDE
                                                                                  VERQLDRNVKILRSDRGGEYYGKYDE
Subjt:  KGKQTKHTISKEATRSSQLLEIIHTDIYGPFDVPSFGGEKYFITFIDDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDE

Query:  NGQCPGPFAKFLESHGICAQYTMPGYTMPGTPQQNDVAE-----------------------------------------------------STRIVETG
        NGQCPGPFAKFLESHGICAQYTMPGYTMPGTPQQNDVAE                                                     STRIVETG
Subjt:  NGQCPGPFAKFLESHGICAQYTMPGYTMPGTPQQNDVAE-----------------------------------------------------STRIVETG

Query:  NVRFIENDIINGSLEPRKVEIQEVRVEIPSSITSSQIVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHES
        NVRFIENDIINGSLEPRKVEIQEVRVEIPSSITSSQIVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHES
Subjt:  NVRFIENDIINGSLEPRKVEIQEVRVEIPSSITSSQIVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHES

Query:  EFDLSIDNDPVSFSQAIKGIILPN---------------------ELPKKSKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKK
        EFDLSIDNDPVSFSQAIKG                          ELPKKSKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKK
Subjt:  EFDLSIDNDPVSFSQAIKGIILPN---------------------ELPKKSKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKK

Query:  DSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKF
        DSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKF
Subjt:  DSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKF

Query:  IILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQNAYINKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELE
        IILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQNAYINKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELE
Subjt:  IILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQNAYINKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELE

Query:  RNQMETISYASIVGSLLYAQTCTRLDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLLA
        RNQMETISYASIVGSLLYAQTCTRLDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLLA
Subjt:  RNQMETISYASIVGSLLYAQTCTRLDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLLA

Query:  EGAISWKSAKQSIIAASTMEAEFVTCFEATSSWFMAVELYLRTWNC
        EGAISWKSAKQSIIAASTMEAEFVTCFEATSSWFMAVELYLRTWNC
Subjt:  EGAISWKSAKQSIIAASTMEAEFVTCFEATSSWFMAVELYLRTWNC

KAG7564986.1 Integrase catalytic core [Arabidopsis suecica]0.050.28Show/hide
Query:  SAPVSLYSHATSIIKFNGLNFSDWCEQIRFHLGVLDLDLALLS-EKPGAITSANSDEDRSFYKAWERSNRLSLMFIRMTVANNIKFTIKNTEDAKEFMKS
        +A  +L++ A S++KFNGLN+ +W EQIRF LGV+ LD A+L+ E+P AIT  +S+ ++S Y++WERSNRLSL  +RMT+A ++K ++  TE A+EF+K 
Subjt:  SAPVSLYSHATSIIKFNGLNFSDWCEQIRFHLGVLDLDLALLS-EKPGAITSANSDEDRSFYKAWERSNRLSLMFIRMTVANNIKFTIKNTEDAKEFMKS

Query:  VKKCFQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNENFLVMFILNSLPSEYGPFHMNYNTLKDKWNVHELQSMLIQEEAR
        +K+C QS+ ADKS+ G LMS LT  KFD S+ IH+H+  M+NLA++L T+GMEV+E FLV FI+NSLP E+  F +NYNT+KDKWN  EL++ML+QEE R
Subjt:  VKKCFQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNENFLVMFILNSLPSEYGPFHMNYNTLKDKWNVHELQSMLIQEEAR

Query:  LKKPIIHSVNLMGHKGAGKKPGKKNGKGNH-GQLKVKQSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAWFENK------------------------
        LKK      +L+G   A  + GK + K     +  VK   + IHK+     KC FC K GH++KDC KRKAWF+ K                        
Subjt:  LKKPIIHSVNLMGHKGAGKKPGKKNGKGNH-GQLKVKQSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAWFENK------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------DLDFTDLGICVDCIKGKQTKHTISKEATRSSQLLEIIH
                                                                       LDF+DL +C+DCIKGKQTKH + K ATRS+QLLE+IH
Subjt:  --------------------------------------------------------------DLDFTDLGICVDCIKGKQTKHTISKEATRSSQLLEIIH

Query:  TDIYGPFDVPSFGGEKYFITFIDDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQCPGPFAKFLESHGICAQYTMP
        TDI GPFD PS+ GEKYFITFIDD+SRYG+ YLLHEKS++++ L+VFI+EVERQLDR VK++RSDRGGE+YGK+ E+GQCPGPFAK LES GICAQYTMP
Subjt:  TDIYGPFDVPSFGGEKYFITFIDDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQCPGPFAKFLESHGICAQYTMP

Query:  GYTMPGTPQQNDVAE-------------------------------------------------------------------------------------
        G     TPQQN VAE                                                                                     
Subjt:  GYTMPGTPQQNDVAE-------------------------------------------------------------------------------------

Query:  ---------------------STRIVETGNVRFIENDIINGSLEPRKVEIQEVRVEIPSSITSSQIVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVT-
                             STRIVETGN RFIEN   +GS E RKV+IQE++VE+ S    S++VVP+V    N+  EQ  +   P ++   NEPVT 
Subjt:  ---------------------STRIVETGNVRFIENDIINGSLEPRKVEIQEVRVEIPSSITSSQIVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVT-

Query:  -----EGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSIDNDPVSFSQAI------KGIILPNE---------------LPKKSKRVGCKWVFKTKRD
               PQE  LRRS R RRSAIS+DY+VY  ESE D+S+D DP++F +A+      K  I   E               LP   K VG KWVFKTKRD
Subjt:  -----EGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSIDNDPVSFSQAI------KGIILPNE---------------LPKKSKRVGCKWVFKTKRD

Query:  SNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQA
        S GNIER KARLVAKG+TQKDGIDYKETFSPVSKKDSLRI++ LVAHYDLELHQMDVKTAFLNG L+EEV+MDQPEGF+  G EH+VCKLK+SIYGLKQA
Subjt:  SNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQA

Query:  SRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQNAYIN
        SRQWYLKFNDTITS+GF E IVDRCIY+K+SGSKF+ILVLYVDDILLA ND G+L   K++LSKNFEMKDM EASYVIGIEI RDR+ GLL LSQ  YIN
Subjt:  SRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQNAYIN

Query:  KVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLT
        K+LE+++M++CS+   PIQKGDKFS MQCP+NELER +ME I YAS+VGSL Y QTCTR DISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQG K+ MLT
Subjt:  KVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLT

Query:  YKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAEFVTCFEAT
        Y+RSD+LEVI YSDSD+AGCVD+RKSTFGYLFLLA GAISWKS KQS+IA STMEAEFV CFEAT
Subjt:  YKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAEFVTCFEAT

RZC25410.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja]0.054.82Show/hide
Query:  VIAASAPVSLYSHATSIIKFNGLNFSDWCEQIRFHLGVLDLDLALLSEKPGAITSANSDEDRSFYKAWERSNRLSLMFIRMTVANNIKFTIKNTEDAKEF
        V+    P SL SH +S+  FNGLNFSDW EQ++FHLGVLDLDLA+L EKP  IT A+S+E ++ YKAWERSNRLSLMF+RMTVA++IK  +  T+ AKEF
Subjt:  VIAASAPVSLYSHATSIIKFNGLNFSDWCEQIRFHLGVLDLDLALLSEKPGAITSANSDEDRSFYKAWERSNRLSLMFIRMTVANNIKFTIKNTEDAKEF

Query:  MKSVKKCFQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNENFLVMFILNSLPSEYGPFHMNYNTLKDKWNVHELQSMLIQE
        M  V +  +S++ADKSLAGTLMSTLT +KFDGSRT+HEH++EMTN+AARLKT+GM VNENFLV FILNSLPSEYGPF M+YNT+KDKWNVHEL SML+QE
Subjt:  MKSVKKCFQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNENFLVMFILNSLPSEYGPFHMNYNTLKDKWNVHELQSMLIQE

Query:  EARLKKPIIHSVNLMGHKG---AGKKPGKKNGKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAWFENK-------------------
        E RLK    HS++ + H+G   AGKK  KK+ KG  G LK+K     I KK    + C FC K GH+QKDC KRK+WFE K                   
Subjt:  EARLKKPIIHSVNLMGHKG---AGKKPGKKNGKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAWFENK-------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------DLDFTDLGICVDCIKGKQTKHTISKEATRSSQ
                                                                            DLDFTDL ICVDCIKGKQTKHT  K ATRS+Q
Subjt:  --------------------------------------------------------------------DLDFTDLGICVDCIKGKQTKHTISKEATRSSQ

Query:  LLEIIHTDIYGPFDVPSFGGEKYFITFIDDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQCPGPFAKFLESHGIC
        LLEI+HTDI GPFDV SFG E+YFITFIDD+SRYGY+YLLHEKSQA++AL++++NEVERQLDR VKI+RSDRGGEYY +YDE GQ P PFAK L+  GIC
Subjt:  LLEIIHTDIYGPFDVPSFGGEKYFITFIDDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQCPGPFAKFLESHGIC

Query:  AQYTMPGYTMPGTPQQNDVAE-------------------------------------------------------------------------------
        AQYTMPG     TPQQN V+E                                                                               
Subjt:  AQYTMPGYTMPGTPQQNDVAE-------------------------------------------------------------------------------

Query:  ---------------------------STRIVETGNVRFIENDIINGSLEPRKVEIQEVRVEIPSSITSSQIVVPVVVDSVNNPQEQQINGQTPHNDIVT
                                   STRIVETGN RFIEN  I+GS  PR+VEI+EVRV++P +  SS  V+   V + N+ +E Q N +     ++ 
Subjt:  ---------------------------STRIVETGNVRFIENDIINGSLEPRKVEIQEVRVEIPSSITSSQIVVPVVVDSVNNPQEQQINGQTPHNDIVT

Query:  NEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSI-DNDPVSFSQAIK---------------------GIILPNELPKKSKRVGCKWVFKTKR
        NEP+ E PQE+ LR+S R RR AIS+DY+VYLHE+E +LSI DNDPVSFSQAI                       +    ELPK  KRVG KWVFKTKR
Subjt:  NEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSI-DNDPVSFSQAIK---------------------GIILPNELPKKSKRVGCKWVFKTKR

Query:  DSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQ
        DS+GN+ER KARLVAKG+TQKDGIDYKETFSPVS+KDS RIIMALVAHYDLELHQMDVKTAFLNG+L+E+V+MDQP GF VEGKEHMVCKLK+SIYGLKQ
Subjt:  DSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQ

Query:  ASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQNAYI
        ASRQWYLKFNDTI SFGFKEN VDRC+YLK+SGSK + LVLYVDDILLATND GL  +TK+FLS NFEMKDM EASYVIGIEIFR+R+ GLLGLSQ AYI
Subjt:  ASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQNAYI

Query:  NKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYML
        NKVLE+F+M KCS+S VPIQKGDKFSL QCPKN+LER QME I YAS+VGS++YAQTCTR DISFA GMLGRYQSNPGM+HWKAAKKVLRYLQG KD++L
Subjt:  NKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYML

Query:  TYKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAEFVTCFEAT
        TYKRSDHLEVI YSDSDFAGCVDTRKST G++FLLA GAISWKSAKQS++AASTMEA FV CFEAT
Subjt:  TYKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAEFVTCFEAT

TYK04201.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]0.083.84Show/hide
Query:  MMYKHIYFVIAASAPVSLYSHATSIIKFNGLNFSDWCEQIRFHLGVLDLDLALLSEKPGAITSANSDEDRSFYKAWERSNRLSLMFIRMTVANNIKFTIK
        MMYKHIYFVIAASAPVSLYSHATSIIKFNGLNFSDWCEQIRFHLGVLDLDLALLSEKPGAITSANSDEDRSFYKAWERSNRLSLMFIRMTVANNIKFTIK
Subjt:  MMYKHIYFVIAASAPVSLYSHATSIIKFNGLNFSDWCEQIRFHLGVLDLDLALLSEKPGAITSANSDEDRSFYKAWERSNRLSLMFIRMTVANNIKFTIK

Query:  NTEDAKEFMKSVKKCFQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNENFLVMFILNSLPSEYGPFHMNYNTLKDKWNVHE
        NTEDAKEFMKSVKKCFQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNENFLV FILNSLPSEYGPFHMNYNTLKDKWNVHE
Subjt:  NTEDAKEFMKSVKKCFQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNENFLVMFILNSLPSEYGPFHMNYNTLKDKWNVHE

Query:  LQSMLIQEEARLKKPIIHSVNLMGHKGAGKKPGKKNGKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAWFENKDLDFTDLGICVDCI
        LQSMLIQEEARLKKPIIHSVNLMGHKGAGKKPGKKNGKGNHGQLKVKQSSAPIHKKG+IKDKCRFCNKPGHYQKDCLKRKAWFENK              
Subjt:  LQSMLIQEEARLKKPIIHSVNLMGHKGAGKKPGKKNGKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAWFENKDLDFTDLGICVDCI

Query:  KGKQTKHTISKEATRSSQLLEIIHTDIYGPFDVPSFGGEKYFITFIDDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDE
                                                                                  VERQLDRNVKILRSDRGGEYYGKYDE
Subjt:  KGKQTKHTISKEATRSSQLLEIIHTDIYGPFDVPSFGGEKYFITFIDDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDE

Query:  NGQCPGPFAKFLESHGICAQYTMPGYTMPGTPQQNDVAE-----------------------------------------------------STRIVETG
        NGQCPGPFAKFLESHGICAQYTMPGYTMPGTPQQNDVAE                                                     STRIVETG
Subjt:  NGQCPGPFAKFLESHGICAQYTMPGYTMPGTPQQNDVAE-----------------------------------------------------STRIVETG

Query:  NVRFIENDIINGSLEPRKVEIQEVRVEIPSSITSSQIVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHES
        NVRFIENDIINGSLEPRKVEIQEVRVEIPSSITSSQIVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHES
Subjt:  NVRFIENDIINGSLEPRKVEIQEVRVEIPSSITSSQIVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHES

Query:  EFDLSIDNDPVSFSQAIKGIILPN---------------------ELPKKSKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKK
        EFDLSIDNDPVSFSQAIKG                          ELPKKSKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKK
Subjt:  EFDLSIDNDPVSFSQAIKGIILPN---------------------ELPKKSKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKK

Query:  DSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKF
        DSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKF
Subjt:  DSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKF

Query:  IILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQNAYINKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELE
        IILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQNAYINKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELE
Subjt:  IILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQNAYINKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELE

Query:  RNQMETISYASIVGSLLYAQTCTRLDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLLA
        RNQMETISYASIVGSLLYAQTCTRLDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLLA
Subjt:  RNQMETISYASIVGSLLYAQTCTRLDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLLA

Query:  EGAISWKSAKQSIIAASTMEAEFVTCFEATSSWFMAVELYLRTWNC
        EGAISWKSAKQSIIAASTMEAEFVTCFEATSSWFMAVELYLRTWNC
Subjt:  EGAISWKSAKQSIIAASTMEAEFVTCFEATSSWFMAVELYLRTWNC

TrEMBL top hitse value%identityAlignment
A0A445LQ30 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0054.76Show/hide
Query:  IYFVIAASAPVSLYSHATSIIKFNGLNFSDWCEQIRFHLGVLDLDLALLSEKPGAITSANSDEDRSFYKAWERSNRLSLMFIRMTVANNIKFTIKNTEDA
        +  V+    P SL SH +S+  FNGLNFSDW EQ++FHLGVLDLDLA+L EKP  IT A+S+E ++ YKAWERSNRLSLMF+RMTVA++IK  +  T+ A
Subjt:  IYFVIAASAPVSLYSHATSIIKFNGLNFSDWCEQIRFHLGVLDLDLALLSEKPGAITSANSDEDRSFYKAWERSNRLSLMFIRMTVANNIKFTIKNTEDA

Query:  KEFMKSVKKCFQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNENFLVMFILNSLPSEYGPFHMNYNTLKDKWNVHELQSML
        KEFM  V +  +S++ADKSLAGTLMSTLT +KFDGSRT+HEH++EMTN+AARLKT+GM VNENFLV FILNSLPSEYGPF M+YNT+KDKWNVHEL SML
Subjt:  KEFMKSVKKCFQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNENFLVMFILNSLPSEYGPFHMNYNTLKDKWNVHELQSML

Query:  IQEEARLKKPIIHSVNLMGHK---GAGKKPGKKNGKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAWFENK----------------
        +QEE RLK    HS++ + H+   GAGKK  KK+ KG  G LK+K     I KK    + C FC K GH+QKDC KRK+WFE K                
Subjt:  IQEEARLKKPIIHSVNLMGHK---GAGKKPGKKNGKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAWFENK----------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------------------------DLDFTDLGICVDCIKGKQTKHTISKEATR
                                                                               DLDFTDL ICVDCIKGKQTKHT  K ATR
Subjt:  -----------------------------------------------------------------------DLDFTDLGICVDCIKGKQTKHTISKEATR

Query:  SSQLLEIIHTDIYGPFDVPSFGGEKYFITFIDDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQCPGPFAKFLESH
        S+QLLEI+HTDI GPFDV SFG E+YFITFIDD+SRYGY+YLLHEKSQA++AL++++NEVERQLDR VKI+RSDRGGEYY +YDE GQ P PFAK L+  
Subjt:  SSQLLEIIHTDIYGPFDVPSFGGEKYFITFIDDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQCPGPFAKFLESH

Query:  GICAQYTMPGYTMPGTPQQNDVAE----------------------------------------------------------------------------
        GICAQ     YTMPGTPQQN V+E                                                                            
Subjt:  GICAQYTMPGYTMPGTPQQNDVAE----------------------------------------------------------------------------

Query:  ------------------------------STRIVETGNVRFIENDIINGSLEPRKVEIQEVRVEIPSSITSSQIVVPVVVDSVNNPQEQQINGQTPHND
                                      STRIVETGN RFIEN  I+GS  PR+VEI+EVRV++P +  SS  V+   V + N+ +E Q      HND
Subjt:  ------------------------------STRIVETGNVRFIENDIINGSLEPRKVEIQEVRVEIPSSITSSQIVVPVVVDSVNNPQEQQINGQTPHND

Query:  --IVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSI-DNDPVSFSQAI---------------------KGIILPNELPKKSKRVGCKWV
          ++ NEP+ E PQE+ LR+S R RR AIS+DY+VYLHE+E +LSI DNDPVSFSQAI                       +    ELPK  KRVG KWV
Subjt:  --IVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSI-DNDPVSFSQAI---------------------KGIILPNELPKKSKRVGCKWV

Query:  FKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSI
        FKTKRDS+GN+ER KARLVAKG+TQKDGIDYKETFSPVS+KDS RIIMALVAHYDLELHQMDVKTAFLNG+L+E+V+MDQP GF VEGKEHMVCKLK+SI
Subjt:  FKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSI

Query:  YGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLS
        YGLKQASRQWYLKFNDTI SFGFKEN VDRC+YLK+SGSK + LVLYVDDILLATND GL  +TK+FLS NFEMKDM EASYVIGIEIFR+R+ GLLGLS
Subjt:  YGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLS

Query:  QNAYINKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGA
        Q AYINKVLE+F+M KCS+S VPIQKGDKFSL QCPKN+LER QME I YAS+VGS++YAQTCTR DISFA GMLGRYQSNPGM+HWKAAKKVLRYLQG 
Subjt:  QNAYINKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGA

Query:  KDYMLTYKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAEFVTCFEAT
        KD++LTYKRSDHLEVI YSDSDFAGCVDTRKST G++FLLA GAISWKSAKQS++AASTMEA FV CFEAT
Subjt:  KDYMLTYKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAEFVTCFEAT

A0A5A7UG95 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0084.03Show/hide
Query:  MMYKHIYFVIAASAPVSLYSHATSIIKFNGLNFSDWCEQIRFHLGVLDLDLALLSEKPGAITSANSDEDRSFYKAWERSNRLSLMFIRMTVANNIKFTIK
        MMYKHIYFVIAASAPVSLYSHATSIIKFNGLNFSDWCEQIRFHLGVLDLDLALLSEKPGAITSANSDEDRSFYKAWERSNRLSLMFIRMTVANNIKFTIK
Subjt:  MMYKHIYFVIAASAPVSLYSHATSIIKFNGLNFSDWCEQIRFHLGVLDLDLALLSEKPGAITSANSDEDRSFYKAWERSNRLSLMFIRMTVANNIKFTIK

Query:  NTEDAKEFMKSVKKCFQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNENFLVMFILNSLPSEYGPFHMNYNTLKDKWNVHE
        NTEDAKEFMKSVKKCFQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNENFLVMFILNSLPSEYGPFHMNYNTLKDKWNVHE
Subjt:  NTEDAKEFMKSVKKCFQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNENFLVMFILNSLPSEYGPFHMNYNTLKDKWNVHE

Query:  LQSMLIQEEARLKKPIIHSVNLMGHKGAGKKPGKKNGKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAWFENKDLDFTDLGICVDCI
        LQSMLIQEEARLKKPIIHSVNLMGHKGAGKKPGKKNGKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAWFENK              
Subjt:  LQSMLIQEEARLKKPIIHSVNLMGHKGAGKKPGKKNGKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAWFENKDLDFTDLGICVDCI

Query:  KGKQTKHTISKEATRSSQLLEIIHTDIYGPFDVPSFGGEKYFITFIDDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDE
                                                                                  VERQLDRNVKILRSDRGGEYYGKYDE
Subjt:  KGKQTKHTISKEATRSSQLLEIIHTDIYGPFDVPSFGGEKYFITFIDDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDE

Query:  NGQCPGPFAKFLESHGICAQYTMPGYTMPGTPQQNDVAE-----------------------------------------------------STRIVETG
        NGQCPGPFAKFLESHGICAQYTMPGYTMPGTPQQNDVAE                                                     STRIVETG
Subjt:  NGQCPGPFAKFLESHGICAQYTMPGYTMPGTPQQNDVAE-----------------------------------------------------STRIVETG

Query:  NVRFIENDIINGSLEPRKVEIQEVRVEIPSSITSSQIVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHES
        NVRFIENDIINGSLEPRKVEIQEVRVEIPSSITSSQIVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHES
Subjt:  NVRFIENDIINGSLEPRKVEIQEVRVEIPSSITSSQIVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHES

Query:  EFDLSIDNDPVSFSQAIKGIILPN---------------------ELPKKSKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKK
        EFDLSIDNDPVSFSQAIKG                          ELPKKSKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKK
Subjt:  EFDLSIDNDPVSFSQAIKGIILPN---------------------ELPKKSKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKK

Query:  DSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKF
        DSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKF
Subjt:  DSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKF

Query:  IILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQNAYINKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELE
        IILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQNAYINKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELE
Subjt:  IILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQNAYINKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELE

Query:  RNQMETISYASIVGSLLYAQTCTRLDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLLA
        RNQMETISYASIVGSLLYAQTCTRLDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLLA
Subjt:  RNQMETISYASIVGSLLYAQTCTRLDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLLA

Query:  EGAISWKSAKQSIIAASTMEAEFVTCFEATSSWFMAVELYLRTWNC
        EGAISWKSAKQSIIAASTMEAEFVTCFEATSSWFMAVELYLRTWNC
Subjt:  EGAISWKSAKQSIIAASTMEAEFVTCFEATSSWFMAVELYLRTWNC

A0A5D3BWW5 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0083.84Show/hide
Query:  MMYKHIYFVIAASAPVSLYSHATSIIKFNGLNFSDWCEQIRFHLGVLDLDLALLSEKPGAITSANSDEDRSFYKAWERSNRLSLMFIRMTVANNIKFTIK
        MMYKHIYFVIAASAPVSLYSHATSIIKFNGLNFSDWCEQIRFHLGVLDLDLALLSEKPGAITSANSDEDRSFYKAWERSNRLSLMFIRMTVANNIKFTIK
Subjt:  MMYKHIYFVIAASAPVSLYSHATSIIKFNGLNFSDWCEQIRFHLGVLDLDLALLSEKPGAITSANSDEDRSFYKAWERSNRLSLMFIRMTVANNIKFTIK

Query:  NTEDAKEFMKSVKKCFQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNENFLVMFILNSLPSEYGPFHMNYNTLKDKWNVHE
        NTEDAKEFMKSVKKCFQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNENFLV FILNSLPSEYGPFHMNYNTLKDKWNVHE
Subjt:  NTEDAKEFMKSVKKCFQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNENFLVMFILNSLPSEYGPFHMNYNTLKDKWNVHE

Query:  LQSMLIQEEARLKKPIIHSVNLMGHKGAGKKPGKKNGKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAWFENKDLDFTDLGICVDCI
        LQSMLIQEEARLKKPIIHSVNLMGHKGAGKKPGKKNGKGNHGQLKVKQSSAPIHKKG+IKDKCRFCNKPGHYQKDCLKRKAWFENK              
Subjt:  LQSMLIQEEARLKKPIIHSVNLMGHKGAGKKPGKKNGKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAWFENKDLDFTDLGICVDCI

Query:  KGKQTKHTISKEATRSSQLLEIIHTDIYGPFDVPSFGGEKYFITFIDDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDE
                                                                                  VERQLDRNVKILRSDRGGEYYGKYDE
Subjt:  KGKQTKHTISKEATRSSQLLEIIHTDIYGPFDVPSFGGEKYFITFIDDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDE

Query:  NGQCPGPFAKFLESHGICAQYTMPGYTMPGTPQQNDVAE-----------------------------------------------------STRIVETG
        NGQCPGPFAKFLESHGICAQYTMPGYTMPGTPQQNDVAE                                                     STRIVETG
Subjt:  NGQCPGPFAKFLESHGICAQYTMPGYTMPGTPQQNDVAE-----------------------------------------------------STRIVETG

Query:  NVRFIENDIINGSLEPRKVEIQEVRVEIPSSITSSQIVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHES
        NVRFIENDIINGSLEPRKVEIQEVRVEIPSSITSSQIVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHES
Subjt:  NVRFIENDIINGSLEPRKVEIQEVRVEIPSSITSSQIVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHES

Query:  EFDLSIDNDPVSFSQAIKGIILPN---------------------ELPKKSKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKK
        EFDLSIDNDPVSFSQAIKG                          ELPKKSKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKK
Subjt:  EFDLSIDNDPVSFSQAIKGIILPN---------------------ELPKKSKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKK

Query:  DSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKF
        DSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKF
Subjt:  DSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKF

Query:  IILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQNAYINKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELE
        IILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQNAYINKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELE
Subjt:  IILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQNAYINKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELE

Query:  RNQMETISYASIVGSLLYAQTCTRLDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLLA
        RNQMETISYASIVGSLLYAQTCTRLDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLLA
Subjt:  RNQMETISYASIVGSLLYAQTCTRLDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLLA

Query:  EGAISWKSAKQSIIAASTMEAEFVTCFEATSSWFMAVELYLRTWNC
        EGAISWKSAKQSIIAASTMEAEFVTCFEATSSWFMAVELYLRTWNC
Subjt:  EGAISWKSAKQSIIAASTMEAEFVTCFEATSSWFMAVELYLRTWNC

A0A6N2LRF5 Uncharacterized protein0.0e+0051.91Show/hide
Query:  KHIYFVIAASAPVSLYSHATSIIKFNGLNFSDWCEQIRFHLGVLDLDLALLS-EKPGAITSANSDEDRSFYKAWERSNRLSLMFIRMTVANNIKFTIKNT
        KH+    A+S   SL++ A S++KFNGLN+ +W EQIRF LGV+ LD A+L+ E+P AIT  +S+ ++S Y+ WERSNRL L  +RM++A ++K ++   
Subjt:  KHIYFVIAASAPVSLYSHATSIIKFNGLNFSDWCEQIRFHLGVLDLDLALLS-EKPGAITSANSDEDRSFYKAWERSNRLSLMFIRMTVANNIKFTIKNT

Query:  EDAKEFMKSVKKCFQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNENFLVMFILNSLPSEYGPFHMNYNTLKDKWNVHELQ
        E A+EF+  +K   QS+ ADKS+ G+LMS LT  +FD S+TIHEH+  M+NLA+RL +MGMEV+E+FLV FI+NSLP EYG F +NYNT+KDKWN  EL+
Subjt:  EDAKEFMKSVKKCFQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNENFLVMFILNSLPSEYGPFHMNYNTLKDKWNVHELQ

Query:  SMLIQEEARLKKPIIHSVNLMGHKGAG---KKPGKKNGKGNHGQLKVKQSSAPIHKKGQIKD--KCRFCNKPGHYQKDCLKRKAWFENKDLD--------
        +MLIQEEARLKK       ++G   AG   KKP  K+ + + G  K  +S        QI+   KC FC K GH +KDC +RKAWF+ K           
Subjt:  SMLIQEEARLKKPIIHSVNLMGHKGAG---KKPGKKNGKGNHGQLKVKQSSAPIHKKGQIKD--KCRFCNKPGHYQKDCLKRKAWFENKDLD--------

Query:  -----------------------------------------------------FTDLGIC----------------------------------------
                                                             F DL  C                                        
Subjt:  -----------------------------------------------------FTDLGIC----------------------------------------

Query:  ---------------------------------------------VDCIKGKQTKHTISKEATRSSQLLEIIHTDIYGPFDVPSFGGEKYFITFIDDFSR
                                                     ++CIKGKQTKH   K ATRSSQLLE+IHTDI GPFD PS+ GEKYFITFIDD+SR
Subjt:  ---------------------------------------------VDCIKGKQTKHTISKEATRSSQLLEIIHTDIYGPFDVPSFGGEKYFITFIDDFSR

Query:  YGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQCPGPFAKFLESHGICAQYTMPGYTMPGTPQQNDVAE------------
        YGY YLLH+K+++++ LK+FI+EVERQL++ VK++RSDRGGE+YG++ E+GQCPGPFAK LES GICAQ     YTMPGTPQQN VAE            
Subjt:  YGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQCPGPFAKFLESHGICAQYTMPGYTMPGTPQQNDVAE------------

Query:  --------------------------------------STRIVETGNVRFIENDIINGSLEPRKVEIQEVRVEIPSSITSSQIVVPVVVDSVNNPQEQQI
                                              STRIVETGN RFIEN   +GS E R V I+E+RVE  S +  +Q+V+PVV    N   EQQ 
Subjt:  --------------------------------------STRIVETGNVRFIENDIINGSLEPRKVEIQEVRVEIPSSITSSQIVVPVVVDSVNNPQEQQI

Query:  NGQT-PHNDIVTNEPVTEGPQ----EIELRRSVRSRRSAISDDYLVYLHESEFDLSIDNDPVSFSQAIKG-------------IILPN--------ELPK
           T P N  + NEP+    Q    ++ LRRS+R RRSAI+DDY+VY  ESE DLS+D +P +F +A++              I   N        ELP+
Subjt:  NGQT-PHNDIVTNEPVTEGPQ----EIELRRSVRSRRSAISDDYLVYLHESEFDLSIDNDPVSFSQAIKG-------------IILPN--------ELPK

Query:  KSKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKE
          K VGCKW+F TKRDS GN+ER KARLVAKG+TQKDGIDY ET SPVSKKDSLRI++ALVAHYDLELHQMDVKTAFLNG L+EE++MDQPEGF+  G E
Subjt:  KSKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKE

Query:  HMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFR
         +VC+L++SIYGLKQASRQWY+KFNDTI S+GF E IVDRCIY+K+SGSKF ILVLYVDDIL+A ND G+L   K++LS NFEMKDM EASYVIGIEI R
Subjt:  HMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFR

Query:  DRTHGLLGLSQNAYINKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDISFAVGMLGRYQSNPGMDHWKAA
        DR+ GLL LSQ  YINKVL+++ M+KCS    PIQKGD+FS MQC +NELER +ME I YAS+VGSL Y QTC+R DISFAVGMLGRYQSNPGMDHWKAA
Subjt:  DRTHGLLGLSQNAYINKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDISFAVGMLGRYQSNPGMDHWKAA

Query:  KKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAEFVTCFEAT
        KKVLRYLQG K+Y LTY+RSD++E++ YSDSD+AGC+D+RK TFGYLFLLA GA+SWK  KQS+IA STMEAEFV CFEAT
Subjt:  KKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAEFVTCFEAT

A0A6N2MJZ2 Uncharacterized protein0.0e+0050.7Show/hide
Query:  KHIYFVIAASAPVSLYSHATSIIKFNGLNFSDWCEQIRFHLGVLDLDLALLS-EKPGAITSANSDEDRSFYKAWERSNRLSLMFIRMTVANNIKFTIKNT
        KH+    A+S   SL++ A S++KFNGLN+ +W EQIRF LGV+ LD A+L+ E+P AIT  +S+ ++S Y+ WERSNRL L  +RM++A +IK ++  T
Subjt:  KHIYFVIAASAPVSLYSHATSIIKFNGLNFSDWCEQIRFHLGVLDLDLALLS-EKPGAITSANSDEDRSFYKAWERSNRLSLMFIRMTVANNIKFTIKNT

Query:  EDAKEFMKSVKKCFQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNENFLVMFILNSLPSEYGPFHMNYNTLKDKWNVHELQ
        + A+EF+  +K+  QS+ ADKS+ G+LMS LT  +FD S+TIHEH+  M+NLA+RL +MGMEV+E+FLV FI+NSLP EYG F +NYNT+KDKWN+ EL+
Subjt:  EDAKEFMKSVKKCFQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNENFLVMFILNSLPSEYGPFHMNYNTLKDKWNVHELQ

Query:  SMLIQEEARLKKPIIHSVNLMGHKGAG---KKPGKKNGKGNHGQLKVKQSSAPIHKKGQIKD--KCRFCNKPGHYQKDCLKRKAWFENK-----------
        +MLIQEEARLKK       ++G   AG   KKP  K+ + + G  K  +S        QI+   KC FC K GH +KDC +RKAWF+ K           
Subjt:  SMLIQEEARLKKPIIHSVNLMGHKGAG---KKPGKKNGKGNHGQLKVKQSSAPIHKKGQIKD--KCRFCNKPGHYQKDCLKRKAWFENK-----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------DLDFTDLGICVDCIKGKQTKHTISKE
                                                                                   LDF+DL +C++CIKGKQTKH   K 
Subjt:  --------------------------------------------------------------------------DLDFTDLGICVDCIKGKQTKHTISKE

Query:  ATRSSQLLEIIHTDIYGPFDVPSFGGEKYFITFIDDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQCPGPFAKFL
        ATRSSQLLE+IHTDI GPFD PS+ GEKYFITFIDD+SRYGY YLLH+K+++++ LK+FI+EVERQL++ VK++RSDRGGE+YG++ E+GQCPGPFAK L
Subjt:  ATRSSQLLEIIHTDIYGPFDVPSFGGEKYFITFIDDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQCPGPFAKFL

Query:  ESHGICAQYTMPGYTMPGTPQQNDVAE--------------------------------------------------STRIVETGNVRFIENDIINGSLE
        ES GICAQ     YTMPGTPQQN VAE                                                  STRIVETGN RFIEN   +GS E
Subjt:  ESHGICAQYTMPGYTMPGTPQQNDVAE--------------------------------------------------STRIVETGNVRFIENDIINGSLE

Query:  PRKVEIQEVRVEIPSSITSSQIVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSIDNDPVSFSQ
         R V I+E+RVE  S +  +Q+VVPVV +  N P             I   E +     ++ LRRS+R RRSAI+DDY+VY  E E DLS+D +P +F +
Subjt:  PRKVEIQEVRVEIPSSITSSQIVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSIDNDPVSFSQ

Query:  AIKG-------------IILPN--------ELPKKSKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKKDSLRIIMALVAHYDL
        A++              I   N        ELP   K VGCKW+F TKRDS GN+ER KARLVAKG+TQKDGIDY ETFSPVSKKDSLRI++ALVAHYDL
Subjt:  AIKG-------------IILPN--------ELPKKSKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKKDSLRIIMALVAHYDL

Query:  ELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATN
        ELHQMDVKTAFLNG L+EE++MDQPEGF+  G E++VC+L++SIYGLKQASRQWY+KFNDTI S+GF E IVDRCIY+K+SGSKF ILVLYVDDIL+A N
Subjt:  ELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATN

Query:  DFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQNAYINKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGS
        D G+L   K++LS NFEMKDM EASYVIGIEI RDR+ GLL LSQ  YI+KVL+++ M+KCS    PIQKGD+FS MQCPKNELER +ME I YAS+VGS
Subjt:  DFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQNAYINKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGS

Query:  LLYAQTCTRLDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLLAEGAISWKSAKQSIIA
        L Y QTCTR DISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQG K+YMLTY+RSD++E++ YSDSD+AGCVDTRKSTFGYLFLLA GA+SWK  KQS+IA
Subjt:  LLYAQTCTRLDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLLAEGAISWKSAKQSIIA

Query:  ASTMEAEFVTCFEAT
         STMEAEFV CFEAT
Subjt:  ASTMEAEFVTCFEAT

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.8e-7541.34Show/hide
Query:  PKKSKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEG
        P+    V  +WVF  K +  GN  R KARLVA+G+TQK  IDY+ETF+PV++  S R I++LV  Y+L++HQMDVKTAFLNG L EE++M  P+G  +  
Subjt:  PKKSKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEG

Query:  KEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISG--SKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGI
            VCKL ++IYGLKQA+R W+  F   +    F  + VDRCIY+   G  ++ I ++LYVDD+++AT D   +   K +L + F M D++E  + IGI
Subjt:  KEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISG--SKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGI

Query:  EIFRDRTHGLLGLSQNAYINKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDISFAVGMLGRYQSNPGMDH
         I  +     + LSQ+AY+ K+L KF M  C++   P+     + L       L  ++       S++G L+Y   CTR D++ AV +L RY S    + 
Subjt:  EIFRDRTHGLLGLSQNAYINKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDISFAVGMLGRYQSNPGMDH

Query:  WKAAKKVLRYLQGAKDYMLTYKRSDHLE--VIEYSDSDFAGCVDTRKSTFGYLFLLAE-GAISWKSAKQSIIAASTMEAEFVTCFEA
        W+  K+VLRYL+G  D  L +K++   E  +I Y DSD+AG    RKST GYLF + +   I W + +Q+ +AAS+ EAE++  FEA
Subjt:  WKAAKKVLRYLQGAKDYMLTYKRSDHLE--VIEYSDSDFAGCVDTRKSTFGYLFLLAE-GAISWKSAKQSIIAASTMEAEFVTCFEA

P04146 Copia protein1.6e-1028.57Show/hide
Query:  LKRKAWFENKD-LDFTDLG--ICVDCIKGKQTKHTIS--KEATRSSQLLEIIHTDIYGPFDVPSFGGEKYFITFIDDFSRYGYIYLLHEKSQAIDALKVF
        +KRK  F ++  L+  +L   IC  C+ GKQ +      K+ T   + L ++H+D+ GP    +   + YF+ F+D F+ Y   YL+  KS      + F
Subjt:  LKRKAWFENKD-LDFTDLG--ICVDCIKGKQTKHTIS--KEATRSSQLLEIIHTDIYGPFDVPSFGGEKYFITFIDDFSRYGYIYLLHEKSQAIDALKVF

Query:  INEVERQLDRNVKILRSDRGGEYYGKYDENGQCPGPFAKFLESHGICAQYTMPGYTMPGTPQQNDVAESTRIVET
        + + E   +  V  L  D G EY               +F    GI         T+P TPQ N V+E  R++ T
Subjt:  INEVERQLDRNVKILRSDRGGEYYGKYDENGQCPGPFAKFLESHGICAQYTMPGYTMPGTPQQNDVAESTRIVET

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.2e-12035.8Show/hide
Query:  CVDCIKGKQTKHTISKEATRSSQLLEIIHTDIYGPFDVPSFGGEKYFITFIDDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYY
        C  C+ GKQ + +    + R   +L+++++D+ GP ++ S GG KYF+TFIDD SR  ++Y+L  K Q     + F   VER+  R +K LRSD GGEY 
Subjt:  CVDCIKGKQTKHTISKEATRSSQLLEIIHTDIYGPFDVPSFGGEKYFITFIDDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYY

Query:  GKYDENGQCPGPFAKFLESHGICAQYTMPGYTMPGTPQQNDVAE--STRIVET---------------GNVRFIENDIINGS------------------
         +          F ++  SHGI  +      T+PGTPQ N VAE  +  IVE                G        +IN S                  
Subjt:  GKYDENGQCPGPFAKFLESHGICAQYTMPGYTMPGTPQQNDVAE--STRIVET---------------GNVRFIENDIINGS------------------

Query:  ----------------------------------------------LEPRKVEI----------QEVRVEIPSSITSSQIVVP---VVVDSVNNP-----
                                                       +P K ++           EVR     S      ++P    +  + NNP     
Subjt:  ----------------------------------------------LEPRKVEI----------QEVRVEIPSSITSSQIVVP---VVVDSVNNP-----

Query:  --QEQQINGQTPHNDIVTNEPVTEGPQEIE-----------LRRSVRSR---RSAISDDYLVYLHESEFDLSIDNDPVSFSQAI----------------
           E    G+ P   I   E + EG +E+E           LRRS R R   R   S +Y++        +S D +P S  + +                
Subjt:  --QEQQINGQTPHNDIVTNEPVTEGPQEIE-----------LRRSVRSR---RSAISDDYLVYLHESEFDLSIDNDPVSFSQAI----------------

Query:  -----KGIILPNELPKKSKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDE
              G     ELPK  + + CKWVFK K+D +  + R KARLV KG+ QK GID+ E FSPV K  S+R I++L A  DLE+ Q+DVKTAFL+G+L+E
Subjt:  -----KGIILPNELPKKSKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDE

Query:  EVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLK-ISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFE
        E++M+QPEGF V GK+HMVCKL +S+YGLKQA RQWY+KF+  + S  + +   D C+Y K  S + FIIL+LYVDD+L+   D GL+ + K  LSK+F+
Subjt:  EVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLK-ISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFE

Query:  MKDMSEASYVIGIEIFRDRTHGLLGLSQNAYINKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDISFAVG
        MKD+  A  ++G++I R+RT   L LSQ  YI +VLE+F M        P+    K S   CP    E+  M  + Y+S VGSL+YA  CTR DI+ AVG
Subjt:  MKDMSEASYVIGIEIFRDRTHGLLGLSQNAYINKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDISFAVG

Query:  MLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAEFVTCFE
        ++ R+  NPG +HW+A K +LRYL+G     L +  SD + +  Y+D+D AG +D RKS+ GYLF  + GAISW+S  Q  +A ST EAE++   E
Subjt:  MLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAEFVTCFE

P25600 Putative transposon Ty5-1 protein YCL074W7.5e-3731.83Show/hide
Query:  MDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGL
        MDV TAFLN  +DE +++ QP GF+ E     V +L   +YGLKQA   W    N+T+   GF  +  +  +Y + +    I + +YVDD+L+A     +
Subjt:  MDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGL

Query:  LCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQNAYINKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIS-YASIVGSLLY
          + K+ L+K + MKD+ +    +G+ I +  ++G + LS   YI K   + ++N    +  P+           P  E     ++ I+ Y SIVG LL+
Subjt:  LCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQNAYINKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIS-YASIVGSLLY

Query:  AQTCTRLDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLLAEGAISWKSAK-QSIIAAS
             R DIS+ V +L R+   P   H ++A++VLRYL   +   L Y+    L +  Y D+      D   ST GY+ LLA   ++W S K + +I   
Subjt:  AQTCTRLDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLLAEGAISWKSAK-QSIIAAS

Query:  TMEAEFVTCFE
        + EAE++T  E
Subjt:  TMEAEFVTCFE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.1e-7039.73Show/hide
Query:  VGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVC
        VGC+W+F  K +S+G++ R KARLVAKGY Q+ G+DY ETFSPV K  S+RI++ +       + Q+DV  AFL G L ++V+M QP GF+ + + + VC
Subjt:  VGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVC

Query:  KLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTH
        KL++++YGLKQA R WY++  + + + GF  ++ D  +++   G   + +++YVDDIL+  ND  LL  T + LS+ F +KD  E  Y +GIE  R  T 
Subjt:  KLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTH

Query:  GLLGLSQNAYINKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDISFAVGMLGRYQSNPGMDHWKAAKKVL
          L LSQ  YI  +L +  M        P+    K SL    K        +   Y  IVGSL Y    TR DIS+AV  L ++   P  +H +A K++L
Subjt:  GLLGLSQNAYINKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDISFAVGMLGRYQSNPGMDHWKAAKKVL

Query:  RYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAEF
        RYL G  ++ +  K+ + L +  YSD+D+AG  D   ST GY+  L    ISW S KQ  +  S+ EAE+
Subjt:  RYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAEF

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.4e-1229.87Show/hide
Query:  CVDCIKGKQTKHTISKEATRSSQLLEIIHTDIYGPFDVPSFGGEKYFITFIDDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYY
        C DC+  K  K   S+    S++ LE I++D++    + S    +Y++ F+D F+RY ++Y L +KSQ  +    F N +E +    +    SD GGE+ 
Subjt:  CVDCIKGKQTKHTISKEATRSSQLLEIIHTDIYGPFDVPSFGGEKYFITFIDDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYY

Query:  GKYDENGQCPGPFAKFLESHGICAQYTMPGYTMPGTPQQNDVAE--STRIVETG
          ++           +   HGI +  T P    P TP+ N ++E     IVETG
Subjt:  GKYDENGQCPGPFAKFLESHGICAQYTMPGYTMPGTPQQNDVAE--STRIVETG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-6935.54Show/hide
Query:  PSSITSSQIVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSV----------RSRRSAISDDYLVYLHESEFDLSIDNDPVSFSQAIK
        P+S +SS    P +   +  P   Q+N Q P N         +G ++   + S           R+   A+ DD       SE +  I N          
Subjt:  PSSITSSQIVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSV----------RSRRSAISDDYLVYLHESEFDLSIDNDPVSFSQAIK

Query:  GIILPNELPKKSKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQ
           L    P     VGC+W+F  K +S+G++ R KARLVAKGY Q+ G+DY ETFSPV K  S+RI++ +       + Q+DV  AFL G L +EV+M Q
Subjt:  GIILPNELPKKSKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQ

Query:  PEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEA
        P GF+ + +   VC+L+++IYGLKQA R WY++    + + GF  +I D  +++   G   I +++YVDDIL+  ND  LL  T + LS+ F +K+  + 
Subjt:  PEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEA

Query:  SYVIGIEIFRDRTHGLLGLSQNAYINKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDISFAVGMLGRYQS
         Y +GIE    R    L LSQ  Y   +L +  M        P+    K +L    K        +   Y  IVGSL Y    TR D+S+AV  L +Y  
Subjt:  SYVIGIEIFRDRTHGLLGLSQNAYINKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDISFAVGMLGRYQS

Query:  NPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAEF
         P  DHW A K+VLRYL G  D+ +  K+ + L +  YSD+D+AG  D   ST GY+  L    ISW S KQ  +  S+ EAE+
Subjt:  NPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAEF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-1232.47Show/hide
Query:  CVDCIKGKQTKHTISKEATRSSQLLEIIHTDIYGPFDVPSFGGEKYFITFIDDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYY
        C DC   K  K   S     SS+ LE I++D++    + S    +Y++ F+D F+RY ++Y L +KSQ  D   +F + VE +    +  L SD GGE+ 
Subjt:  CVDCIKGKQTKHTISKEATRSSQLLEIIHTDIYGPFDVPSFGGEKYFITFIDDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYY

Query:  GKYDENGQCPGPFAKFLESHGICAQYTMPGYTMPGTPQQNDVAE--STRIVETG
           D           +L  HGI + +T P    P TP+ N ++E     IVE G
Subjt:  GKYDENGQCPGPFAKFLESHGICAQYTMPGYTMPGTPQQNDVAE--STRIVETG

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.5e-8039.7Show/hide
Query:  LPKKSKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVE
        LP   K +GCKWV+K K +S+G IER KARLVAKGYTQ++GID+ ETFSPV K  S+++I+A+ A Y+  LHQ+D+  AFLNG+LDEE++M  P G+   
Subjt:  LPKKSKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVE

Query:  GKEHM----VCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYV
          + +    VC LK+SIYGLKQASRQW+LKF+ T+  FGF ++  D   +LKI+ + F+ +++YVDDI++ +N+   + + K  L   F+++D+    Y 
Subjt:  GKEHM----VCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYV

Query:  IGIEIFRDRTHGLLGLSQNAYINKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDISFAVGMLGRYQSNPG
        +G+EI R      + + Q  Y   +L++  +  C  S VP+     FS           + ++  +Y  ++G L+Y Q  TRLDISFAV  L ++   P 
Subjt:  IGIEIFRDRTHGLLGLSQNAYINKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDISFAVGMLGRYQSNPG

Query:  MDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAEFVTCFEATSSWFMAVELY
        + H +A  K+L Y++G     L Y     +++  +SD+ F  C DTR+ST GY   L    ISWKS KQ +++ S+ EAE+     AT       + +
Subjt:  MDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAEFVTCFEATSSWFMAVELY

AT5G53670.1 unknown protein2.7e-1327.06Show/hide
Query:  SLYSHATSIIKFNGLNFSDWCEQIRFHLGVLDLDLALLSEKPGAITSANSDEDRSFYKAWERSNRLSLMFIRMTVANNIKFTI-KNTEDAKEFMKSVKKC
        S  S+  SI   +G NFS+W E +   L ++DLDL+L++E+P +             K W+RSNR+S+M +++ +    +  +  +   AK+F+ S++  
Subjt:  SLYSHATSIIKFNGLNFSDWCEQIRFHLGVLDLDLALLSEKPGAITSANSDEDRSFYKAWERSNRLSLMFIRMTVANNIKFTI-KNTEDAKEFMKSVKKC

Query:  F-QSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGME---VNENFLVMFILNSLPSEYGPFHMNYNTLKDK-------------WNV
        F ++E A++S      S+++ I+   +  + E I+ M  L A+ K +G+     N+  L    +  LP +Y      Y+ L+ K             W+ 
Subjt:  F-QSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGME---VNENFLVMFILNSLPSEYGPFHMNYNTLKDK-------------WNV

Query:  HELQSMLIQEEARLKKPI
         EL S    EE  L+  I
Subjt:  HELQSMLIQEEARLKKPI

ATMG00240.1 Gag-Pol-related retrotransposon family protein7.5e-0836.71Show/hide
Query:  TCTRLDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLL
        T TR D++FAV  L ++ S       +A  KVL Y++G     L Y  +  L++  ++DSD+A C DTR+S  G+  L+
Subjt:  TCTRLDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLL

ATMG00810.1 DNA/RNA polymerases superfamily protein1.2e-2434.82Show/hide
Query:  LVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTH-GLLGLSQNAYINKVLEKFKMNKCS--SSVVPIQKGDKFSLMQCPKNEL
        L+LYVDDILL  +   LL      LS  F MKD+    Y +GI+I   +TH   L LSQ  Y  ++L    M  C   S+ +P++     S  + P    
Subjt:  LVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTH-GLLGLSQNAYINKVLEKFKMNKCS--SSVVPIQKGDKFSLMQCPKNEL

Query:  ERNQMETISYASIVGSLLYAQTCTRLDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLL
             +   + SIVG+L Y  T TR DIS+AV ++ +    P +  +   K+VLRY++G   + L   ++  L V  + DSD+AGC  TR+ST G+   L
Subjt:  ERNQMETISYASIVGSLLYAQTCTRLDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLL

Query:  AEGAISWKSAKQSIIAASTMEAEF
            ISW + +Q  ++ S+ E E+
Subjt:  AEGAISWKSAKQSIIAASTMEAEF

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.5e-1151.67Show/hide
Query:  PKKSKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKKDSLRIIM
        P     +GCKWVFKTK  S+G ++R KARLVAKG+ Q++GI + ET+SPV +  ++R I+
Subjt:  PKKSKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKKDSLRIIM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGTATAAGCATATTTATTTTGTAATTGCAGCATCTGCTCCTGTTTCTCTTTATTCGCATGCTACATCTATAATAAAGTTTAATGGACTCAATTTCTCTGATTGGTG
CGAACAAATCCGATTCCATCTTGGAGTTTTGGATCTTGATTTAGCACTTTTAAGTGAGAAACCTGGTGCAATTACTTCTGCTAACAGTGATGAGGATAGATCTTTCTATA
AAGCTTGGGAAAGATCAAATAGATTGAGCTTAATGTTTATACGAATGACTGTAGCAAACAATATTAAGTTCACAATTAAGAACACTGAAGATGCTAAGGAATTTATGAAA
TCTGTGAAAAAATGTTTTCAGTCAGAGTCGGCTGACAAGTCACTTGCTGGAACACTTATGAGTACTTTAACCAACATCAAGTTTGATGGTTCTCGTACTATACATGAGCA
TATCCTTGAAATGACGAACTTGGCAGCAAGGTTAAAGACCATGGGAATGGAAGTTAATGAGAATTTTTTGGTAATGTTTATCCTTAATTCCTTACCTTCAGAGTATGGTC
CATTTCACATGAACTATAACACTCTGAAAGATAAATGGAATGTGCATGAATTACAAAGTATGCTCATTCAAGAGGAAGCGAGGCTTAAGAAACCAATAATTCACTCTGTC
AATCTCATGGGTCACAAAGGAGCTGGAAAGAAACCTGGAAAAAAGAATGGCAAGGGCAATCATGGACAATTAAAGGTAAAACAGTCATCTGCCCCAATCCACAAAAAGGG
ACAAATTAAGGATAAGTGTCGTTTTTGCAACAAACCAGGACACTATCAAAAAGATTGTTTAAAACGTAAGGCATGGTTCGAGAATAAAGATTTGGATTTTACTGACCTTG
GAATTTGTGTGGATTGTATTAAAGGAAAACAAACAAAACACACAATTAGTAAAGAAGCCACAAGAAGCTCACAGCTTCTTGAAATTATACACACTGATATTTATGGGCCT
TTTGATGTTCCATCTTTTGGTGGAGAAAAGTATTTTATCACATTTATTGATGATTTCTCACGTTATGGTTATATCTATTTATTGCATGAGAAATCTCAAGCAATAGATGC
CTTAAAAGTATTTATAAATGAAGTTGAAAGGCAATTAGATAGAAATGTGAAAATCTTAAGATCTGATAGAGGTGGTGAGTATTATGGAAAATATGACGAGAATGGACAAT
GCCCTGGTCCATTCGCTAAATTCCTAGAAAGCCATGGCATATGTGCTCAATACACAATGCCAGGATACACAATGCCAGGAACACCACAACAAAATGATGTTGCAGAAAGT
ACGAGAATAGTTGAAACTGGAAATGTAAGGTTCATTGAGAATGACATAATTAATGGGAGTTTGGAACCGCGAAAAGTAGAAATTCAAGAAGTTAGGGTGGAAATTCCTTC
CTCTATAACTTCTTCTCAAATTGTTGTTCCTGTAGTTGTTGACTCTGTTAACAATCCACAAGAACAACAAATTAATGGTCAAACACCACATAATGATATTGTAACAAATG
AACCTGTAACTGAGGGACCACAAGAAATAGAATTAAGAAGATCTGTAAGATCAAGAAGATCAGCTATTTCTGATGACTATTTGGTTTATTTGCATGAGTCAGAATTTGAC
TTAAGCATTGATAATGATCCAGTTTCGTTTTCACAAGCCATTAAAGGAATAATTCTACCAAATGAATTGCCTAAAAAAAGTAAAAGAGTTGGGTGTAAATGGGTCTTTAA
GACCAAACGTGACTCAAATGGCAATATTGAACGATGCAAGGCTAGACTTGTTGCCAAAGGTTATACTCAGAAAGATGGCATTGACTACAAAGAGACTTTTTCTCCTGTCT
CGAAAAAGGACTCATTAAGAATTATTATGGCTTTGGTAGCTCATTATGATTTAGAGCTTCATCAAATGGATGTGAAAACCGCCTTTCTAAATGGAAATTTAGATGAAGAA
GTGTTCATGGATCAACCAGAAGGTTTTATGGTTGAAGGAAAGGAACATATGGTGTGTAAATTAAAGAGGTCAATATATGGACTTAAACAAGCTTCCAGACAGTGGTATCT
TAAGTTTAATGATACCATCACATCTTTTGGTTTTAAAGAAAACATCGTTGATCGATGTATATACCTAAAGATCAGTGGGAGTAAGTTTATAATTCTTGTTCTATATGTTG
ATGACATCTTGCTTGCTACAAATGACTTTGGTTTATTATGTCAAACCAAAGAATTTCTTTCTAAAAACTTTGAAATGAAAGATATGAGTGAGGCATCCTATGTGATTGGA
ATTGAAATATTCCGTGATCGAACACATGGATTGTTAGGATTGTCTCAAAATGCCTATATTAATAAAGTTTTAGAGAAATTTAAGATGAACAAATGCTCTTCAAGTGTAGT
TCCAATTCAGAAGGGAGATAAATTTAGTCTCATGCAATGTCCAAAAAATGAATTGGAACGAAATCAGATGGAAACTATTTCTTATGCATCTATTGTTGGAAGCTTATTGT
ATGCACAAACTTGCACTAGACTAGACATCAGTTTTGCTGTGGGTATGCTAGGCAGGTATCAAAGTAATCCAGGAATGGATCATTGGAAAGCTGCAAAGAAAGTTTTAAGG
TATCTGCAAGGAGCAAAAGATTATATGCTTACTTACAAGAGATCTGATCATCTTGAGGTGATTGAATATTCAGATTCAGATTTTGCCGGATGTGTGGATACAAGAAAATC
CACTTTTGGCTATTTGTTTCTTTTAGCTGAAGGAGCAATTTCATGGAAAAGTGCAAAGCAGTCTATTATCGCTGCATCCACTATGGAAGCTGAATTTGTAACATGCTTTG
AGGCTACCAGTTCATGGTTTATGGCTGTGGAACTTTATCTCAGGACTTGGAATTGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGATGTATAAGCATATTTATTTTGTAATTGCAGCATCTGCTCCTGTTTCTCTTTATTCGCATGCTACATCTATAATAAAGTTTAATGGACTCAATTTCTCTGATTGGTG
CGAACAAATCCGATTCCATCTTGGAGTTTTGGATCTTGATTTAGCACTTTTAAGTGAGAAACCTGGTGCAATTACTTCTGCTAACAGTGATGAGGATAGATCTTTCTATA
AAGCTTGGGAAAGATCAAATAGATTGAGCTTAATGTTTATACGAATGACTGTAGCAAACAATATTAAGTTCACAATTAAGAACACTGAAGATGCTAAGGAATTTATGAAA
TCTGTGAAAAAATGTTTTCAGTCAGAGTCGGCTGACAAGTCACTTGCTGGAACACTTATGAGTACTTTAACCAACATCAAGTTTGATGGTTCTCGTACTATACATGAGCA
TATCCTTGAAATGACGAACTTGGCAGCAAGGTTAAAGACCATGGGAATGGAAGTTAATGAGAATTTTTTGGTAATGTTTATCCTTAATTCCTTACCTTCAGAGTATGGTC
CATTTCACATGAACTATAACACTCTGAAAGATAAATGGAATGTGCATGAATTACAAAGTATGCTCATTCAAGAGGAAGCGAGGCTTAAGAAACCAATAATTCACTCTGTC
AATCTCATGGGTCACAAAGGAGCTGGAAAGAAACCTGGAAAAAAGAATGGCAAGGGCAATCATGGACAATTAAAGGTAAAACAGTCATCTGCCCCAATCCACAAAAAGGG
ACAAATTAAGGATAAGTGTCGTTTTTGCAACAAACCAGGACACTATCAAAAAGATTGTTTAAAACGTAAGGCATGGTTCGAGAATAAAGATTTGGATTTTACTGACCTTG
GAATTTGTGTGGATTGTATTAAAGGAAAACAAACAAAACACACAATTAGTAAAGAAGCCACAAGAAGCTCACAGCTTCTTGAAATTATACACACTGATATTTATGGGCCT
TTTGATGTTCCATCTTTTGGTGGAGAAAAGTATTTTATCACATTTATTGATGATTTCTCACGTTATGGTTATATCTATTTATTGCATGAGAAATCTCAAGCAATAGATGC
CTTAAAAGTATTTATAAATGAAGTTGAAAGGCAATTAGATAGAAATGTGAAAATCTTAAGATCTGATAGAGGTGGTGAGTATTATGGAAAATATGACGAGAATGGACAAT
GCCCTGGTCCATTCGCTAAATTCCTAGAAAGCCATGGCATATGTGCTCAATACACAATGCCAGGATACACAATGCCAGGAACACCACAACAAAATGATGTTGCAGAAAGT
ACGAGAATAGTTGAAACTGGAAATGTAAGGTTCATTGAGAATGACATAATTAATGGGAGTTTGGAACCGCGAAAAGTAGAAATTCAAGAAGTTAGGGTGGAAATTCCTTC
CTCTATAACTTCTTCTCAAATTGTTGTTCCTGTAGTTGTTGACTCTGTTAACAATCCACAAGAACAACAAATTAATGGTCAAACACCACATAATGATATTGTAACAAATG
AACCTGTAACTGAGGGACCACAAGAAATAGAATTAAGAAGATCTGTAAGATCAAGAAGATCAGCTATTTCTGATGACTATTTGGTTTATTTGCATGAGTCAGAATTTGAC
TTAAGCATTGATAATGATCCAGTTTCGTTTTCACAAGCCATTAAAGGAATAATTCTACCAAATGAATTGCCTAAAAAAAGTAAAAGAGTTGGGTGTAAATGGGTCTTTAA
GACCAAACGTGACTCAAATGGCAATATTGAACGATGCAAGGCTAGACTTGTTGCCAAAGGTTATACTCAGAAAGATGGCATTGACTACAAAGAGACTTTTTCTCCTGTCT
CGAAAAAGGACTCATTAAGAATTATTATGGCTTTGGTAGCTCATTATGATTTAGAGCTTCATCAAATGGATGTGAAAACCGCCTTTCTAAATGGAAATTTAGATGAAGAA
GTGTTCATGGATCAACCAGAAGGTTTTATGGTTGAAGGAAAGGAACATATGGTGTGTAAATTAAAGAGGTCAATATATGGACTTAAACAAGCTTCCAGACAGTGGTATCT
TAAGTTTAATGATACCATCACATCTTTTGGTTTTAAAGAAAACATCGTTGATCGATGTATATACCTAAAGATCAGTGGGAGTAAGTTTATAATTCTTGTTCTATATGTTG
ATGACATCTTGCTTGCTACAAATGACTTTGGTTTATTATGTCAAACCAAAGAATTTCTTTCTAAAAACTTTGAAATGAAAGATATGAGTGAGGCATCCTATGTGATTGGA
ATTGAAATATTCCGTGATCGAACACATGGATTGTTAGGATTGTCTCAAAATGCCTATATTAATAAAGTTTTAGAGAAATTTAAGATGAACAAATGCTCTTCAAGTGTAGT
TCCAATTCAGAAGGGAGATAAATTTAGTCTCATGCAATGTCCAAAAAATGAATTGGAACGAAATCAGATGGAAACTATTTCTTATGCATCTATTGTTGGAAGCTTATTGT
ATGCACAAACTTGCACTAGACTAGACATCAGTTTTGCTGTGGGTATGCTAGGCAGGTATCAAAGTAATCCAGGAATGGATCATTGGAAAGCTGCAAAGAAAGTTTTAAGG
TATCTGCAAGGAGCAAAAGATTATATGCTTACTTACAAGAGATCTGATCATCTTGAGGTGATTGAATATTCAGATTCAGATTTTGCCGGATGTGTGGATACAAGAAAATC
CACTTTTGGCTATTTGTTTCTTTTAGCTGAAGGAGCAATTTCATGGAAAAGTGCAAAGCAGTCTATTATCGCTGCATCCACTATGGAAGCTGAATTTGTAACATGCTTTG
AGGCTACCAGTTCATGGTTTATGGCTGTGGAACTTTATCTCAGGACTTGGAATTGTTGA
Protein sequenceShow/hide protein sequence
MMYKHIYFVIAASAPVSLYSHATSIIKFNGLNFSDWCEQIRFHLGVLDLDLALLSEKPGAITSANSDEDRSFYKAWERSNRLSLMFIRMTVANNIKFTIKNTEDAKEFMK
SVKKCFQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNENFLVMFILNSLPSEYGPFHMNYNTLKDKWNVHELQSMLIQEEARLKKPIIHSV
NLMGHKGAGKKPGKKNGKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAWFENKDLDFTDLGICVDCIKGKQTKHTISKEATRSSQLLEIIHTDIYGP
FDVPSFGGEKYFITFIDDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQCPGPFAKFLESHGICAQYTMPGYTMPGTPQQNDVAES
TRIVETGNVRFIENDIINGSLEPRKVEIQEVRVEIPSSITSSQIVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFD
LSIDNDPVSFSQAIKGIILPNELPKKSKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEE
VFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIG
IEIFRDRTHGLLGLSQNAYINKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDISFAVGMLGRYQSNPGMDHWKAAKKVLR
YLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDTRKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAEFVTCFEATSSWFMAVELYLRTWNC