; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005727 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005727
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr6:27234325..27260339
RNA-Seq ExpressionLag0005727
SyntenyLag0005727
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR029058 - Alpha/Beta hydrolase fold
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031744753.1 uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus]1.6e-27769.24Show/hide
Query:  MPSSVLKGQIPYHVLFPTKPLFPIEPKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSSNRYLVSRDVTFFEDTPFKSSLPSTSQEEE
        MPSSVL G+IPY VLFPTK LFPI PKIFGC CFVRDVRP+ TKLDPKSLKCIFLGYSRVQKGYRCYCP+  RYLVS DV FFEDTPF SS  S  Q E+
Subjt:  MPSSVLKGQIPYHVLFPTKPLFPIEPKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSSNRYLVSRDVTFFEDTPFKSSLPSTSQEEE

Query:  DDLFIYTLVPPTPSTDPSPLVSPTPSLDPSSSTLVPTCPPITQVYSRR--QQPPGECLVPQDSSSSDPGPSDELPIALRKGKRSCAFPISSFVSYDHLSS
        D+LFIY +  PTPS                S+ + P+ P I+QVYSRR   QP   C      SS DP PSD+LPIALRKGKR C +P+SSF+SY  LS 
Subjt:  DDLFIYTLVPPTPSTDPSPLVSPTPSLDPSSSTLVPTCPPITQVYSRR--QQPPGECLVPQDSSSSDPGPSDELPIALRKGKRSCAFPISSFVSYDHLSS

Query:  STCSFVASLDSISIPKTVHEALSHPGWRNAMIEEMTALDDNGTWDLVSRPEGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPVA
        ST +F+ SL+S SIP +VHEALSHPGW+NAMIEEMTALDDNGTWDLVSRP GKKAIGCKWVFA+K+NPDG+VARLKARLVAKGYAQ YG DY DTFSPVA
Subjt:  STCSFVASLDSISIPKTVHEALSHPGWRNAMIEEMTALDDNGTWDLVSRPEGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPVA

Query:  KLTSVRLFISMAASQCWPLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGENDKVCRLRKSLYGLKQSPRAWFGRFSQALEQFGMKKSKSDHSV-------
        KLTS+RLF+SMAA+  W LHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGE+DKVCRLRKSLYGLKQSPRAWFG+FSQAL  FGMKKS SDHSV       
Subjt:  KLTSVRLFISMAASQCWPLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGENDKVCRLRKSLYGLKQSPRAWFGRFSQALEQFGMKKSKSDHSV-------

Query:  --------------------STNELRV-----FHTKDLGTLKYFLGIEVMRSKK----------------------------------------------
                              + L+      F+TKDLG LKYFLGIEVMRSKK                                              
Subjt:  --------------------STNELRV-----FHTKDLGTLKYFLGIEVMRSKK----------------------------------------------

Query:  -------------VTRPDIAYSVSIVSQFMSSPTVEHWAAVQQILCYLKVAPGRGILYKDHGHMKIECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKS
                     VTRPDIAYSVS+VSQFMSSPTV+HWAAV+QILCYLK APGRGILYKDHGH ++ECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKS
Subjt:  -------------VTRPDIAYSVSIVSQFMSSPTVEHWAAVQQILCYLKVAPGRGILYKDHGHMKIECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKS

Query:  KKQNVVSRSSAESEYRAMAQSVCEIMWVHQLLIELGFNVTVPTKLWCDNQAALHIASNPVFHERTKHIEIDCHFVREKIQQGLVATGYVKTGEQLGDIFT
        KKQNVVSRSSAESEYRAMAQSVCEI+W+HQLL E+GF++TVP KLWCDNQAALHIASNPVFHERTKHIE+DCHF+REKIQ GLV+TGYVKTGEQLGDI T
Subjt:  KKQNVVSRSSAESEYRAMAQSVCEIMWVHQLLIELGFNVTVPTKLWCDNQAALHIASNPVFHERTKHIEIDCHFVREKIQQGLVATGYVKTGEQLGDIFT

Query:  KALNGVRIDYLC
        KALNG RI YLC
Subjt:  KALNGVRIDYLC

XP_031744754.1 uncharacterized protein LOC101212255 isoform X2 [Cucumis sativus]1.6e-27769.24Show/hide
Query:  MPSSVLKGQIPYHVLFPTKPLFPIEPKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSSNRYLVSRDVTFFEDTPFKSSLPSTSQEEE
        MPSSVL G+IPY VLFPTK LFPI PKIFGC CFVRDVRP+ TKLDPKSLKCIFLGYSRVQKGYRCYCP+  RYLVS DV FFEDTPF SS  S  Q E+
Subjt:  MPSSVLKGQIPYHVLFPTKPLFPIEPKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSSNRYLVSRDVTFFEDTPFKSSLPSTSQEEE

Query:  DDLFIYTLVPPTPSTDPSPLVSPTPSLDPSSSTLVPTCPPITQVYSRR--QQPPGECLVPQDSSSSDPGPSDELPIALRKGKRSCAFPISSFVSYDHLSS
        D+LFIY +  PTPS                S+ + P+ P I+QVYSRR   QP   C      SS DP PSD+LPIALRKGKR C +P+SSF+SY  LS 
Subjt:  DDLFIYTLVPPTPSTDPSPLVSPTPSLDPSSSTLVPTCPPITQVYSRR--QQPPGECLVPQDSSSSDPGPSDELPIALRKGKRSCAFPISSFVSYDHLSS

Query:  STCSFVASLDSISIPKTVHEALSHPGWRNAMIEEMTALDDNGTWDLVSRPEGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPVA
        ST +F+ SL+S SIP +VHEALSHPGW+NAMIEEMTALDDNGTWDLVSRP GKKAIGCKWVFA+K+NPDG+VARLKARLVAKGYAQ YG DY DTFSPVA
Subjt:  STCSFVASLDSISIPKTVHEALSHPGWRNAMIEEMTALDDNGTWDLVSRPEGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPVA

Query:  KLTSVRLFISMAASQCWPLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGENDKVCRLRKSLYGLKQSPRAWFGRFSQALEQFGMKKSKSDHSV-------
        KLTS+RLF+SMAA+  W LHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGE+DKVCRLRKSLYGLKQSPRAWFG+FSQAL  FGMKKS SDHSV       
Subjt:  KLTSVRLFISMAASQCWPLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGENDKVCRLRKSLYGLKQSPRAWFGRFSQALEQFGMKKSKSDHSV-------

Query:  --------------------STNELRV-----FHTKDLGTLKYFLGIEVMRSKK----------------------------------------------
                              + L+      F+TKDLG LKYFLGIEVMRSKK                                              
Subjt:  --------------------STNELRV-----FHTKDLGTLKYFLGIEVMRSKK----------------------------------------------

Query:  -------------VTRPDIAYSVSIVSQFMSSPTVEHWAAVQQILCYLKVAPGRGILYKDHGHMKIECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKS
                     VTRPDIAYSVS+VSQFMSSPTV+HWAAV+QILCYLK APGRGILYKDHGH ++ECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKS
Subjt:  -------------VTRPDIAYSVSIVSQFMSSPTVEHWAAVQQILCYLKVAPGRGILYKDHGHMKIECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKS

Query:  KKQNVVSRSSAESEYRAMAQSVCEIMWVHQLLIELGFNVTVPTKLWCDNQAALHIASNPVFHERTKHIEIDCHFVREKIQQGLVATGYVKTGEQLGDIFT
        KKQNVVSRSSAESEYRAMAQSVCEI+W+HQLL E+GF++TVP KLWCDNQAALHIASNPVFHERTKHIE+DCHF+REKIQ GLV+TGYVKTGEQLGDI T
Subjt:  KKQNVVSRSSAESEYRAMAQSVCEIMWVHQLLIELGFNVTVPTKLWCDNQAALHIASNPVFHERTKHIEIDCHFVREKIQQGLVATGYVKTGEQLGDIFT

Query:  KALNGVRIDYLC
        KALNG RI YLC
Subjt:  KALNGVRIDYLC

XP_031744755.1 uncharacterized protein LOC101212255 isoform X3 [Cucumis sativus]1.6e-27769.24Show/hide
Query:  MPSSVLKGQIPYHVLFPTKPLFPIEPKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSSNRYLVSRDVTFFEDTPFKSSLPSTSQEEE
        MPSSVL G+IPY VLFPTK LFPI PKIFGC CFVRDVRP+ TKLDPKSLKCIFLGYSRVQKGYRCYCP+  RYLVS DV FFEDTPF SS  S  Q E+
Subjt:  MPSSVLKGQIPYHVLFPTKPLFPIEPKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSSNRYLVSRDVTFFEDTPFKSSLPSTSQEEE

Query:  DDLFIYTLVPPTPSTDPSPLVSPTPSLDPSSSTLVPTCPPITQVYSRR--QQPPGECLVPQDSSSSDPGPSDELPIALRKGKRSCAFPISSFVSYDHLSS
        D+LFIY +  PTPS                S+ + P+ P I+QVYSRR   QP   C      SS DP PSD+LPIALRKGKR C +P+SSF+SY  LS 
Subjt:  DDLFIYTLVPPTPSTDPSPLVSPTPSLDPSSSTLVPTCPPITQVYSRR--QQPPGECLVPQDSSSSDPGPSDELPIALRKGKRSCAFPISSFVSYDHLSS

Query:  STCSFVASLDSISIPKTVHEALSHPGWRNAMIEEMTALDDNGTWDLVSRPEGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPVA
        ST +F+ SL+S SIP +VHEALSHPGW+NAMIEEMTALDDNGTWDLVSRP GKKAIGCKWVFA+K+NPDG+VARLKARLVAKGYAQ YG DY DTFSPVA
Subjt:  STCSFVASLDSISIPKTVHEALSHPGWRNAMIEEMTALDDNGTWDLVSRPEGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPVA

Query:  KLTSVRLFISMAASQCWPLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGENDKVCRLRKSLYGLKQSPRAWFGRFSQALEQFGMKKSKSDHSV-------
        KLTS+RLF+SMAA+  W LHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGE+DKVCRLRKSLYGLKQSPRAWFG+FSQAL  FGMKKS SDHSV       
Subjt:  KLTSVRLFISMAASQCWPLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGENDKVCRLRKSLYGLKQSPRAWFGRFSQALEQFGMKKSKSDHSV-------

Query:  --------------------STNELRV-----FHTKDLGTLKYFLGIEVMRSKK----------------------------------------------
                              + L+      F+TKDLG LKYFLGIEVMRSKK                                              
Subjt:  --------------------STNELRV-----FHTKDLGTLKYFLGIEVMRSKK----------------------------------------------

Query:  -------------VTRPDIAYSVSIVSQFMSSPTVEHWAAVQQILCYLKVAPGRGILYKDHGHMKIECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKS
                     VTRPDIAYSVS+VSQFMSSPTV+HWAAV+QILCYLK APGRGILYKDHGH ++ECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKS
Subjt:  -------------VTRPDIAYSVSIVSQFMSSPTVEHWAAVQQILCYLKVAPGRGILYKDHGHMKIECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKS

Query:  KKQNVVSRSSAESEYRAMAQSVCEIMWVHQLLIELGFNVTVPTKLWCDNQAALHIASNPVFHERTKHIEIDCHFVREKIQQGLVATGYVKTGEQLGDIFT
        KKQNVVSRSSAESEYRAMAQSVCEI+W+HQLL E+GF++TVP KLWCDNQAALHIASNPVFHERTKHIE+DCHF+REKIQ GLV+TGYVKTGEQLGDI T
Subjt:  KKQNVVSRSSAESEYRAMAQSVCEIMWVHQLLIELGFNVTVPTKLWCDNQAALHIASNPVFHERTKHIEIDCHFVREKIQQGLVATGYVKTGEQLGDIFT

Query:  KALNGVRIDYLC
        KALNG RI YLC
Subjt:  KALNGVRIDYLC

XP_031744756.1 uncharacterized protein LOC101212255 isoform X4 [Cucumis sativus]1.6e-27769.24Show/hide
Query:  MPSSVLKGQIPYHVLFPTKPLFPIEPKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSSNRYLVSRDVTFFEDTPFKSSLPSTSQEEE
        MPSSVL G+IPY VLFPTK LFPI PKIFGC CFVRDVRP+ TKLDPKSLKCIFLGYSRVQKGYRCYCP+  RYLVS DV FFEDTPF SS  S  Q E+
Subjt:  MPSSVLKGQIPYHVLFPTKPLFPIEPKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSSNRYLVSRDVTFFEDTPFKSSLPSTSQEEE

Query:  DDLFIYTLVPPTPSTDPSPLVSPTPSLDPSSSTLVPTCPPITQVYSRR--QQPPGECLVPQDSSSSDPGPSDELPIALRKGKRSCAFPISSFVSYDHLSS
        D+LFIY +  PTPS                S+ + P+ P I+QVYSRR   QP   C      SS DP PSD+LPIALRKGKR C +P+SSF+SY  LS 
Subjt:  DDLFIYTLVPPTPSTDPSPLVSPTPSLDPSSSTLVPTCPPITQVYSRR--QQPPGECLVPQDSSSSDPGPSDELPIALRKGKRSCAFPISSFVSYDHLSS

Query:  STCSFVASLDSISIPKTVHEALSHPGWRNAMIEEMTALDDNGTWDLVSRPEGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPVA
        ST +F+ SL+S SIP +VHEALSHPGW+NAMIEEMTALDDNGTWDLVSRP GKKAIGCKWVFA+K+NPDG+VARLKARLVAKGYAQ YG DY DTFSPVA
Subjt:  STCSFVASLDSISIPKTVHEALSHPGWRNAMIEEMTALDDNGTWDLVSRPEGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPVA

Query:  KLTSVRLFISMAASQCWPLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGENDKVCRLRKSLYGLKQSPRAWFGRFSQALEQFGMKKSKSDHSV-------
        KLTS+RLF+SMAA+  W LHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGE+DKVCRLRKSLYGLKQSPRAWFG+FSQAL  FGMKKS SDHSV       
Subjt:  KLTSVRLFISMAASQCWPLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGENDKVCRLRKSLYGLKQSPRAWFGRFSQALEQFGMKKSKSDHSV-------

Query:  --------------------STNELRV-----FHTKDLGTLKYFLGIEVMRSKK----------------------------------------------
                              + L+      F+TKDLG LKYFLGIEVMRSKK                                              
Subjt:  --------------------STNELRV-----FHTKDLGTLKYFLGIEVMRSKK----------------------------------------------

Query:  -------------VTRPDIAYSVSIVSQFMSSPTVEHWAAVQQILCYLKVAPGRGILYKDHGHMKIECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKS
                     VTRPDIAYSVS+VSQFMSSPTV+HWAAV+QILCYLK APGRGILYKDHGH ++ECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKS
Subjt:  -------------VTRPDIAYSVSIVSQFMSSPTVEHWAAVQQILCYLKVAPGRGILYKDHGHMKIECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKS

Query:  KKQNVVSRSSAESEYRAMAQSVCEIMWVHQLLIELGFNVTVPTKLWCDNQAALHIASNPVFHERTKHIEIDCHFVREKIQQGLVATGYVKTGEQLGDIFT
        KKQNVVSRSSAESEYRAMAQSVCEI+W+HQLL E+GF++TVP KLWCDNQAALHIASNPVFHERTKHIE+DCHF+REKIQ GLV+TGYVKTGEQLGDI T
Subjt:  KKQNVVSRSSAESEYRAMAQSVCEIMWVHQLLIELGFNVTVPTKLWCDNQAALHIASNPVFHERTKHIEIDCHFVREKIQQGLVATGYVKTGEQLGDIFT

Query:  KALNGVRIDYLC
        KALNG RI YLC
Subjt:  KALNGVRIDYLC

XP_031744758.1 uncharacterized protein LOC101212255 isoform X5 [Cucumis sativus]1.6e-27769.24Show/hide
Query:  MPSSVLKGQIPYHVLFPTKPLFPIEPKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSSNRYLVSRDVTFFEDTPFKSSLPSTSQEEE
        MPSSVL G+IPY VLFPTK LFPI PKIFGC CFVRDVRP+ TKLDPKSLKCIFLGYSRVQKGYRCYCP+  RYLVS DV FFEDTPF SS  S  Q E+
Subjt:  MPSSVLKGQIPYHVLFPTKPLFPIEPKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSSNRYLVSRDVTFFEDTPFKSSLPSTSQEEE

Query:  DDLFIYTLVPPTPSTDPSPLVSPTPSLDPSSSTLVPTCPPITQVYSRR--QQPPGECLVPQDSSSSDPGPSDELPIALRKGKRSCAFPISSFVSYDHLSS
        D+LFIY +  PTPS                S+ + P+ P I+QVYSRR   QP   C      SS DP PSD+LPIALRKGKR C +P+SSF+SY  LS 
Subjt:  DDLFIYTLVPPTPSTDPSPLVSPTPSLDPSSSTLVPTCPPITQVYSRR--QQPPGECLVPQDSSSSDPGPSDELPIALRKGKRSCAFPISSFVSYDHLSS

Query:  STCSFVASLDSISIPKTVHEALSHPGWRNAMIEEMTALDDNGTWDLVSRPEGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPVA
        ST +F+ SL+S SIP +VHEALSHPGW+NAMIEEMTALDDNGTWDLVSRP GKKAIGCKWVFA+K+NPDG+VARLKARLVAKGYAQ YG DY DTFSPVA
Subjt:  STCSFVASLDSISIPKTVHEALSHPGWRNAMIEEMTALDDNGTWDLVSRPEGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPVA

Query:  KLTSVRLFISMAASQCWPLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGENDKVCRLRKSLYGLKQSPRAWFGRFSQALEQFGMKKSKSDHSV-------
        KLTS+RLF+SMAA+  W LHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGE+DKVCRLRKSLYGLKQSPRAWFG+FSQAL  FGMKKS SDHSV       
Subjt:  KLTSVRLFISMAASQCWPLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGENDKVCRLRKSLYGLKQSPRAWFGRFSQALEQFGMKKSKSDHSV-------

Query:  --------------------STNELRV-----FHTKDLGTLKYFLGIEVMRSKK----------------------------------------------
                              + L+      F+TKDLG LKYFLGIEVMRSKK                                              
Subjt:  --------------------STNELRV-----FHTKDLGTLKYFLGIEVMRSKK----------------------------------------------

Query:  -------------VTRPDIAYSVSIVSQFMSSPTVEHWAAVQQILCYLKVAPGRGILYKDHGHMKIECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKS
                     VTRPDIAYSVS+VSQFMSSPTV+HWAAV+QILCYLK APGRGILYKDHGH ++ECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKS
Subjt:  -------------VTRPDIAYSVSIVSQFMSSPTVEHWAAVQQILCYLKVAPGRGILYKDHGHMKIECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKS

Query:  KKQNVVSRSSAESEYRAMAQSVCEIMWVHQLLIELGFNVTVPTKLWCDNQAALHIASNPVFHERTKHIEIDCHFVREKIQQGLVATGYVKTGEQLGDIFT
        KKQNVVSRSSAESEYRAMAQSVCEI+W+HQLL E+GF++TVP KLWCDNQAALHIASNPVFHERTKHIE+DCHF+REKIQ GLV+TGYVKTGEQLGDI T
Subjt:  KKQNVVSRSSAESEYRAMAQSVCEIMWVHQLLIELGFNVTVPTKLWCDNQAALHIASNPVFHERTKHIEIDCHFVREKIQQGLVATGYVKTGEQLGDIFT

Query:  KALNGVRIDYLC
        KALNG RI YLC
Subjt:  KALNGVRIDYLC

TrEMBL top hitse value%identityAlignment
A0A438G5Y3 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-24061.06Show/hide
Query:  MPSSVLKGQIPYHVLFPTKPLFPIEPKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSSNRYLVSRDVTFFEDTPFKSSLP-STSQEE
        MPS+VL  QIPY +LFP K LFP+EP+IFG TC+VRDVRP++TKLDPK+LKC+FLGYSR+QKGYRC+ P  N+Y+VS DV F EDTPF SS P S S+ E
Subjt:  MPSSVLKGQIPYHVLFPTKPLFPIEPKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSSNRYLVSRDVTFFEDTPFKSSLP-STSQEE

Query:  EDDLFIYTLVPPTPSTDPSPLVSPTPSLDPSSSTLVPTCPPITQVYSRRQQPPGECLVPQDSSSSDPGPSDELPIALRKGKRSC--AFPISSFVSYDHLS
         ++  IY    P+ STD S    P   +D  S+   P  PPI Q YSR Q+    C  P  SS SDP    +LPI LRKGKR C   + I++FVSYD LS
Subjt:  EDDLFIYTLVPPTPSTDPSPLVSPTPSLDPSSSTLVPTCPPITQVYSRRQQPPGECLVPQDSSSSDPGPSDELPIALRKGKRSC--AFPISSFVSYDHLS

Query:  SSTCSFVASLDSISIPKTVHEALSHPGWRNAMIEEMTALDDNGTWDLVSRPEGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPV
         S+ +FVASLDS+SIPKT+ EAL+HPGW NAM+EE+ AL+ N TW+LV  P GK  +GCKWVFAIKVNP+GSVARLK RLVAKGYAQTYG+DY DTFSPV
Subjt:  SSTCSFVASLDSISIPKTVHEALSHPGWRNAMIEEMTALDDNGTWDLVSRPEGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPV

Query:  AKLTSVRLFISMAASQCWPLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGENDKVCRLRKSLYGLKQSPRAWFGRFSQALEQFGMKKSKSDHSV----ST
        A+L SVRL IS+AASQ WPLHQ+DIKNAFLHGDLQEEVYMEQPPGFVAQGE  KVC LRKSLYGLKQSPRAWFG+FS+ +++FGM KSK DHSV    S 
Subjt:  AKLTSVRLFISMAASQCWPLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGENDKVCRLRKSLYGLKQSPRAWFGRFSQALEQFGMKKSKSDHSV----ST

Query:  NELRV----------------------------FHTKDLGTLKYFLGIEVMRSKK---------------------------------------------
        N + +                            FHTKDLG LKYFLG+EV RSK+                                             
Subjt:  NELRV----------------------------FHTKDLGTLKYFLGIEVMRSKK---------------------------------------------

Query:  ---------------VTRPDIAYSVSIVSQFMSSPTVEHWAAVQQILCYLKVAPGRGILYKDHGHMKIECFSDADWAGSREDRRSTSGYCVFVGGNLVSW
                       VTRPDIAY+VSIVSQFM +PTV+HWAA++QILCYLK APG GILY +HGH +IECF+DADWAGS+ DRRST+G+CVFVG NLVSW
Subjt:  ---------------VTRPDIAYSVSIVSQFMSSPTVEHWAAVQQILCYLKVAPGRGILYKDHGHMKIECFSDADWAGSREDRRSTSGYCVFVGGNLVSW

Query:  KSKKQNVVSRSSAESEYRAMAQSVCEIMWVHQLLIELGFNVTVPTKLWCDNQAALHIASNPVFHERTKHIEIDCHFVREKIQQGLVATGYVKTGEQLGDI
        KSKKQNVVSRSSAES+YRAMAQ+ CEIMW++ LLIE+G    +  KL CDNQAA+HIASNPV+HERTKHIE+DCHF+REKIQ+ L++T YVKTGEQLGDI
Subjt:  KSKKQNVVSRSSAESEYRAMAQSVCEIMWVHQLLIELGFNVTVPTKLWCDNQAALHIASNPVFHERTKHIEIDCHFVREKIQQGLVATGYVKTGEQLGDI

Query:  FTKALNGVRIDYLC
        FTKALNG +++Y C
Subjt:  FTKALNGVRIDYLC

A0A438GAA6 Retrovirus-related Pol polyprotein from transposon TNT 1-942.2e-24560.39Show/hide
Query:  MPSSVLKGQIPYHVLFPTKPLFPIEPKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSSNRYLVSRDVTFFEDTPFKSSLPSTSQEEE
        MP+ VLK  IPY V+ P K LFP+ P+IFGCTC+VRD RP + KLDPK+L+C+FLGYSR+QKGYRC+ P  N+YLVS DV F EDT F SS  S++ EE+
Subjt:  MPSSVLKGQIPYHVLFPTKPLFPIEPKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSSNRYLVSRDVTFFEDTPFKSSLPSTSQEEE

Query:  DDLFIYTLVPPTPSTDPSPLVSPTPSLDPSSSTL----VPTCPPITQVYSRRQQPPGECLVPQDSSSSDPGPSDELPIALRKGKRSC--AFPISSFVSYD
        ++  +Y +V   P+   S +V    SL PS   +     P  PPI QVYSRR      C  P   SSSDP    +LPI+LRKGKR C   + I++FVSYD
Subjt:  DDLFIYTLVPPTPSTDPSPLVSPTPSLDPSSSTL----VPTCPPITQVYSRRQQPPGECLVPQDSSSSDPGPSDELPIALRKGKRSC--AFPISSFVSYD

Query:  HLSSSTCSFVASLDSISIPKTVHEALSHPGWRNAMIEEMTALDDNGTWDLVSRPEGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTF
        HLSSS+   VAS+DSIS+PKTV EAL+HPGW+NAM+EE+ AL+DN TW LV  P+GKK +GCKWVFA+KVNPDGSVARLKARLVA+GYAQTYG+DY DTF
Subjt:  HLSSSTCSFVASLDSISIPKTVHEALSHPGWRNAMIEEMTALDDNGTWDLVSRPEGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTF

Query:  SPVAKLTSVRLFISMAASQCWPLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGENDKVCRLRKSLYGLKQSPRAWFGRFSQALEQFGMKKSKSDHSV---
        SPVAKL SVRLFIS+AASQ W +HQLDIKNAFLHGDL+EEVY+EQPPGFVAQGE  KVCRL+K+LYGLKQSPRAWFG+FS+ ++ FGM KS+ DHSV   
Subjt:  SPVAKLTSVRLFISMAASQCWPLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGENDKVCRLRKSLYGLKQSPRAWFGRFSQALEQFGMKKSKSDHSV---

Query:  ------------------------STNELRV-----FHTKDLGTLKYFLGIEVMRSKK------------------------------------------
                                  ++L+      FHTKDLG LKYFLGIEV RSKK                                          
Subjt:  ------------------------STNELRV-----FHTKDLGTLKYFLGIEVMRSKK------------------------------------------

Query:  ------------------VTRPDIAYSVSIVSQFMSSPTVEHWAAVQQILCYLKVAPGRGILYKDHGHMKIECFSDADWAGSREDRRSTSGYCVFVGGNL
                          VTRPDIAY+VS+VSQF S+PT++HWAA++QILCYLK APG GILY   GH +IECFSDADWAGS+ DRRST+GYCVF GGNL
Subjt:  ------------------VTRPDIAYSVSIVSQFMSSPTVEHWAAVQQILCYLKVAPGRGILYKDHGHMKIECFSDADWAGSREDRRSTSGYCVFVGGNL

Query:  VSWKSKKQNVVSRSSAESEYRAMAQSVCEIMWVHQLLIELGFNVTVPTKLWCDNQAALHIASNPVFHERTKHIEIDCHFVREKIQQGLVATGYVKTGEQL
        V+WKSKKQ+VVSRSSAESEYRAM+Q+ CEI+W+HQLL E+G   T+P KLWCDNQAALHIA+NPV+HERTKHIE+DCHF+REKI++ LV+TGYVKTGEQL
Subjt:  VSWKSKKQNVVSRSSAESEYRAMAQSVCEIMWVHQLLIELGFNVTVPTKLWCDNQAALHIASNPVFHERTKHIEIDCHFVREKIQQGLVATGYVKTGEQL

Query:  GDIFTKALNGVRIDYLC
        GDIFTKALNG R++Y C
Subjt:  GDIFTKALNGVRIDYLC

A0A438HEX0 Retrovirus-related Pol polyprotein from transposon TNT 1-944.2e-24460.64Show/hide
Query:  MPSSVLKGQIPYHVLFPTKPLFPIEPKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSSNRYLVSRDVTFFEDTPFKSSLPSTSQEEE
        MP+ VLKG IPY V+ P K LF + P+IFGCTC+VRD RP +TKLDPK+L+C+FLGYSR+QKGYRC+ P  N+YLVS DV F EDT F SS  S++ EE+
Subjt:  MPSSVLKGQIPYHVLFPTKPLFPIEPKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSSNRYLVSRDVTFFEDTPFKSSLPSTSQEEE

Query:  DDLFIYTLVPPTPSTDPSPLVSPTPSLDPSSSTL----VPTCPPITQVYSRRQQPPGECLVPQDSSSSDPGPSDELPIALRKGKRSC--AFPISSFVSYD
        ++  +Y +V   P+   S +V    SL PS   +     P  PPI QVYSRR      C  P   SSSDP    +LPI+LRKGKR C   + I++FVSYD
Subjt:  DDLFIYTLVPPTPSTDPSPLVSPTPSLDPSSSTL----VPTCPPITQVYSRRQQPPGECLVPQDSSSSDPGPSDELPIALRKGKRSC--AFPISSFVSYD

Query:  HLSSSTCSFVASLDSISIPKTVHEALSHPGWRNAMIEEMTALDDNGTWDLVSRPEGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTF
        HLSSS+   VAS+DSIS+PKTV EAL+HPGW+NAM+EE+ AL DN TW LV  P+GKK +GCKWVFA+KVNPDGSVARLKARLVA+GYAQTYG+DY DTF
Subjt:  HLSSSTCSFVASLDSISIPKTVHEALSHPGWRNAMIEEMTALDDNGTWDLVSRPEGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTF

Query:  SPVAKLTSVRLFISMAASQCWPLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGENDKVCRLRKSLYGLKQSPRAWFGRFSQALEQFGMKKSKSDHSV---
        SPVAKL SVRLFIS+AASQ W +HQLDIKNAFLHGDL+EEVY+EQPPGFVAQGE  KVCRL+K+LYGLKQSPRAWFG+FS+ ++ FGM KS+ DHSV   
Subjt:  SPVAKLTSVRLFISMAASQCWPLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGENDKVCRLRKSLYGLKQSPRAWFGRFSQALEQFGMKKSKSDHSV---

Query:  ------------------------STNELRV-----FHTKDLGTLKYFLGIEVMRSKK------------------------------------------
                                  ++L+      FHTKDLG LKYFLGIEV RSKK                                          
Subjt:  ------------------------STNELRV-----FHTKDLGTLKYFLGIEVMRSKK------------------------------------------

Query:  ------------------VTRPDIAYSVSIVSQFMSSPTVEHWAAVQQILCYLKVAPGRGILYKDHGHMKIECFSDADWAGSREDRRSTSGYCVFVGGNL
                          VTRPDIAY+VS+VSQF S+PT++HWAA++QILCYLK APG GILY   GH +IECFSDADWAGS+ DRRST+GYCVF GGNL
Subjt:  ------------------VTRPDIAYSVSIVSQFMSSPTVEHWAAVQQILCYLKVAPGRGILYKDHGHMKIECFSDADWAGSREDRRSTSGYCVFVGGNL

Query:  VSWKSKKQNVVSRSSAESEYRAMAQSVCEIMWVHQLLIELGFNVTVPTKLWCDNQAALHIASNPVFHERTKHIEIDCHFVREKIQQGLVATGYVKTGEQL
        V+WKSKKQ+VVSRSSAESEYRAM+Q+ CEI+W+HQLL E+G   T+P KLWCDNQAALHIA+NPV+HERTKHIE+DCHF+REKI++ LV+TGYVKTGEQL
Subjt:  VSWKSKKQNVVSRSSAESEYRAMAQSVCEIMWVHQLLIELGFNVTVPTKLWCDNQAALHIASNPVFHERTKHIEIDCHFVREKIQQGLVATGYVKTGEQL

Query:  GDIFTKALNGVRID
        GDIFTKALNG R+D
Subjt:  GDIFTKALNGVRID

A0A438IRR9 Retrovirus-related Pol polyprotein from transposon TNT 1-946.4e-24560.39Show/hide
Query:  MPSSVLKGQIPYHVLFPTKPLFPIEPKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSSNRYLVSRDVTFFEDTPFKSSLPSTSQEEE
        MP+ VLKG IPY V+ P K LFP+ P+IFGCTC+VRD RP +TKLDPK+L+C+FLGYSR+QKGYRC+ P  N+YLVS DV F EDT F SS  S++ EE+
Subjt:  MPSSVLKGQIPYHVLFPTKPLFPIEPKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSSNRYLVSRDVTFFEDTPFKSSLPSTSQEEE

Query:  DDLFIYTLVPPTPSTDPSPLVSPTPSLDPSSSTL----VPTCPPITQVYSRRQQPPGECLVPQDSSSSDPGPSDELPIALRKGKR--SCAFPISSFVSYD
        ++  +Y +V   P+   S +V    SL PS   +     P  PPI QVYSRR      C  P   SSSDP    +LPI+LRKGKR     + I++FVSYD
Subjt:  DDLFIYTLVPPTPSTDPSPLVSPTPSLDPSSSTL----VPTCPPITQVYSRRQQPPGECLVPQDSSSSDPGPSDELPIALRKGKR--SCAFPISSFVSYD

Query:  HLSSSTCSFVASLDSISIPKTVHEALSHPGWRNAMIEEMTALDDNGTWDLVSRPEGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTF
        HLSSS+   VAS+DSIS+PKTV EAL+HPGW+NAM+EE+ AL+DN TW LV  P+GKK +GCKWVFA+KVN DGSVARLKARLVA+GYAQTYG+DY DTF
Subjt:  HLSSSTCSFVASLDSISIPKTVHEALSHPGWRNAMIEEMTALDDNGTWDLVSRPEGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTF

Query:  SPVAKLTSVRLFISMAASQCWPLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGENDKVCRLRKSLYGLKQSPRAWFGRFSQALEQFGMKKSKSDHSV---
        SPVAKL SVRLFIS+AASQ W +HQLDIKNAFLHGDL+EEVY+EQPPGFVAQGE  KVCRL+K+LYGLKQSPRAWFG+FS+ ++ FGM KS+ DHSV   
Subjt:  SPVAKLTSVRLFISMAASQCWPLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGENDKVCRLRKSLYGLKQSPRAWFGRFSQALEQFGMKKSKSDHSV---

Query:  ------------------------STNELRV-----FHTKDLGTLKYFLGIEVMRSKK------------------------------------------
                                  ++L+      FHTKDLG LKYFLGIEV RSKK                                          
Subjt:  ------------------------STNELRV-----FHTKDLGTLKYFLGIEVMRSKK------------------------------------------

Query:  ------------------VTRPDIAYSVSIVSQFMSSPTVEHWAAVQQILCYLKVAPGRGILYKDHGHMKIECFSDADWAGSREDRRSTSGYCVFVGGNL
                          VTRPDIAY+VS+VSQF S+PT++HWAA++QILCYLK APG GILY   GH +IECFSDADWAGS+ DRRST+GYCVF GGNL
Subjt:  ------------------VTRPDIAYSVSIVSQFMSSPTVEHWAAVQQILCYLKVAPGRGILYKDHGHMKIECFSDADWAGSREDRRSTSGYCVFVGGNL

Query:  VSWKSKKQNVVSRSSAESEYRAMAQSVCEIMWVHQLLIELGFNVTVPTKLWCDNQAALHIASNPVFHERTKHIEIDCHFVREKIQQGLVATGYVKTGEQL
        V+WKSKKQ+VVSRSSAESEYRAMAQ+ CEI+W+HQLL E+G   T+P KLWCDNQAALHIA+NP++HERTKHIE+DCHF+REKI++ LV+TGYVKTGEQL
Subjt:  VSWKSKKQNVVSRSSAESEYRAMAQSVCEIMWVHQLLIELGFNVTVPTKLWCDNQAALHIASNPVFHERTKHIEIDCHFVREKIQQGLVATGYVKTGEQL

Query:  GDIFTKALNGVRIDYLC
        GDIFTKALNG R++Y C
Subjt:  GDIFTKALNGVRIDYLC

B0FBS2 Uncharacterized protein6.9e-24760.67Show/hide
Query:  MPSSVLKGQIPYHVLFPTKPLFPIEPKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSSNRYLVSRDVTFFEDTPFKSSLPSTSQEEE
        MP+ VLKG IPY V+ P K LFP+ P+IFGCTC+VRD RP +TKLDPK+L+C+FLGYSR+QKGYRC+ P  N+YLVS DV F EDT F SS  S++ EE+
Subjt:  MPSSVLKGQIPYHVLFPTKPLFPIEPKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSSNRYLVSRDVTFFEDTPFKSSLPSTSQEEE

Query:  DDLFIYTLVPPTPSTDPSPLVSPTPSLDPSSSTL----VPTCPPITQVYSRRQQPPGECLVPQDSSSSDPGPSDELPIALRKGKRSC--AFPISSFVSYD
        ++  +Y +V   P+   S +V    SL PS   +     P  PPI QVYSRR      C  P   SSSDP    +LPI+LRKGKR C   + I++FVSYD
Subjt:  DDLFIYTLVPPTPSTDPSPLVSPTPSLDPSSSTL----VPTCPPITQVYSRRQQPPGECLVPQDSSSSDPGPSDELPIALRKGKRSC--AFPISSFVSYD

Query:  HLSSSTCSFVASLDSISIPKTVHEALSHPGWRNAMIEEMTALDDNGTWDLVSRPEGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTF
        HLSSS+   VAS+DSIS+PKTV EAL+HPGW+NAM+EE+ AL+DN TW LV  P+GKK +GCKWVFA+KVNPDGSVARLKARLVA+GYAQTYG+DY DTF
Subjt:  HLSSSTCSFVASLDSISIPKTVHEALSHPGWRNAMIEEMTALDDNGTWDLVSRPEGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTF

Query:  SPVAKLTSVRLFISMAASQCWPLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGENDKVCRLRKSLYGLKQSPRAWFGRFSQALEQFGMKKSKSDHSV---
        SPVAKL SVRLFIS+AASQ W +HQLDIKNAFLHGDL+EEVY+EQPPGFVAQGE  KVCRL+K+LYGLKQSPRAWFG+FS+ ++ FGM KS+ DHSV   
Subjt:  SPVAKLTSVRLFISMAASQCWPLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGENDKVCRLRKSLYGLKQSPRAWFGRFSQALEQFGMKKSKSDHSV---

Query:  ------------------------STNELRV-----FHTKDLGTLKYFLGIEVMRSKK------------------------------------------
                                  ++L+      FHTKDLG LKYFLGIEV RSKK                                          
Subjt:  ------------------------STNELRV-----FHTKDLGTLKYFLGIEVMRSKK------------------------------------------

Query:  ------------------VTRPDIAYSVSIVSQFMSSPTVEHWAAVQQILCYLKVAPGRGILYKDHGHMKIECFSDADWAGSREDRRSTSGYCVFVGGNL
                          VTRPDIAY+VS+VSQF S+PT++HWAA++QILCYLK APG GILY   GH +IECFSDADWAGS+ DRRST+GYCVF GGNL
Subjt:  ------------------VTRPDIAYSVSIVSQFMSSPTVEHWAAVQQILCYLKVAPGRGILYKDHGHMKIECFSDADWAGSREDRRSTSGYCVFVGGNL

Query:  VSWKSKKQNVVSRSSAESEYRAMAQSVCEIMWVHQLLIELGFNVTVPTKLWCDNQAALHIASNPVFHERTKHIEIDCHFVREKIQQGLVATGYVKTGEQL
        V+WKSKKQ+VVSRSSAESEYRAM+Q+ CEI+W+HQLL E+G   T+P KLWCDNQAALHIA+NPV+HERTKHIE+DCHF+REKI++ LV+TGYVKTGEQL
Subjt:  VSWKSKKQNVVSRSSAESEYRAMAQSVCEIMWVHQLLIELGFNVTVPTKLWCDNQAALHIASNPVFHERTKHIEIDCHFVREKIQQGLVATGYVKTGEQL

Query:  GDIFTKALNGVRIDYLC
        GDIFTKALNG R++Y C
Subjt:  GDIFTKALNGVRIDYLC

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.7e-7227.71Show/hide
Query:  MPSSVL--KGQIPYHVLFPTKPLFPIEPKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSSNRYLVSRDVTFFED-------TPFKSS
        +PS  L    + PY +    KP      ++FG T +V  ++    K D KS K IF+GY     G++ +   + +++V+RDV   E          F++ 
Subjt:  MPSSVL--KGQIPYHVLFPTKPLFPIEPKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSSNRYLVSRDVTFFED-------TPFKSS

Query:  LPSTSQEEEDDLF-------IYTLVP-----------------------PTPSTDPSPLVSPTPSLDPSSSTLVPTCPPITQVY---------------S
            S+E E+  F       I T  P                       P  S        P  S +  +   +       + +               S
Subjt:  LPSTSQEEEDDLF-------IYTLVP-----------------------PTPSTDPSPLVSPTPSLDPSSSTLVPTCPPITQVY---------------S

Query:  RRQQPPGECLVPQDSSS------SDPGPSDELPIALRKGKRSCAFPISSFVSYDHLSSSTCSFVASLDSI--SIPKTVHEAL---SHPGWRNAMIEEMTA
        +    P E    + +         +P  +D + I  R+ +R    P    +SY+   +S    V +  +I   +P +  E         W  A+  E+ A
Subjt:  RRQQPPGECLVPQDSSS------SDPGPSDELPIALRKGKRSCAFPISSFVSYDHLSSSTCSFVASLDSI--SIPKTVHEAL---SHPGWRNAMIEEMTA

Query:  LDDNGTWDLVSRPEGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPVAKLTSVRLFISMAASQCWPLHQLDIKNAFLHGDLQEEV
           N TW +  RPE K  +  +WVF++K N  G+  R KARLVA+G+ Q Y IDY +TF+PVA+++S R  +S+       +HQ+D+K AFL+G L+EE+
Subjt:  LDDNGTWDLVSRPEGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPVAKLTSVRLFISMAASQCWPLHQLDIKNAFLHGDLQEEV

Query:  YMEQPPGFVAQGENDKVCRLRKSLYGLKQSPRAWFGRFSQALEQFGMKKSKSDHSV------STNE----------------------------LRVFHT
        YM  P G     +N  VC+L K++YGLKQ+ R WF  F QAL++     S  D  +      + NE                            +  F  
Subjt:  YMEQPPGFVAQGENDKVCRLRKSLYGLKQSPRAWFGRFSQALEQFGMKKSKSDHSV------STNE----------------------------LRVFHT

Query:  KDLGTLKYFLGIEV-MRSKKV-----------------------------------------------------------TRPDIAYSVSIVSQFMSSPT
         DL  +K+F+GI + M+  K+                                                           TRPD+  +V+I+S++ S   
Subjt:  KDLGTLKYFLGIEV-MRSKKV-----------------------------------------------------------TRPDIAYSVSIVSQFMSSPT

Query:  VEHWAAVQQILCYLKVAPGRGILYKDH--GHMKIECFSDADWAGSREDRRSTSGYCV-FVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIMWVHQL
         E W  ++++L YLK      +++K +     KI  + D+DWAGS  DR+ST+GY       NL+ W +K+QN V+ SS E+EY A+ ++V E +W+  L
Subjt:  VEHWAAVQQILCYLKVAPGRGILYKDH--GHMKIECFSDADWAGSREDRRSTSGYCV-FVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIMWVHQL

Query:  LIELGFNVTVPTKLWCDNQAALHIASNPVFHERTKHIEIDCHFVREKIQQGLVATGYVKTGEQLGDIFTKALNGVR
        L  +   +  P K++ DNQ  + IA+NP  H+R KHI+I  HF RE++Q  ++   Y+ T  QL DIFTK L   R
Subjt:  LIELGFNVTVPTKLWCDNQAALHIASNPVFHERTKHIEIDCHFVREKIQQGLVATGYVKTGEQLGDIFTKALNGVR

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-8630.98Show/hide
Query:  PSSVLKGQIPYHVLFPTKPLFPIEPKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSSNRYLVSRDVTFFEDTPFKSSLPSTSQEEED
        PS  L  +IP  V +  K +     K+FGC  F    +   TKLD KS+ CIF+GY   + GYR + P   + + SRDV F E      +    S++ ++
Subjt:  PSSVLKGQIPYHVLFPTKPLFPIEPKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSSNRYLVSRDVTFFEDTPFKSSLPSTSQEEED

Query:  DLFIYTLVPPTPSTDPSPLVSPTPSLDPSSSTLVPTCPPITQVYSRRQQP-----PGECLVPQDSSSSDPGPSDELPIALRKGKR----SCAFPISSFVS
         +    +  P+ S +P+   S T                  +V  + +QP      GE L         P   +E    LR+ +R    S  +P + +V 
Subjt:  DLFIYTLVPPTPSTDPSPLVSPTPSLDPSSSTLVPTCPPITQVYSRRQQP-----PGECLVPQDSSSSDPGPSDELPIALRKGKR----SCAFPISSFVS

Query:  YDHLSSSTCSFVASLDSISIPKTVHEALSHP---GWRNAMIEEMTALDDNGTWDLVSRPEGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGID
                      +     P+++ E LSHP       AM EEM +L  NGT+ LV  P+GK+ + CKWVF +K + D  + R KARLV KG+ Q  GID
Subjt:  YDHLSSSTCSFVASLDSISIPKTVHEALSHP---GWRNAMIEEMTALDDNGTWDLVSRPEGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGID

Query:  YFDTFSPVAKLTSVRLFISMAASQCWPLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGENDKVCRLRKSLYGLKQSPRAWFGRFSQALEQFGMKKSKSDH
        + + FSPV K+TS+R  +S+AAS    + QLD+K AFLHGDL+EE+YMEQP GF   G+   VC+L KSLYGLKQ+PR W+ +F   ++     K+ SD 
Subjt:  YFDTFSPVAKLTSVRLFISMAASQCWPLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGENDKVCRLRKSLYGLKQSPRAWFGRFSQALEQFGMKKSKSDH

Query:  SV-----STNEL----------------------------RVFHTKDLGTLKYFLGIEVMR---------------------------------------
         V     S N                              + F  KDLG  +  LG++++R                                       
Subjt:  SV-----STNEL----------------------------RVFHTKDLGTLKYFLGIEVMR---------------------------------------

Query:  -SKKV-----------------------------TRPDIAYSVSIVSQFMSSPTVEHWAAVQQILCYLKVAPGRGILYKDHGHMKIECFSDADWAGSRED
         SKK+                             TRPDIA++V +VS+F+ +P  EHW AV+ IL YL+   G  + +     + ++ ++DAD AG  ++
Subjt:  -SKKV-----------------------------TRPDIAYSVSIVSQFMSSPTVEHWAAVQQILCYLKVAPGRGILYKDHGHMKIECFSDADWAGSRED

Query:  RRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIMWVHQLLIELGFNVTVPTKLWCDNQAALHIASNPVFHERTKHIEIDCHFVREKIQ
        R+S++GY     G  +SW+SK Q  V+ S+ E+EY A  ++  E++W+ + L ELG +      ++CD+Q+A+ ++ N ++H RTKHI++  H++RE + 
Subjt:  RRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIMWVHQLLIELGFNVTVPTKLWCDNQAALHIASNPVFHERTKHIEIDCHFVREKIQ

Query:  QGLVATGYVKTGEQLGDIFTKAL
           +    + T E   D+ TK +
Subjt:  QGLVATGYVKTGEQLGDIFTKAL

Q93V61 Phospholipase A(1) LCAT31.3e-8865.25Show/hide
Query:  APGCINDCLLTGLQFVEGFESQFFVSRWTFHQLLVECPSIYEMLANLGFKWHAQPQIQVWKKSSVNG-KTSVDLKSYDPIDSIALFEEALRNNEVKFHGK
        APGCIND +LTG+QFVEG ES FFVSRWT HQLLVECPSIYEM+AN  FKW  QP+I+VW+K S N   TSV+L+S+  I+SI LF +AL+NNE+ + G 
Subjt:  APGCINDCLLTGLQFVEGFESQFFVSRWTFHQLLVECPSIYEMLANLGFKWHAQPQIQVWKKSSVNG-KTSVDLKSYDPIDSIALFEEALRNNEVKFHGK

Query:  TIPLPFNFDILKWAAGTRQVIDSAKLPDGISFYNIYGTSFDTPFDVCYGSESLPIEDLSEICKTLPQYSYVDGDGTVPSESAKADIFEATERVGVAASHR
         I LPFNF IL WAA TR++++ A+LPDG+SFYNIYG S +TPFDVCYG+E+ PI+DLSEIC+T+P+Y+YVDGDGTVP+ESA A  F+A   VGV+ SHR
Subjt:  TIPLPFNFDILKWAAGTRQVIDSAKLPDGISFYNIYGTSFDTPFDVCYGSESLPIEDLSEICKTLPQYSYVDGDGTVPSESAKADIFEATERVGVAASHR

Query:  GLLKDEKVFQHIQKWLGVDQELS--KHLTTSKVVDA
        GLL+DE+VF+ IQ+WLGV+ + +  KHL T KVVD+
Subjt:  GLLKDEKVFQHIQKWLGVDQELS--KHLTTSKVVDA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.1e-10734.35Show/hide
Query:  MPSSVLKGQIPYHVLFPTKPLFPIEPKIFGCTCFVRDVRP-NLTKLDPKSLKCIFLGYSRVQKGYRCYCPSSNRYLVSRDVTFFEDT-PFKSSLPSTSQE
        +P+ +L+ + P+  LF T P +  + ++FGC C+   +RP N  KLD KS +C+FLGYS  Q  Y C    ++R  +SR V F E+  PF + L + S  
Subjt:  MPSSVLKGQIPYHVLFPTKPLFPIEPKIFGCTCFVRDVRP-NLTKLDPKSLKCIFLGYSRVQKGYRCYCPSSNRYLVSRDVTFFEDT-PFKSSLPSTSQE

Query:  EED------------DLFIYTLVPPTPS-TDPSPLVSPTPS--------------LDPSSSTLVPTCPPITQVYSRRQQPPGECLVPQ-----DSSSSDP
        +E              L   T V P PS +DP    +P  S              LD S S+  P+ P  T       QP  +    Q       ++S  
Subjt:  EED------------DLFIYTLVPPTPS-TDPSPLVSPTPS--------------LDPSSSTLVPTCPPITQVYSRRQQPPGECLVPQ-----DSSSSDP

Query:  GPSDELPIALRKGKRSCAFPISSFVSYDHLSSSTC------------------------------------------------SFVASLDSISIPKTVHE
         P++E P  L +   + A   SS  S    +SS+                                                 S   SL + S P+T  +
Subjt:  GPSDELPIALRKGKRSCAFPISSFVSYDHLSSSTC------------------------------------------------SFVASLDSISIPKTVHE

Query:  ALSHPGWRNAMIEEMTALDDNGTWDLVSRPEGKKAI-GCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPVAKLTSVRLFISMAASQCWPL
        AL    WRNAM  E+ A   N TWDLV  P     I GC+W+F  K N DGS+ R KARLVAKGY Q  G+DY +TFSPV K TS+R+ + +A  + WP+
Subjt:  ALSHPGWRNAMIEEMTALDDNGTWDLVSRPEGKKAI-GCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPVAKLTSVRLFISMAASQCWPL

Query:  HQLDIKNAFLHGDLQEEVYMEQPPGFVAQGENDKVCRLRKSLYGLKQSPRAWFGRFSQALEQFGMKKSKSD-----------------------------
         QLD+ NAFL G L ++VYM QPPGF+ +   + VC+LRK+LYGLKQ+PRAW+      L   G   S SD                             
Subjt:  HQLDIKNAFLHGDLQEEVYMEQPPGFVAQGENDKVCRLRKSLYGLKQSPRAWFGRFSQALEQFGMKKSKSD-----------------------------

Query:  ---HSVSTNELRVFHTKDLGTLKYFLGIEVMR----------------------------------SKKV--------------------------TRPD
           H+   N  + F  KD   L YFLGIE  R                                  S K+                          TRPD
Subjt:  ---HSVSTNELRVFHTKDLGTLKYFLGIEVMR----------------------------------SKKV--------------------------TRPD

Query:  IAYSVSIVSQFMSSPTVEHWAAVQQILCYLKVAPGRGILYKDHGHMKIECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAM
        I+Y+V+ +SQFM  PT EH  A+++IL YL   P  GI  K    + +  +SDADWAG ++D  ST+GY V++G + +SW SKKQ  V RSS E+EYR++
Subjt:  IAYSVSIVSQFMSSPTVEHWAAVQQILCYLKVAPGRGILYKDHGHMKIECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAM

Query:  AQSVCEIMWVHQLLIELGFNVTVPTKLWCDNQAALHIASNPVFHERTKHIEIDCHFVREKIQQGLVATGYVKTGEQLGDIFTKALN
        A +  E+ W+  LL ELG  +T P  ++CDN  A ++ +NPVFH R KHI ID HF+R ++Q G +   +V T +QL D  TK L+
Subjt:  AQSVCEIMWVHQLLIELGFNVTVPTKLWCDNQAALHIASNPVFHERTKHIEIDCHFVREKIQQGLVATGYVKTGEQLGDIFTKALN

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.3e-10533.96Show/hide
Query:  MPSSVLKGQIPYHVLFPTKPLFPIEPKIFGCTCFVRDVRP-NLTKLDPKSLKCIFLGYSRVQKGYRCYCPSSNRYLVSRDVTFFEDT-PFKSSL--PSTS
        +P+ +L+ Q P+  LF   P +  + K+FGC C+   +RP N  KL+ KS +C F+GYS  Q  Y C    + R   SR V F E   PF ++    STS
Subjt:  MPSSVLKGQIPYHVLFPTKPLFPIEPKIFGCTCFVRDVRP-NLTKLDPKSLKCIFLGYSRVQKGYRCYCPSSNRYLVSRDVTFFEDT-PFKSSL--PSTS

Query:  QEEEDDLF----IYTLVPPTP-------------STDPSPLVSPTP--------SLDPSSSTLVPTCPPITQVYSRRQQPPGECLVPQDSSS-----SDP
        QE+  D       +T +P TP              T P P  SP+P        S  PSSS   P+    T       QP  +    Q+S+S     ++P
Subjt:  QEEEDDLF----IYTLVPPTP-------------STDPSPLVSPTP--------SLDPSSSTLVPTCPPITQVYSRRQQPPGECLVPQDSSS-----SDP

Query:  GPSDELPIALRKGKRSCAFPISS--------FVSYDHLSSSTC--------------------------------------------SFVASLDSISIPK
         P+   P +  +       PISS         +S  +  SS+                                             S+  SL + S P+
Subjt:  GPSDELPIALRKGKRSCAFPISS--------FVSYDHLSSSTC--------------------------------------------SFVASLDSISIPK

Query:  TVHEALSHPGWRNAMIEEMTALDDNGTWDLVSRPEGKKAI-GCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPVAKLTSVRLFISMAASQ
        T  +A+    WR AM  E+ A   N TWDLV  P     I GC+W+F  K N DGS+ R KARLVAKGY Q  G+DY +TFSPV K TS+R+ + +A  +
Subjt:  TVHEALSHPGWRNAMIEEMTALDDNGTWDLVSRPEGKKAI-GCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPVAKLTSVRLFISMAASQ

Query:  CWPLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGENDKVCRLRKSLYGLKQSPRAWFGRFSQALEQFGMKKSKSDHS--------------VSTNELRV-
         WP+ QLD+ NAFL G L +EVYM QPPGFV +   D VCRLRK++YGLKQ+PRAW+      L   G   S SD S              V  +++ + 
Subjt:  CWPLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGENDKVCRLRKSLYGLKQSPRAWFGRFSQALEQFGMKKSKSDHS--------------VSTNELRV-

Query:  -----------------FHTKDLGTLKYFLGIEVMR----------------------------------SKKV--------------------------
                         F  K+   L YFLGIE  R                                  S K+                          
Subjt:  -----------------FHTKDLGTLKYFLGIEVMR----------------------------------SKKV--------------------------

Query:  TRPDIAYSVSIVSQFMSSPTVEHWAAVQQILCYLKVAPGRGILYKDHGHMKIECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESE
        TRPD++Y+V+ +SQ+M  PT +HW A++++L YL   P  GI  K    + +  +SDADWAG  +D  ST+GY V++G + +SW SKKQ  V RSS E+E
Subjt:  TRPDIAYSVSIVSQFMSSPTVEHWAAVQQILCYLKVAPGRGILYKDHGHMKIECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESE

Query:  YRAMAQSVCEIMWVHQLLIELGFNVTVPTKLWCDNQAALHIASNPVFHERTKHIEIDCHFVREKIQQGLVATGYVKTGEQLGDIFTKALNGV
        YR++A +  E+ W+  LL ELG  ++ P  ++CDN  A ++ +NPVFH R KHI +D HF+R ++Q G +   +V T +QL D  TK L+ V
Subjt:  YRAMAQSVCEIMWVHQLLIELGFNVTVPTKLWCDNQAALHIASNPVFHERTKHIEIDCHFVREKIQQGLVATGYVKTGEQLGDIFTKALNGV

Arabidopsis top hitse value%identityAlignment
AT3G03310.1 lecithin:cholesterol acyltransferase 39.0e-9065.25Show/hide
Query:  APGCINDCLLTGLQFVEGFESQFFVSRWTFHQLLVECPSIYEMLANLGFKWHAQPQIQVWKKSSVNG-KTSVDLKSYDPIDSIALFEEALRNNEVKFHGK
        APGCIND +LTG+QFVEG ES FFVSRWT HQLLVECPSIYEM+AN  FKW  QP+I+VW+K S N   TSV+L+S+  I+SI LF +AL+NNE+ + G 
Subjt:  APGCINDCLLTGLQFVEGFESQFFVSRWTFHQLLVECPSIYEMLANLGFKWHAQPQIQVWKKSSVNG-KTSVDLKSYDPIDSIALFEEALRNNEVKFHGK

Query:  TIPLPFNFDILKWAAGTRQVIDSAKLPDGISFYNIYGTSFDTPFDVCYGSESLPIEDLSEICKTLPQYSYVDGDGTVPSESAKADIFEATERVGVAASHR
         I LPFNF IL WAA TR++++ A+LPDG+SFYNIYG S +TPFDVCYG+E+ PI+DLSEIC+T+P+Y+YVDGDGTVP+ESA A  F+A   VGV+ SHR
Subjt:  TIPLPFNFDILKWAAGTRQVIDSAKLPDGISFYNIYGTSFDTPFDVCYGSESLPIEDLSEICKTLPQYSYVDGDGTVPSESAKADIFEATERVGVAASHR

Query:  GLLKDEKVFQHIQKWLGVDQELS--KHLTTSKVVDA
        GLL+DE+VF+ IQ+WLGV+ + +  KHL T KVVD+
Subjt:  GLLKDEKVFQHIQKWLGVDQELS--KHLTTSKVVDA

AT4G19860.1 alpha/beta-Hydrolases superfamily protein1.9e-6346.89Show/hide
Query:  DIFTKALN---GVRIDYLCAPGCINDCLLTGLQFVEGFESQFFVSRWTFHQLLVECPSIYEMLANLGFKWHAQPQIQVWKKSSVN---GKTSVDLKSYDP
        DIF K +     +   +  APG I   LL G+ FV G+E  FFVS+W+ HQLL+ECPSIYE++    FKW   P +++W++   N   G + V L+SY  
Subjt:  DIFTKALN---GVRIDYLCAPGCINDCLLTGLQFVEGFESQFFVSRWTFHQLLVECPSIYEMLANLGFKWHAQPQIQVWKKSSVN---GKTSVDLKSYDP

Query:  IDSIALFEEALRNNEVKFHGKTIPLPFNFDILKWAAGTRQVIDSAKLPDGISFYNIYGTSFDTPFDVCYGSESLPIEDLSEICKTLPQYSYVDGDGTVPS
        ++S+ +F ++L NN   + G++I LPFN+ I++WA  T+QV+ SAKLP  + FYNIYGT+ +TP  VCYG+E +P++DL+ +    P Y  VDGDGTVP 
Subjt:  IDSIALFEEALRNNEVKFHGKTIPLPFNFDILKWAAGTRQVIDSAKLPDGISFYNIYGTSFDTPFDVCYGSESLPIEDLSEICKTLPQYSYVDGDGTVPS

Query:  ESAKADIFEATERVGVAASHRGLLKDEKVFQHIQKWLGVDQ
        ESA AD  EA  RVGV   HRG+L D +VF+ ++KWL V +
Subjt:  ESAKADIFEATERVGVAASHRGLLKDEKVFQHIQKWLGVDQ

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.4e-10440.65Show/hide
Query:  ISSFVSYDHLSSSTCSFVASLDSISIPKTVHEALSHPGWRNAMIEEMTALDDNGTWDLVSRPEGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTY
        IS F+SY+ +S    SF+  +     P T +EA     W  AM +E+ A++   TW++ + P  KK IGCKWV+ IK N DG++ R KARLVAKGY Q  
Subjt:  ISSFVSYDHLSSSTCSFVASLDSISIPKTVHEALSHPGWRNAMIEEMTALDDNGTWDLVSRPEGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTY

Query:  GIDYFDTFSPVAKLTSVRLFISMAASQCWPLHQLDIKNAFLHGDLQEEVYMEQPPGFVA-QGEN---DKVCRLRKSLYGLKQSPRAWFGRFSQALEQFGM
        GID+ +TFSPV KLTSV+L ++++A   + LHQLDI NAFL+GDL EE+YM+ PPG+ A QG++   + VC L+KS+YGLKQ+ R WF +FS  L  FG 
Subjt:  GIDYFDTFSPVAKLTSVRLFISMAASQCWPLHQLDIKNAFLHGDLQEEVYMEQPPGFVA-QGEN---DKVCRLRKSLYGLKQSPRAWFGRFSQALEQFGM

Query:  KKSKSDHS----------------------VSTNELRV----------FHTKDLGTLKYFLGIEVMRSK-------------------------------
         +S SDH+                       S N+  V          F  +DLG LKYFLG+E+ RS                                
Subjt:  KKSKSDHS----------------------VSTNELRV----------FHTKDLGTLKYFLGIEVMRSK-------------------------------

Query:  -----------------------------KVTRPDIAYSVSIVSQFMSSPTVEHWAAVQQILCYLKVAPGRGILYKDHGHMKIECFSDADWAGSREDRRS
                                     ++TR DI+++V+ +SQF  +P + H  AV +IL Y+K   G+G+ Y     M+++ FSDA +   ++ RRS
Subjt:  -----------------------------KVTRPDIAYSVSIVSQFMSSPTVEHWAAVQQILCYLKVAPGRGILYKDHGHMKIECFSDADWAGSREDRRS

Query:  TSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIMWVHQLLIELGFNVTVPTKLWCDNQAALHIASNPVFHERTKHIEIDCHFVREK-IQQG
        T+GYC+F+G +L+SWKSKKQ VVS+SSAE+EYRA++ +  E+MW+ Q   EL   ++ PT L+CDN AA+HIA+N VFHERTKHIE DCH VRE+ + Q 
Subjt:  TSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIMWVHQLLIELGFNVTVPTKLWCDNQAALHIASNPVFHERTKHIEIDCHFVREK-IQQG

Query:  LVATGYVKTGEQLGDIFTKALNGV
         ++  +    EQ  D FT+ L+ +
Subjt:  LVATGYVKTGEQLGDIFTKALNGV

ATMG00810.1 DNA/RNA polymerases superfamily protein2.9e-2442.98Show/hide
Query:  VTRPDIAYSVSIVSQFMSSPTVEHWAAVQQILCYLKVAPGRGILYKDHGHMKIECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAES
        +TRPDI+Y+V+IV Q M  PT+  +  ++++L Y+K     G+    +  + ++ F D+DWAG    RRST+G+C F+G N++SW +K+Q  VSRSS E+
Subjt:  VTRPDIAYSVSIVSQFMSSPTVEHWAAVQQILCYLKVAPGRGILYKDHGHMKIECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAES

Query:  EYRAMAQSVCEIMW
        EYRA+A +  E+ W
Subjt:  EYRAMAQSVCEIMW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.0e-2148.98Show/hide
Query:  PKTVHEALSHPGWRNAMIEEMTALDDNGTWDLVSRPEGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPVAKLTSVRLFISMA
        PK+V  AL  PGW  AM EE+ AL  N TW LV  P  +  +GCKWVF  K++ DG++ RLKARLVAKG+ Q  GI + +T+SPV +  ++R  +++A
Subjt:  PKTVHEALSHPGWRNAMIEEMTALDDNGTWDLVSRPEGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPVAKLTSVRLFISMA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTTCATCTGTTCTTAAAGGTCAAATTCCTTATCATGTTCTATTCCCCACAAAACCGTTGTTTCCTATTGAGCCGAAAATATTTGGTTGTACTTGTTTTGTTAGAGA
TGTTCGTCCTAATCTAACAAAATTAGACCCCAAATCTTTAAAGTGCATTTTCTTAGGCTATTCTCGTGTTCAAAAAGGTTATAGATGTTATTGTCCTAGTTCGAATAGGT
ATCTTGTATCTCGTGATGTTACTTTCTTTGAGGACACGCCTTTCAAGTCATCTTTGCCTAGTACGAGTCAGGAGGAGGAGGATGATCTTTTTATTTATACGCTTGTACCT
CCTACACCCTCTACCGATCCATCTCCACTCGTGTCTCCTACACCCTCTCTCGATCCTTCTAGTTCGACACTTGTTCCTACTTGTCCGCCTATTACTCAAGTATACTCTAG
ACGGCAACAACCTCCAGGTGAATGTCTTGTACCACAAGATTCTTCGTCATCGGATCCAGGACCGAGTGATGAGCTTCCTATTGCTCTTCGTAAAGGTAAACGTTCTTGCG
CTTTTCCTATTTCTTCATTTGTCTCTTATGACCACTTGTCATCTTCTACATGTTCTTTTGTTGCATCTCTTGACTCTATCTCGATTCCTAAAACGGTTCATGAAGCCTTG
TCTCATCCTGGTTGGCGCAATGCGATGATAGAAGAGATGACTGCCTTAGATGATAATGGTACTTGGGATTTAGTATCTCGTCCAGAAGGAAAGAAAGCTATCGGGTGTAA
ATGGGTGTTTGCAATTAAGGTAAATCCTGATGGTTCAGTTGCCCGATTGAAAGCACGCCTAGTTGCCAAAGGCTATGCTCAGACATATGGGATTGATTATTTTGATACGT
TTTCTCCTGTTGCTAAATTAACTTCTGTAAGGTTATTCATTTCCATGGCTGCTTCTCAATGTTGGCCTTTGCATCAGCTTGACATTAAAAATGCTTTTCTCCATGGTGAT
CTTCAAGAGGAAGTCTATATGGAGCAACCACCTGGGTTTGTTGCTCAGGGGGAGAATGATAAAGTGTGTCGTCTTCGTAAATCTTTATATGGGTTGAAACAAAGTCCACG
AGCATGGTTTGGAAGATTTAGTCAGGCACTTGAGCAGTTTGGAATGAAGAAAAGCAAGTCAGATCACTCTGTTTCTACCAACGAACTAAGAGTGTTTCATACTAAAGACT
TGGGAACATTAAAATACTTCTTGGGTATCGAAGTAATGAGAAGCAAGAAAGTGACACGACCAGACATAGCTTATTCAGTAAGCATTGTGAGTCAATTTATGTCTTCACCC
ACAGTGGAACATTGGGCAGCAGTACAACAGATTTTATGTTATTTGAAGGTTGCACCTGGACGTGGGATCCTATATAAAGACCATGGTCATATGAAGATCGAGTGTTTTTC
AGATGCTGATTGGGCAGGATCAAGAGAAGATAGAAGATCAACTTCTGGCTATTGTGTTTTTGTTGGAGGTAATTTGGTTTCATGGAAAAGTAAGAAACAAAATGTGGTTT
CACGATCAAGTGCTGAATCAGAATACAGAGCTATGGCACAATCAGTATGTGAGATAATGTGGGTTCATCAACTCTTGATCGAATTAGGTTTCAATGTTACAGTTCCGACC
AAGTTGTGGTGTGACAACCAAGCTGCCCTTCATATTGCATCCAACCCAGTATTTCATGAGCGTACTAAACATATTGAAATCGATTGTCATTTTGTTCGGGAGAAAATACA
GCAAGGTTTGGTAGCTACTGGATATGTGAAGACCGGAGAGCAATTAGGAGATATTTTCACGAAAGCTTTAAATGGAGTCAGAATAGATTATCTTTGTGCACCAGGATGCA
TCAATGATTGTCTTTTGACTGGACTGCAATTTGTTGAAGGCTTTGAAAGCCAATTTTTTGTATCTAGATGGACATTCCACCAGCTGTTGGTTGAATGTCCTTCAATTTAT
GAGATGCTGGCAAATTTAGGATTCAAATGGCATGCACAACCACAGATCCAAGTTTGGAAAAAGAGTTCTGTCAACGGGAAAACTTCTGTTGATTTGAAGTCGTATGACCC
AATTGATAGTATTGCTTTATTTGAAGAAGCATTAAGAAATAACGAGGTAAAATTTCATGGAAAGACCATTCCACTGCCCTTTAACTTTGATATTCTCAAATGGGCTGCTG
GTACACGCCAAGTAATTGATAGCGCAAAACTACCAGATGGAATCTCCTTCTATAATATTTATGGAACATCATTTGATACACCTTTCGATGTGTGCTATGGCTCAGAGTCA
TTACCGATTGAGGACTTGTCTGAAATATGCAAAACTTTGCCTCAGTATTCTTATGTGGATGGAGATGGTACAGTTCCCAGTGAATCAGCCAAGGCTGATATTTTTGAAGC
AACTGAGAGAGTGGGAGTCGCAGCGTCTCATCGGGGACTTTTGAAGGACGAAAAAGTGTTTCAACATATCCAGAAGTGGTTGGGAGTTGATCAGGAGCTCAGTAAACACC
TTACAACTTCCAAAGTGGTCGATGCTTCTCTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCTTCATCTGTTCTTAAAGGTCAAATTCCTTATCATGTTCTATTCCCCACAAAACCGTTGTTTCCTATTGAGCCGAAAATATTTGGTTGTACTTGTTTTGTTAGAGA
TGTTCGTCCTAATCTAACAAAATTAGACCCCAAATCTTTAAAGTGCATTTTCTTAGGCTATTCTCGTGTTCAAAAAGGTTATAGATGTTATTGTCCTAGTTCGAATAGGT
ATCTTGTATCTCGTGATGTTACTTTCTTTGAGGACACGCCTTTCAAGTCATCTTTGCCTAGTACGAGTCAGGAGGAGGAGGATGATCTTTTTATTTATACGCTTGTACCT
CCTACACCCTCTACCGATCCATCTCCACTCGTGTCTCCTACACCCTCTCTCGATCCTTCTAGTTCGACACTTGTTCCTACTTGTCCGCCTATTACTCAAGTATACTCTAG
ACGGCAACAACCTCCAGGTGAATGTCTTGTACCACAAGATTCTTCGTCATCGGATCCAGGACCGAGTGATGAGCTTCCTATTGCTCTTCGTAAAGGTAAACGTTCTTGCG
CTTTTCCTATTTCTTCATTTGTCTCTTATGACCACTTGTCATCTTCTACATGTTCTTTTGTTGCATCTCTTGACTCTATCTCGATTCCTAAAACGGTTCATGAAGCCTTG
TCTCATCCTGGTTGGCGCAATGCGATGATAGAAGAGATGACTGCCTTAGATGATAATGGTACTTGGGATTTAGTATCTCGTCCAGAAGGAAAGAAAGCTATCGGGTGTAA
ATGGGTGTTTGCAATTAAGGTAAATCCTGATGGTTCAGTTGCCCGATTGAAAGCACGCCTAGTTGCCAAAGGCTATGCTCAGACATATGGGATTGATTATTTTGATACGT
TTTCTCCTGTTGCTAAATTAACTTCTGTAAGGTTATTCATTTCCATGGCTGCTTCTCAATGTTGGCCTTTGCATCAGCTTGACATTAAAAATGCTTTTCTCCATGGTGAT
CTTCAAGAGGAAGTCTATATGGAGCAACCACCTGGGTTTGTTGCTCAGGGGGAGAATGATAAAGTGTGTCGTCTTCGTAAATCTTTATATGGGTTGAAACAAAGTCCACG
AGCATGGTTTGGAAGATTTAGTCAGGCACTTGAGCAGTTTGGAATGAAGAAAAGCAAGTCAGATCACTCTGTTTCTACCAACGAACTAAGAGTGTTTCATACTAAAGACT
TGGGAACATTAAAATACTTCTTGGGTATCGAAGTAATGAGAAGCAAGAAAGTGACACGACCAGACATAGCTTATTCAGTAAGCATTGTGAGTCAATTTATGTCTTCACCC
ACAGTGGAACATTGGGCAGCAGTACAACAGATTTTATGTTATTTGAAGGTTGCACCTGGACGTGGGATCCTATATAAAGACCATGGTCATATGAAGATCGAGTGTTTTTC
AGATGCTGATTGGGCAGGATCAAGAGAAGATAGAAGATCAACTTCTGGCTATTGTGTTTTTGTTGGAGGTAATTTGGTTTCATGGAAAAGTAAGAAACAAAATGTGGTTT
CACGATCAAGTGCTGAATCAGAATACAGAGCTATGGCACAATCAGTATGTGAGATAATGTGGGTTCATCAACTCTTGATCGAATTAGGTTTCAATGTTACAGTTCCGACC
AAGTTGTGGTGTGACAACCAAGCTGCCCTTCATATTGCATCCAACCCAGTATTTCATGAGCGTACTAAACATATTGAAATCGATTGTCATTTTGTTCGGGAGAAAATACA
GCAAGGTTTGGTAGCTACTGGATATGTGAAGACCGGAGAGCAATTAGGAGATATTTTCACGAAAGCTTTAAATGGAGTCAGAATAGATTATCTTTGTGCACCAGGATGCA
TCAATGATTGTCTTTTGACTGGACTGCAATTTGTTGAAGGCTTTGAAAGCCAATTTTTTGTATCTAGATGGACATTCCACCAGCTGTTGGTTGAATGTCCTTCAATTTAT
GAGATGCTGGCAAATTTAGGATTCAAATGGCATGCACAACCACAGATCCAAGTTTGGAAAAAGAGTTCTGTCAACGGGAAAACTTCTGTTGATTTGAAGTCGTATGACCC
AATTGATAGTATTGCTTTATTTGAAGAAGCATTAAGAAATAACGAGGTAAAATTTCATGGAAAGACCATTCCACTGCCCTTTAACTTTGATATTCTCAAATGGGCTGCTG
GTACACGCCAAGTAATTGATAGCGCAAAACTACCAGATGGAATCTCCTTCTATAATATTTATGGAACATCATTTGATACACCTTTCGATGTGTGCTATGGCTCAGAGTCA
TTACCGATTGAGGACTTGTCTGAAATATGCAAAACTTTGCCTCAGTATTCTTATGTGGATGGAGATGGTACAGTTCCCAGTGAATCAGCCAAGGCTGATATTTTTGAAGC
AACTGAGAGAGTGGGAGTCGCAGCGTCTCATCGGGGACTTTTGAAGGACGAAAAAGTGTTTCAACATATCCAGAAGTGGTTGGGAGTTGATCAGGAGCTCAGTAAACACC
TTACAACTTCCAAAGTGGTCGATGCTTCTCTGTAG
Protein sequenceShow/hide protein sequence
MPSSVLKGQIPYHVLFPTKPLFPIEPKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSSNRYLVSRDVTFFEDTPFKSSLPSTSQEEEDDLFIYTLVP
PTPSTDPSPLVSPTPSLDPSSSTLVPTCPPITQVYSRRQQPPGECLVPQDSSSSDPGPSDELPIALRKGKRSCAFPISSFVSYDHLSSSTCSFVASLDSISIPKTVHEAL
SHPGWRNAMIEEMTALDDNGTWDLVSRPEGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPVAKLTSVRLFISMAASQCWPLHQLDIKNAFLHGD
LQEEVYMEQPPGFVAQGENDKVCRLRKSLYGLKQSPRAWFGRFSQALEQFGMKKSKSDHSVSTNELRVFHTKDLGTLKYFLGIEVMRSKKVTRPDIAYSVSIVSQFMSSP
TVEHWAAVQQILCYLKVAPGRGILYKDHGHMKIECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIMWVHQLLIELGFNVTVPT
KLWCDNQAALHIASNPVFHERTKHIEIDCHFVREKIQQGLVATGYVKTGEQLGDIFTKALNGVRIDYLCAPGCINDCLLTGLQFVEGFESQFFVSRWTFHQLLVECPSIY
EMLANLGFKWHAQPQIQVWKKSSVNGKTSVDLKSYDPIDSIALFEEALRNNEVKFHGKTIPLPFNFDILKWAAGTRQVIDSAKLPDGISFYNIYGTSFDTPFDVCYGSES
LPIEDLSEICKTLPQYSYVDGDGTVPSESAKADIFEATERVGVAASHRGLLKDEKVFQHIQKWLGVDQELSKHLTTSKVVDASL