; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0019562 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0019562
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionGag-pol polyprotein
Genome locationchr01:24308234..24312347
RNA-Seq ExpressionIVF0019562
SyntenyIVF0019562
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026117.1 gag-pol polyprotein [Cucumis melo var. makuwa]0.082.88Show/hide
Query:  MTEDESVSDYNKRVLEIANESLLLGEKIPDSKIVRKVLRSLPRKFDMKVTAIEKAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVD
        MTEDESVSDYNKRVLEIANESLLLGEKIPDSKIVRKVLRSLPRKFDMKVTAIEKAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVD
Subjt:  MTEDESVSDYNKRVLEIANESLLLGEKIPDSKIVRKVLRSLPRKFDMKVTAIEKAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVD

Query:  CDIEANIDESIALLTKQITNALQNLKNPNVTDEEFVDS-----RDDDG-----NINAFT----------VRITDKNTDDDSECSVESKNDELSIEKLETL
        CDIEANIDESIALLTKQITNALQNLKNPNVT      S     R+D+G     N N+            VRITDKNTDDDSECSVESKNDELSIEKLETL
Subjt:  CDIEANIDESIALLTKQITNALQNLKNPNVTDEEFVDS-----RDDDG-----NINAFT----------VRITDKNTDDDSECSVESKNDELSIEKLETL

Query:  WKEDCEARAIQKERIQDLLEENERLMSVISSLKLKLREVQNENDQILKSVKMLNLGTENLDSILKAGHNGSHRYGLGFVASASSSKATSEIKFVPASMRV
        WKEDCEARAIQKERIQDLLEENERLMSVISSLKLKLREVQNENDQILKSVKMLNLGTENLDSILKAGHNGSHRYGLGFVASASSSKATSEIKFVPASMRV
Subjt:  WKEDCEARAIQKERIQDLLEENERLMSVISSLKLKLREVQNENDQILKSVKMLNLGTENLDSILKAGHNGSHRYGLGFVASASSSKATSEIKFVPASMRV

Query:  EYDTIHTEAGIRTPVKYLGRTCYYCGRKGHIRHMTGNRSYFTNLNDCVTGHVTFGDGAKEKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLCDQGY
        EYDTIHTEAGIRTPVKYLGRTCYYCGRKGHIR      SYFTNLNDCVTGHVTFGDGAKEKIIAKGNIDKDDLPRLNDVR                    
Subjt:  EYDTIHTEAGIRTPVKYLGRTCYYCGRKGHIRHMTGNRSYFTNLNDCVTGHVTFGDGAKEKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLCDQGY

Query:  KVSFDDIGRVFMNKENQICMSGKRQADNCYHWNSNMFDTCQLIRSDQTWLWHRKLGHVSMRGLEKVIKNKAVVEILDLDVNGNFFCGDCQIGKQKRSSHK
                                                             KLGHVSMRGLEKVIKNKAVVEILDLDVNGNFFCGDCQIGKQK     
Subjt:  KVSFDDIGRVFMNKENQICMSGKRQADNCYHWNSNMFDTCQLIRSDQTWLWHRKLGHVSMRGLEKVIKNKAVVEILDLDVNGNFFCGDCQIGKQKRSSHK

Query:  SLKECYTNRVLELLHMDLMSPMQTESLGGKRYVLVVVDDYSRYTWKWDARSEQGIFLGYSQNSRAYRVYNNRSSSVMKTINVVINDLDSDIKQMNDEEDE
                           SPMQTESLGGKRYVLVVVDDYSRYTW  +     G+     +N   +               +VINDLDSDIKQMNDEEDE
Subjt:  SLKECYTNRVLELLHMDLMSPMQTESLGGKRYVLVVVDDYSRYTWKWDARSEQGIFLGYSQNSRAYRVYNNRSSSVMKTINVVINDLDSDIKQMNDEEDE

Query:  TPNMYEVRTTSIVEESKADNSSDGPGKSLKKSSEEIINKKSELIPSAHEKKNHPASSIIGEPSAGMQTRRKDKIDYLKM--VADLRNNVWTLVSKPEGVN
        TPNMYEVRTTSIVEESKADNSSDGPGKSLKKSSEEIINKKSELIPSAHEKKNHPASSIIGEPSAGMQTRRKDKIDYLKM  +   RNNVWTLVSKPEGVN
Subjt:  TPNMYEVRTTSIVEESKADNSSDGPGKSLKKSSEEIINKKSELIPSAHEKKNHPASSIIGEPSAGMQTRRKDKIDYLKM--VADLRNNVWTLVSKPEGVN

Query:  VIGTKWIFKNKIDETGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLQAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHV
        VIGTKWIFKNKIDETGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLQAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHV
Subjt:  VIGTKWIFKNKIDETGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLQAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHV

Query:  YKLNKALYGLKQAPRAWYDRLTVYLRGRGYFREEIDKTFFIHRKSDQLLVDQIYVDDIIFGGFPYDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKND
        YKLNKALYGLKQAPRAWYDRLTVYLRGRGYFREEIDKTFFIHRKSDQLLV QIYVDDIIFGGFPYDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKND
Subjt:  YKLNKALYGLKQAPRAWYDRLTVYLRGRGYFREEIDKTFFIHRKSDQLLVDQIYVDDIIFGGFPYDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKND

Query:  DIFISQEKYAKNMVKKFCLEQARNKRTLAATHVKLTKDTEGAEVDHKLYRSIVGNLLYLTTSRPDIAYVVGICACYQADPRITHLEAVKRILKYVHGTSD
        DIFISQEKYAKNMVKKFCLEQARNKRTLAATHVKLTKDTEGAEVDHKLYRSIVGNLLYLTTSRPDIAYVVGICACYQADPRITHLEAVKRILKYVHGTSD
Subjt:  DIFISQEKYAKNMVKKFCLEQARNKRTLAATHVKLTKDTEGAEVDHKLYRSIVGNLLYLTTSRPDIAYVVGICACYQADPRITHLEAVKRILKYVHGTSD

Query:  FRMMYSYDTTPTLVGYCDADWAGSANDRKSTSEG
        FRMMYSYDTTPTLVGYCDADWAGSANDRKSTSEG
Subjt:  FRMMYSYDTTPTLVGYCDADWAGSANDRKSTSEG

KAA0035966.1 F5J5.1 [Cucumis melo var. makuwa]0.069.24Show/hide
Query:  MTEDESVSDYNKRVLEIANESLLLGEKIPDSKIVRKVLRSLPRKFDMKVTAIEKAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVD
        MTEDESVSDYNKRVL+IANESLLLGEKIPDSKIVRKVLRS+ RKFDMKVTAIE+AHDITTLKLDELFGSLLTFEMATADRESKKGKGI+FKSTHV++E D
Subjt:  MTEDESVSDYNKRVLEIANESLLLGEKIPDSKIVRKVLRSLPRKFDMKVTAIEKAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVD

Query:  CDIEANIDESIALLTKQITNALQNLKNPNVT------------------DEEFVDSRDDDGNINAFTVRITDKNTDDDSECSVESKNDELSIEKLETLWK
        CD +AN+DESIALLTKQ TNAL+NLKNPN T                  DEE VDSRDDDGNINAF +RITD+NTDD+SECSVESKNDELSIEKL+TLWK
Subjt:  CDIEANIDESIALLTKQITNALQNLKNPNVT------------------DEEFVDSRDDDGNINAFTVRITDKNTDDDSECSVESKNDELSIEKLETLWK

Query:  EDCEARAIQKERIQDLLEENERLMSVISSLKLKLREVQNENDQILKSVKMLNLGTENLDSILKAGHNGSHRYGLGFVASASSSKATSEIKFVPASMRVEY
        EDCEAR IQKE IQDLLEENE LMSVISSLKLKLREVQNENDQILKSVKMLN G +NLDSILK GHNGS RYGLGFV+SASSSKATSEIKFVPASMRVEY
Subjt:  EDCEARAIQKERIQDLLEENERLMSVISSLKLKLREVQNENDQILKSVKMLNLGTENLDSILKAGHNGSHRYGLGFVASASSSKATSEIKFVPASMRVEY

Query:  DTIHTEAGIRTPVKYLGRTCYYCGRKGHIR----------------------------------------------------------HMTGNRSYFTNL
        DTIH E GIR  VK LGRT YYCG+KGHIR                                                          HMTGNRSYF NL
Subjt:  DTIHTEAGIRTPVKYLGRTCYYCGRKGHIR----------------------------------------------------------HMTGNRSYFTNL

Query:  NDCVTGHVTFGDGAKEKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLCDQGYKVSFDDIGRVFMNKENQICMSGKRQADNCYHWNSNMFDTCQLIR
        NDCV GHVTFGDGAK KIIAKGNI+KDDLPRLNDVRYVDGLKANLISI+QLCDQGYKVSFDDIG V MNKENQICMSGKRQADNCYHWNSNM DTC+LIR
Subjt:  NDCVTGHVTFGDGAKEKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLCDQGYKVSFDDIGRVFMNKENQICMSGKRQADNCYHWNSNMFDTCQLIR

Query:  SDQTWLWHRKLGHVSMRGLEKVIKNKAVVEILDLDVNGNFFCGDCQIGKQKRSSHKSLKECYTNRVLELLHMDLMSPMQTESLGGKRYVLVVVDDYSRYT
        SDQTWLWHRKL H SMRGLEKVIKNKAVV I DLDVNGNFFCGDCQI                                     GKR+VLVVVDDY+RYT
Subjt:  SDQTWLWHRKLGHVSMRGLEKVIKNKAVVEILDLDVNGNFFCGDCQIGKQKRSSHKSLKECYTNRVLELLHMDLMSPMQTESLGGKRYVLVVVDDYSRYT

Query:  WKWDARS------------------------------EQGIFLGYSQNSRAYRVYNNRSSSVMKTINVVINDLDSDIKQMNDEEDETPNMYEVRTTSIVE
        W +  +                               EQGIFLGYSQNS+AYRVYNNRS SVM+TINVVINDLDS IKQMNDEEDETPNM EVRT SIVE
Subjt:  WKWDARS------------------------------EQGIFLGYSQNSRAYRVYNNRSSSVMKTINVVINDLDSDIKQMNDEEDETPNMYEVRTTSIVE

Query:  ESKADNSSDGPGKSLKKSSEEIINKKSELIPSAHEKKNHPASSIIGEPSAGMQTRRKDKIDYLKMVADLRNNVWTLVSKPEGVNVIGTKWIFKNKIDETG
        ESKADNSS+GP                             ASSIIG+PS GMQTRRKDKIDYLKMVADL    +    +P  V+ +     + N + E  
Subjt:  ESKADNSSDGPGKSLKKSSEEIINKKSELIPSAHEKKNHPASSIIGEPSAGMQTRRKDKIDYLKMVADLRNNVWTLVSKPEGVNVIGTKWIFKNKIDETG

Query:  CVTK--NKARLVAQGYTQVEGVDFDETFAPVARLQAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAP
           +  N   LVAQGYTQVEGVDFDETFA VARL+AIRLLLGISCIQKFKLYQMDVKSAFLN YLN EVYVAQPKGFVD EHPKHVYKLNKALYGLKQAP
Subjt:  CVTK--NKARLVAQGYTQVEGVDFDETFAPVARLQAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAP

Query:  RAWYDRLTVYLRGRGYFREEIDKTFFIHRKSDQLLVDQIYVDDIIFGGFPYDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYAKNMV
        RAWY+RLTVYLRGRGY R EIDKT FIHRKSDQLLV QIYVDDIIFGGFP DLVNNFINIMQSEFEMS VGELSCFLGLQIKQKND IFISQEK      
Subjt:  RAWYDRLTVYLRGRGYFREEIDKTFFIHRKSDQLLVDQIYVDDIIFGGFPYDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYAKNMV

Query:  KKFCLEQARNKRTLAATHVKLTKDTEGAEVDHKLYRSIVGNLLYLTTSRPDIAYVVGICACYQADPRITHLEAVKRILKYVHGTSDFRMMYSYDTTPTLV
                                            SIVG+LLYLT SR DIAY VGICA YQ DPRITHLE VKRILKYVHG SDF MMYSY+TTPTLV
Subjt:  KKFCLEQARNKRTLAATHVKLTKDTEGAEVDHKLYRSIVGNLLYLTTSRPDIAYVVGICACYQADPRITHLEAVKRILKYVHGTSDFRMMYSYDTTPTLV

Query:  GYCDADWAGSANDRK
        GY D DWAGS +DRK
Subjt:  GYCDADWAGSANDRK

KAA0053200.1 gag-pol polyprotein [Cucumis melo var. makuwa]0.063.36Show/hide
Query:  MTEDESVSDYNKRVLEIANESLLLGEKIPDSKIVRKVLRSLPRKFDMKVTAIEKAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVD
        M E+ESVSDYNKRVLEI NESLLL EKIPDSKIV+KVLRSLPRKFDMKV AIE+AHDITTLKLDELFG LLTFEMATAD E+KK KGIAFKSTHVSEE  
Subjt:  MTEDESVSDYNKRVLEIANESLLLGEKIPDSKIVRKVLRSLPRKFDMKVTAIEKAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVD

Query:  CDIEANIDE----------SIALLTKQITNAL---------------------QNLKNPNVT--DEEFVDSRDDDGNINAFTVRITDKNTDDDSECSVES
        CD EAN+DE             L  K+    +                     +  KN  VT  DEE VDSRDD GNINAFT+RITD+NTDDDSECSVES
Subjt:  CDIEANIDE----------SIALLTKQITNAL---------------------QNLKNPNVT--DEEFVDSRDDDGNINAFTVRITDKNTDDDSECSVES

Query:  KNDELSIEKLETLWKEDCEARAIQKERIQDLLEENERLMSVISSLKLKLREVQNENDQILKSVKMLNLGTENLDSILKAGHNGSHRYGLGFVASASSSKA
        KNDEL IEKLETLWKEDCEAR IQKERIQDLLEENERLMSVISSLKLKLREVQNENDQILKS KMLN GTENLDSILKAGHNGSHR+GLGFVASASSSKA
Subjt:  KNDELSIEKLETLWKEDCEARAIQKERIQDLLEENERLMSVISSLKLKLREVQNENDQILKSVKMLNLGTENLDSILKAGHNGSHRYGLGFVASASSSKA

Query:  TSEIKFVPASMRVEYDTIHTEAGIRTPVKYLGRTCYYCGRKGHIRHMTGNRSYFTNLNDCVTGHVTFGDGAKEKIIAKGNIDKDDLPRLNDVRYVDGLKA
        TSEIKFVPASMRVEYDTIH E GIRTP+K  GRTCYYCGRKGHIR                            KIIAK NID DDLPRLNDVRYVDGLKA
Subjt:  TSEIKFVPASMRVEYDTIHTEAGIRTPVKYLGRTCYYCGRKGHIRHMTGNRSYFTNLNDCVTGHVTFGDGAKEKIIAKGNIDKDDLPRLNDVRYVDGLKA

Query:  NLISISQLCDQGYKVSFDDIGRVFMNKENQICMSGKRQADNCYHWNSNMFDTCQLIRSDQTWLWHRKLGHVSMRGLEKVIKNKAVVEILDLDVNGNFFCG
        NLIS+SQLCDQGYKVSFDDIG                               C LIRSDQTWLWHRKLGHVS+RGL KVIKNKAVV I DLDVNGNFF G
Subjt:  NLISISQLCDQGYKVSFDDIGRVFMNKENQICMSGKRQADNCYHWNSNMFDTCQLIRSDQTWLWHRKLGHVSMRGLEKVIKNKAVVEILDLDVNGNFFCG

Query:  DCQIGKQKRSSHKSLKECYTNRVLELLHMDLMSPMQTESLGGKRYVLVVVDDYSRYTWKWDARSEQGIFLGYSQNSRAYRVYNNRSSSVMKTINVVINDL
        DCQIGK++                             ESLGGKRYVLVVVDDYSRYTWKWDARSEQGIFLGYSQNSR Y VYNNRS SVM+TINVVINDL
Subjt:  DCQIGKQKRSSHKSLKECYTNRVLELLHMDLMSPMQTESLGGKRYVLVVVDDYSRYTWKWDARSEQGIFLGYSQNSRAYRVYNNRSSSVMKTINVVINDL

Query:  DSDIKQMNDEEDETPNMYEVRTTSIVEESKADNSSDGPGKSLKKSSEEIINKKSELIPSAHEKKNHPASSIIGEPSAGMQTRRKDKIDYLKMVADLRNNV
        DS IKQMND++        +RT    E          P   ++ +    +     LI       N    +++   S G   + +       ++   RN +
Subjt:  DSDIKQMNDEEDETPNMYEVRTTSIVEESKADNSSDGPGKSLKKSSEEIINKKSELIPSAHEKKNHPASSIIGEPSAGMQTRRKDKIDYLKMVADLRNNV

Query:  WTLVSKPEGVNVIGTKWIFKNKIDETGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLQAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPK
        WTLVSK EGVNVIGTKWIFKNKID  GCVTKNKARLVAQGYTQVEGVDFDETFAPVARL+ IRLLLGISCIQKFKLYQMD+KS FLN YLNEEVYVAQPK
Subjt:  WTLVSKPEGVNVIGTKWIFKNKIDETGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLQAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPK

Query:  GFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYFREEIDKTFFIHRKSDQLLVDQIYVDDIIFGGFPYDLVNNFINIMQSEFEMSKVGELSC
        GFVDSEHPKHVYKLNKALYGLKQA +AWYDRLT+                                                      EFEMS VGELSC
Subjt:  GFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYFREEIDKTFFIHRKSDQLLVDQIYVDDIIFGGFPYDLVNNFINIMQSEFEMSKVGELSC

Query:  FLGLQIKQKNDDIFISQEKYAKNMVKKFCLEQARNKRTLAATHVKLTKDTEGAEVDHKLYRSIVGNLLYLTTSRPDIAYVVGICACYQADPRITHLEAVK
        FLGLQIKQKND IFISQEKYA++MVKKF LEQARNKRT AATHVKLTKDTEG EVDHKLYR                            +PRITHLEAVK
Subjt:  FLGLQIKQKNDDIFISQEKYAKNMVKKFCLEQARNKRTLAATHVKLTKDTEGAEVDHKLYRSIVGNLLYLTTSRPDIAYVVGICACYQADPRITHLEAVK

Query:  RILKYVHGTSDFRMMYSYDTTPTLVGYCDADWAGSANDRKSTSEGCFFLGNNLISWLSKKQNRVSLSTAEAEYIAASSGCTQLIWMKNMLLEYGFDQDIM
        RILKYVH TSDF MMYSYDTT TLVGYCDADWAGSA+D KSTS G                                SGCTQLIWMKNML EYGFDQD M
Subjt:  RILKYVHGTSDFRMMYSYDTTPTLVGYCDADWAGSANDRKSTSEGCFFLGNNLISWLSKKQNRVSLSTAEAEYIAASSGCTQLIWMKNMLLEYGFDQDIM

TYK11575.1 gag-pol polyprotein [Cucumis melo var. makuwa]0.085.6Show/hide
Query:  MTEDESVSDYNKRVLEIANESLLLGEKIPDSKIVRKVLRSLPRKFDMKVTAIEKAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVD
        MTEDESVSDYNKRVLEIANESLLLGEKIPDSKIVRKVLRSLPRKFDMKVTAIEKAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVD
Subjt:  MTEDESVSDYNKRVLEIANESLLLGEKIPDSKIVRKVLRSLPRKFDMKVTAIEKAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVD

Query:  CDIEANIDESIALLTKQITNALQNLKNPNVTDEEFVDSRDDDGNINAFTVRITDKNTDDDSECSVESKNDELSIEKLETLWKEDCEARAIQKERIQDLLE
        CDIEANIDESIALLTKQITNALQNLKNPNVTDEEFVDSRDDDGNINAFTVRITDKNTDDDSECSVESKNDELSIEKLETLWKEDCEARAIQKERIQDLLE
Subjt:  CDIEANIDESIALLTKQITNALQNLKNPNVTDEEFVDSRDDDGNINAFTVRITDKNTDDDSECSVESKNDELSIEKLETLWKEDCEARAIQKERIQDLLE

Query:  ENERLMSVISSLKLKLREVQNENDQILKSVKMLNLGTENLDSILKAGHNGSHRYGLGFVASASSSKATSEIKFVPASMRVEYDTIHTEAGIRTPVKYLGR
        ENERLMSVISSLKLKLREVQNENDQILKSVKMLNLGTENLDSILKAGHNGSHRYGLGFVASASSSKATSEIKFVPASMRVEYDTIHTEAGIRTPVKYLGR
Subjt:  ENERLMSVISSLKLKLREVQNENDQILKSVKMLNLGTENLDSILKAGHNGSHRYGLGFVASASSSKATSEIKFVPASMRVEYDTIHTEAGIRTPVKYLGR

Query:  TCYYCGRKGHIRHMTGNRSYFTNLNDCVTGHVTFGDGAKEKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLCDQGYKVSFDDIGRVFMNKENQICM
        TCYYCGRKGHIR      SYFTNLNDCVTGHVTFGDGAKEKIIAKGNIDKDDLPRLNDVR                                        
Subjt:  TCYYCGRKGHIRHMTGNRSYFTNLNDCVTGHVTFGDGAKEKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLCDQGYKVSFDDIGRVFMNKENQICM

Query:  SGKRQADNCYHWNSNMFDTCQLIRSDQTWLWHRKLGHVSMRGLEKVIKNKAVVEILDLDVNGNFFCGDCQIGKQKRSSHKSLKECYTNRVLELLHMDLMS
                                         KLGHVSMRGLEKVIKNKAVVEILDLDVNGNFFCGDCQIGKQK                        S
Subjt:  SGKRQADNCYHWNSNMFDTCQLIRSDQTWLWHRKLGHVSMRGLEKVIKNKAVVEILDLDVNGNFFCGDCQIGKQKRSSHKSLKECYTNRVLELLHMDLMS

Query:  PMQTESLGGKRYVLVVVDDYSRYTWKWDARSEQGIFLGYSQNSRAYRVYNNRSSSVMKTINVVINDLDSDIKQMNDEEDETPNMYEVRTTSIVEESKADN
        PMQTESLGGKRYVLVVVDDYSRYTW  +     G+     +N   +               +VINDLDSDIKQMNDEEDETPNMYEVRTTSIVEESKADN
Subjt:  PMQTESLGGKRYVLVVVDDYSRYTWKWDARSEQGIFLGYSQNSRAYRVYNNRSSSVMKTINVVINDLDSDIKQMNDEEDETPNMYEVRTTSIVEESKADN

Query:  SSDGPGKSLKKSSEEIINKKSELIPSAHEKKNHPASSIIGEPSAGMQTRRKDKIDYLKM--VADLRNNVWTLVSKPEGVNVIGTKWIFKNKIDETGCVTK
        SSDGPGKSLKKSSEEIINKKSELIPSAHEKKNHPASSIIGEPSAGMQTRRKDKIDYLKM  +   RNNVWTLVSKPEGVNVIGTKWIFKNKIDETGCVTK
Subjt:  SSDGPGKSLKKSSEEIINKKSELIPSAHEKKNHPASSIIGEPSAGMQTRRKDKIDYLKM--VADLRNNVWTLVSKPEGVNVIGTKWIFKNKIDETGCVTK

Query:  NKARLVAQGYTQVEGVDFDETFAPVARLQAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDR
        NKARLVAQGYTQVEGVDFDETFAPVARLQAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDR
Subjt:  NKARLVAQGYTQVEGVDFDETFAPVARLQAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDR

Query:  LTVYLRGRGYFREEIDKTFFIHRKSDQLLVDQIYVDDIIFGGFPYDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYAKNMVKKFCLE
        LT YLRGRGYFREEIDKTFFIHRKSDQLLV QIYVDDIIFGGFPYDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYAKNMVKKFCLE
Subjt:  LTVYLRGRGYFREEIDKTFFIHRKSDQLLVDQIYVDDIIFGGFPYDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYAKNMVKKFCLE

Query:  QARNKRTLAATHVKLTKDTEGAEVDHKLYRSIVGNLLYLTTSRPDIAYVVGICACYQADPRITHLEAVKRILKYVHGTSDFRMMYSYDTTPTLVGYCDAD
        QARNKRTLAATHVKLTKDTEGAEVDHKLYRSIVGNLLYLTTSRPDIAYVVGICACYQADPRITHLEAVKRILKYVHGTSDFRMMYSYDTTPTLVGYCDAD
Subjt:  QARNKRTLAATHVKLTKDTEGAEVDHKLYRSIVGNLLYLTTSRPDIAYVVGICACYQADPRITHLEAVKRILKYVHGTSDFRMMYSYDTTPTLVGYCDAD

Query:  WAGSANDRKSTSEG
        WAGSANDRKSTSEG
Subjt:  WAGSANDRKSTSEG

TYK30437.1 F5J5.1 [Cucumis melo var. makuwa]0.069.24Show/hide
Query:  MTEDESVSDYNKRVLEIANESLLLGEKIPDSKIVRKVLRSLPRKFDMKVTAIEKAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVD
        MTEDESVSDYNKRVL+IANESLLLGEKIPDSKIVRKVLRS+ RKFDMKVTAIE+AHDITTLKLDELFGSLLTFEMATADRESKKGKGI+FKSTHV++E D
Subjt:  MTEDESVSDYNKRVLEIANESLLLGEKIPDSKIVRKVLRSLPRKFDMKVTAIEKAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVD

Query:  CDIEANIDESIALLTKQITNALQNLKNPNVT------------------DEEFVDSRDDDGNINAFTVRITDKNTDDDSECSVESKNDELSIEKLETLWK
        CD +AN+DESIALLTKQ TNAL+NLKNPN T                  DEE VDSRDDDGNINAF +RITD+NTDD+SECSVESKNDELSIEKL+TLWK
Subjt:  CDIEANIDESIALLTKQITNALQNLKNPNVT------------------DEEFVDSRDDDGNINAFTVRITDKNTDDDSECSVESKNDELSIEKLETLWK

Query:  EDCEARAIQKERIQDLLEENERLMSVISSLKLKLREVQNENDQILKSVKMLNLGTENLDSILKAGHNGSHRYGLGFVASASSSKATSEIKFVPASMRVEY
        EDCEAR IQKE IQDLLEENE LMSVISSLKLKLREVQNENDQILKSVKMLN G +NLDSILK GHNGS RYGLGFV+SASSSKATSEIKFVPASMRVEY
Subjt:  EDCEARAIQKERIQDLLEENERLMSVISSLKLKLREVQNENDQILKSVKMLNLGTENLDSILKAGHNGSHRYGLGFVASASSSKATSEIKFVPASMRVEY

Query:  DTIHTEAGIRTPVKYLGRTCYYCGRKGHIR----------------------------------------------------------HMTGNRSYFTNL
        DTIH E GIR  VK LGRT YYCG+KGHIR                                                          HMTGNRSYF NL
Subjt:  DTIHTEAGIRTPVKYLGRTCYYCGRKGHIR----------------------------------------------------------HMTGNRSYFTNL

Query:  NDCVTGHVTFGDGAKEKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLCDQGYKVSFDDIGRVFMNKENQICMSGKRQADNCYHWNSNMFDTCQLIR
        NDCV GHVTFGDGAK KIIAKGNI+KDDLPRLNDVRYVDGLKANLISI+QLCDQGYKVSFDDIG V MNKENQICMSGKRQADNCYHWNSNM DTC+LIR
Subjt:  NDCVTGHVTFGDGAKEKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLCDQGYKVSFDDIGRVFMNKENQICMSGKRQADNCYHWNSNMFDTCQLIR

Query:  SDQTWLWHRKLGHVSMRGLEKVIKNKAVVEILDLDVNGNFFCGDCQIGKQKRSSHKSLKECYTNRVLELLHMDLMSPMQTESLGGKRYVLVVVDDYSRYT
        SDQTWLWHRKL H SMRGLEKVIKNKAVV I DLDVNGNFFCGDCQI                                     GKR+VLVVVDDY+RYT
Subjt:  SDQTWLWHRKLGHVSMRGLEKVIKNKAVVEILDLDVNGNFFCGDCQIGKQKRSSHKSLKECYTNRVLELLHMDLMSPMQTESLGGKRYVLVVVDDYSRYT

Query:  WKWDARS------------------------------EQGIFLGYSQNSRAYRVYNNRSSSVMKTINVVINDLDSDIKQMNDEEDETPNMYEVRTTSIVE
        W +  +                               EQGIFLGYSQNS+AYRVYNNRS SVM+TINVVINDLDS IKQMNDEEDETPNM EVRT SIVE
Subjt:  WKWDARS------------------------------EQGIFLGYSQNSRAYRVYNNRSSSVMKTINVVINDLDSDIKQMNDEEDETPNMYEVRTTSIVE

Query:  ESKADNSSDGPGKSLKKSSEEIINKKSELIPSAHEKKNHPASSIIGEPSAGMQTRRKDKIDYLKMVADLRNNVWTLVSKPEGVNVIGTKWIFKNKIDETG
        ESKADNSS+GP                             ASSIIG+PS GMQTRRKDKIDYLKMVADL    +    +P  V+ +     + N + E  
Subjt:  ESKADNSSDGPGKSLKKSSEEIINKKSELIPSAHEKKNHPASSIIGEPSAGMQTRRKDKIDYLKMVADLRNNVWTLVSKPEGVNVIGTKWIFKNKIDETG

Query:  CVTK--NKARLVAQGYTQVEGVDFDETFAPVARLQAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAP
           +  N   LVAQGYTQVEGVDFDETFA VARL+AIRLLLGISCIQKFKLYQMDVKSAFLN YLN EVYVAQPKGFVD EHPKHVYKLNKALYGLKQAP
Subjt:  CVTK--NKARLVAQGYTQVEGVDFDETFAPVARLQAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAP

Query:  RAWYDRLTVYLRGRGYFREEIDKTFFIHRKSDQLLVDQIYVDDIIFGGFPYDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYAKNMV
        RAWY+RLTVYLRGRGY R EIDKT FIHRKSDQLLV QIYVDDIIFGGFP DLVNNFINIMQSEFEMS VGELSCFLGLQIKQKND IFISQEK      
Subjt:  RAWYDRLTVYLRGRGYFREEIDKTFFIHRKSDQLLVDQIYVDDIIFGGFPYDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYAKNMV

Query:  KKFCLEQARNKRTLAATHVKLTKDTEGAEVDHKLYRSIVGNLLYLTTSRPDIAYVVGICACYQADPRITHLEAVKRILKYVHGTSDFRMMYSYDTTPTLV
                                            SIVG+LLYLT SR DIAY VGICA YQ DPRITHLE VKRILKYVHG SDF MMYSY+TTPTLV
Subjt:  KKFCLEQARNKRTLAATHVKLTKDTEGAEVDHKLYRSIVGNLLYLTTSRPDIAYVVGICACYQADPRITHLEAVKRILKYVHGTSDFRMMYSYDTTPTLV

Query:  GYCDADWAGSANDRK
        GY D DWAGS +DRK
Subjt:  GYCDADWAGSANDRK

TrEMBL top hitse value%identityAlignment
A0A5A7SN07 Gag-pol polyprotein0.0e+0082.69Show/hide
Query:  MTEDESVSDYNKRVLEIANESLLLGEKIPDSKIVRKVLRSLPRKFDMKVTAIEKAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVD
        MTEDESVSDYNKRVLEIANESLLLGEKIPDSKIVRKVLRSLPRKFDMKVTAIEKAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVD
Subjt:  MTEDESVSDYNKRVLEIANESLLLGEKIPDSKIVRKVLRSLPRKFDMKVTAIEKAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVD

Query:  CDIEANIDESIALLTKQITNALQNLKNPNVTDEEFVDS-----RDDDG---------------NINAFTVRITDKNTDDDSECSVESKNDELSIEKLETL
        CDIEANIDESIALLTKQITNALQNLKNPNVT      S     R+D+G                +    VRITDKNTDDDSECSVESKNDELSIEKLETL
Subjt:  CDIEANIDESIALLTKQITNALQNLKNPNVTDEEFVDS-----RDDDG---------------NINAFTVRITDKNTDDDSECSVESKNDELSIEKLETL

Query:  WKEDCEARAIQKERIQDLLEENERLMSVISSLKLKLREVQNENDQILKSVKMLNLGTENLDSILKAGHNGSHRYGLGFVASASSSKATSEIKFVPASMRV
        WKEDCEARAIQKERIQDLLEENERLMSVISSLKLKLREVQNENDQILKSVKMLNLGTENLDSILKAGHNGSHRYGLGFVASASSSKATSEIKFVPASMRV
Subjt:  WKEDCEARAIQKERIQDLLEENERLMSVISSLKLKLREVQNENDQILKSVKMLNLGTENLDSILKAGHNGSHRYGLGFVASASSSKATSEIKFVPASMRV

Query:  EYDTIHTEAGIRTPVKYLGRTCYYCGRKGHIRHMTGNRSYFTNLNDCVTGHVTFGDGAKEKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLCDQGY
        EYDTIHTEAGIRTPVKYLGRTCYYCGRKGHI      RSYFTNLNDCVTGHVTFGDGAKEKIIAKGNIDKDDLPRLNDV                     
Subjt:  EYDTIHTEAGIRTPVKYLGRTCYYCGRKGHIRHMTGNRSYFTNLNDCVTGHVTFGDGAKEKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLCDQGY

Query:  KVSFDDIGRVFMNKENQICMSGKRQADNCYHWNSNMFDTCQLIRSDQTWLWHRKLGHVSMRGLEKVIKNKAVVEILDLDVNGNFFCGDCQIGKQKRSSHK
                                                            RKLGHVSMRGLEKVIKNKAVVEILDLDVNGNFFCGDCQIGKQK     
Subjt:  KVSFDDIGRVFMNKENQICMSGKRQADNCYHWNSNMFDTCQLIRSDQTWLWHRKLGHVSMRGLEKVIKNKAVVEILDLDVNGNFFCGDCQIGKQKRSSHK

Query:  SLKECYTNRVLELLHMDLMSPMQTESLGGKRYVLVVVDDYSRYTWKWDARSEQGIFLGYSQNSRAYRVYNNRSSSVMKTINVVINDLDSDIKQMNDEEDE
                           SPMQTESLGGKRYVLVVVDDYSRYTW  +                      N      K   +VINDLDSDIKQMNDEEDE
Subjt:  SLKECYTNRVLELLHMDLMSPMQTESLGGKRYVLVVVDDYSRYTWKWDARSEQGIFLGYSQNSRAYRVYNNRSSSVMKTINVVINDLDSDIKQMNDEEDE

Query:  TPNMYEVRTTSIVEESKADNSSDGPGKSLKKSSEEIINKKSELIPSAHEKKNHPASSIIGEPSAGMQTRRKDKIDYLKM--VADLRNNVWTLVSKPEGVN
        TPNMYEVRTTSIVEESKADNSSDGPGKSLKKSSEEIINKKSELIPSAHEKKNHPASSIIGEPSAGMQTRRKDKIDYLKM  +   RNNVWTLVSKPEGVN
Subjt:  TPNMYEVRTTSIVEESKADNSSDGPGKSLKKSSEEIINKKSELIPSAHEKKNHPASSIIGEPSAGMQTRRKDKIDYLKM--VADLRNNVWTLVSKPEGVN

Query:  VIGTKWIFKNKIDETGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLQAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHV
        VIGTKWIFKNKIDETGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLQAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHV
Subjt:  VIGTKWIFKNKIDETGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLQAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHV

Query:  YKLNKALYGLKQAPRAWYDRLTVYLRGRGYFREEIDKTFFIHRKSDQLLVDQIYVDDIIFGGFPYDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKND
        YKLNKALYGLKQAPRAWYDRLTVYLRGRGYFREEIDKTFFIHRKSDQLLV QIYVDDIIFGGFPYDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKND
Subjt:  YKLNKALYGLKQAPRAWYDRLTVYLRGRGYFREEIDKTFFIHRKSDQLLVDQIYVDDIIFGGFPYDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKND

Query:  DIFISQEKYAKNMVKKFCLEQARNKRTLAATHVKLTKDTEGAEVDHKLYRSIVGNLLYLTTSRPDIAYVVGICACYQADPRITHLEAVKRILKYVHGTSD
        DIFISQEKYAKNMVKKFCLEQARNKRTLAATHVKLTKDTEGAEVDHKLYRSIVGNLLYLTTSRPDIAYVVGICACYQADPRITHLEAVKRILKYVHGTSD
Subjt:  DIFISQEKYAKNMVKKFCLEQARNKRTLAATHVKLTKDTEGAEVDHKLYRSIVGNLLYLTTSRPDIAYVVGICACYQADPRITHLEAVKRILKYVHGTSD

Query:  FRMMYSYDTTPTLVGYCDADWAGSANDRKSTSEG
        FRMMYSYDTTPTLVGYCDADWAGSANDRKSTSEG
Subjt:  FRMMYSYDTTPTLVGYCDADWAGSANDRKSTSEG

A0A5A7T169 F5J5.10.0e+0069.24Show/hide
Query:  MTEDESVSDYNKRVLEIANESLLLGEKIPDSKIVRKVLRSLPRKFDMKVTAIEKAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVD
        MTEDESVSDYNKRVL+IANESLLLGEKIPDSKIVRKVLRS+ RKFDMKVTAIE+AHDITTLKLDELFGSLLTFEMATADRESKKGKGI+FKSTHV++E D
Subjt:  MTEDESVSDYNKRVLEIANESLLLGEKIPDSKIVRKVLRSLPRKFDMKVTAIEKAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVD

Query:  CDIEANIDESIALLTKQITNALQNLKNPNVT------------------DEEFVDSRDDDGNINAFTVRITDKNTDDDSECSVESKNDELSIEKLETLWK
        CD +AN+DESIALLTKQ TNAL+NLKNPN T                  DEE VDSRDDDGNINAF +RITD+NTDD+SECSVESKNDELSIEKL+TLWK
Subjt:  CDIEANIDESIALLTKQITNALQNLKNPNVT------------------DEEFVDSRDDDGNINAFTVRITDKNTDDDSECSVESKNDELSIEKLETLWK

Query:  EDCEARAIQKERIQDLLEENERLMSVISSLKLKLREVQNENDQILKSVKMLNLGTENLDSILKAGHNGSHRYGLGFVASASSSKATSEIKFVPASMRVEY
        EDCEAR IQKE IQDLLEENE LMSVISSLKLKLREVQNENDQILKSVKMLN G +NLDSILK GHNGS RYGLGFV+SASSSKATSEIKFVPASMRVEY
Subjt:  EDCEARAIQKERIQDLLEENERLMSVISSLKLKLREVQNENDQILKSVKMLNLGTENLDSILKAGHNGSHRYGLGFVASASSSKATSEIKFVPASMRVEY

Query:  DTIHTEAGIRTPVKYLGRTCYYCGRKGHI----------------------------------------------------------RHMTGNRSYFTNL
        DTIH E GIR  VK LGRT YYCG+KGHI                                                          RHMTGNRSYF NL
Subjt:  DTIHTEAGIRTPVKYLGRTCYYCGRKGHI----------------------------------------------------------RHMTGNRSYFTNL

Query:  NDCVTGHVTFGDGAKEKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLCDQGYKVSFDDIGRVFMNKENQICMSGKRQADNCYHWNSNMFDTCQLIR
        NDCV GHVTFGDGAK KIIAKGNI+KDDLPRLNDVRYVDGLKANLISI+QLCDQGYKVSFDDIG V MNKENQICMSGKRQADNCYHWNSNM DTC+LIR
Subjt:  NDCVTGHVTFGDGAKEKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLCDQGYKVSFDDIGRVFMNKENQICMSGKRQADNCYHWNSNMFDTCQLIR

Query:  SDQTWLWHRKLGHVSMRGLEKVIKNKAVVEILDLDVNGNFFCGDCQIGKQKRSSHKSLKECYTNRVLELLHMDLMSPMQTESLGGKRYVLVVVDDYSRYT
        SDQTWLWHRKL H SMRGLEKVIKNKAVV I DLDVNGNFFCGDCQI                                     GKR+VLVVVDDY+RYT
Subjt:  SDQTWLWHRKLGHVSMRGLEKVIKNKAVVEILDLDVNGNFFCGDCQIGKQKRSSHKSLKECYTNRVLELLHMDLMSPMQTESLGGKRYVLVVVDDYSRYT

Query:  WKWDARS------------------------------EQGIFLGYSQNSRAYRVYNNRSSSVMKTINVVINDLDSDIKQMNDEEDETPNMYEVRTTSIVE
        W +  +                               EQGIFLGYSQNS+AYRVYNNRS SVM+TINVVINDLDS IKQMNDEEDETPNM EVRT SIVE
Subjt:  WKWDARS------------------------------EQGIFLGYSQNSRAYRVYNNRSSSVMKTINVVINDLDSDIKQMNDEEDETPNMYEVRTTSIVE

Query:  ESKADNSSDGPGKSLKKSSEEIINKKSELIPSAHEKKNHPASSIIGEPSAGMQTRRKDKIDYLKMVADLRNNVWTLVSKPEGVNVIGTKWIFKNKIDETG
        ESKADNSS+G                             PASSIIG+PS GMQTRRKDKIDYLKMVADL    +    +P  V+ +     + N + E  
Subjt:  ESKADNSSDGPGKSLKKSSEEIINKKSELIPSAHEKKNHPASSIIGEPSAGMQTRRKDKIDYLKMVADLRNNVWTLVSKPEGVNVIGTKWIFKNKIDETG

Query:  CVTK--NKARLVAQGYTQVEGVDFDETFAPVARLQAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAP
           +  N   LVAQGYTQVEGVDFDETFA VARL+AIRLLLGISCIQKFKLYQMDVKSAFLN YLN EVYVAQPKGFVD EHPKHVYKLNKALYGLKQAP
Subjt:  CVTK--NKARLVAQGYTQVEGVDFDETFAPVARLQAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAP

Query:  RAWYDRLTVYLRGRGYFREEIDKTFFIHRKSDQLLVDQIYVDDIIFGGFPYDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYAKNMV
        RAWY+RLTVYLRGRGY R EIDKT FIHRKSDQLLV QIYVDDIIFGGFP DLVNNFINIMQSEFEMS VGELSCFLGLQIKQKND IFISQEK      
Subjt:  RAWYDRLTVYLRGRGYFREEIDKTFFIHRKSDQLLVDQIYVDDIIFGGFPYDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYAKNMV

Query:  KKFCLEQARNKRTLAATHVKLTKDTEGAEVDHKLYRSIVGNLLYLTTSRPDIAYVVGICACYQADPRITHLEAVKRILKYVHGTSDFRMMYSYDTTPTLV
                                            SIVG+LLYLT SR DIAY VGICA YQ DPRITHLE VKRILKYVHG SDF MMYSY+TTPTLV
Subjt:  KKFCLEQARNKRTLAATHVKLTKDTEGAEVDHKLYRSIVGNLLYLTTSRPDIAYVVGICACYQADPRITHLEAVKRILKYVHGTSDFRMMYSYDTTPTLV

Query:  GYCDADWAGSANDRK
        GY D DWAGS +DRK
Subjt:  GYCDADWAGSANDRK

A0A5A7UDB5 Gag-pol polyprotein0.0e+0063.45Show/hide
Query:  MTEDESVSDYNKRVLEIANESLLLGEKIPDSKIVRKVLRSLPRKFDMKVTAIEKAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVD
        M E+ESVSDYNKRVLEI NESLLL EKIPDSKIV+KVLRSLPRKFDMKV AIE+AHDITTLKLDELFG LLTFEMATAD E+KK KGIAFKSTHVSEE  
Subjt:  MTEDESVSDYNKRVLEIANESLLLGEKIPDSKIVRKVLRSLPRKFDMKVTAIEKAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVD

Query:  CDIEANIDE----------SIALLTKQ---------------------ITNALQNLKNPNVT--DEEFVDSRDDDGNINAFTVRITDKNTDDDSECSVES
        CD EAN+DE             L  K+                      T   +  KN  VT  DEE VDSRDD GNINAFT+RITD+NTDDDSECSVES
Subjt:  CDIEANIDE----------SIALLTKQ---------------------ITNALQNLKNPNVT--DEEFVDSRDDDGNINAFTVRITDKNTDDDSECSVES

Query:  KNDELSIEKLETLWKEDCEARAIQKERIQDLLEENERLMSVISSLKLKLREVQNENDQILKSVKMLNLGTENLDSILKAGHNGSHRYGLGFVASASSSKA
        KNDEL IEKLETLWKEDCEAR IQKERIQDLLEENERLMSVISSLKLKLREVQNENDQILKS KMLN GTENLDSILKAGHNGSHR+GLGFVASASSSKA
Subjt:  KNDELSIEKLETLWKEDCEARAIQKERIQDLLEENERLMSVISSLKLKLREVQNENDQILKSVKMLNLGTENLDSILKAGHNGSHRYGLGFVASASSSKA

Query:  TSEIKFVPASMRVEYDTIHTEAGIRTPVKYLGRTCYYCGRKGHIRHMTGNRSYFTNLNDCVTGHVTFGDGAKEKIIAKGNIDKDDLPRLNDVRYVDGLKA
        TSEIKFVPASMRVEYDTIH E GIRTP+K  GRTCYYCGRKGHIR                            KIIAK NID DDLPRLNDVRYVDGLKA
Subjt:  TSEIKFVPASMRVEYDTIHTEAGIRTPVKYLGRTCYYCGRKGHIRHMTGNRSYFTNLNDCVTGHVTFGDGAKEKIIAKGNIDKDDLPRLNDVRYVDGLKA

Query:  NLISISQLCDQGYKVSFDDIGRVFMNKENQICMSGKRQADNCYHWNSNMFDTCQLIRSDQTWLWHRKLGHVSMRGLEKVIKNKAVVEILDLDVNGNFFCG
        NLIS+SQLCDQGYKVSFDDIG                               C LIRSDQTWLWHRKLGHVS+RGL KVIKNKAVV I DLDVNGNFF G
Subjt:  NLISISQLCDQGYKVSFDDIGRVFMNKENQICMSGKRQADNCYHWNSNMFDTCQLIRSDQTWLWHRKLGHVSMRGLEKVIKNKAVVEILDLDVNGNFFCG

Query:  DCQIGKQKRSSHKSLKECYTNRVLELLHMDLMSPMQTESLGGKRYVLVVVDDYSRYTWKWDARSEQGIFLGYSQNSRAYRVYNNRSSSVMKTINVVINDL
        DCQIGK++                             ESLGGKRYVLVVVDDYSRYTWKWDARSEQGIFLGYSQNSR Y VYNNRS SVM+TINVVINDL
Subjt:  DCQIGKQKRSSHKSLKECYTNRVLELLHMDLMSPMQTESLGGKRYVLVVVDDYSRYTWKWDARSEQGIFLGYSQNSRAYRVYNNRSSSVMKTINVVINDL

Query:  DSDIKQMNDEEDETPNMYEVRTTSIVEESKADNSSDGPGKSLKKSSEEIINKKSELIPSAHEKKNHPASSIIGEPSAGMQTRRKDKIDYLKMVADLRNNV
        DS IKQMND++        +RT    E          P   ++ +    +     LI       N    +++   S G   +        +++   RN +
Subjt:  DSDIKQMNDEEDETPNMYEVRTTSIVEESKADNSSDGPGKSLKKSSEEIINKKSELIPSAHEKKNHPASSIIGEPSAGMQTRRKDKIDYLKMVADLRNNV

Query:  WTLVSKPEGVNVIGTKWIFKNKIDETGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLQAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPK
        WTLVSK EGVNVIGTKWIFKNKID  GCVTKNKARLVAQGYTQVEGVDFDETFAPVARL+ IRLLLGISCIQKFKLYQMD+KS FLN YLNEEVYVAQPK
Subjt:  WTLVSKPEGVNVIGTKWIFKNKIDETGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLQAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPK

Query:  GFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYFREEIDKTFFIHRKSDQLLVDQIYVDDIIFGGFPYDLVNNFINIMQSEFEMSKVGELSC
        GFVDSEHPKHVYKLNKALYGLKQA +AWYDRLT+                                                      EFEMS VGELSC
Subjt:  GFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYFREEIDKTFFIHRKSDQLLVDQIYVDDIIFGGFPYDLVNNFINIMQSEFEMSKVGELSC

Query:  FLGLQIKQKNDDIFISQEKYAKNMVKKFCLEQARNKRTLAATHVKLTKDTEGAEVDHKLYRSIVGNLLYLTTSRPDIAYVVGICACYQADPRITHLEAVK
        FLGLQIKQKND IFISQEKYA++MVKKF LEQARNKRT AATHVKLTKDTEG EVDHKLYR                            +PRITHLEAVK
Subjt:  FLGLQIKQKNDDIFISQEKYAKNMVKKFCLEQARNKRTLAATHVKLTKDTEGAEVDHKLYRSIVGNLLYLTTSRPDIAYVVGICACYQADPRITHLEAVK

Query:  RILKYVHGTSDFRMMYSYDTTPTLVGYCDADWAGSANDRKSTSEGCFFLGNNLISWLSKKQNRVSLSTAEAEYIAASSGCTQLIWMKNMLLEYGFDQDIM
        RILKYVH TSDF MMYSYDTT TLVGYCDADWAGSA+D KSTS G                                SGCTQLIWMKNML EYGFDQD M
Subjt:  RILKYVHGTSDFRMMYSYDTTPTLVGYCDADWAGSANDRKSTSEGCFFLGNNLISWLSKKQNRVSLSTAEAEYIAASSGCTQLIWMKNMLLEYGFDQDIM

A0A5D3CJ17 Gag-pol polyprotein0.0e+0085.6Show/hide
Query:  MTEDESVSDYNKRVLEIANESLLLGEKIPDSKIVRKVLRSLPRKFDMKVTAIEKAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVD
        MTEDESVSDYNKRVLEIANESLLLGEKIPDSKIVRKVLRSLPRKFDMKVTAIEKAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVD
Subjt:  MTEDESVSDYNKRVLEIANESLLLGEKIPDSKIVRKVLRSLPRKFDMKVTAIEKAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVD

Query:  CDIEANIDESIALLTKQITNALQNLKNPNVTDEEFVDSRDDDGNINAFTVRITDKNTDDDSECSVESKNDELSIEKLETLWKEDCEARAIQKERIQDLLE
        CDIEANIDESIALLTKQITNALQNLKNPNVTDEEFVDSRDDDGNINAFTVRITDKNTDDDSECSVESKNDELSIEKLETLWKEDCEARAIQKERIQDLLE
Subjt:  CDIEANIDESIALLTKQITNALQNLKNPNVTDEEFVDSRDDDGNINAFTVRITDKNTDDDSECSVESKNDELSIEKLETLWKEDCEARAIQKERIQDLLE

Query:  ENERLMSVISSLKLKLREVQNENDQILKSVKMLNLGTENLDSILKAGHNGSHRYGLGFVASASSSKATSEIKFVPASMRVEYDTIHTEAGIRTPVKYLGR
        ENERLMSVISSLKLKLREVQNENDQILKSVKMLNLGTENLDSILKAGHNGSHRYGLGFVASASSSKATSEIKFVPASMRVEYDTIHTEAGIRTPVKYLGR
Subjt:  ENERLMSVISSLKLKLREVQNENDQILKSVKMLNLGTENLDSILKAGHNGSHRYGLGFVASASSSKATSEIKFVPASMRVEYDTIHTEAGIRTPVKYLGR

Query:  TCYYCGRKGHIRHMTGNRSYFTNLNDCVTGHVTFGDGAKEKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLCDQGYKVSFDDIGRVFMNKENQICM
        TCYYCGRKGHI      RSYFTNLNDCVTGHVTFGDGAKEKIIAKGNIDKDDLPRLNDV                                         
Subjt:  TCYYCGRKGHIRHMTGNRSYFTNLNDCVTGHVTFGDGAKEKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLCDQGYKVSFDDIGRVFMNKENQICM

Query:  SGKRQADNCYHWNSNMFDTCQLIRSDQTWLWHRKLGHVSMRGLEKVIKNKAVVEILDLDVNGNFFCGDCQIGKQKRSSHKSLKECYTNRVLELLHMDLMS
                                        RKLGHVSMRGLEKVIKNKAVVEILDLDVNGNFFCGDCQIGKQK                        S
Subjt:  SGKRQADNCYHWNSNMFDTCQLIRSDQTWLWHRKLGHVSMRGLEKVIKNKAVVEILDLDVNGNFFCGDCQIGKQKRSSHKSLKECYTNRVLELLHMDLMS

Query:  PMQTESLGGKRYVLVVVDDYSRYTWKWDARSEQGIFLGYSQNSRAYRVYNNRSSSVMKTINVVINDLDSDIKQMNDEEDETPNMYEVRTTSIVEESKADN
        PMQTESLGGKRYVLVVVDDYSRYTW  +                      N      K   +VINDLDSDIKQMNDEEDETPNMYEVRTTSIVEESKADN
Subjt:  PMQTESLGGKRYVLVVVDDYSRYTWKWDARSEQGIFLGYSQNSRAYRVYNNRSSSVMKTINVVINDLDSDIKQMNDEEDETPNMYEVRTTSIVEESKADN

Query:  SSDGPGKSLKKSSEEIINKKSELIPSAHEKKNHPASSIIGEPSAGMQTRRKDKIDYLKM--VADLRNNVWTLVSKPEGVNVIGTKWIFKNKIDETGCVTK
        SSDGPGKSLKKSSEEIINKKSELIPSAHEKKNHPASSIIGEPSAGMQTRRKDKIDYLKM  +   RNNVWTLVSKPEGVNVIGTKWIFKNKIDETGCVTK
Subjt:  SSDGPGKSLKKSSEEIINKKSELIPSAHEKKNHPASSIIGEPSAGMQTRRKDKIDYLKM--VADLRNNVWTLVSKPEGVNVIGTKWIFKNKIDETGCVTK

Query:  NKARLVAQGYTQVEGVDFDETFAPVARLQAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDR
        NKARLVAQGYTQVEGVDFDETFAPVARLQAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDR
Subjt:  NKARLVAQGYTQVEGVDFDETFAPVARLQAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDR

Query:  LTVYLRGRGYFREEIDKTFFIHRKSDQLLVDQIYVDDIIFGGFPYDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYAKNMVKKFCLE
        LT YLRGRGYFREEIDKTFFIHRKSDQLLV QIYVDDIIFGGFPYDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYAKNMVKKFCLE
Subjt:  LTVYLRGRGYFREEIDKTFFIHRKSDQLLVDQIYVDDIIFGGFPYDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYAKNMVKKFCLE

Query:  QARNKRTLAATHVKLTKDTEGAEVDHKLYRSIVGNLLYLTTSRPDIAYVVGICACYQADPRITHLEAVKRILKYVHGTSDFRMMYSYDTTPTLVGYCDAD
        QARNKRTLAATHVKLTKDTEGAEVDHKLYRSIVGNLLYLTTSRPDIAYVVGICACYQADPRITHLEAVKRILKYVHGTSDFRMMYSYDTTPTLVGYCDAD
Subjt:  QARNKRTLAATHVKLTKDTEGAEVDHKLYRSIVGNLLYLTTSRPDIAYVVGICACYQADPRITHLEAVKRILKYVHGTSDFRMMYSYDTTPTLVGYCDAD

Query:  WAGSANDRKSTSEG
        WAGSANDRKSTSEG
Subjt:  WAGSANDRKSTSEG

A0A5D3E2Y4 F5J5.10.0e+0069.24Show/hide
Query:  MTEDESVSDYNKRVLEIANESLLLGEKIPDSKIVRKVLRSLPRKFDMKVTAIEKAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVD
        MTEDESVSDYNKRVL+IANESLLLGEKIPDSKIVRKVLRS+ RKFDMKVTAIE+AHDITTLKLDELFGSLLTFEMATADRESKKGKGI+FKSTHV++E D
Subjt:  MTEDESVSDYNKRVLEIANESLLLGEKIPDSKIVRKVLRSLPRKFDMKVTAIEKAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVD

Query:  CDIEANIDESIALLTKQITNALQNLKNPNVT------------------DEEFVDSRDDDGNINAFTVRITDKNTDDDSECSVESKNDELSIEKLETLWK
        CD +AN+DESIALLTKQ TNAL+NLKNPN T                  DEE VDSRDDDGNINAF +RITD+NTDD+SECSVESKNDELSIEKL+TLWK
Subjt:  CDIEANIDESIALLTKQITNALQNLKNPNVT------------------DEEFVDSRDDDGNINAFTVRITDKNTDDDSECSVESKNDELSIEKLETLWK

Query:  EDCEARAIQKERIQDLLEENERLMSVISSLKLKLREVQNENDQILKSVKMLNLGTENLDSILKAGHNGSHRYGLGFVASASSSKATSEIKFVPASMRVEY
        EDCEAR IQKE IQDLLEENE LMSVISSLKLKLREVQNENDQILKSVKMLN G +NLDSILK GHNGS RYGLGFV+SASSSKATSEIKFVPASMRVEY
Subjt:  EDCEARAIQKERIQDLLEENERLMSVISSLKLKLREVQNENDQILKSVKMLNLGTENLDSILKAGHNGSHRYGLGFVASASSSKATSEIKFVPASMRVEY

Query:  DTIHTEAGIRTPVKYLGRTCYYCGRKGHI----------------------------------------------------------RHMTGNRSYFTNL
        DTIH E GIR  VK LGRT YYCG+KGHI                                                          RHMTGNRSYF NL
Subjt:  DTIHTEAGIRTPVKYLGRTCYYCGRKGHI----------------------------------------------------------RHMTGNRSYFTNL

Query:  NDCVTGHVTFGDGAKEKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLCDQGYKVSFDDIGRVFMNKENQICMSGKRQADNCYHWNSNMFDTCQLIR
        NDCV GHVTFGDGAK KIIAKGNI+KDDLPRLNDVRYVDGLKANLISI+QLCDQGYKVSFDDIG V MNKENQICMSGKRQADNCYHWNSNM DTC+LIR
Subjt:  NDCVTGHVTFGDGAKEKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLCDQGYKVSFDDIGRVFMNKENQICMSGKRQADNCYHWNSNMFDTCQLIR

Query:  SDQTWLWHRKLGHVSMRGLEKVIKNKAVVEILDLDVNGNFFCGDCQIGKQKRSSHKSLKECYTNRVLELLHMDLMSPMQTESLGGKRYVLVVVDDYSRYT
        SDQTWLWHRKL H SMRGLEKVIKNKAVV I DLDVNGNFFCGDCQI                                     GKR+VLVVVDDY+RYT
Subjt:  SDQTWLWHRKLGHVSMRGLEKVIKNKAVVEILDLDVNGNFFCGDCQIGKQKRSSHKSLKECYTNRVLELLHMDLMSPMQTESLGGKRYVLVVVDDYSRYT

Query:  WKWDARS------------------------------EQGIFLGYSQNSRAYRVYNNRSSSVMKTINVVINDLDSDIKQMNDEEDETPNMYEVRTTSIVE
        W +  +                               EQGIFLGYSQNS+AYRVYNNRS SVM+TINVVINDLDS IKQMNDEEDETPNM EVRT SIVE
Subjt:  WKWDARS------------------------------EQGIFLGYSQNSRAYRVYNNRSSSVMKTINVVINDLDSDIKQMNDEEDETPNMYEVRTTSIVE

Query:  ESKADNSSDGPGKSLKKSSEEIINKKSELIPSAHEKKNHPASSIIGEPSAGMQTRRKDKIDYLKMVADLRNNVWTLVSKPEGVNVIGTKWIFKNKIDETG
        ESKADNSS+G                             PASSIIG+PS GMQTRRKDKIDYLKMVADL    +    +P  V+ +     + N + E  
Subjt:  ESKADNSSDGPGKSLKKSSEEIINKKSELIPSAHEKKNHPASSIIGEPSAGMQTRRKDKIDYLKMVADLRNNVWTLVSKPEGVNVIGTKWIFKNKIDETG

Query:  CVTK--NKARLVAQGYTQVEGVDFDETFAPVARLQAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAP
           +  N   LVAQGYTQVEGVDFDETFA VARL+AIRLLLGISCIQKFKLYQMDVKSAFLN YLN EVYVAQPKGFVD EHPKHVYKLNKALYGLKQAP
Subjt:  CVTK--NKARLVAQGYTQVEGVDFDETFAPVARLQAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAP

Query:  RAWYDRLTVYLRGRGYFREEIDKTFFIHRKSDQLLVDQIYVDDIIFGGFPYDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYAKNMV
        RAWY+RLTVYLRGRGY R EIDKT FIHRKSDQLLV QIYVDDIIFGGFP DLVNNFINIMQSEFEMS VGELSCFLGLQIKQKND IFISQEK      
Subjt:  RAWYDRLTVYLRGRGYFREEIDKTFFIHRKSDQLLVDQIYVDDIIFGGFPYDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYAKNMV

Query:  KKFCLEQARNKRTLAATHVKLTKDTEGAEVDHKLYRSIVGNLLYLTTSRPDIAYVVGICACYQADPRITHLEAVKRILKYVHGTSDFRMMYSYDTTPTLV
                                            SIVG+LLYLT SR DIAY VGICA YQ DPRITHLE VKRILKYVHG SDF MMYSY+TTPTLV
Subjt:  KKFCLEQARNKRTLAATHVKLTKDTEGAEVDHKLYRSIVGNLLYLTTSRPDIAYVVGICACYQADPRITHLEAVKRILKYVHGTSDFRMMYSYDTTPTLV

Query:  GYCDADWAGSANDRK
        GY D DWAGS +DRK
Subjt:  GYCDADWAGSANDRK

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.5e-6631.76Show/hide
Query:  DEEDETPNMYEVRTTSIVEESKADNSSDGPGKSLKKSSEEIINKKSELIPSAHEKKNHPASSIIGEPSAGMQT------RRKDKIDY------------L
        +E   + N  E R +   E  K +   D P    K    EIIN++SE + +  +   +   + + +      T         D+I Y             
Subjt:  DEEDETPNMYEVRTTSIVEESKADNSSDGPGKSLKKSSEEIINKKSELIPSAHEKKNHPASSIIGEPSAGMQT------RRKDKIDY------------L

Query:  KMVADLRNNVWTLVSKPEGVNVIGTKWIFKNKIDETGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLQAIRLLLGISCIQKFKLYQMDVKSAFLNEYL
        ++ A   NN WT+  +PE  N++ ++W+F  K +E G   + KARLVA+G+TQ   +D++ETFAPVAR+ + R +L +      K++QMDVK+AFLN  L
Subjt:  KMVADLRNNVWTLVSKPEGVNVIGTKWIFKNKIDETGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLQAIRLLLGISCIQKFKLYQMDVKSAFLNEYL

Query:  NEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYFREEIDKTFFIHRKS--DQLLVDQIYVDDIIFGGFPYDLVNNFINIMQS
         EE+Y+  P+G   S +  +V KLNKA+YGLKQA R W++     L+   +    +D+  +I  K   ++ +   +YVDD++        +NNF   +  
Subjt:  NEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYFREEIDKTFFIHRKS--DQLLVDQIYVDDIIFGGFPYDLVNNFINIMQS

Query:  EFEMSKVGELSCFLGLQIKQKNDDIFISQEKYAKNMVKKFCLEQARNKRTLAATHVKLTKDTEGAEVD-HKLYRSIVGNLLY-LTTSRPDIAYVVGICAC
        +F M+ + E+  F+G++I+ + D I++SQ  Y K ++ KF +E      T   +  K+  +   ++ D +   RS++G L+Y +  +RPD+   V I + 
Subjt:  EFEMSKVGELSCFLGLQIKQKNDDIFISQEKYAKNMVKKFCLEQARNKRTLAATHVKLTKDTEGAEVD-HKLYRSIVGNLLY-LTTSRPDIAYVVGICAC

Query:  YQADPRITHLEAVKRILKYVHGTSDFRMMYSYDTT--PTLVGYCDADWAGSANDRKSTSEGCFFLGN-NLISWLSKKQNRVSLSTAEAEYIAASSGCTQL
        Y +       + +KR+L+Y+ GT D ++++  +      ++GY D+DWAGS  DRKST+   F + + NLI W +K+QN V+ S+ EAEY+A      + 
Subjt:  YQADPRITHLEAVKRILKYVHGTSDFRMMYSYDTT--PTLVGYCDADWAGSANDRKSTSEGCFFLGN-NLISWLSKKQNRVSLSTAEAEYIAASSGCTQL

Query:  IWMKNML
        +W+K +L
Subjt:  IWMKNML

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.7e-7525.68Show/hide
Query:  HMTGNRSYFTNLNDCVTGHVTFGDGAKEKIIAKGNI----DKDDLPRLNDVRYVDGLKANLISISQLCDQGYKVSFDDIGRVFMNKENQICMSGKRQADN
        H T  R  F        G V  G+ +  KI   G+I    +      L DVR+V  L+ NLIS   L   GY+  F +  +  + K + +   G  +   
Subjt:  HMTGNRSYFTNLNDCVTGHVTFGDGAKEKIIAKGNI----DKDDLPRLNDVRYVDGLKANLISISQLCDQGYKVSFDDIGRVFMNKENQICMSGKRQADN

Query:  CYHWNSNMFDTCQLIRSDQ--TWLWHRKLGHVSMRGLEKVIKNKAVVEILDLDVNGNFFCGDCQIGKQKRSSHKSLKECYTNRVLELLHMDLMSPMQTES
         Y  N+ +         D+    LWH+++GH+S +GL+ + K   +       V     C  C  GKQ R S ++  E   N +L+L++ D+  PM+ ES
Subjt:  CYHWNSNMFDTCQLIRSDQ--TWLWHRKLGHVSMRGLEKVIKNKAVVEILDLDVNGNFFCGDCQIGKQKRSSHKSLKECYTNRVLELLHMDLMSPMQTES

Query:  LGGKRYVLVVVDDYSRYTW---------------------------------------------------------------------------------
        +GG +Y +  +DD SR  W                                                                                 
Subjt:  LGGKRYVLVVVDDYSRYTW---------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------KWDARSEQGIFLGYSQNSRAYRVYNNRSSSV
                                                                             K D +S   IF+GY      YR+++     V
Subjt:  ---------------------------------------------------------------------KWDARSEQGIFLGYSQNSRAYRVYNNRSSSV

Query:  MKTINVVINDLDSDIKQMNDEEDET-----PNMYEVRTTSIVEESKADNSSDGPGKSLKKSSEEIINKKSELIPSAHEKKNHPASS------IIGEPSAG
        +++ +VV    +S+++   D  ++      PN   + +TS    + A++++D   +  ++  E I  ++ E +    E+  HP         +       
Subjt:  MKTINVVINDLDSDIKQMNDEEDET-----PNMYEVRTTSIVEESKADNSSDGPGKSLKKSSEEIINKKSELIPSAHEKKNHPASS------IIGEPSAG

Query:  MQTRRKDKIDYL-----------------------------KMVADLRNNVWTLVSKPEGVNVIGTKWIFKNKIDETGCVTKNKARLVAQGYTQVEGVDF
        +++RR    +Y+                             +M +  +N  + LV  P+G   +  KW+FK K D    + + KARLV +G+ Q +G+DF
Subjt:  MQTRRKDKIDYL-----------------------------KMVADLRNNVWTLVSKPEGVNVIGTKWIFKNKIDETGCVTKNKARLVAQGYTQVEGVDF

Query:  DETFAPVARLQAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYFREEIDKT
        DE F+PV ++ +IR +L ++     ++ Q+DVK+AFL+  L EE+Y+ QP+GF  +     V KLNK+LYGLKQAPR WY +   +++ + Y +   D  
Subjt:  DETFAPVARLQAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYFREEIDKT

Query:  FFIHRKSD-QLLVDQIYVDDIIFGGFPYDLVNNFINIMQSEFEMSKVGELSCFLGLQI--KQKNDDIFISQEKYAKNMVKKFCLEQARNKRTLAATHVKL
         +  R S+   ++  +YVDD++  G    L+      +   F+M  +G     LG++I  ++ +  +++SQEKY + ++++F ++ A+   T  A H+KL
Subjt:  FFIHRKSD-QLLVDQIYVDDIIFGGFPYDLVNNFINIMQSEFEMSKVGELSCFLGLQI--KQKNDDIFISQEKYAKNMVKKFCLEQARNKRTLAATHVKL

Query:  TKDTEGAEVDHK------LYRSIVGNLLY-LTTSRPDIAYVVGICACYQADPRITHLEAVKRILKYVHGTSDFRMMYSYDTTPTLVGYCDADWAGSANDR
        +K      V+ K       Y S VG+L+Y +  +RPDIA+ VG+ + +  +P   H EAVK IL+Y+ GT+   + +   + P L GY DAD AG  ++R
Subjt:  TKDTEGAEVDHK------LYRSIVGNLLY-LTTSRPDIAYVVGICACYQADPRITHLEAVKRILKYVHGTSDFRMMYSYDTTPTLVGYCDADWAGSANDR

Query:  KSTSEGCFFLGNNLISWLSKKQNRVSLSTAEAEYIAASSGCTQLIWMKNMLLEYGFDQ
        KS++   F      ISW SK Q  V+LST EAEYIAA+    ++IW+K  L E G  Q
Subjt:  KSTSEGCFFLGNNLISWLSKKQNRVSLSTAEAEYIAASSGCTQLIWMKNMLLEYGFDQ

P25600 Putative transposon Ty5-1 protein YCL074W7.0e-3631.79Show/hide
Query:  MDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYFREEIDKTFFIHRKSDQLLVDQIYVDDIIFGGFPYDL
        MDV +AFLN  ++E +YV QP GFV+  +P +V++L   +YGLKQAP  W + +   L+  G+ R E +   +    SD  +   +YVDD++       +
Subjt:  MDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYFREEIDKTFFIHRKSDQLLVDQIYVDDIIFGGFPYDL

Query:  VNNFINIMQSEFEMSKVGELSCFLGLQIKQ-KNDDIFISQEKYAKNMVKKFCLEQARNKRTLAATHVKLTKDTEGAEVDHKLYRSIVGNLLY-LTTSRPD
         +     +   + M  +G++  FLGL I Q  N DI +S + Y      +  +   +  +T       L + T     D   Y+SIVG LL+   T RPD
Subjt:  VNNFINIMQSEFEMSKVGELSCFLGLQIKQ-KNDDIFISQEKYAKNMVKKFCLEQARNKRTLAATHVKLTKDTEGAEVDHKLYRSIVGNLLY-LTTSRPD

Query:  IAYVVGICACYQADPRITHLEAVKRILKYVHGTSDFRMMYSYDTTPTLVGYCDADWAGSANDRKSTSEGCFFLGNNLISWLSKK-QNRVSLSTAEAEYIA
        I+Y V + + +  +PR  HLE+ +R+L+Y++ T    + Y   +   L  YCDA      +   ST      L    ++W SKK +  + + + EAEYI 
Subjt:  IAYVVGICACYQADPRITHLEAVKRILKYVHGTSDFRMMYSYDTTPTLVGYCDADWAGSANDRKSTSEGCFFLGNNLISWLSKK-QNRVSLSTAEAEYIA

Query:  AS
        AS
Subjt:  AS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.8e-7436.73Show/hide
Query:  ADLRNNVWTLVSKPEG-VNVIGTKWIFKNKIDETGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLQAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNE
        A + N+ W LV  P   V ++G +WIF  K +  G + + KARLVA+GY Q  G+D+ ETF+PV +  +IR++LG++  + + + Q+DV +AFL   L +
Subjt:  ADLRNNVWTLVSKPEG-VNVIGTKWIFKNKIDETGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLQAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNE

Query:  EVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYFREEIDKTFFIHRKSDQLLVDQIYVDDIIFGGFPYDLVNNFINIMQSEFEM
        +VY++QP GF+D + P +V KL KALYGLKQAPRAWY  L  YL   G+     D + F+ ++   ++   +YVDDI+  G    L++N ++ +   F +
Subjt:  EVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYFREEIDKTFFIHRKSDQLLVDQIYVDDIIFGGFPYDLVNNFINIMQSEFEM

Query:  SKVGELSCFLGLQIKQKNDDIFISQEKYAKNMVKKFCLEQARNKRTLAATHVKLTKDTEGAEVDHKLYRSIVGNLLYLTTSRPDIAYVVGICACYQADPR
            EL  FLG++ K+    + +SQ +Y  +++ +  +  A+   T  A   KL+  +     D   YR IVG+L YL  +RPDI+Y V   + +   P 
Subjt:  SKVGELSCFLGLQIKQKNDDIFISQEKYAKNMVKKFCLEQARNKRTLAATHVKLTKDTEGAEVDHKLYRSIVGNLLYLTTSRPDIAYVVGICACYQADPR

Query:  ITHLEAVKRILKYVHGTSDFRMMYSYDTTPTLVGYCDADWAGSANDRKSTSEGCFFLGNNLISWLSKKQNRVSLSTAEAEYIAASSGCTQLIWMKNMLLE
          HL+A+KRIL+Y+ GT +  +      T +L  Y DADWAG  +D  ST+    +LG++ ISW SKKQ  V  S+ EAEY + ++  +++ W+ ++L E
Subjt:  ITHLEAVKRILKYVHGTSDFRMMYSYDTTPTLVGYCDADWAGSANDRKSTSEGCFFLGNNLISWLSKKQNRVSLSTAEAEYIAASSGCTQLIWMKNMLLE

Query:  YGF---DQDIMMCCVVEVNVLC
         G       ++ C  V    LC
Subjt:  YGF---DQDIMMCCVVEVNVLC

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.2e-7535.73Show/hide
Query:  EPSAGMQTRRKDKIDYL---KMVADLRNNVWTLV-SKPEGVNVIGTKWIFKNKIDETGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLQAIRLLLGIS
        EP   +Q  + D+       ++ A + N+ W LV   P  V ++G +WIF  K +  G + + KARLVA+GY Q  G+D+ ETF+PV +  +IR++LG++
Subjt:  EPSAGMQTRRKDKIDYL---KMVADLRNNVWTLV-SKPEGVNVIGTKWIFKNKIDETGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLQAIRLLLGIS

Query:  CIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYFREEIDKTFFIHRKSDQLLVDQIYVDDI
          + + + Q+DV +AFL   L +EVY++QP GFVD + P +V +L KA+YGLKQAPRAWY  L  YL   G+     D + F+ ++   ++   +YVDDI
Subjt:  CIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYFREEIDKTFFIHRKSDQLLVDQIYVDDI

Query:  IFGGFPYDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYAKNMVKKFCLEQARNKRTLAATHVKLTKDTEGAEVDHKLYRSIVGNLLY
        +  G    L+ + ++ +   F + +  +L  FLG++ K+    + +SQ +Y  +++ +  +  A+   T  AT  KLT  +     D   YR IVG+L Y
Subjt:  IFGGFPYDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYAKNMVKKFCLEQARNKRTLAATHVKLTKDTEGAEVDHKLYRSIVGNLLY

Query:  LTTSRPDIAYVVGICACYQADPRITHLEAVKRILKYVHGTSDFRMMYSYDTTPTLVGYCDADWAGSANDRKSTSEGCFFLGNNLISWLSKKQNRVSLSTA
        L  +RPD++Y V   + Y   P   H  A+KR+L+Y+ GT D  +      T +L  Y DADWAG  +D  ST+    +LG++ ISW SKKQ  V  S+ 
Subjt:  LTTSRPDIAYVVGICACYQADPRITHLEAVKRILKYVHGTSDFRMMYSYDTTPTLVGYCDADWAGSANDRKSTSEGCFFLGNNLISWLSKKQNRVSLSTA

Query:  EAEYIAASSGCTQLIWMKNMLLEYGFD---QDIMMCCVVEVNVLC
        EAEY + ++  ++L W+ ++L E G       ++ C  V    LC
Subjt:  EAEYIAASSGCTQLIWMKNMLLEYGFD---QDIMMCCVVEVNVLC

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.4e-7137.69Show/hide
Query:  NVWTLVSKPEGVNVIGTKWIFKNKIDETGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLQAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQ
        + W + + P     IG KW++K K +  G + + KARLVA+GYTQ EG+DF ETF+PV +L +++L+L IS I  F L+Q+D+ +AFLN  L+EE+Y+  
Subjt:  NVWTLVSKPEGVNVIGTKWIFKNKIDETGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLQAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQ

Query:  PKGFV----DSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYFREEIDKTFFIHRKSDQLLVDQIYVDDIIFGGFPYDLVNNFINIMQSEFEMSK
        P G+     DS  P  V  L K++YGLKQA R W+ + +V L G G+ +   D T+F+   +   L   +YVDDII        V+   + ++S F++  
Subjt:  PKGFV----DSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYFREEIDKTFFIHRKSDQLLVDQIYVDDIIFGGFPYDLVNNFINIMQSEFEMSK

Query:  VGELSCFLGLQIKQKNDDIFISQEKYAKNMVKKFCLEQARNKRTLAATHVKLTKDTEGAEVDHKLYRSIVGNLLYLTTSRPDIAYVVGICACYQADPRIT
        +G L  FLGL+I +    I I Q KYA +++ +  L   +         V  +  + G  VD K YR ++G L+YL  +R DI++ V   + +   PR+ 
Subjt:  VGELSCFLGLQIKQKNDDIFISQEKYAKNMVKKFCLEQARNKRTLAATHVKLTKDTEGAEVDHKLYRSIVGNLLYLTTSRPDIAYVVGICACYQADPRIT

Query:  HLEAVKRILKYVHGTSDFRMMYSYDTTPTLVGYCDADWAGSANDRKSTSEGCFFLGNNLISWLSKKQNRVSLSTAEAEYIAASSGCTQLIWMKNMLLE
        H +AV +IL Y+ GT    + YS      L  + DA +    + R+ST+  C FLG +LISW SKKQ  VS S+AEAEY A S    +++W+     E
Subjt:  HLEAVKRILKYVHGTSDFRMMYSYDTTPTLVGYCDADWAGSANDRKSTSEGCFFLGNNLISWLSKKQNRVSLSTAEAEYIAASSGCTQLIWMKNMLLE

ATMG00240.1 Gag-Pol-related retrotransposon family protein2.4e-0731.82Show/hide
Query:  LYLTTSRPDIAYVVGICACYQADPRITHLEAVKRILKYVHGTSDFRMMYSYDTTPTLVGYCDADWAGSANDRKSTSEGC-----FFLG
        +YLT +RPD+ + V   + + +  R   ++AV ++L YV GT    + YS  +   L  + D+DWA   + R+S +  C     +FLG
Subjt:  LYLTTSRPDIAYVVGICACYQADPRITHLEAVKRILKYVHGTSDFRMMYSYDTTPTLVGYCDADWAGSANDRKSTSEGC-----FFLG

ATMG00300.1 Gag-Pol-related retrotransposon family protein1.9e-0435.29Show/hide
Query:  SNMFDTCQLIRSDQTWLWHRKLGHVSMRGLEKVIKNKAVVEILDLDVNGNFFCGDCQIGKQKRSSHKSLKECYTNRVLELLHMDL
        SN+ +T +    D+T LWH +L H+S RG+E ++K K  ++     V+   FC DC  GK  R +  S  +  T   L+ +H DL
Subjt:  SNMFDTCQLIRSDQTWLWHRKLGHVSMRGLEKVIKNKAVVEILDLDVNGNFFCGDCQIGKQKRSSHKSLKECYTNRVLELLHMDL

ATMG00810.1 DNA/RNA polymerases superfamily protein1.5e-3336.77Show/hide
Query:  IYVDDIIFGGFPYDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYAKNMVKKFCLEQARNKRTLAATHVKLTKDTEGAEV-DHKLYRS
        +YVDDI+  G    L+N  I  + S F M  +G +  FLG+QIK     +F+SQ KYA+ ++    +   +   T     +KL      A+  D   +RS
Subjt:  IYVDDIIFGGFPYDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYAKNMVKKFCLEQARNKRTLAATHVKLTKDTEGAEV-DHKLYRS

Query:  IVGNLLYLTTSRPDIAYVVGICACYQADPRITHLEAVKRILKYVHGTSDFRMMYSYDTTPTLVGYCDADWAGSANDRKSTSEGCFFLGNNLISWLSKKQN
        IVG L YLT +RPDI+Y V I      +P +   + +KR+L+YV GT    +    ++   +  +CD+DWAG  + R+ST+  C FLG N+ISW +K+Q 
Subjt:  IVGNLLYLTTSRPDIAYVVGICACYQADPRITHLEAVKRILKYVHGTSDFRMMYSYDTTPTLVGYCDADWAGSANDRKSTSEGCFFLGNNLISWLSKKQN

Query:  RVSLSTAEAEYIAASSGCTQLIW
         VS S+ E EY A +    +L W
Subjt:  RVSLSTAEAEYIAASSGCTQLIW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.6e-1437.74Show/hide
Query:  KNHPASSIIGEPSAGMQTRRKDKIDYLKMVADLRNNVWTLVSKPEGVNVIGTKWIFKNKIDETGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLQAIR
        K  P S I      G     ++++D L      RN  W LV  P   N++G KW+FK K+   G + + KARLVA+G+ Q EG+ F ET++PV R   IR
Subjt:  KNHPASSIIGEPSAGMQTRRKDKIDYLKMVADLRNNVWTLVSKPEGVNVIGTKWIFKNKIDETGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLQAIR

Query:  LLLGIS
         +L ++
Subjt:  LLLGIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCGAGGATGAATCAGTGTCTGATTATAATAAGAGGGTTCTTGAAATCGCAAATGAATCCTTACTGCTCGGTGAAAAGATACCTGATTCTAAGATAGTTCGGAAAGT
ACTTCGATCCTTGCCCAGAAAATTTGATATGAAAGTTACTGCGATTGAGAAAGCTCATGATATTACAACGCTGAAACTTGATGAATTGTTTGGCTCGTTGCTTACGTTTG
AGATGGCCACTGCTGATAGAGAAAGTAAGAAAGGCAAGGGAATTGCTTTTAAATCCACACATGTAAGTGAGGAGGTTGATTGTGATATTGAAGCAAACATAGATGAGTCA
ATAGCTTTATTAACCAAACAAATCACAAACGCCCTTCAGAATTTAAAAAATCCGAATGTCACAGATGAAGAATTTGTTGATAGTAGAGATGATGATGGCAACATAAATGC
TTTCACAGTACGAATTACTGATAAGAATACTGATGATGATAGTGAATGTTCGGTAGAAAGTAAAAATGATGAATTGTCAATTGAAAAGCTTGAAACTCTATGGAAAGAAG
ATTGTGAAGCTAGGGCAATACAAAAGGAAAGGATCCAAGATCTTTTAGAAGAAAATGAACGATTGATGTCTGTAATATCTTCTCTAAAGTTAAAATTGAGAGAGGTTCAA
AATGAGAATGATCAGATTTTAAAATCCGTTAAAATGCTAAATTTAGGAACGGAGAATCTAGATTCAATACTAAAAGCTGGACATAATGGCTCTCATAGATATGGGTTGGG
ATTTGTTGCCTCTGCAAGTAGTTCTAAAGCTACATCAGAAATCAAGTTTGTCCCTGCCTCAATGAGAGTTGAATATGACACGATTCATACAGAGGCTGGCATCAGGACTC
CAGTTAAATATCTTGGGAGAACTTGTTACTATTGTGGTCGAAAAGGTCATATCAGACATATGACTGGAAACAGATCCTACTTTACGAACTTAAACGACTGTGTCACCGGA
CATGTTACCTTTGGTGATGGTGCAAAAGAAAAAATTATAGCTAAAGGTAACATAGACAAGGATGATCTACCACGATTAAATGATGTTAGGTATGTGGATGGATTAAAAGC
AAACTTGATCAGTATAAGTCAACTGTGTGATCAAGGTTACAAAGTTAGTTTTGATGATATTGGTCGTGTTTTCATGAATAAAGAAAATCAAATTTGCATGAGTGGTAAAC
GACAAGCTGATAATTGCTACCATTGGAATTCAAATATGTTTGACACTTGTCAGTTGATAAGATCCGATCAAACATGGCTGTGGCACAGAAAGCTGGGGCATGTCAGCATG
AGAGGATTGGAAAAAGTTATTAAAAATAAAGCAGTTGTGGAAATTCTTGATTTAGACGTAAATGGAAACTTCTTCTGTGGAGACTGTCAAATTGGCAAGCAGAAAAGGTC
TAGTCATAAAAGTTTGAAAGAATGTTATACCAATAGAGTCTTAGAACTGTTACATATGGATCTCATGAGTCCAATGCAAACAGAAAGTCTTGGAGGTAAGAGGTATGTGC
TGGTTGTTGTTGATGATTACTCAAGATACACTTGGAAATGGGATGCAAGGTCGGAACAAGGAATTTTTCTTGGGTACTCTCAAAACAGTCGGGCCTATAGAGTCTACAAT
AACAGATCCAGTAGTGTTATGAAAACAATCAATGTAGTTATAAATGATCTCGATTCTGATATCAAACAGATGAATGATGAGGAAGATGAGACTCCAAACATGTATGAAGT
CAGAACTACGAGTATTGTGGAAGAGTCTAAAGCTGATAATTCATCCGACGGTCCAGGCAAAAGTTTGAAAAAATCATCAGAAGAAATTATCAATAAAAAATCTGAACTAA
TTCCATCAGCTCATGAAAAGAAAAATCATCCAGCAAGCTCTATCATAGGTGAACCATCAGCTGGGATGCAGACCAGAAGGAAAGATAAGATCGACTATTTGAAGATGGTT
GCTGACTTGCGAAACAATGTCTGGACATTAGTCTCAAAGCCAGAAGGTGTAAACGTTATTGGCACCAAATGGATATTTAAAAATAAGATTGATGAAACTGGATGTGTGAC
GAAAAATAAAGCCAGATTAGTAGCTCAAGGTTATACTCAAGTTGAAGGTGTTGACTTTGATGAAACGTTTGCTCCTGTTGCTCGACTTCAAGCCATTCGACTTTTATTGG
GCATATCATGCATACAGAAATTTAAATTGTATCAGATGGATGTAAAGAGTGCCTTCTTAAATGAGTACTTGAATGAGGAGGTTTATGTTGCCCAACCAAAAGGTTTTGTA
GATTCCGAGCACCCGAAGCATGTGTATAAGCTCAACAAAGCCTTATATGGACTAAAGCAAGCTCCGAGAGCTTGGTATGACCGGCTAACTGTGTACTTGAGAGGTAGAGG
ATATTTTAGAGAAGAAATTGACAAGACCTTCTTCATACACAGGAAATCTGACCAACTGTTGGTGGATCAAATTTATGTTGATGACATCATTTTTGGAGGATTTCCTTATG
ATCTAGTAAATAATTTCATTAATATTATGCAGTCAGAATTCGAAATGAGCAAGGTTGGAGAGCTTTCATGTTTTTTGGGACTTCAAATTAAGCAAAAGAATGATGACATT
TTTATATCACAAGAAAAGTACGCCAAGAATATGGTTAAAAAGTTTTGTTTGGAACAGGCTCGAAATAAGCGGACTCTAGCTGCGACACATGTTAAACTTACAAAAGACAC
TGAAGGTGCTGAAGTTGATCACAAACTTTACAGGAGTATAGTAGGCAACCTATTATACTTAACAACAAGTCGACCTGACATAGCTTATGTTGTGGGAATATGTGCTTGTT
ATCAGGCAGATCCCCGCATCACTCACCTAGAAGCTGTTAAACGAATTCTTAAATATGTTCATGGGACCAGTGACTTTAGAATGATGTATTCCTATGATACCACTCCCACT
CTAGTTGGATATTGTGATGCTGACTGGGCAGGTTCAGCTAATGATCGTAAAAGTACATCTGAAGGTTGCTTCTTTTTGGGAAACAATTTAATCTCTTGGTTAAGTAAGAA
GCAAAACCGTGTTTCTTTATCTACAGCTGAAGCTGAATATATAGCTGCAAGTAGTGGTTGTACACAATTGATTTGGATGAAAAACATGCTGCTTGAATATGGCTTCGATC
AGGACATTATGATGTGCTGTGTTGTTGAAGTGAATGTGCTGTGTATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGACCGAGGATGAATCAGTGTCTGATTATAATAAGAGGGTTCTTGAAATCGCAAATGAATCCTTACTGCTCGGTGAAAAGATACCTGATTCTAAGATAGTTCGGAAAGT
ACTTCGATCCTTGCCCAGAAAATTTGATATGAAAGTTACTGCGATTGAGAAAGCTCATGATATTACAACGCTGAAACTTGATGAATTGTTTGGCTCGTTGCTTACGTTTG
AGATGGCCACTGCTGATAGAGAAAGTAAGAAAGGCAAGGGAATTGCTTTTAAATCCACACATGTAAGTGAGGAGGTTGATTGTGATATTGAAGCAAACATAGATGAGTCA
ATAGCTTTATTAACCAAACAAATCACAAACGCCCTTCAGAATTTAAAAAATCCGAATGTCACAGATGAAGAATTTGTTGATAGTAGAGATGATGATGGCAACATAAATGC
TTTCACAGTACGAATTACTGATAAGAATACTGATGATGATAGTGAATGTTCGGTAGAAAGTAAAAATGATGAATTGTCAATTGAAAAGCTTGAAACTCTATGGAAAGAAG
ATTGTGAAGCTAGGGCAATACAAAAGGAAAGGATCCAAGATCTTTTAGAAGAAAATGAACGATTGATGTCTGTAATATCTTCTCTAAAGTTAAAATTGAGAGAGGTTCAA
AATGAGAATGATCAGATTTTAAAATCCGTTAAAATGCTAAATTTAGGAACGGAGAATCTAGATTCAATACTAAAAGCTGGACATAATGGCTCTCATAGATATGGGTTGGG
ATTTGTTGCCTCTGCAAGTAGTTCTAAAGCTACATCAGAAATCAAGTTTGTCCCTGCCTCAATGAGAGTTGAATATGACACGATTCATACAGAGGCTGGCATCAGGACTC
CAGTTAAATATCTTGGGAGAACTTGTTACTATTGTGGTCGAAAAGGTCATATCAGACATATGACTGGAAACAGATCCTACTTTACGAACTTAAACGACTGTGTCACCGGA
CATGTTACCTTTGGTGATGGTGCAAAAGAAAAAATTATAGCTAAAGGTAACATAGACAAGGATGATCTACCACGATTAAATGATGTTAGGTATGTGGATGGATTAAAAGC
AAACTTGATCAGTATAAGTCAACTGTGTGATCAAGGTTACAAAGTTAGTTTTGATGATATTGGTCGTGTTTTCATGAATAAAGAAAATCAAATTTGCATGAGTGGTAAAC
GACAAGCTGATAATTGCTACCATTGGAATTCAAATATGTTTGACACTTGTCAGTTGATAAGATCCGATCAAACATGGCTGTGGCACAGAAAGCTGGGGCATGTCAGCATG
AGAGGATTGGAAAAAGTTATTAAAAATAAAGCAGTTGTGGAAATTCTTGATTTAGACGTAAATGGAAACTTCTTCTGTGGAGACTGTCAAATTGGCAAGCAGAAAAGGTC
TAGTCATAAAAGTTTGAAAGAATGTTATACCAATAGAGTCTTAGAACTGTTACATATGGATCTCATGAGTCCAATGCAAACAGAAAGTCTTGGAGGTAAGAGGTATGTGC
TGGTTGTTGTTGATGATTACTCAAGATACACTTGGAAATGGGATGCAAGGTCGGAACAAGGAATTTTTCTTGGGTACTCTCAAAACAGTCGGGCCTATAGAGTCTACAAT
AACAGATCCAGTAGTGTTATGAAAACAATCAATGTAGTTATAAATGATCTCGATTCTGATATCAAACAGATGAATGATGAGGAAGATGAGACTCCAAACATGTATGAAGT
CAGAACTACGAGTATTGTGGAAGAGTCTAAAGCTGATAATTCATCCGACGGTCCAGGCAAAAGTTTGAAAAAATCATCAGAAGAAATTATCAATAAAAAATCTGAACTAA
TTCCATCAGCTCATGAAAAGAAAAATCATCCAGCAAGCTCTATCATAGGTGAACCATCAGCTGGGATGCAGACCAGAAGGAAAGATAAGATCGACTATTTGAAGATGGTT
GCTGACTTGCGAAACAATGTCTGGACATTAGTCTCAAAGCCAGAAGGTGTAAACGTTATTGGCACCAAATGGATATTTAAAAATAAGATTGATGAAACTGGATGTGTGAC
GAAAAATAAAGCCAGATTAGTAGCTCAAGGTTATACTCAAGTTGAAGGTGTTGACTTTGATGAAACGTTTGCTCCTGTTGCTCGACTTCAAGCCATTCGACTTTTATTGG
GCATATCATGCATACAGAAATTTAAATTGTATCAGATGGATGTAAAGAGTGCCTTCTTAAATGAGTACTTGAATGAGGAGGTTTATGTTGCCCAACCAAAAGGTTTTGTA
GATTCCGAGCACCCGAAGCATGTGTATAAGCTCAACAAAGCCTTATATGGACTAAAGCAAGCTCCGAGAGCTTGGTATGACCGGCTAACTGTGTACTTGAGAGGTAGAGG
ATATTTTAGAGAAGAAATTGACAAGACCTTCTTCATACACAGGAAATCTGACCAACTGTTGGTGGATCAAATTTATGTTGATGACATCATTTTTGGAGGATTTCCTTATG
ATCTAGTAAATAATTTCATTAATATTATGCAGTCAGAATTCGAAATGAGCAAGGTTGGAGAGCTTTCATGTTTTTTGGGACTTCAAATTAAGCAAAAGAATGATGACATT
TTTATATCACAAGAAAAGTACGCCAAGAATATGGTTAAAAAGTTTTGTTTGGAACAGGCTCGAAATAAGCGGACTCTAGCTGCGACACATGTTAAACTTACAAAAGACAC
TGAAGGTGCTGAAGTTGATCACAAACTTTACAGGAGTATAGTAGGCAACCTATTATACTTAACAACAAGTCGACCTGACATAGCTTATGTTGTGGGAATATGTGCTTGTT
ATCAGGCAGATCCCCGCATCACTCACCTAGAAGCTGTTAAACGAATTCTTAAATATGTTCATGGGACCAGTGACTTTAGAATGATGTATTCCTATGATACCACTCCCACT
CTAGTTGGATATTGTGATGCTGACTGGGCAGGTTCAGCTAATGATCGTAAAAGTACATCTGAAGGTTGCTTCTTTTTGGGAAACAATTTAATCTCTTGGTTAAGTAAGAA
GCAAAACCGTGTTTCTTTATCTACAGCTGAAGCTGAATATATAGCTGCAAGTAGTGGTTGTACACAATTGATTTGGATGAAAAACATGCTGCTTGAATATGGCTTCGATC
AGGACATTATGATGTGCTGTGTTGTTGAAGTGAATGTGCTGTGTATGTGA
Protein sequenceShow/hide protein sequence
MTEDESVSDYNKRVLEIANESLLLGEKIPDSKIVRKVLRSLPRKFDMKVTAIEKAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVDCDIEANIDES
IALLTKQITNALQNLKNPNVTDEEFVDSRDDDGNINAFTVRITDKNTDDDSECSVESKNDELSIEKLETLWKEDCEARAIQKERIQDLLEENERLMSVISSLKLKLREVQ
NENDQILKSVKMLNLGTENLDSILKAGHNGSHRYGLGFVASASSSKATSEIKFVPASMRVEYDTIHTEAGIRTPVKYLGRTCYYCGRKGHIRHMTGNRSYFTNLNDCVTG
HVTFGDGAKEKIIAKGNIDKDDLPRLNDVRYVDGLKANLISISQLCDQGYKVSFDDIGRVFMNKENQICMSGKRQADNCYHWNSNMFDTCQLIRSDQTWLWHRKLGHVSM
RGLEKVIKNKAVVEILDLDVNGNFFCGDCQIGKQKRSSHKSLKECYTNRVLELLHMDLMSPMQTESLGGKRYVLVVVDDYSRYTWKWDARSEQGIFLGYSQNSRAYRVYN
NRSSSVMKTINVVINDLDSDIKQMNDEEDETPNMYEVRTTSIVEESKADNSSDGPGKSLKKSSEEIINKKSELIPSAHEKKNHPASSIIGEPSAGMQTRRKDKIDYLKMV
ADLRNNVWTLVSKPEGVNVIGTKWIFKNKIDETGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLQAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFV
DSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYFREEIDKTFFIHRKSDQLLVDQIYVDDIIFGGFPYDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDI
FISQEKYAKNMVKKFCLEQARNKRTLAATHVKLTKDTEGAEVDHKLYRSIVGNLLYLTTSRPDIAYVVGICACYQADPRITHLEAVKRILKYVHGTSDFRMMYSYDTTPT
LVGYCDADWAGSANDRKSTSEGCFFLGNNLISWLSKKQNRVSLSTAEAEYIAASSGCTQLIWMKNMLLEYGFDQDIMMCCVVEVNVLCM