; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0006833 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0006833
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionGag-pol polyprotein
Genome locationchr01:10480336..10484787
RNA-Seq ExpressionIVF0006833
SyntenyIVF0006833
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035966.1 F5J5.1 [Cucumis melo var. makuwa]1.42e-31477.91Show/hide
Query:  NESLLLGEKIPDSKIVRKVLRSLPRKFDIKVIAIEKAHDITTLKLDELFGLLFTFKMAIVDRESKKGKGIAFKSTH-KKEGD--------------KKIF
        NESLLLGEKIPDSKIVRKVLRS+ RKFD+KV AIE+AHDITTLKLDELFG L TF+MA  DRESKKGKGI+FKSTH  KE D               K F
Subjt:  NESLLLGEKIPDSKIVRKVLRSLPRKFDIKVIAIEKAHDITTLKLDELFGLLFTFKMAIVDRESKKGKGIAFKSTH-KKEGD--------------KKIF

Query:  RYRECGGVGHYQVECLTFLRKQKKNFRVTLSDEESVDSRDDDGNINAFAVRIIDENTDDDSECSVESKNDELSIEKLETLWKEDCEAKAIQKERIQDLLE
                     EC TFLRKQKKNFRVTLSDEESVDSRDDDGNINAFA+RI DENTDD+SECSVESKNDELSIEKL+TLWKEDCEA+ IQKE IQDLLE
Subjt:  RYRECGGVGHYQVECLTFLRKQKKNFRVTLSDEESVDSRDDDGNINAFAVRIIDENTDDDSECSVESKNDELSIEKLETLWKEDCEAKAIQKERIQDLLE

Query:  ENEWLMSVISSLKLKLRKVQNENDNILKSVKMLNSGTENLDSILKVGHNGSHRYGLGFVASASSSKATSEIKFVLASVRVKYDTIHTETGIRTPVKSLER
        ENEWLMSVISSLKLKLR+VQNEND ILKSVKMLNSG +NLDSILK GHNGS RYGLGFV+SASSSKATSEIKFV AS+RV+YDTIH ETGIR  VKSL R
Subjt:  ENEWLMSVISSLKLKLRKVQNENDNILKSVKMLNSGTENLDSILKVGHNGSHRYGLGFVASASSSKATSEIKFVLASVRVKYDTIHTETGIRTPVKSLER

Query:  ACYYCGRKGHIRSICYKLRQDQLRQQKHWNRSRGQPCLVWRIKFVERCKIAFTSVQTVDDAWYFDSGCFRHMTGNKSYFTNLNDCVTGHVTFGDGAKREI
          YYCG+KGHIRSICYKLRQDQLRQQKHWNRS  QPC+VWRIK++ERCKIAFTSVQT DD WYFDSGC RHMTGN+SYF NLNDCV GHVTFGDGAK +I
Subjt:  ACYYCGRKGHIRSICYKLRQDQLRQQKHWNRSRGQPCLVWRIKFVERCKIAFTSVQTVDDAWYFDSGCFRHMTGNKSYFTNLNDCVTGHVTFGDGAKREI

Query:  IAKGNIDKDDLPRLNDVRYVDGLKGNLISISQLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCQLIRPDQTWLWHRKLRHVSMRG
        IAKGNI+KDDLPRLNDVRYVDGLK NLISI+QLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTC+LIR DQTWLWHRKL H SMRG
Subjt:  IAKGNIDKDDLPRLNDVRYVDGLKGNLISISQLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCQLIRPDQTWLWHRKLRHVSMRG

Query:  LEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMGPMQTESLGGKRYVLVVVDDYSRYTWLCFLKGKIDTVEICKNL
        LEKVIKNKAVVGIPDLDVNGNFFCGDCQI                                     GKR+VLVVVDDY+RYTW+ FLKGK D VEICKNL
Subjt:  LEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMGPMQTESLGGKRYVLVVVDDYSRYTWLCFLKGKIDTVEICKNL

Query:  GLKLQREKRRR
         LKLQREK ++
Subjt:  GLKLQREKRRR

KAA0048228.1 gag-pol polyprotein [Cucumis melo var. makuwa]0.097.78Show/hide
Query:  ECLTFLRKQKKNFRVTLSDEESVDSRDDDGNINAFAVRIIDENTDDDSECSVESKNDELSIEKLETLWKEDCEAKAIQKERIQDLLEENEWLMSVISSLK
        E +  LRK+K   R   SDEESVDSRDDDGNINAFAVRIIDENTDDDSECSVESKNDELSIEKLETLWKEDCEAKAIQKERIQDLLEENEWLMSVISSLK
Subjt:  ECLTFLRKQKKNFRVTLSDEESVDSRDDDGNINAFAVRIIDENTDDDSECSVESKNDELSIEKLETLWKEDCEAKAIQKERIQDLLEENEWLMSVISSLK

Query:  LKLRKVQNENDNILKSVKMLNSGTENLDSILKVGHNGSHRYGLGFVASASSSKATSEIKFVLASVRVKYDTIHTETGIRTPVKSLERACYYCGRKGHIRS
        LKLRKVQNENDNILKSVKMLNSGTENLDSILKVGHNGSHRYGLGFVASASSSKATSEIKFVLASVRVKYDTIHTETGIRTPVKSLERACYYCGRKGHIRS
Subjt:  LKLRKVQNENDNILKSVKMLNSGTENLDSILKVGHNGSHRYGLGFVASASSSKATSEIKFVLASVRVKYDTIHTETGIRTPVKSLERACYYCGRKGHIRS

Query:  ICYKLRQDQLRQQKHWNRSRGQPCLVWRIKFVERCKIAFTSVQTVDDAWYFDSGCFRHMTGNKSYFTNLNDCVTGHVTFGDGAKREIIAKGNIDKDDLPR
        ICYKLRQDQLRQQKHWNRSRGQPCLVWRIKFVERCKIAFTSVQTVDDAWYFDSGCFRHMTGNKSYFTNLNDCVTGHVTFGDGAKREIIAKGNIDKDDLPR
Subjt:  ICYKLRQDQLRQQKHWNRSRGQPCLVWRIKFVERCKIAFTSVQTVDDAWYFDSGCFRHMTGNKSYFTNLNDCVTGHVTFGDGAKREIIAKGNIDKDDLPR

Query:  LNDVRYVDGLKGNLISISQLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCQLIRPDQTWLWHRKLRHVSMRGLEKVIKNKAVVGI
        LNDVRYVDGLKGNLISISQLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCQLIRPDQTWLWHRKLRHVSMRGLEKVIKNKAVVGI
Subjt:  LNDVRYVDGLKGNLISISQLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCQLIRPDQTWLWHRKLRHVSMRGLEKVIKNKAVVGI

Query:  PDLDVNGNFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMGPMQTESLGGKRYVLVVVDDYSRYTWLCFLKGKIDTVEICKNLGLKLQREK
        PDLDVNGNFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMGPMQTESLGGKRYVLVVVDDYSRYTWLCFLKGKIDTVEICKNLGLKLQREK
Subjt:  PDLDVNGNFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMGPMQTESLGGKRYVLVVVDDYSRYTWLCFLKGKIDTVEICKNLGLKLQREK

KAA0053561.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.05e-31274.88Show/hide
Query:  NESLLLGEKIPDSKIVRKVLRSLPRKFDIKVIAIEKAHDITTLKLDELFGLLFTFKMAIVDRESKKGKGIAFKSTHKKEG---------DKKIF------
        NESLLLGEKIPDSKIVRKVLRSLPRKFD+KV AIE+AHDITTLKLDELFG L TF+MA  DRESKKGKGIAFK TH  E          D+ I       
Subjt:  NESLLLGEKIPDSKIVRKVLRSLPRKFDIKVIAIEKAHDITTLKLDELFGLLFTFKMAIVDRESKKGKGIAFKSTHKKEG---------DKKIF------

Query:  ---------------------RYRECGGVGHYQVECLTFLRKQKKNFRVTLSDEESVDSRDDDGNINAFAVRIIDENTDDDSECSVESKNDELSIEKLET
                             +YR     G  +     F R+    +     D+ESVDSRDDDGNINAFA+RI DENTDDDSECSVESKND+LSIEKLET
Subjt:  ---------------------RYRECGGVGHYQVECLTFLRKQKKNFRVTLSDEESVDSRDDDGNINAFAVRIIDENTDDDSECSVESKNDELSIEKLET

Query:  LWKEDCEAKAIQKERIQDLLEENEWLMSVISSLKLKLRKVQNENDNILKSVKMLNSGTENLDSILKVGHNGSHRYGLGFVASASSSKATSEIKFVLASVR
        LWKEDCEA+AIQK+RIQDLLEENE LMSVISSLKLKLR+VQNEND ILKSVKMLNSGT+NLDSILK  HNGSHRYGLGFVASASSS+ATSEIKFV AS+R
Subjt:  LWKEDCEAKAIQKERIQDLLEENEWLMSVISSLKLKLRKVQNENDNILKSVKMLNSGTENLDSILKVGHNGSHRYGLGFVASASSSKATSEIKFVLASVR

Query:  VKYDTIHTETGIRTPVKSLERACYYCGRKGHIRSICYKLRQDQLRQQKHWNRSRGQPCLVWRIKFVERCKIAFTSVQTVDDAWYFDSGCFRHMTGNKSYF
        V+YDTIH+ETGIRTPVKSL R CYYCGRKGHI                                   RCKIA TS Q  DDAWYFDSGC RHMTGN+SYF
Subjt:  VKYDTIHTETGIRTPVKSLERACYYCGRKGHIRSICYKLRQDQLRQQKHWNRSRGQPCLVWRIKFVERCKIAFTSVQTVDDAWYFDSGCFRHMTGNKSYF

Query:  TNLNDCVTGHVTFGDGAKREIIAKGNIDKDDLPRLNDVRYVDGLKGNLISISQLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCQ
        TNLNDCVT HVT GDGAK +IIAKGNIDK DLPRLNDVRYVDGLK NLISISQLCDQGYKVSFDDIG VVMNKENQICMSGKRQADNCYHWNSNMSDTCQ
Subjt:  TNLNDCVTGHVTFGDGAKREIIAKGNIDKDDLPRLNDVRYVDGLKGNLISISQLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCQ

Query:  LIRPDQTWLWHRKLRHVSMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMGPMQTESLGGKRYVLVVVDDYS
        LIR DQTWLWHRKL HVSMRGLEK+IKNKAVVGIPDLDVNGNFFCGDCQIGKQTR THK+LKECYTNRVL+LLHMDLMGPM  ESL GKRYVLVVVDDYS
Subjt:  LIRPDQTWLWHRKLRHVSMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMGPMQTESLGGKRYVLVVVDDYS

Query:  RYTWLCFLKGKIDTVEICKNLGLKLQREK
        RYTW+CFLK K DTVE CKNL LKLQREK
Subjt:  RYTWLCFLKGKIDTVEICKNLGLKLQREK

TYK06527.1 gag-pol polyprotein [Cucumis melo var. makuwa]0.083.83Show/hide
Query:  KVIAIEKAHDITTLKLDELFGLLFTFKMAIVDRESKKGKGIAFKSTH------------------------------KKEGDKKIFRYRECGGVGHYQVE
        KV AI++AHDITTLKLDELFG L TF+M   DRESKKGKGIAFKSTH                              KK+G++KIFR RECGGVGHYQ E
Subjt:  KVIAIEKAHDITTLKLDELFGLLFTFKMAIVDRESKKGKGIAFKSTH------------------------------KKEGDKKIFRYRECGGVGHYQVE

Query:  CLTFLRKQKKNFRVTLSDEESVDSRDDDGNINAFAVRIIDENTDDDSECSVESKNDELSIEKLETLWKEDCEAKAIQKERIQDLLEENEWLMSVISSLKL
        CLTFLRKQKKN RVTLSDEE VDSRDDDGNINAF +RI ++NTDDDSECSVESKNDELSIEKLETLWKEDCEA+AIQKERIQDLLEE E LMSVISSLKL
Subjt:  CLTFLRKQKKNFRVTLSDEESVDSRDDDGNINAFAVRIIDENTDDDSECSVESKNDELSIEKLETLWKEDCEAKAIQKERIQDLLEENEWLMSVISSLKL

Query:  KLRKVQNENDNILKSVKMLNSGTENLDSILKVGHNGSHRYGLGFVASASSSKATSEIKFVLASVRVKYDTIHTETGIRTPVKSLERACYYCGRKGHIRSI
        KLR+VQNEND ILK VKMLNSGTENLDSILK GHNGSHRYGLGFVASASSSKATSEIKFV AS+RV+YDTIHT+TGIRTPVKSL R CYY GRKGHIRSI
Subjt:  KLRKVQNENDNILKSVKMLNSGTENLDSILKVGHNGSHRYGLGFVASASSSKATSEIKFVLASVRVKYDTIHTETGIRTPVKSLERACYYCGRKGHIRSI

Query:  CYKLRQDQLRQQKHWNRSRGQPCLVWRIKFVERCKIAFTSVQTVDDAWYFDSGCFRHMTGNKSYFTNLNDCVTGHVTFGDGAKREIIAKGNIDKDDLPRL
        CYKL++DQLRQQKHWNRSR QP +V RIK  ERCK+AFTSVQT DDAWYFDSGCFRHM GN+SYFTNLNDCV  HVTFGDGAK +IIAKGNIDKDDLPRL
Subjt:  CYKLRQDQLRQQKHWNRSRGQPCLVWRIKFVERCKIAFTSVQTVDDAWYFDSGCFRHMTGNKSYFTNLNDCVTGHVTFGDGAKREIIAKGNIDKDDLPRL

Query:  NDVRYVDGLKGNLISISQLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCQLIRPDQTWLWHRKLRHVSMRGLEKVIKNKAVVGIP
        NDVRYVDGLK NLI+ISQLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCQLIR DQTWLWH KL HVSMRGLEKVIKNKAVVGIP
Subjt:  NDVRYVDGLKGNLISISQLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCQLIRPDQTWLWHRKLRHVSMRGLEKVIKNKAVVGIP

Query:  DLDVNGNFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMGPMQTESLGGKRYVLVVVDDYSRYTW
        +LDVNGNFFC DCQIGKQTRST+KSLKECYTNRVL+LLHMDLMGPMQT+SLGGKRYVLVV DDYSRYTW
Subjt:  DLDVNGNFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMGPMQTESLGGKRYVLVVVDDYSRYTW

TYK30437.1 F5J5.1 [Cucumis melo var. makuwa]1.19e-31477.91Show/hide
Query:  NESLLLGEKIPDSKIVRKVLRSLPRKFDIKVIAIEKAHDITTLKLDELFGLLFTFKMAIVDRESKKGKGIAFKSTH-KKEGD--------------KKIF
        NESLLLGEKIPDSKIVRKVLRS+ RKFD+KV AIE+AHDITTLKLDELFG L TF+MA  DRESKKGKGI+FKSTH  KE D               K F
Subjt:  NESLLLGEKIPDSKIVRKVLRSLPRKFDIKVIAIEKAHDITTLKLDELFGLLFTFKMAIVDRESKKGKGIAFKSTH-KKEGD--------------KKIF

Query:  RYRECGGVGHYQVECLTFLRKQKKNFRVTLSDEESVDSRDDDGNINAFAVRIIDENTDDDSECSVESKNDELSIEKLETLWKEDCEAKAIQKERIQDLLE
                     EC TFLRKQKKNFRVTLSDEESVDSRDDDGNINAFA+RI DENTDD+SECSVESKNDELSIEKL+TLWKEDCEA+ IQKE IQDLLE
Subjt:  RYRECGGVGHYQVECLTFLRKQKKNFRVTLSDEESVDSRDDDGNINAFAVRIIDENTDDDSECSVESKNDELSIEKLETLWKEDCEAKAIQKERIQDLLE

Query:  ENEWLMSVISSLKLKLRKVQNENDNILKSVKMLNSGTENLDSILKVGHNGSHRYGLGFVASASSSKATSEIKFVLASVRVKYDTIHTETGIRTPVKSLER
        ENEWLMSVISSLKLKLR+VQNEND ILKSVKMLNSG +NLDSILK GHNGS RYGLGFV+SASSSKATSEIKFV AS+RV+YDTIH ETGIR  VKSL R
Subjt:  ENEWLMSVISSLKLKLRKVQNENDNILKSVKMLNSGTENLDSILKVGHNGSHRYGLGFVASASSSKATSEIKFVLASVRVKYDTIHTETGIRTPVKSLER

Query:  ACYYCGRKGHIRSICYKLRQDQLRQQKHWNRSRGQPCLVWRIKFVERCKIAFTSVQTVDDAWYFDSGCFRHMTGNKSYFTNLNDCVTGHVTFGDGAKREI
          YYCG+KGHIRSICYKLRQDQLRQQKHWNRS  QPC+VWRIK++ERCKIAFTSVQT DD WYFDSGC RHMTGN+SYF NLNDCV GHVTFGDGAK +I
Subjt:  ACYYCGRKGHIRSICYKLRQDQLRQQKHWNRSRGQPCLVWRIKFVERCKIAFTSVQTVDDAWYFDSGCFRHMTGNKSYFTNLNDCVTGHVTFGDGAKREI

Query:  IAKGNIDKDDLPRLNDVRYVDGLKGNLISISQLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCQLIRPDQTWLWHRKLRHVSMRG
        IAKGNI+KDDLPRLNDVRYVDGLK NLISI+QLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTC+LIR DQTWLWHRKL H SMRG
Subjt:  IAKGNIDKDDLPRLNDVRYVDGLKGNLISISQLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCQLIRPDQTWLWHRKLRHVSMRG

Query:  LEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMGPMQTESLGGKRYVLVVVDDYSRYTWLCFLKGKIDTVEICKNL
        LEKVIKNKAVVGIPDLDVNGNFFCGDCQI                                     GKR+VLVVVDDY+RYTW+ FLKGK D VEICKNL
Subjt:  LEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMGPMQTESLGGKRYVLVVVDDYSRYTWLCFLKGKIDTVEICKNL

Query:  GLKLQREKRRR
         LKLQREK ++
Subjt:  GLKLQREKRRR

TrEMBL top hitse value%identityAlignment
A0A5A7T169 F5J5.14.4e-26677.91Show/hide
Query:  NESLLLGEKIPDSKIVRKVLRSLPRKFDIKVIAIEKAHDITTLKLDELFGLLFTFKMAIVDRESKKGKGIAFKSTH-KKEGD--------------KKIF
        NESLLLGEKIPDSKIVRKVLRS+ RKFD+KV AIE+AHDITTLKLDELFG L TF+MA  DRESKKGKGI+FKSTH  KE D               K F
Subjt:  NESLLLGEKIPDSKIVRKVLRSLPRKFDIKVIAIEKAHDITTLKLDELFGLLFTFKMAIVDRESKKGKGIAFKSTH-KKEGD--------------KKIF

Query:  RYRECGGVGHYQVECLTFLRKQKKNFRVTLSDEESVDSRDDDGNINAFAVRIIDENTDDDSECSVESKNDELSIEKLETLWKEDCEAKAIQKERIQDLLE
                     EC TFLRKQKKNFRVTLSDEESVDSRDDDGNINAFA+RI DENTDD+SECSVESKNDELSIEKL+TLWKEDCEA+ IQKE IQDLLE
Subjt:  RYRECGGVGHYQVECLTFLRKQKKNFRVTLSDEESVDSRDDDGNINAFAVRIIDENTDDDSECSVESKNDELSIEKLETLWKEDCEAKAIQKERIQDLLE

Query:  ENEWLMSVISSLKLKLRKVQNENDNILKSVKMLNSGTENLDSILKVGHNGSHRYGLGFVASASSSKATSEIKFVLASVRVKYDTIHTETGIRTPVKSLER
        ENEWLMSVISSLKLKLR+VQNEND ILKSVKMLNSG +NLDSILK GHNGS RYGLGFV+SASSSKATSEIKFV AS+RV+YDTIH ETGIR  VKSL R
Subjt:  ENEWLMSVISSLKLKLRKVQNENDNILKSVKMLNSGTENLDSILKVGHNGSHRYGLGFVASASSSKATSEIKFVLASVRVKYDTIHTETGIRTPVKSLER

Query:  ACYYCGRKGHIRSICYKLRQDQLRQQKHWNRSRGQPCLVWRIKFVERCKIAFTSVQTVDDAWYFDSGCFRHMTGNKSYFTNLNDCVTGHVTFGDGAKREI
          YYCG+KGHIRSICYKLRQDQLRQQKHWNRS  QPC+VWRIK++ERCKIAFTSVQT DD WYFDSGC RHMTGN+SYF NLNDCV GHVTFGDGAK +I
Subjt:  ACYYCGRKGHIRSICYKLRQDQLRQQKHWNRSRGQPCLVWRIKFVERCKIAFTSVQTVDDAWYFDSGCFRHMTGNKSYFTNLNDCVTGHVTFGDGAKREI

Query:  IAKGNIDKDDLPRLNDVRYVDGLKGNLISISQLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCQLIRPDQTWLWHRKLRHVSMRG
        IAKGNI+KDDLPRLNDVRYVDGLK NLISI+QLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTC+LIR DQTWLWHRKL H SMRG
Subjt:  IAKGNIDKDDLPRLNDVRYVDGLKGNLISISQLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCQLIRPDQTWLWHRKLRHVSMRG

Query:  LEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMGPMQTESLGGKRYVLVVVDDYSRYTWLCFLKGKIDTVEICKNL
        LEKVIKNKAVVGIPDLDVNGNFFCGDCQI                                     GKR+VLVVVDDY+RYTW+ FLKGK D VEICKNL
Subjt:  LEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMGPMQTESLGGKRYVLVVVDDYSRYTWLCFLKGKIDTVEICKNL

Query:  GLKLQREKRRR
         LKLQREK ++
Subjt:  GLKLQREKRRR

A0A5A7UJA4 Gag-pol polyprotein3.5e-25574.53Show/hide
Query:  NESLLLGEKIPDSKIVRKVLRSLPRKFDIKVIAIEKAHDITTLKLDELFGLLFTFKMAIVDRESKKGKGIAFKSTHKKEG---------DKKIF------
        NESLLLGEKIPDSKIVRKVLRSLPRKFD+KV AIE+AHDITTLKLDELFG L TF+MA  DRESKKGKGIAFK TH  E          D+ I       
Subjt:  NESLLLGEKIPDSKIVRKVLRSLPRKFDIKVIAIEKAHDITTLKLDELFGLLFTFKMAIVDRESKKGKGIAFKSTHKKEG---------DKKIF------

Query:  ---------------------RYRECGGVGHYQVECLTFLRKQKKNFRVTLSDEESVDSRDDDGNINAFAVRIIDENTDDDSECSVESKNDELSIEKLET
                             +YR     G  +     F R+    +     D+ESVDSRDDDGNINAFA+RI DENTDDDSECSVESKND+LSIEKLET
Subjt:  ---------------------RYRECGGVGHYQVECLTFLRKQKKNFRVTLSDEESVDSRDDDGNINAFAVRIIDENTDDDSECSVESKNDELSIEKLET

Query:  LWKEDCEAKAIQKERIQDLLEENEWLMSVISSLKLKLRKVQNENDNILKSVKMLNSGTENLDSILKVGHNGSHRYGLGFVASASSSKATSEIKFVLASVR
        LWKEDCEA+AIQK+RIQDLLEENE LMSVISSLKLKLR+VQNEND ILKSVKMLNSGT+NLDSILK  HNGSHRYGLGFVASASSS+ATSEIKFV AS+R
Subjt:  LWKEDCEAKAIQKERIQDLLEENEWLMSVISSLKLKLRKVQNENDNILKSVKMLNSGTENLDSILKVGHNGSHRYGLGFVASASSSKATSEIKFVLASVR

Query:  VKYDTIHTETGIRTPVKSLERACYYCGRKGHIRSICYKLRQDQLRQQKHWNRSRGQPCLVWRIKFVERCKIAFTSVQTVDDAWYFDSGCFRHMTGNKSYF
        V+YDTIH+ETGIRTPVKSL R CYYCGRKGHI                                   RCKIA TS Q  DDAWYFDSGC RHMTGN+SYF
Subjt:  VKYDTIHTETGIRTPVKSLERACYYCGRKGHIRSICYKLRQDQLRQQKHWNRSRGQPCLVWRIKFVERCKIAFTSVQTVDDAWYFDSGCFRHMTGNKSYF

Query:  TNLNDCVTGHVTFGDGAKREIIAKGNIDKDDLPRLNDVRYVDGLKGNLISISQLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCQ
        TNLNDCVT HVT GDGAK +IIAKGNIDK DLPRLNDVRYVDGLK NLISISQLCDQGYKVSFDDIG VVMNKENQICMSGKRQADNCYHWNSNMSDTCQ
Subjt:  TNLNDCVTGHVTFGDGAKREIIAKGNIDKDDLPRLNDVRYVDGLKGNLISISQLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCQ

Query:  LIRPDQTWLWHRKLRHVSMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMGPMQTESLGGKRYVLVVVDDYS
        LIR DQTWLWHRKL HVSMRGLEK+IKNKAVVGIPDLDVNGNFFCGDCQIGKQTR THK+LKECYTNRVL+LLHMDLMGPM  ESL GKRYVLVVVDDYS
Subjt:  LIRPDQTWLWHRKLRHVSMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMGPMQTESLGGKRYVLVVVDDYS

Query:  RYTWLCFLKGKIDTVEICKNLGLKLQREKRRR
        RYTW+CFLK K DTVE CKNL LKLQREK ++
Subjt:  RYTWLCFLKGKIDTVEICKNLGLKLQREKRRR

A0A5D3C3W3 Gag-pol polyprotein1.2e-27483.68Show/hide
Query:  IKVIAIEKAHDITTLKLDELFGLLFTFKMAIVDRESKKGKGIAFKSTH------------------------------KKEGDKKIFRYRECGGVGHYQV
        +KV AI++AHDITTLKLDELFG L TF+M   DRESKKGKGIAFKSTH                              KK+G++KIFR RECGGVGHYQ 
Subjt:  IKVIAIEKAHDITTLKLDELFGLLFTFKMAIVDRESKKGKGIAFKSTH------------------------------KKEGDKKIFRYRECGGVGHYQV

Query:  ECLTFLRKQKKNFRVTLSDEESVDSRDDDGNINAFAVRIIDENTDDDSECSVESKNDELSIEKLETLWKEDCEAKAIQKERIQDLLEENEWLMSVISSLK
        ECLTFLRKQKKN RVTLSDEE VDSRDDDGNINAF +RI ++NTDDDSECSVESKNDELSIEKLETLWKEDCEA+AIQKERIQDLLEE E LMSVISSLK
Subjt:  ECLTFLRKQKKNFRVTLSDEESVDSRDDDGNINAFAVRIIDENTDDDSECSVESKNDELSIEKLETLWKEDCEAKAIQKERIQDLLEENEWLMSVISSLK

Query:  LKLRKVQNENDNILKSVKMLNSGTENLDSILKVGHNGSHRYGLGFVASASSSKATSEIKFVLASVRVKYDTIHTETGIRTPVKSLERACYYCGRKGHIRS
        LKLR+VQNEND ILK VKMLNSGTENLDSILK GHNGSHRYGLGFVASASSSKATSEIKFV AS+RV+YDTIHT+TGIRTPVKSL R CYY GRKGHIRS
Subjt:  LKLRKVQNENDNILKSVKMLNSGTENLDSILKVGHNGSHRYGLGFVASASSSKATSEIKFVLASVRVKYDTIHTETGIRTPVKSLERACYYCGRKGHIRS

Query:  ICYKLRQDQLRQQKHWNRSRGQPCLVWRIKFVERCKIAFTSVQTVDDAWYFDSGCFRHMTGNKSYFTNLNDCVTGHVTFGDGAKREIIAKGNIDKDDLPR
        ICYKL++DQLRQQKHWNRSR QP +V RIK  ERCK+AFTSVQT DDAWYFDSGCFRHM GN+SYFTNLNDCV  HVTFGDGAK +IIAKGNIDKDDLPR
Subjt:  ICYKLRQDQLRQQKHWNRSRGQPCLVWRIKFVERCKIAFTSVQTVDDAWYFDSGCFRHMTGNKSYFTNLNDCVTGHVTFGDGAKREIIAKGNIDKDDLPR

Query:  LNDVRYVDGLKGNLISISQLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCQLIRPDQTWLWHRKLRHVSMRGLEKVIKNKAVVGI
        LNDVRYVDGLK NLI+ISQLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCQLIR DQTWLWH KL HVSMRGLEKVIKNKAVVGI
Subjt:  LNDVRYVDGLKGNLISISQLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCQLIRPDQTWLWHRKLRHVSMRGLEKVIKNKAVVGI

Query:  PDLDVNGNFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMGPMQTESLGGKRYVLVVVDDYSRYTW
        P+LDVNGNFFC DCQIGKQTRST+KSLKECYTNRVL+LLHMDLMGPMQT+SLGGKRYVLVV DDYSRYTW
Subjt:  PDLDVNGNFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMGPMQTESLGGKRYVLVVVDDYSRYTW

A0A5D3C7T5 Gag-pol polyprotein1.4e-28397.19Show/hide
Query:  ECLTFLRKQKKNFRVTLSDEESVDSRDDDGNINAFAVRIIDENTDDDSECSVESKNDELSIEKLETLWKEDCEAKAIQKERIQDLLEENEWLMSVISSLK
        E +  LRK+K   R   SDEESVDSRDDDGNINAFAVRIIDENTDDDSECSVESKNDELSIEKLETLWKEDCEAKAIQKERIQDLLEENEWLMSVISSLK
Subjt:  ECLTFLRKQKKNFRVTLSDEESVDSRDDDGNINAFAVRIIDENTDDDSECSVESKNDELSIEKLETLWKEDCEAKAIQKERIQDLLEENEWLMSVISSLK

Query:  LKLRKVQNENDNILKSVKMLNSGTENLDSILKVGHNGSHRYGLGFVASASSSKATSEIKFVLASVRVKYDTIHTETGIRTPVKSLERACYYCGRKGHIRS
        LKLRKVQNENDNILKSVKMLNSGTENLDSILKVGHNGSHRYGLGFVASASSSKATSEIKFVLASVRVKYDTIHTETGIRTPVKSLERACYYCGRKGHIRS
Subjt:  LKLRKVQNENDNILKSVKMLNSGTENLDSILKVGHNGSHRYGLGFVASASSSKATSEIKFVLASVRVKYDTIHTETGIRTPVKSLERACYYCGRKGHIRS

Query:  ICYKLRQDQLRQQKHWNRSRGQPCLVWRIKFVERCKIAFTSVQTVDDAWYFDSGCFRHMTGNKSYFTNLNDCVTGHVTFGDGAKREIIAKGNIDKDDLPR
        ICYKLRQDQLRQQKHWNRSRGQPCLVWRIKFVERCKIAFTSVQTVDDAWYFDSGCFRHMTGNKSYFTNLNDCVTGHVTFGDGAKREIIAKGNIDKDDLPR
Subjt:  ICYKLRQDQLRQQKHWNRSRGQPCLVWRIKFVERCKIAFTSVQTVDDAWYFDSGCFRHMTGNKSYFTNLNDCVTGHVTFGDGAKREIIAKGNIDKDDLPR

Query:  LNDVRYVDGLKGNLISISQLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCQLIRPDQTWLWHRKLRHVSMRGLEKVIKNKAVVGI
        LNDVRYVDGLKGNLISISQLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCQLIRPDQTWLWHRKLRHVSMRGLEKVIKNKAVVGI
Subjt:  LNDVRYVDGLKGNLISISQLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCQLIRPDQTWLWHRKLRHVSMRGLEKVIKNKAVVGI

Query:  PDLDVNGNFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMGPMQTESLGGKRYVLVVVDDYSRYTWLCFLKGKIDTVEICKNLGLKLQREKRRR
        PDLDVNGNFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMGPMQTESLGGKRYVLVVVDDYSRYTWLCFLKGKIDTVEICKNLGLKLQREK ++
Subjt:  PDLDVNGNFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMGPMQTESLGGKRYVLVVVDDYSRYTWLCFLKGKIDTVEICKNLGLKLQREKRRR

A0A5D3E2Y4 F5J5.14.4e-26677.91Show/hide
Query:  NESLLLGEKIPDSKIVRKVLRSLPRKFDIKVIAIEKAHDITTLKLDELFGLLFTFKMAIVDRESKKGKGIAFKSTH-KKEGD--------------KKIF
        NESLLLGEKIPDSKIVRKVLRS+ RKFD+KV AIE+AHDITTLKLDELFG L TF+MA  DRESKKGKGI+FKSTH  KE D               K F
Subjt:  NESLLLGEKIPDSKIVRKVLRSLPRKFDIKVIAIEKAHDITTLKLDELFGLLFTFKMAIVDRESKKGKGIAFKSTH-KKEGD--------------KKIF

Query:  RYRECGGVGHYQVECLTFLRKQKKNFRVTLSDEESVDSRDDDGNINAFAVRIIDENTDDDSECSVESKNDELSIEKLETLWKEDCEAKAIQKERIQDLLE
                     EC TFLRKQKKNFRVTLSDEESVDSRDDDGNINAFA+RI DENTDD+SECSVESKNDELSIEKL+TLWKEDCEA+ IQKE IQDLLE
Subjt:  RYRECGGVGHYQVECLTFLRKQKKNFRVTLSDEESVDSRDDDGNINAFAVRIIDENTDDDSECSVESKNDELSIEKLETLWKEDCEAKAIQKERIQDLLE

Query:  ENEWLMSVISSLKLKLRKVQNENDNILKSVKMLNSGTENLDSILKVGHNGSHRYGLGFVASASSSKATSEIKFVLASVRVKYDTIHTETGIRTPVKSLER
        ENEWLMSVISSLKLKLR+VQNEND ILKSVKMLNSG +NLDSILK GHNGS RYGLGFV+SASSSKATSEIKFV AS+RV+YDTIH ETGIR  VKSL R
Subjt:  ENEWLMSVISSLKLKLRKVQNENDNILKSVKMLNSGTENLDSILKVGHNGSHRYGLGFVASASSSKATSEIKFVLASVRVKYDTIHTETGIRTPVKSLER

Query:  ACYYCGRKGHIRSICYKLRQDQLRQQKHWNRSRGQPCLVWRIKFVERCKIAFTSVQTVDDAWYFDSGCFRHMTGNKSYFTNLNDCVTGHVTFGDGAKREI
          YYCG+KGHIRSICYKLRQDQLRQQKHWNRS  QPC+VWRIK++ERCKIAFTSVQT DD WYFDSGC RHMTGN+SYF NLNDCV GHVTFGDGAK +I
Subjt:  ACYYCGRKGHIRSICYKLRQDQLRQQKHWNRSRGQPCLVWRIKFVERCKIAFTSVQTVDDAWYFDSGCFRHMTGNKSYFTNLNDCVTGHVTFGDGAKREI

Query:  IAKGNIDKDDLPRLNDVRYVDGLKGNLISISQLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCQLIRPDQTWLWHRKLRHVSMRG
        IAKGNI+KDDLPRLNDVRYVDGLK NLISI+QLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTC+LIR DQTWLWHRKL H SMRG
Subjt:  IAKGNIDKDDLPRLNDVRYVDGLKGNLISISQLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCQLIRPDQTWLWHRKLRHVSMRG

Query:  LEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMGPMQTESLGGKRYVLVVVDDYSRYTWLCFLKGKIDTVEICKNL
        LEKVIKNKAVVGIPDLDVNGNFFCGDCQI                                     GKR+VLVVVDDY+RYTW+ FLKGK D VEICKNL
Subjt:  LEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMGPMQTESLGGKRYVLVVVDDYSRYTWLCFLKGKIDTVEICKNL

Query:  GLKLQREKRRR
         LKLQREK ++
Subjt:  GLKLQREKRRR

SwissProt top hitse value%identityAlignment
P0C2J7 Transposon Ty4-H Gag-Pol polyprotein4.9e-0430.21Show/hide
Query:  HRKLRHVSMRGLEKVIKNKAVVGIPDLDVNGN-FFCGDCQIGKQTRSTH--KSLKECYTNRVL-KLLHMDLMGPMQTESLGGKRYVLVVVDDYSRY
        H+++ H  ++ +E  IK+       DL    N F+C  C+I K T+  H   S+    T+        MD+ GP+ + +   KRY+L++VD+ +RY
Subjt:  HRKLRHVSMRGLEKVIKNKAVVGIPDLDVNGN-FFCGDCQIGKQTRSTH--KSLKECYTNRVL-KLLHMDLMGPMQTESLGGKRYVLVVVDDYSRY

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-1725.91Show/hide
Query:  KSLERACYYCGRKGHIRSICYKLR--QDQLRQQKHWNRS----RGQPCLVWRIKFVERCKIAFTSVQTVDDAWYFDSGCFRHMTGNKSYFTNLNDCVTGH
        KS  R CY C + GH +  C   R  + +   QK+ + +    +    +V  I   E C      +   +  W  D+    H T  +  F        G 
Subjt:  KSLERACYYCGRKGHIRSICYKLR--QDQLRQQKHWNRS----RGQPCLVWRIKFVERCKIAFTSVQTVDDAWYFDSGCFRHMTGNKSYFTNLNDCVTGH

Query:  VTFGDGAKREIIAKGNI----DKDDLPRLNDVRYVDGLKGNLISISQLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCQLIRPDQ
        V  G+ +  +I   G+I    +      L DVR+V  L+ NLIS   L   GY+  F +     + K + +   G  +    Y  N+ +         D+
Subjt:  VTFGDGAKREIIAKGNI----DKDDLPRLNDVRYVDGLKGNLISISQLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCQLIRPDQ

Query:  --TWLWHRKLRHVSMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMGPMQTESLGGKRYVLVVVDDYSRYTW
            LWH+++ H+S +GL+ + K   +       V     C  C  GKQ R + ++  E   N +L L++ D+ GPM+ ES+GG +Y +  +DD SR  W
Subjt:  --TWLWHRKLRHVSMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMGPMQTESLGGKRYVLVVVDDYSRYTW

Query:  LCFLKGKIDTVEICKNLGLKLQREKRRR
        +  LK K    ++ +     ++RE  R+
Subjt:  LCFLKGKIDTVEICKNLGLKLQREKRRR

P93293 Uncharacterized mitochondrial protein AtMg003003.7e-0434.48Show/hide
Query:  SNMSDTCQLIRPDQTWLWHRKLRHVSMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMG
        SN+++T +    D+T LWH +L H+S RG+E ++K      +    V+   FC DC  GK T   + S  +  T   L  +H DL G
Subjt:  SNMSDTCQLIRPDQTWLWHRKLRHVSMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMG

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.7e-1226.23Show/hide
Query:  CYYCGRKGHIRSICYKLRQDQLRQQKHWNRSRGQPCLVWRIKFVERCKIAFTSVQTVDDAWYFDSGCFRHMTGNKSYFTNLNDCVTG--HVTFGDGAKRE
        C  CG +GH    C +L Q  L      ++    P   W+     R  +A  S  + ++ W  DSG   H+T + +   +L+   TG   V   DG+   
Subjt:  CYYCGRKGHIRSICYKLRQDQLRQQKHWNRSRGQPCLVWRIKFVERCKIAFTSVQTVDDAWYFDSGCFRHMTGNKSYFTNLNDCVTG--HVTFGDGAKRE

Query:  IIAKGNIDKDDLPR---LNDVRYVDGLKGNLISISQLCD-QGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCQLIRPDQTWL----WHR
        I   G+       R   L+++ YV  +  NLIS+ +LC+  G  V F      V +    + +   +  D  Y W    S    L     +      WH 
Subjt:  IIAKGNIDKDDLPR---LNDVRYVDGLKGNLISISQLCD-QGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCQLIRPDQTWL----WHR

Query:  KLRHVSMRGLEKVIKNKAVVGIPDLDVNGNFF-CGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMGPMQTESLGGKRYVLVVVDDYSRYTWLCFLKGK
        +L H +   L  VI N     +  L+ +  F  C DC I K  +    S     + R L+ ++ D+       S    RY ++ VD ++RYTWL  LK K
Subjt:  KLRHVSMRGLEKVIKNKAVVGIPDLDVNGNFF-CGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMGPMQTESLGGKRYVLVVVDDYSRYTWLCFLKGK

Query:  IDTVE
            E
Subjt:  IDTVE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-1428.35Show/hide
Query:  RTPVKSLERACYYCGRKGHIRSICYKLRQDQLRQQKHWNRSRGQPCLVWRIKFVERCKIAFTSVQTVDDAWYFDSGCFRHMTGNKSYFTNL--NDCVTG-
        R P   L R C  C  +GH    C +L Q Q    +  + S   P   W+     R  +A  S    ++ W  DSG   H+T   S F NL  +   TG 
Subjt:  RTPVKSLERACYYCGRKGHIRSICYKLRQDQLRQQKHWNRSRGQPCLVWRIKFVERCKIAFTSVQTVDDAWYFDSGCFRHMTGNKSYFTNL--NDCVTG-

Query:  -HVTFGDGAKREIIAKGNIDKDDLPR---LNDVRYVDGLKGNLISISQLCDQG-YKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCQLIRP
          V   DG+   I   G+       R   LN V YV  +  NLIS+ +LC+     V F      V +    + +   +  D  Y W    S    +   
Subjt:  -HVTFGDGAKREIIAKGNIDKDDLPR---LNDVRYVDGLKGNLISISQLCDQG-YKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCQLIRP

Query:  DQTWL----WHRKLRHVSMRGLEKVIKNKAVVGIPDLDVNGNFF-CGDCQIGKQTRSTHK---SLKECYTNRVLKLLHMDLMGPMQTESLGGKRYVLVVV
          +      WH +L H S+  L  VI N +   +P L+ +     C DC I K    +HK   S     +++ L+ ++ D+       S+   RY ++ V
Subjt:  DQTWL----WHRKLRHVSMRGLEKVIKNKAVVGIPDLDVNGNFF-CGDCQIGKQTRSTHK---SLKECYTNRVLKLLHMDLMGPMQTESLGGKRYVLVVV

Query:  DDYSRYTWLCFLKGKI---DTVEICKNL
        D ++RYTWL  LK K    DT  I K+L
Subjt:  DDYSRYTWLCFLKGKI---DTVEICKNL

Arabidopsis top hitse value%identityAlignment
AT3G20980.1 Gag-Pol-related retrotransposon family protein3.8e-0430.1Show/hide
Query:  DDAWYFDSGCFRHMTGNKSYFTNLNDCVTGHVTFGDGAKRE---IIAKG-----NIDKDDLPRLNDVRYVDGLKGNLISISQLCDQGYKVSFD-DIGCVV
        ++ W   S    HMT +  +FT L+      V F  G K E    + +G      I  +    + +V YV G++GN +S+SQL   G++VS +   GC V
Subjt:  DDAWYFDSGCFRHMTGNKSYFTNLNDCVTGHVTFGDGAKRE---IIAKG-----NIDKDDLPRLNDVRYVDGLKGNLISISQLCDQGYKVSFD-DIGCVV

Query:  MNK
         ++
Subjt:  MNK

AT4G05360.1 Zinc knuckle (CCHC-type) family protein1.6e-0523.18Show/hide
Query:  LGEKIPDSKIVRKVLRSLPRKFDIKVIAIEKAHDITTLKLDELFGLLFTFKMAIVDRESKKGKGIAFKSTHKKEGDKKIFRYRECGGVGHYQVECLTFLR
        LGE +P+  IV+K+++ + +    +++ I +         +E    L  FK+          K + +KS+  K       R  EC G  H   EC   ++
Subjt:  LGEKIPDSKIVRKVLRSLPRKFDIKVIAIEKAHDITTLKLDELFGLLFTFKMAIVDRESKKGKGIAFKSTHKKEGDKKIFRYRECGGVGHYQVECLTFLR

Query:  KQKKNFRVTLSDEESVDSRDDDGNINAFAVRIIDENTDDDSEC------------------SVESKNDELSIEKLETLWKEDCEAKAIQKERIQDLLEEN
        +++K F   +SD E +DS D +   N  A    + +    S                    +  S ND+   + L    +E  E      E    ++EEN
Subjt:  KQKKNFRVTLSDEESVDSRDDDGNINAFAVRIIDENTDDDSEC------------------SVESKNDELSIEKLETLWKEDCEAKAIQKERIQDLLEEN

Query:  EWL--------MSVISSLKLKLRKVQ--NENDNILKSVKMLNSGTENLDSILKVGHNGSHRYGLGFVASASSS-----------KATSEIKFVLASVRVK
          L          V+ +LK    K +  ++ +   K+++MLN+GT+ L  IL +G   + + GLGF  + S S            A+  +K       + 
Subjt:  EWL--------MSVISSLKLKLRKVQ--NENDNILKSVKMLNSGTENLDSILKVGHNGSHRYGLGFVASASSS-----------KATSEIKFVLASVRVK

Query:  YDT-------IHTETGIR-------TPVKSLERACYYCGRKGHIRSICYKLRQDQLRQQKHWN-RSRGQPC
         DT         T +G R        P +     C++CG  GHIR  C++L +++ R    ++ R  G  C
Subjt:  YDT-------IHTETGIR-------TPVKSLERACYYCGRKGHIRSICYKLRQDQLRQQKHWN-RSRGQPC

ATMG00300.1 Gag-Pol-related retrotransposon family protein2.7e-0534.48Show/hide
Query:  SNMSDTCQLIRPDQTWLWHRKLRHVSMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMG
        SN+++T +    D+T LWH +L H+S RG+E ++K      +    V+   FC DC  GK T   + S  +  T   L  +H DL G
Subjt:  SNMSDTCQLIRPDQTWLWHRKLRHVSMRGLEKVIKNKAVVGIPDLDVNGNFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGCTGAAGTTGATCACAAACTTTACAAAAGACACTGATGGTGCTGAAGTCGAGGAGTCTGCTGCTCCTGTCACTCCTGTTGTGCATGCACATCGAGCTTCTGAGGC
CACTGTATCGAATATGGATTCAGATGATCAGGATAATGTTCCGTTAATCAGTTCCATTCACTCTCAAGAGAGCTCATCAACCGAAGGGGTATTCATTCCAACTCCAATAG
ACCCTCGACGCTCACCAGCTCTACCTTCAGGTCATTCTCCTTCTATTCACCCTTCTGAATCAAGACTACCTACTTCGCAACCAGATGCGACTGAGATACCTCCTGAAGAC
ATTCCTCCTCCTACCGATGATCCCATTGCACCATCTTCTGAAGGCCAAAGTCTCCTAAAGGAAAGCGTTCAATGCTGGAAGTTTTTGATGCAGAGGAGAATTGTTGATGA
GAGAATTATTGGGTTTAATGATCCAAGTAGTGCCGACTATCAGACAGTGCACATTAGAGGGTTCAAATTTGTGATTTCACCTTCTGTGATAAATGAGTTTCTTGGAAATA
CTGTTGATATTGACTGCTCTCCATCATGTCCTATTATTGAGGTTTTAGCTAATGTCTTATCTGGAGGGACCTTGTCCACATGGCCTGTGAATGGAATAACTGCAGCTGCT
CCTAGCATCAAGTATGCCATTCTGCACAAGATTGGCATTGCAAAATGGTTCTCTTCCTCGCATGCCTCAAGCATATCTGCGGCACTAGATACGGGTGCCTTCATTTACAA
TCAACTGTTGAGGCATGTTGGGTCGTTTGGGATCAAGATTCCCATTGCTTTTTCGAGGTTGTTCTCCAGTCTGCTACTTCATCTAAATGGAGCGGTGCTTACTGCATTTG
ATGCTCCTGGACTTGAACCTAAGACAATTGCACTTAGCTACAGACTCTTCCAAGGCAGTCATGTTCCTGATATTGACCATGATGTGCATCCAACTCGGGGCCCACGTATT
TTTGACACTACTGACTGGGATGATTCTGCTGAAGGCTTCTATGGGGAGCAGGTTGATGCTCTCATTCGACATTTGAAGTCTTCCACACCATCTACTAGTCGTCAGCAGCC
ACCATCTGAGAAGGACCATCGCAATGAATCCTTGCTGCTCGGTGAAAAGATACCTGATTCTAAGATAGTTCGGAAAGTACTTCGATCCTTGCCCAGAAAATTTGATATAA
AAGTTATTGCAATTGAGAAAGCTCATGATATTACAACGCTAAAACTTGATGAATTGTTTGGCTTGTTGTTTACGTTTAAGATGGCCATTGTTGATAGAGAAAGTAAGAAA
GGCAAGGGAATTGCTTTTAAATCCACACATAAAAAGGAAGGTGACAAAAAGATTTTCAGGTATAGAGAATGTGGAGGAGTTGGTCATTATCAGGTAGAATGTCTTACATT
CTTGAGAAAACAGAAGAAAAACTTTCGTGTCACACTGTCAGATGAAGAATCTGTTGATAGTAGAGATGATGATGGCAACATAAATGCTTTTGCAGTACGAATTATTGATG
AGAACACTGATGACGATAGTGAATGTTCGGTAGAAAGTAAAAATGATGAATTGTCAATTGAGAAGCTTGAAACTCTATGGAAAGAAGATTGTGAAGCTAAGGCAATACAA
AAGGAAAGGATCCAAGATCTTTTAGAAGAAAATGAATGGTTGATGTCTGTAATATCTTCTCTAAAGTTAAAATTGAGAAAGGTTCAAAATGAGAATGATAATATTTTAAA
ATCCGTTAAAATGCTAAATTCAGGAACGGAGAATCTAGATTCAATACTAAAAGTTGGACATAATGGCTCTCATAGATATGGGTTGGGATTTGTGGCCTCTGCAAGTAGTT
CTAAAGCTACATCAGAAATCAAGTTTGTCCTTGCCTCAGTGAGAGTTAAATATGACACGATTCATACAGAGACTGGCATCAGGACTCCAGTTAAATCTCTTGAGAGAGCT
TGTTACTATTGTGGTCGAAAAGGTCATATCAGGTCAATTTGTTATAAATTAAGGCAAGACCAGTTACGTCAACAGAAACACTGGAATAGAAGCCGTGGTCAACCTTGCTT
GGTTTGGAGAATTAAATTTGTTGAAAGATGTAAGATTGCCTTTACATCCGTTCAAACCGTAGATGATGCGTGGTATTTTGATAGTGGGTGCTTCAGACATATGACTGGAA
ACAAATCCTACTTTACGAACTTAAACGACTGTGTCACCGGACATGTTACCTTTGGTGATGGTGCAAAAAGAGAAATTATAGCTAAAGGTAACATAGACAAGGATGATCTA
CCACGACTAAACGATGTTAGGTATGTGGATGGACTAAAAGGAAACTTGATCAGTATAAGTCAACTGTGTGATCAAGGTTACAAAGTTAGTTTTGATGATATTGGTTGTGT
TGTCATGAATAAAGAAAATCAAATTTGCATGAGTGGTAAACGACAAGCTGATAATTGCTACCATTGGAATTCAAATATGTCTGACACCTGTCAGCTGATAAGACCCGATC
AAACATGGTTGTGGCACAGAAAGCTGAGGCATGTCAGCATGAGAGGATTGGAAAAAGTTATTAAAAATAAAGCAGTTGTGGGAATTCCTGATTTAGACGTAAATGGAAAC
TTCTTCTGTGGAGACTGTCAAATTGGCAAGCAGACAAGGTCTACTCATAAAAGTTTAAAAGAATGTTATACCAATAGAGTCTTGAAACTGTTACATATGGATCTCATGGG
TCCAATGCAAACAGAAAGTCTTGGAGGTAAGAGGTATGTGTTGGTTGTTGTTGATGATTACTCAAGATACACTTGGCTTTGTTTTCTCAAAGGAAAAATAGATACGGTTG
AAATATGCAAAAATCTGGGTTTGAAGCTACAACGTGAAAAGCGAAGAAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGTGCTGAAGTTGATCACAAACTTTACAAAAGACACTGATGGTGCTGAAGTCGAGGAGTCTGCTGCTCCTGTCACTCCTGTTGTGCATGCACATCGAGCTTCTGAGGC
CACTGTATCGAATATGGATTCAGATGATCAGGATAATGTTCCGTTAATCAGTTCCATTCACTCTCAAGAGAGCTCATCAACCGAAGGGGTATTCATTCCAACTCCAATAG
ACCCTCGACGCTCACCAGCTCTACCTTCAGGTCATTCTCCTTCTATTCACCCTTCTGAATCAAGACTACCTACTTCGCAACCAGATGCGACTGAGATACCTCCTGAAGAC
ATTCCTCCTCCTACCGATGATCCCATTGCACCATCTTCTGAAGGCCAAAGTCTCCTAAAGGAAAGCGTTCAATGCTGGAAGTTTTTGATGCAGAGGAGAATTGTTGATGA
GAGAATTATTGGGTTTAATGATCCAAGTAGTGCCGACTATCAGACAGTGCACATTAGAGGGTTCAAATTTGTGATTTCACCTTCTGTGATAAATGAGTTTCTTGGAAATA
CTGTTGATATTGACTGCTCTCCATCATGTCCTATTATTGAGGTTTTAGCTAATGTCTTATCTGGAGGGACCTTGTCCACATGGCCTGTGAATGGAATAACTGCAGCTGCT
CCTAGCATCAAGTATGCCATTCTGCACAAGATTGGCATTGCAAAATGGTTCTCTTCCTCGCATGCCTCAAGCATATCTGCGGCACTAGATACGGGTGCCTTCATTTACAA
TCAACTGTTGAGGCATGTTGGGTCGTTTGGGATCAAGATTCCCATTGCTTTTTCGAGGTTGTTCTCCAGTCTGCTACTTCATCTAAATGGAGCGGTGCTTACTGCATTTG
ATGCTCCTGGACTTGAACCTAAGACAATTGCACTTAGCTACAGACTCTTCCAAGGCAGTCATGTTCCTGATATTGACCATGATGTGCATCCAACTCGGGGCCCACGTATT
TTTGACACTACTGACTGGGATGATTCTGCTGAAGGCTTCTATGGGGAGCAGGTTGATGCTCTCATTCGACATTTGAAGTCTTCCACACCATCTACTAGTCGTCAGCAGCC
ACCATCTGAGAAGGACCATCGCAATGAATCCTTGCTGCTCGGTGAAAAGATACCTGATTCTAAGATAGTTCGGAAAGTACTTCGATCCTTGCCCAGAAAATTTGATATAA
AAGTTATTGCAATTGAGAAAGCTCATGATATTACAACGCTAAAACTTGATGAATTGTTTGGCTTGTTGTTTACGTTTAAGATGGCCATTGTTGATAGAGAAAGTAAGAAA
GGCAAGGGAATTGCTTTTAAATCCACACATAAAAAGGAAGGTGACAAAAAGATTTTCAGGTATAGAGAATGTGGAGGAGTTGGTCATTATCAGGTAGAATGTCTTACATT
CTTGAGAAAACAGAAGAAAAACTTTCGTGTCACACTGTCAGATGAAGAATCTGTTGATAGTAGAGATGATGATGGCAACATAAATGCTTTTGCAGTACGAATTATTGATG
AGAACACTGATGACGATAGTGAATGTTCGGTAGAAAGTAAAAATGATGAATTGTCAATTGAGAAGCTTGAAACTCTATGGAAAGAAGATTGTGAAGCTAAGGCAATACAA
AAGGAAAGGATCCAAGATCTTTTAGAAGAAAATGAATGGTTGATGTCTGTAATATCTTCTCTAAAGTTAAAATTGAGAAAGGTTCAAAATGAGAATGATAATATTTTAAA
ATCCGTTAAAATGCTAAATTCAGGAACGGAGAATCTAGATTCAATACTAAAAGTTGGACATAATGGCTCTCATAGATATGGGTTGGGATTTGTGGCCTCTGCAAGTAGTT
CTAAAGCTACATCAGAAATCAAGTTTGTCCTTGCCTCAGTGAGAGTTAAATATGACACGATTCATACAGAGACTGGCATCAGGACTCCAGTTAAATCTCTTGAGAGAGCT
TGTTACTATTGTGGTCGAAAAGGTCATATCAGGTCAATTTGTTATAAATTAAGGCAAGACCAGTTACGTCAACAGAAACACTGGAATAGAAGCCGTGGTCAACCTTGCTT
GGTTTGGAGAATTAAATTTGTTGAAAGATGTAAGATTGCCTTTACATCCGTTCAAACCGTAGATGATGCGTGGTATTTTGATAGTGGGTGCTTCAGACATATGACTGGAA
ACAAATCCTACTTTACGAACTTAAACGACTGTGTCACCGGACATGTTACCTTTGGTGATGGTGCAAAAAGAGAAATTATAGCTAAAGGTAACATAGACAAGGATGATCTA
CCACGACTAAACGATGTTAGGTATGTGGATGGACTAAAAGGAAACTTGATCAGTATAAGTCAACTGTGTGATCAAGGTTACAAAGTTAGTTTTGATGATATTGGTTGTGT
TGTCATGAATAAAGAAAATCAAATTTGCATGAGTGGTAAACGACAAGCTGATAATTGCTACCATTGGAATTCAAATATGTCTGACACCTGTCAGCTGATAAGACCCGATC
AAACATGGTTGTGGCACAGAAAGCTGAGGCATGTCAGCATGAGAGGATTGGAAAAAGTTATTAAAAATAAAGCAGTTGTGGGAATTCCTGATTTAGACGTAAATGGAAAC
TTCTTCTGTGGAGACTGTCAAATTGGCAAGCAGACAAGGTCTACTCATAAAAGTTTAAAAGAATGTTATACCAATAGAGTCTTGAAACTGTTACATATGGATCTCATGGG
TCCAATGCAAACAGAAAGTCTTGGAGGTAAGAGGTATGTGTTGGTTGTTGTTGATGATTACTCAAGATACACTTGGCTTTGTTTTCTCAAAGGAAAAATAGATACGGTTG
AAATATGCAAAAATCTGGGTTTGAAGCTACAACGTGAAAAGCGAAGAAGATAA
Protein sequenceShow/hide protein sequence
MVLKLITNFTKDTDGAEVEESAAPVTPVVHAHRASEATVSNMDSDDQDNVPLISSIHSQESSSTEGVFIPTPIDPRRSPALPSGHSPSIHPSESRLPTSQPDATEIPPED
IPPPTDDPIAPSSEGQSLLKESVQCWKFLMQRRIVDERIIGFNDPSSADYQTVHIRGFKFVISPSVINEFLGNTVDIDCSPSCPIIEVLANVLSGGTLSTWPVNGITAAA
PSIKYAILHKIGIAKWFSSSHASSISAALDTGAFIYNQLLRHVGSFGIKIPIAFSRLFSSLLLHLNGAVLTAFDAPGLEPKTIALSYRLFQGSHVPDIDHDVHPTRGPRI
FDTTDWDDSAEGFYGEQVDALIRHLKSSTPSTSRQQPPSEKDHRNESLLLGEKIPDSKIVRKVLRSLPRKFDIKVIAIEKAHDITTLKLDELFGLLFTFKMAIVDRESKK
GKGIAFKSTHKKEGDKKIFRYRECGGVGHYQVECLTFLRKQKKNFRVTLSDEESVDSRDDDGNINAFAVRIIDENTDDDSECSVESKNDELSIEKLETLWKEDCEAKAIQ
KERIQDLLEENEWLMSVISSLKLKLRKVQNENDNILKSVKMLNSGTENLDSILKVGHNGSHRYGLGFVASASSSKATSEIKFVLASVRVKYDTIHTETGIRTPVKSLERA
CYYCGRKGHIRSICYKLRQDQLRQQKHWNRSRGQPCLVWRIKFVERCKIAFTSVQTVDDAWYFDSGCFRHMTGNKSYFTNLNDCVTGHVTFGDGAKREIIAKGNIDKDDL
PRLNDVRYVDGLKGNLISISQLCDQGYKVSFDDIGCVVMNKENQICMSGKRQADNCYHWNSNMSDTCQLIRPDQTWLWHRKLRHVSMRGLEKVIKNKAVVGIPDLDVNGN
FFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMGPMQTESLGGKRYVLVVVDDYSRYTWLCFLKGKIDTVEICKNLGLKLQREKRRR