; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013786 (gene) of Snake gourd v1 genome

Gene IDTan0013786
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG02:52251682..52254342
RNA-Seq ExpressionTan0013786
SyntenyTan0013786
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054839.1 gag/pol protein [Cucumis melo var. makuwa]9.0e-7952.35Show/hide
Query:  KPRLSLPMMICEDPLTYDQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDHLDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSP
        K ++ +P    EDPLTY+QAM DVD D+WIK+MD EMESMY NSVW LV+  + VKPIGCKWIYKRKR   GK QTFKARLVA G+TQ EG+DYEETFSP
Subjt:  KPRLSLPMMICEDPLTYDQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDHLDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSP

Query:  VAM----------------------------------------GI-------------VGDIGPIRVRSSGG----------------------------
        VAM                                        GI             V D+  I   S+ G                            
Subjt:  VAM----------------------------------------GI-------------VGDIGPIRVRSSGG----------------------------

Query:  ------VKAILKYLKRMRNYSLVYGSGDLILTGYTDSDFQTDKHSRKSTSGK---------FMIDLEVVPNMNLPITLFCDNSGAVANSREPRSHKRGKH
              VK ILKYL+R ++Y LVYGS DLILTGYTDSDFQTDK  RK T G          F+ DLEVVPNM LPITL+CDNSGA ANSREPRSHKRGKH
Subjt:  ------VKAILKYLKRMRNYSLVYGSGDLILTGYTDSDFQTDKHSRKSTSGK---------FMIDLEVVPNMNLPITLFCDNSGAVANSREPRSHKRGKH

Query:  IERKYHLIREIVHRGDVTVTQIASERNVVDPFTKPLTAKV
        IERKYHLIREIVHRGDV +T+I+SE+N+ +PFTK L AKV
Subjt:  IERKYHLIREIVHRGDVTVTQIASERNVVDPFTKPLTAKV

KAA0062799.1 gag/pol protein [Cucumis melo var. makuwa]9.3e-8461.72Show/hide
Query:  EDPLTYDQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDHLDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSPVAM-GI-----
        EDPLT+ +A+ DVDKDEWIKAM+ E+ESMYFNSVW+LVD LDGVKPIGCKW YKRKRG DGK+QTFKA L+AKG+TQVEGVDYEETFSPVAM G+     
Subjt:  EDPLTYDQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDHLDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSPVAM-GI-----

Query:  --------VGDIGPIRVRSSGGVKAILKYLKRMRNYSLVYGSGDLILTGYTDSDFQTDKHSRKSTSG---------------------------------
                V ++  I   S+ G       L R R+Y LVYGS DLILT Y DSDFQTD+ SRKSTSG                                 
Subjt:  --------VGDIGPIRVRSSGGVKAILKYLKRMRNYSLVYGSGDLILTGYTDSDFQTDKHSRKSTSG---------------------------------

Query:  ----------KFMIDLEVVPNMNLPITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVTVTQIASERNVVDPFTKPLTAKV
                  KF+IDLEVVPNM  PITL+CDNSGAV NSREPRSHKRGKHIERKYHLIREI HRGDV VTQIA   NV DPFTKPLTAKV
Subjt:  ----------KFMIDLEVVPNMNLPITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVTVTQIASERNVVDPFTKPLTAKV

KAA0067450.1 retrovirus-related pol polyprotein from transposon tnt 1-94 [Cucumis melo var. makuwa]9.0e-7945.43Show/hide
Query:  SARVADGASTSTSVVDPSTSSQI-CSQVLGMPRVVGG-CETTDRYMGLNLKPRLSLPMMICEDPLTYDQAMVDVDKDEWIKAMDQEMESMYFNSVWELVD
        S RV +    S+ V + +TS Q   SQ L MPR  G      +RY+GL  + ++ +P     DPL+Y QAM DVDKD+W+K++D +MESMYFNSVWELVD
Subjt:  SARVADGASTSTSVVDPSTSSQI-CSQVLGMPRVVGG-CETTDRYMGLNLKPRLSLPMMICEDPLTYDQAMVDVDKDEWIKAMDQEMESMYFNSVWELVD

Query:  HLDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSPVAM-----------------------------------------------
          +GVKPIGCKWIYKRKR   GKVQT K +LVAKG+TQ EGVD+EETFSPVAM                                               
Subjt:  HLDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSPVAM-----------------------------------------------

Query:  ----------------------------------------------GIVGDIGPI-RVRSSGGV----KAILKYLKRMRNYSLVYGSGDLILTGYTDSDF
                                                       I   +G + R +S+ GV    K ILKYL+R R+Y LVYG+ DLILTGYTDSDF
Subjt:  ----------------------------------------------GIVGDIGPI-RVRSSGGV----KAILKYLKRMRNYSLVYGSGDLILTGYTDSDF

Query:  QTDKHSRKSTSG-------------------------------------------KFMIDLEVVPNMNLPITLFCDNSGAVANSREPRSHKRGKHIERKY
        QTDK SRKS SG                                           KF+ D EVVPNMNLPITL+CDNSGAVANS+EPRSHKRGKHIERKY
Subjt:  QTDKHSRKSTSG-------------------------------------------KFMIDLEVVPNMNLPITLFCDNSGAVANSREPRSHKRGKHIERKY

Query:  HLIREIVHRGDVTVTQIASERNVVDPFTKPLTAKVEVV
        HLIREIV +GDV VT+IASE+N+ DPFTK LTAKV  V
Subjt:  HLIREIVHRGDVTVTQIASERNVVDPFTKPLTAKVEVV

TYJ96755.1 gag/pol protein [Cucumis melo var. makuwa]9.3e-8461.72Show/hide
Query:  EDPLTYDQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDHLDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSPVAM-GI-----
        EDPLT+ +A+ DVDKDEWIKAM+ E+ESMYFNSVW+LVD LDGVKPIGCKW YKRKRG DGK+QTFKA L+AKG+TQVEGVDYEETFSPVAM G+     
Subjt:  EDPLTYDQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDHLDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSPVAM-GI-----

Query:  --------VGDIGPIRVRSSGGVKAILKYLKRMRNYSLVYGSGDLILTGYTDSDFQTDKHSRKSTSG---------------------------------
                V ++  I   S+ G       L R R+Y LVYGS DLILT Y DSDFQTD+ SRKSTSG                                 
Subjt:  --------VGDIGPIRVRSSGGVKAILKYLKRMRNYSLVYGSGDLILTGYTDSDFQTDKHSRKSTSG---------------------------------

Query:  ----------KFMIDLEVVPNMNLPITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVTVTQIASERNVVDPFTKPLTAKV
                  KF+IDLEVVPNM  PITL+CDNSGAV NSREPRSHKRGKHIERKYHLIREI HRGDV VTQIA   NV DPFTKPLTAKV
Subjt:  ----------KFMIDLEVVPNMNLPITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVTVTQIASERNVVDPFTKPLTAKV

TYK02840.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-7646Show/hide
Query:  SARVADGASTSTSVVDPSTSSQI-CSQVLGMPRVVGG-CETTDRYMGLNLKPRLSLPMMICEDPLTYDQAMVDVDKDEWIKAMDQEMESMYFNSVWELVD
        S RV D    S+ V + +TS Q   SQ L MPR  G      +RY+GL  + ++ +P    EDPL+Y QAM DVDKD+W+KAMD EMESMYFNSVWELVD
Subjt:  SARVADGASTSTSVVDPSTSSQI-CSQVLGMPRVVGG-CETTDRYMGLNLKPRLSLPMMICEDPLTYDQAMVDVDKDEWIKAMDQEMESMYFNSVWELVD

Query:  HLDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSPVAM-----------------------------------------------
          +GVKPIGCKWIYKRKR   GKVQTFKARLVAKG+TQ EGVDYEETFSPVAM                                               
Subjt:  HLDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSPVAM-----------------------------------------------

Query:  ---------------------------GI-------------VGDIGPIRVRSSGG-------------VKAILKYLKRMRNYSLVYGSGDLILTGYTDS
                                   G+             V D+  I   S+ G               A+    +R R Y LVYG  DLIL  YTDS
Subjt:  ---------------------------GI-------------VGDIGPIRVRSSGG-------------VKAILKYLKRMRNYSLVYGSGDLILTGYTDS

Query:  DFQTDKHSRKSTSG-------------------------------------------KFMIDLEVVPNMNLPITLFCDNSGAVANSREPRSHKRGKHIER
        DFQTDK SRKS SG                                           KF+ DLEVVPNMNLPITL+CDNSGAVANS+EPRSHKRGKHIER
Subjt:  DFQTDKHSRKSTSG-------------------------------------------KFMIDLEVVPNMNLPITLFCDNSGAVANSREPRSHKRGKHIER

Query:  KYHLIREIVHRGDVTVTQIASERNVVDPFTKPLTAKV
        KYHLIREIV RGDV VT+I SE N+ DPFTK LTAKV
Subjt:  KYHLIREIVHRGDVTVTQIASERNVVDPFTKPLTAKV

TrEMBL top hitse value%identityAlignment
A0A5A7SRW5 Gag/pol protein1.4e-7248.08Show/hide
Query:  EDPLTYDQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDHLDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSPVAM--------
        EDPLTY QAM DVD D+ IKAMD EMESMY NSVW LVD    V+ IGCKWIYKRKR   GKVQTFKARLVAKG+TQ EG+DYEETFSPVAM        
Subjt:  EDPLTYDQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDHLDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSPVAM--------

Query:  -------------------------------------------------------------------------GIVGDIGPI-RVRSSGG------VKAI
                                                                                  I   +G + R +S+ G      VK I
Subjt:  -------------------------------------------------------------------------GIVGDIGPI-RVRSSGG------VKAI

Query:  LKYLKRMRNYSLVYGSGDLILTGYTDSDFQTDKHSRKSTSG-------------------------------------------KFMIDLEVVPNMNLPI
        LKYL++ ++Y LVYGS DLILTG+TDSDFQ DK +RKSTSG                                           KF+ DLEVVPNM+L I
Subjt:  LKYLKRMRNYSLVYGSGDLILTGYTDSDFQTDKHSRKSTSG-------------------------------------------KFMIDLEVVPNMNLPI

Query:  TLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVTVTQIASERNVVDPFTKPLTAKV
        TL+CDNSGAVANSREPRSHKRGKHIERKYHLIREI+H+GDV VT+I+SE+N+ DPFTK L AKV
Subjt:  TLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVTVTQIASERNVVDPFTKPLTAKV

A0A5A7VB15 Gag/pol protein4.5e-8461.72Show/hide
Query:  EDPLTYDQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDHLDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSPVAM-GI-----
        EDPLT+ +A+ DVDKDEWIKAM+ E+ESMYFNSVW+LVD LDGVKPIGCKW YKRKRG DGK+QTFKA L+AKG+TQVEGVDYEETFSPVAM G+     
Subjt:  EDPLTYDQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDHLDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSPVAM-GI-----

Query:  --------VGDIGPIRVRSSGGVKAILKYLKRMRNYSLVYGSGDLILTGYTDSDFQTDKHSRKSTSG---------------------------------
                V ++  I   S+ G       L R R+Y LVYGS DLILT Y DSDFQTD+ SRKSTSG                                 
Subjt:  --------VGDIGPIRVRSSGGVKAILKYLKRMRNYSLVYGSGDLILTGYTDSDFQTDKHSRKSTSG---------------------------------

Query:  ----------KFMIDLEVVPNMNLPITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVTVTQIASERNVVDPFTKPLTAKV
                  KF+IDLEVVPNM  PITL+CDNSGAV NSREPRSHKRGKHIERKYHLIREI HRGDV VTQIA   NV DPFTKPLTAKV
Subjt:  ----------KFMIDLEVVPNMNLPITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVTVTQIASERNVVDPFTKPLTAKV

A0A5A7VJV4 Retrovirus-related pol polyprotein from transposon tnt 1-944.3e-7945.43Show/hide
Query:  SARVADGASTSTSVVDPSTSSQI-CSQVLGMPRVVGG-CETTDRYMGLNLKPRLSLPMMICEDPLTYDQAMVDVDKDEWIKAMDQEMESMYFNSVWELVD
        S RV +    S+ V + +TS Q   SQ L MPR  G      +RY+GL  + ++ +P     DPL+Y QAM DVDKD+W+K++D +MESMYFNSVWELVD
Subjt:  SARVADGASTSTSVVDPSTSSQI-CSQVLGMPRVVGG-CETTDRYMGLNLKPRLSLPMMICEDPLTYDQAMVDVDKDEWIKAMDQEMESMYFNSVWELVD

Query:  HLDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSPVAM-----------------------------------------------
          +GVKPIGCKWIYKRKR   GKVQT K +LVAKG+TQ EGVD+EETFSPVAM                                               
Subjt:  HLDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSPVAM-----------------------------------------------

Query:  ----------------------------------------------GIVGDIGPI-RVRSSGGV----KAILKYLKRMRNYSLVYGSGDLILTGYTDSDF
                                                       I   +G + R +S+ GV    K ILKYL+R R+Y LVYG+ DLILTGYTDSDF
Subjt:  ----------------------------------------------GIVGDIGPI-RVRSSGGV----KAILKYLKRMRNYSLVYGSGDLILTGYTDSDF

Query:  QTDKHSRKSTSG-------------------------------------------KFMIDLEVVPNMNLPITLFCDNSGAVANSREPRSHKRGKHIERKY
        QTDK SRKS SG                                           KF+ D EVVPNMNLPITL+CDNSGAVANS+EPRSHKRGKHIERKY
Subjt:  QTDKHSRKSTSG-------------------------------------------KFMIDLEVVPNMNLPITLFCDNSGAVANSREPRSHKRGKHIERKY

Query:  HLIREIVHRGDVTVTQIASERNVVDPFTKPLTAKVEVV
        HLIREIV +GDV VT+IASE+N+ DPFTK LTAKV  V
Subjt:  HLIREIVHRGDVTVTQIASERNVVDPFTKPLTAKVEVV

A0A5D3BE74 Gag/pol protein4.5e-8461.72Show/hide
Query:  EDPLTYDQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDHLDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSPVAM-GI-----
        EDPLT+ +A+ DVDKDEWIKAM+ E+ESMYFNSVW+LVD LDGVKPIGCKW YKRKRG DGK+QTFKA L+AKG+TQVEGVDYEETFSPVAM G+     
Subjt:  EDPLTYDQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDHLDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSPVAM-GI-----

Query:  --------VGDIGPIRVRSSGGVKAILKYLKRMRNYSLVYGSGDLILTGYTDSDFQTDKHSRKSTSG---------------------------------
                V ++  I   S+ G       L R R+Y LVYGS DLILT Y DSDFQTD+ SRKSTSG                                 
Subjt:  --------VGDIGPIRVRSSGGVKAILKYLKRMRNYSLVYGSGDLILTGYTDSDFQTDKHSRKSTSG---------------------------------

Query:  ----------KFMIDLEVVPNMNLPITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVTVTQIASERNVVDPFTKPLTAKV
                  KF+IDLEVVPNM  PITL+CDNSGAV NSREPRSHKRGKHIERKYHLIREI HRGDV VTQIA   NV DPFTKPLTAKV
Subjt:  ----------KFMIDLEVVPNMNLPITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVTVTQIASERNVVDPFTKPLTAKV

A0A5D3DWF1 Gag/pol protein4.3e-7952.35Show/hide
Query:  KPRLSLPMMICEDPLTYDQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDHLDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSP
        K ++ +P    EDPLTY+QAM DVD D+WIK+MD EMESMY NSVW LV+  + VKPIGCKWIYKRKR   GK QTFKARLVA G+TQ EG+DYEETFSP
Subjt:  KPRLSLPMMICEDPLTYDQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDHLDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSP

Query:  VAM----------------------------------------GI-------------VGDIGPIRVRSSGG----------------------------
        VAM                                        GI             V D+  I   S+ G                            
Subjt:  VAM----------------------------------------GI-------------VGDIGPIRVRSSGG----------------------------

Query:  ------VKAILKYLKRMRNYSLVYGSGDLILTGYTDSDFQTDKHSRKSTSGK---------FMIDLEVVPNMNLPITLFCDNSGAVANSREPRSHKRGKH
              VK ILKYL+R ++Y LVYGS DLILTGYTDSDFQTDK  RK T G          F+ DLEVVPNM LPITL+CDNSGA ANSREPRSHKRGKH
Subjt:  ------VKAILKYLKRMRNYSLVYGSGDLILTGYTDSDFQTDKHSRKSTSGK---------FMIDLEVVPNMNLPITLFCDNSGAVANSREPRSHKRGKH

Query:  IERKYHLIREIVHRGDVTVTQIASERNVVDPFTKPLTAKV
        IERKYHLIREIVHRGDV +T+I+SE+N+ +PFTK L AKV
Subjt:  IERKYHLIREIVHRGDVTVTQIASERNVVDPFTKPLTAKV

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.9e-1034.83Show/hide
Query:  PLTYDQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDHLDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSPVA
        P ++D+     DK  W +A++ E+ +   N+ W +    +    +  +W++  K    G    +KARLVA+GFTQ   +DYEETF+PVA
Subjt:  PLTYDQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDHLDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSPVA

P04146 Copia protein4.6e-0928.57Show/hide
Query:  VKAILKYLKRMRNYSLVYGSG---DLILTGYTDSDFQTDKHSRKSTSG----------------------------KFMIDLEVVP------------NM
        +K +L+YLK   +  L++      +  + GY DSD+   +  RKST+G                            ++M   E V             N+
Subjt:  VKAILKYLKRMRNYSLVYGSG---DLILTGYTDSDFQTDKHSRKSTSG----------------------------KFMIDLEVVP------------NM

Query:  NL--PITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVTVTQIASERNVVDPFTKPLTA
         L  PI ++ DN G ++ +  P  HKR KHI+ KYH  RE V    + +  I +E  + D FTKPL A
Subjt:  NL--PITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVTVTQIASERNVVDPFTKPLTA

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.3e-1541.57Show/hide
Query:  DPLTYDQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDHLDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSPV
        +P +  + +   +K++ +KAM +EMES+  N  ++LV+   G +P+ CKW++K K+  D K+  +KARLV KGF Q +G+D++E FSPV
Subjt:  DPLTYDQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDHLDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSPV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.0e-0829.56Show/hide
Query:  VKAILKYLKRMRNYSLVYGSGDLILTGYTDSDFQTDKHSRKST------------------------------------SGKFMIDLE-VVPNMNL---P
        VK IL+YL+      L +G  D IL GYTD+D   D  +RKS+                                    +GK MI L+  +  + L    
Subjt:  VKAILKYLKRMRNYSLVYGSGDLILTGYTDSDFQTDKHSRKST------------------------------------SGKFMIDLE-VVPNMNL---P

Query:  ITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVTVTQIASERNVVDPFTK
          ++CD+  A+  S+    H R KHI+ +YH IRE+V    + V +I++  N  D  TK
Subjt:  ITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVTVTQIASERNVVDPFTK

P92520 Uncharacterized mitochondrial protein AtMg008201.2e-1239.42Show/hide
Query:  LNLKPRLSLPMMICEDPLTYDQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDHLDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEET
        LN K  L++   I ++P +   A+ D     W +AM +E++++  N  W LV        +GCKW++K K   DG +   KARLVAKGF Q EG+ + ET
Subjt:  LNLKPRLSLPMMICEDPLTYDQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDHLDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEET

Query:  FSPV
        +SPV
Subjt:  FSPV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.6e-1233.33Show/hide
Query:  STSARVADGASTSTSVV--DPSTSSQICSQVLGMPRVVGGCETTDRYMGLNLKPRLSLPMMIC--EDPLTYDQAMVDVDKDEWIKAMDQEMESMYFNSVW
        +TSA  +  + T  S++   P   +QI +     P       T  +   +   P+ SL + +    +P T  QA+ D   + W  AM  E+ +   N  W
Subjt:  STSARVADGASTSTSVV--DPSTSSQICSQVLGMPRVVGGCETTDRYMGLNLKPRLSLPMMIC--EDPLTYDQAMVDVDKDEWIKAMDQEMESMYFNSVW

Query:  ELVDHLDG-VKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSPV
        +LV      V  +GC+WI+ +K   DG +  +KARLVAKG+ Q  G+DY ETFSPV
Subjt:  ELVDHLDG-VKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSPV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.0e-1336.3Show/hide
Query:  ASTSTSVVDPSTSSQICSQVLGMPRVVGGCETTDRYMGL---NLKPRLSLPMMICEDPLTYDQAMVDVDKDEWIKAMDQEMESMYFNSVWELV-DHLDGV
        +STST  + P   +    QV     V      T    G+   N K   +  +    +P T  QAM D   D W +AM  E+ +   N  W+LV      V
Subjt:  ASTSTSVVDPSTSSQICSQVLGMPRVVGGCETTDRYMGL---NLKPRLSLPMMICEDPLTYDQAMVDVDKDEWIKAMDQEMESMYFNSVWELV-DHLDGV

Query:  KPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSPV
          +GC+WI+ +K   DG +  +KARLVAKG+ Q  G+DY ETFSPV
Subjt:  KPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSPV

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 85.5e-1847.78Show/hide
Query:  EDPLTYDQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDHLDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSPV
        ++P TY++A   +    W  AMD E+ +M     WE+       KPIGCKW+YK K   DG ++ +KARLVAKG+TQ EG+D+ ETFSPV
Subjt:  EDPLTYDQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDHLDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSPV

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)8.2e-1439.42Show/hide
Query:  LNLKPRLSLPMMICEDPLTYDQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDHLDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEET
        LN K  L++   I ++P +   A+ D     W +AM +E++++  N  W LV        +GCKW++K K   DG +   KARLVAKGF Q EG+ + ET
Subjt:  LNLKPRLSLPMMICEDPLTYDQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDHLDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEET

Query:  FSPV
        +SPV
Subjt:  FSPV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACAGTACATCAGCAAGAGTTGCTGATGGGGCTAGTACGTCAACAAGTGTTGTTGATCCTAGCACGTCTAGTCAAATCTGTTCCCAAGTGTTGGGAATGCCTCGCGT
AGTGGGAGGTTGTGAGACGACTGATCGCTACATGGGTTTAAATTTGAAACCTCGGTTGTCGCTCCCGATGATGATCTGTGAGGATCCATTGACCTATGATCAGGCAATGG
TTGATGTTGACAAAGACGAGTGGATTAAAGCTATGGACCAGGAAATGGAGTCTATGTACTTCAATTCTGTATGGGAGCTTGTGGATCATCTAGATGGGGTAAAACCTATT
GGTTGCAAATGGATCTACAAGCGTAAACGTGGCGTAGATGGGAAGGTGCAGACCTTCAAAGCCCGACTAGTGGCAAAGGGTTTTACCCAGGTTGAAGGGGTCGACTATGA
GGAGACCTTTTCACCTGTTGCCATGGGAATTGTCGGTGATATCGGTCCAATCCGAGTTAGATCATCGGGCGGTGTAAAGGCAATCCTCAAGTATCTTAAGAGAATGAGGA
ACTACAGCTTAGTGTATGGTAGTGGGGATTTGATCCTTACGGGATACACGGATTCTGATTTTCAGACCGATAAGCATTCTAGAAAATCCACCTCGGGGAAGTTCATGATA
GATTTGGAAGTTGTTCCAAATATGAACTTGCCAATCACACTGTTCTGTGATAACAGTGGTGCAGTAGCCAACTCACGTGAGCCTCGGAGCCATAAGAGAGGCAAACACAT
TGAGCGCAAGTATCATTTGATACGGGAGATTGTGCACCGCGGAGACGTGACGGTCACGCAGATAGCCTCGGAGCGCAACGTAGTTGATCCATTTACAAAGCCCCTCACGG
CTAAGGTCGAGGTGGTCACGATTACTCGCCCAATATGCCTATCTTCTCGGGACGAGAACCAAGTAAGGAATTGGGAACTTAACTACGAGATGGAATTCACTCGTTCCACT
TTAGGGAAACTAGAGGGTTGTTCCCTTGAGTGTTGTCTCCGTGGCTTGGACATGGCGCGCCACCTTCTCACCGGCCGAGAAGGTGTTGTGTATGGTTGGACCATCATAAT
ATGTTGTTCATTAGAGGAGCAGTGGGGACTTAAGGAACAAGAGGTAGACATAGGGGGTAAAAGCGGTAATTTGACCCAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGACAGTACATCAGCAAGAGTTGCTGATGGGGCTAGTACGTCAACAAGTGTTGTTGATCCTAGCACGTCTAGTCAAATCTGTTCCCAAGTGTTGGGAATGCCTCGCGT
AGTGGGAGGTTGTGAGACGACTGATCGCTACATGGGTTTAAATTTGAAACCTCGGTTGTCGCTCCCGATGATGATCTGTGAGGATCCATTGACCTATGATCAGGCAATGG
TTGATGTTGACAAAGACGAGTGGATTAAAGCTATGGACCAGGAAATGGAGTCTATGTACTTCAATTCTGTATGGGAGCTTGTGGATCATCTAGATGGGGTAAAACCTATT
GGTTGCAAATGGATCTACAAGCGTAAACGTGGCGTAGATGGGAAGGTGCAGACCTTCAAAGCCCGACTAGTGGCAAAGGGTTTTACCCAGGTTGAAGGGGTCGACTATGA
GGAGACCTTTTCACCTGTTGCCATGGGAATTGTCGGTGATATCGGTCCAATCCGAGTTAGATCATCGGGCGGTGTAAAGGCAATCCTCAAGTATCTTAAGAGAATGAGGA
ACTACAGCTTAGTGTATGGTAGTGGGGATTTGATCCTTACGGGATACACGGATTCTGATTTTCAGACCGATAAGCATTCTAGAAAATCCACCTCGGGGAAGTTCATGATA
GATTTGGAAGTTGTTCCAAATATGAACTTGCCAATCACACTGTTCTGTGATAACAGTGGTGCAGTAGCCAACTCACGTGAGCCTCGGAGCCATAAGAGAGGCAAACACAT
TGAGCGCAAGTATCATTTGATACGGGAGATTGTGCACCGCGGAGACGTGACGGTCACGCAGATAGCCTCGGAGCGCAACGTAGTTGATCCATTTACAAAGCCCCTCACGG
CTAAGGTCGAGGTGGTCACGATTACTCGCCCAATATGCCTATCTTCTCGGGACGAGAACCAAGTAAGGAATTGGGAACTTAACTACGAGATGGAATTCACTCGTTCCACT
TTAGGGAAACTAGAGGGTTGTTCCCTTGAGTGTTGTCTCCGTGGCTTGGACATGGCGCGCCACCTTCTCACCGGCCGAGAAGGTGTTGTGTATGGTTGGACCATCATAAT
ATGTTGTTCATTAGAGGAGCAGTGGGGACTTAAGGAACAAGAGGTAGACATAGGGGGTAAAAGCGGTAATTTGACCCAGTAG
Protein sequenceShow/hide protein sequence
MDSTSARVADGASTSTSVVDPSTSSQICSQVLGMPRVVGGCETTDRYMGLNLKPRLSLPMMICEDPLTYDQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDHLDGVKPI
GCKWIYKRKRGVDGKVQTFKARLVAKGFTQVEGVDYEETFSPVAMGIVGDIGPIRVRSSGGVKAILKYLKRMRNYSLVYGSGDLILTGYTDSDFQTDKHSRKSTSGKFMI
DLEVVPNMNLPITLFCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVTVTQIASERNVVDPFTKPLTAKVEVVTITRPICLSSRDENQVRNWELNYEMEFTRST
LGKLEGCSLECCLRGLDMARHLLTGREGVVYGWTIIICCSLEEQWGLKEQEVDIGGKSGNLTQ