; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021036 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021036
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr7:4136436..4142197
RNA-Seq ExpressionLag0021036
SyntenyLag0021036
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052371.1 putative mitochondrial protein [Cucumis melo var. makuwa]4.2e-12367.67Show/hide
Query:  CCYILVYVDDIIITGSSSDTITKLIADLNGKFALKDLGTLSYFLGIEVSYPSSGGIFLSQSKYINDLLCKTNMDGAKPMNIPMMSGSPPSARHGEALSDV
        CCY L+YVDD+II GSS   +  L+  LN +FALKDLG LSYFLG+EVSYP++G +FLSQSKYI DLL +T M  AKP++ PM+SG   SA  GE   DV
Subjt:  CCYILVYVDDIIITGSSSDTITKLIADLNGKFALKDLGTLSYFLGIEVSYPSSGGIFLSQSKYINDLLCKTNMDGAKPMNIPMMSGSPPSARHGEALSDV

Query:  RFYRSVVGALQYATFTPPEISYSVNKACQFMHAPTSTHWQLVKRILRYLKGTISHGLLLSVPRCLSLQGFADADWASDPDDRKSTSGFCIFFGGNLISWG
           RSVVGALQYAT T PEISYSVNKACQFMH P  THWQLVKRILRYLKG + HGL LS     SL GFADADWASDPDDRKSTSG C++FG NL+SWG
Subjt:  RFYRSVVGALQYATFTPPEISYSVNKACQFMHAPTSTHWQLVKRILRYLKGTISHGLLLSVPRCLSLQGFADADWASDPDDRKSTSGFCIFFGGNLISWG

Query:  SKKQSIISRSSTEAEYTCLATAAAELIWLNSLFCDLCVSLLQKPTLWCDNLSAVHLSANPVLHSRTKHVEIDIYFVRDLVLQKRLLIQHLPATDQVADIL
        SKKQSIISRSSTEAEY CLA  A E++W+ SL  DL + L   P L CDNLSAVH SANP+LHS+TKHVE+DIYFVRDL+ +++L ++HLPAT+Q+ADIL
Subjt:  SKKQSIISRSSTEAEYTCLATAAAELIWLNSLFCDLCVSLLQKPTLWCDNLSAVHLSANPVLHSRTKHVEIDIYFVRDLVLQKRLLIQHLPATDQVADIL

Query:  TKPLSASSFLRLTSKLNVRDPLTIGLRGVLR
        TKPLSA SF +L + + V D   IGL+GVLR
Subjt:  TKPLSASSFLRLTSKLNVRDPLTIGLRGVLR

PNY12955.1 retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense]3.1e-10259.88Show/hide
Query:  QGECCYILVYVDDIIITGSSSDTITKLIADLNGKFALKDLGTLSYFLGIEVSYPSSGGIFLSQSKYINDLLCKTNMDGAKPMNIPMMSGSPPSARHGEAL
        QG C Y+L+YVDDI+ITGS+   I  LI  LN +FALK LG + YFLGIEV +  SGG+ L+QSKYI DLLC+TNM+  KP+  PM+S    S    +A+
Subjt:  QGECCYILVYVDDIIITGSSSDTITKLIADLNGKFALKDLGTLSYFLGIEVSYPSSGGIFLSQSKYINDLLCKTNMDGAKPMNIPMMSGSPPSARHGEAL

Query:  SDVRFYRSVVGALQYATFTPPEISYSVNKACQFMHAPTSTHWQLVKRILRYLKGTISHGLLLSVPRC---LSLQGFADADWASDPDDRKSTSGFCIFFGG
        SD   YRS VGALQYAT T P+I++SVNK CQFM  P  THW+ VKRILRYLKGT +HGLLL+        SL+ ++DADWA+D DDR+STSG CI+FG 
Subjt:  SDVRFYRSVVGALQYATFTPPEISYSVNKACQFMHAPTSTHWQLVKRILRYLKGTISHGLLLSVPRC---LSLQGFADADWASDPDDRKSTSGFCIFFGG

Query:  NLISWGSKKQSIISRSSTEAEYTCLATAAAELIWLNSLFCDLCVSLLQKPTLWCDNLSAVHLSANPVLHSRTKHVEIDIYFVRDLVLQKRLLIQHLPATD
        NLISWGSKKQ +++RSSTEAEY  +A   A+L+W+ SL  +L VS    PTL CDNLSAV L+ NPVLHSRTKH+E+DI+FVR+ VL K+L + H+PATD
Subjt:  NLISWGSKKQSIISRSSTEAEYTCLATAAAELIWLNSLFCDLCVSLLQKPTLWCDNLSAVHLSANPVLHSRTKHVEIDIYFVRDLVLQKRLLIQHLPATD

Query:  QVADILTKPLSASSFLRLTSKLNV
        Q+AD LTKPLS S++  + +KL V
Subjt:  QVADILTKPLSASSFLRLTSKLNV

PNY12957.1 retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense]3.1e-10259.88Show/hide
Query:  QGECCYILVYVDDIIITGSSSDTITKLIADLNGKFALKDLGTLSYFLGIEVSYPSSGGIFLSQSKYINDLLCKTNMDGAKPMNIPMMSGSPPSARHGEAL
        QG C Y+L+YVDDI+ITGS+   I  LI  LN +FALK LG + YFLGIEV +  SGG+ L+QSKYI DLLC+TNM+  KP+  PM+S    S    +A+
Subjt:  QGECCYILVYVDDIIITGSSSDTITKLIADLNGKFALKDLGTLSYFLGIEVSYPSSGGIFLSQSKYINDLLCKTNMDGAKPMNIPMMSGSPPSARHGEAL

Query:  SDVRFYRSVVGALQYATFTPPEISYSVNKACQFMHAPTSTHWQLVKRILRYLKGTISHGLLLSVPRC---LSLQGFADADWASDPDDRKSTSGFCIFFGG
        SD   YRS VGALQYAT T P+I++SVNK CQFM  P  THW+ VKRILRYLKGT +HGLLL+        SL+ ++DADWA+D DDR+STSG CI+FG 
Subjt:  SDVRFYRSVVGALQYATFTPPEISYSVNKACQFMHAPTSTHWQLVKRILRYLKGTISHGLLLSVPRC---LSLQGFADADWASDPDDRKSTSGFCIFFGG

Query:  NLISWGSKKQSIISRSSTEAEYTCLATAAAELIWLNSLFCDLCVSLLQKPTLWCDNLSAVHLSANPVLHSRTKHVEIDIYFVRDLVLQKRLLIQHLPATD
        NLISWGSKKQ +++RSSTEAEY  +A   A+L+W+ SL  +L VS    PTL CDNLSAV L+ NPVLHSRTKH+E+DI+FVR+ VL K+L + H+PATD
Subjt:  NLISWGSKKQSIISRSSTEAEYTCLATAAAELIWLNSLFCDLCVSLLQKPTLWCDNLSAVHLSANPVLHSRTKHVEIDIYFVRDLVLQKRLLIQHLPATD

Query:  QVADILTKPLSASSFLRLTSKLNV
        Q+AD LTKPLS S++  + +KL V
Subjt:  QVADILTKPLSASSFLRLTSKLNV

XP_016902754.1 PREDICTED: uncharacterized mitochondrial protein AtMg00810-like [Cucumis melo]6.3e-10362.94Show/hide
Query:  SSSDTITKLIADLNGKFALKDLGTLSYFLGIEVSYPSSGGIFLSQSKYINDLLCKTNMDGAKPMNIPMMSGSPPSARHGEALSDVRFYRSVVGALQYATF
        SS   +  L+  LN +FALKDLG LSYFLG+EVSYP++GG+FLSQSKYI DLL +T M  AKP++ PM                            YAT 
Subjt:  SSSDTITKLIADLNGKFALKDLGTLSYFLGIEVSYPSSGGIFLSQSKYINDLLCKTNMDGAKPMNIPMMSGSPPSARHGEALSDVRFYRSVVGALQYATF

Query:  TPPEISYSVNKACQFMHAPTSTHWQLVKRILRYLKGTISHGLLLSVPRCLSLQGFADADWASDPDDRKSTSGFCIFFGGNLISWGSKKQSIISRSSTEAE
        T PEISYSVNKACQFMH P  THWQLVKRILRYLKG + HGL       +SL GFADADWASDPDDRKSTSGF ++FG NL+SWGSKK SIISRSSTEAE
Subjt:  TPPEISYSVNKACQFMHAPTSTHWQLVKRILRYLKGTISHGLLLSVPRCLSLQGFADADWASDPDDRKSTSGFCIFFGGNLISWGSKKQSIISRSSTEAE

Query:  YTCLATAAAELIWLNSLFCDLCVSLLQKPTLWCDNLSAVHLSANPVLHSRTKHVEIDIYFVRDLVLQKRLLIQHLPATDQVADILTKPLSASSFLRLTSK
        Y CLA  A EL+W+ SL  DL + L   P LWCDNLSAVHLSANP+LHS+TKHVE+DIYFVRDL+ + +L ++HLPAT+Q+ADILTKPLSA SF  L +K
Subjt:  YTCLATAAAELIWLNSLFCDLCVSLLQKPTLWCDNLSAVHLSANPVLHSRTKHVEIDIYFVRDLVLQKRLLIQHLPATDQVADILTKPLSASSFLRLTSK

Query:  LNVRDPLTIGLRG
        L V D   IGL+G
Subjt:  LNVRDPLTIGLRG

XP_016903397.1 PREDICTED: uncharacterized mitochondrial protein AtMg00810-like [Cucumis melo]5.2e-11365.93Show/hide
Query:  GSSSDTITKLIADLNGKFALKDLGTLSYFLGIEVSYPSSGGIFLSQSKYINDLLCKTNMDGAKPMNIPMMSGSPPSARHGEALSDVRFYRSVVGALQYAT
        G+S   +  L+  LN +FALKDLG LSYFL +EVSYP++GG+FLSQSKYI DLL +T M  AKP++  M+SG   SA  GE   DV  YRS+VGALQYAT
Subjt:  GSSSDTITKLIADLNGKFALKDLGTLSYFLGIEVSYPSSGGIFLSQSKYINDLLCKTNMDGAKPMNIPMMSGSPPSARHGEALSDVRFYRSVVGALQYAT

Query:  FTPPEISYSVNKACQFMHAPTSTHWQLVKRILRYLKGTISHGLLLSVPRCLSLQGFADADWASDPDDRKSTSGFCIFFGGNLISWGSKKQSIISRSSTEA
         T PEISYSVNKACQF+H P  THWQLVK+ILRYLKG + H L LS    +SL GF DADWASDPDDRKSTSGFC++FG NL+SWGSKKQSII RSST+A
Subjt:  FTPPEISYSVNKACQFMHAPTSTHWQLVKRILRYLKGTISHGLLLSVPRCLSLQGFADADWASDPDDRKSTSGFCIFFGGNLISWGSKKQSIISRSSTEA

Query:  EYTCLATAAAELIWLNSLFCDLCVSLLQKPTLWCDNLSAVHLSANPVLHSRTKHVEIDIYFVRDLVLQKRLLIQHLPATDQVADILTKPLSASSFLRLTS
        EY CL   A EL+W+ SL  DL + L   P LWCDNLSAVHLSAN +LHS TKHVE+DIYFVRDL+ + +L I+HL AT+Q+ADILTKPLSA S  +L +
Subjt:  EYTCLATAAAELIWLNSLFCDLCVSLLQKPTLWCDNLSAVHLSANPVLHSRTKHVEIDIYFVRDLVLQKRLLIQHLPATDQVADILTKPLSASSFLRLTS

Query:  KLNVRDPLTIGLRGVLR
        KL V D  +IGL+GVLR
Subjt:  KLNVRDPLTIGLRGVLR

TrEMBL top hitse value%identityAlignment
A0A1S4E3F1 uncharacterized mitochondrial protein AtMg00810-like3.1e-10362.94Show/hide
Query:  SSSDTITKLIADLNGKFALKDLGTLSYFLGIEVSYPSSGGIFLSQSKYINDLLCKTNMDGAKPMNIPMMSGSPPSARHGEALSDVRFYRSVVGALQYATF
        SS   +  L+  LN +FALKDLG LSYFLG+EVSYP++GG+FLSQSKYI DLL +T M  AKP++ PM                            YAT 
Subjt:  SSSDTITKLIADLNGKFALKDLGTLSYFLGIEVSYPSSGGIFLSQSKYINDLLCKTNMDGAKPMNIPMMSGSPPSARHGEALSDVRFYRSVVGALQYATF

Query:  TPPEISYSVNKACQFMHAPTSTHWQLVKRILRYLKGTISHGLLLSVPRCLSLQGFADADWASDPDDRKSTSGFCIFFGGNLISWGSKKQSIISRSSTEAE
        T PEISYSVNKACQFMH P  THWQLVKRILRYLKG + HGL       +SL GFADADWASDPDDRKSTSGF ++FG NL+SWGSKK SIISRSSTEAE
Subjt:  TPPEISYSVNKACQFMHAPTSTHWQLVKRILRYLKGTISHGLLLSVPRCLSLQGFADADWASDPDDRKSTSGFCIFFGGNLISWGSKKQSIISRSSTEAE

Query:  YTCLATAAAELIWLNSLFCDLCVSLLQKPTLWCDNLSAVHLSANPVLHSRTKHVEIDIYFVRDLVLQKRLLIQHLPATDQVADILTKPLSASSFLRLTSK
        Y CLA  A EL+W+ SL  DL + L   P LWCDNLSAVHLSANP+LHS+TKHVE+DIYFVRDL+ + +L ++HLPAT+Q+ADILTKPLSA SF  L +K
Subjt:  YTCLATAAAELIWLNSLFCDLCVSLLQKPTLWCDNLSAVHLSANPVLHSRTKHVEIDIYFVRDLVLQKRLLIQHLPATDQVADILTKPLSASSFLRLTSK

Query:  LNVRDPLTIGLRG
        L V D   IGL+G
Subjt:  LNVRDPLTIGLRG

A0A1S4E598 uncharacterized mitochondrial protein AtMg00810-like2.5e-11365.93Show/hide
Query:  GSSSDTITKLIADLNGKFALKDLGTLSYFLGIEVSYPSSGGIFLSQSKYINDLLCKTNMDGAKPMNIPMMSGSPPSARHGEALSDVRFYRSVVGALQYAT
        G+S   +  L+  LN +FALKDLG LSYFL +EVSYP++GG+FLSQSKYI DLL +T M  AKP++  M+SG   SA  GE   DV  YRS+VGALQYAT
Subjt:  GSSSDTITKLIADLNGKFALKDLGTLSYFLGIEVSYPSSGGIFLSQSKYINDLLCKTNMDGAKPMNIPMMSGSPPSARHGEALSDVRFYRSVVGALQYAT

Query:  FTPPEISYSVNKACQFMHAPTSTHWQLVKRILRYLKGTISHGLLLSVPRCLSLQGFADADWASDPDDRKSTSGFCIFFGGNLISWGSKKQSIISRSSTEA
         T PEISYSVNKACQF+H P  THWQLVK+ILRYLKG + H L LS    +SL GF DADWASDPDDRKSTSGFC++FG NL+SWGSKKQSII RSST+A
Subjt:  FTPPEISYSVNKACQFMHAPTSTHWQLVKRILRYLKGTISHGLLLSVPRCLSLQGFADADWASDPDDRKSTSGFCIFFGGNLISWGSKKQSIISRSSTEA

Query:  EYTCLATAAAELIWLNSLFCDLCVSLLQKPTLWCDNLSAVHLSANPVLHSRTKHVEIDIYFVRDLVLQKRLLIQHLPATDQVADILTKPLSASSFLRLTS
        EY CL   A EL+W+ SL  DL + L   P LWCDNLSAVHLSAN +LHS TKHVE+DIYFVRDL+ + +L I+HL AT+Q+ADILTKPLSA S  +L +
Subjt:  EYTCLATAAAELIWLNSLFCDLCVSLLQKPTLWCDNLSAVHLSANPVLHSRTKHVEIDIYFVRDLVLQKRLLIQHLPATDQVADILTKPLSASSFLRLTS

Query:  KLNVRDPLTIGLRGVLR
        KL V D  +IGL+GVLR
Subjt:  KLNVRDPLTIGLRGVLR

A0A2K3PCD7 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-10259.88Show/hide
Query:  QGECCYILVYVDDIIITGSSSDTITKLIADLNGKFALKDLGTLSYFLGIEVSYPSSGGIFLSQSKYINDLLCKTNMDGAKPMNIPMMSGSPPSARHGEAL
        QG C Y+L+YVDDI+ITGS+   I  LI  LN +FALK LG + YFLGIEV +  SGG+ L+QSKYI DLLC+TNM+  KP+  PM+S    S    +A+
Subjt:  QGECCYILVYVDDIIITGSSSDTITKLIADLNGKFALKDLGTLSYFLGIEVSYPSSGGIFLSQSKYINDLLCKTNMDGAKPMNIPMMSGSPPSARHGEAL

Query:  SDVRFYRSVVGALQYATFTPPEISYSVNKACQFMHAPTSTHWQLVKRILRYLKGTISHGLLLSVPRC---LSLQGFADADWASDPDDRKSTSGFCIFFGG
        SD   YRS VGALQYAT T P+I++SVNK CQFM  P  THW+ VKRILRYLKGT +HGLLL+        SL+ ++DADWA+D DDR+STSG CI+FG 
Subjt:  SDVRFYRSVVGALQYATFTPPEISYSVNKACQFMHAPTSTHWQLVKRILRYLKGTISHGLLLSVPRC---LSLQGFADADWASDPDDRKSTSGFCIFFGG

Query:  NLISWGSKKQSIISRSSTEAEYTCLATAAAELIWLNSLFCDLCVSLLQKPTLWCDNLSAVHLSANPVLHSRTKHVEIDIYFVRDLVLQKRLLIQHLPATD
        NLISWGSKKQ +++RSSTEAEY  +A   A+L+W+ SL  +L VS    PTL CDNLSAV L+ NPVLHSRTKH+E+DI+FVR+ VL K+L + H+PATD
Subjt:  NLISWGSKKQSIISRSSTEAEYTCLATAAAELIWLNSLFCDLCVSLLQKPTLWCDNLSAVHLSANPVLHSRTKHVEIDIYFVRDLVLQKRLLIQHLPATD

Query:  QVADILTKPLSASSFLRLTSKLNV
        Q+AD LTKPLS S++  + +KL V
Subjt:  QVADILTKPLSASSFLRLTSKLNV

A0A2K3PCG9 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-10259.88Show/hide
Query:  QGECCYILVYVDDIIITGSSSDTITKLIADLNGKFALKDLGTLSYFLGIEVSYPSSGGIFLSQSKYINDLLCKTNMDGAKPMNIPMMSGSPPSARHGEAL
        QG C Y+L+YVDDI+ITGS+   I  LI  LN +FALK LG + YFLGIEV +  SGG+ L+QSKYI DLLC+TNM+  KP+  PM+S    S    +A+
Subjt:  QGECCYILVYVDDIIITGSSSDTITKLIADLNGKFALKDLGTLSYFLGIEVSYPSSGGIFLSQSKYINDLLCKTNMDGAKPMNIPMMSGSPPSARHGEAL

Query:  SDVRFYRSVVGALQYATFTPPEISYSVNKACQFMHAPTSTHWQLVKRILRYLKGTISHGLLLSVPRC---LSLQGFADADWASDPDDRKSTSGFCIFFGG
        SD   YRS VGALQYAT T P+I++SVNK CQFM  P  THW+ VKRILRYLKGT +HGLLL+        SL+ ++DADWA+D DDR+STSG CI+FG 
Subjt:  SDVRFYRSVVGALQYATFTPPEISYSVNKACQFMHAPTSTHWQLVKRILRYLKGTISHGLLLSVPRC---LSLQGFADADWASDPDDRKSTSGFCIFFGG

Query:  NLISWGSKKQSIISRSSTEAEYTCLATAAAELIWLNSLFCDLCVSLLQKPTLWCDNLSAVHLSANPVLHSRTKHVEIDIYFVRDLVLQKRLLIQHLPATD
        NLISWGSKKQ +++RSSTEAEY  +A   A+L+W+ SL  +L VS    PTL CDNLSAV L+ NPVLHSRTKH+E+DI+FVR+ VL K+L + H+PATD
Subjt:  NLISWGSKKQSIISRSSTEAEYTCLATAAAELIWLNSLFCDLCVSLLQKPTLWCDNLSAVHLSANPVLHSRTKHVEIDIYFVRDLVLQKRLLIQHLPATD

Query:  QVADILTKPLSASSFLRLTSKLNV
        Q+AD LTKPLS S++  + +KL V
Subjt:  QVADILTKPLSASSFLRLTSKLNV

A0A5A7UFS3 Putative mitochondrial protein2.0e-12367.67Show/hide
Query:  CCYILVYVDDIIITGSSSDTITKLIADLNGKFALKDLGTLSYFLGIEVSYPSSGGIFLSQSKYINDLLCKTNMDGAKPMNIPMMSGSPPSARHGEALSDV
        CCY L+YVDD+II GSS   +  L+  LN +FALKDLG LSYFLG+EVSYP++G +FLSQSKYI DLL +T M  AKP++ PM+SG   SA  GE   DV
Subjt:  CCYILVYVDDIIITGSSSDTITKLIADLNGKFALKDLGTLSYFLGIEVSYPSSGGIFLSQSKYINDLLCKTNMDGAKPMNIPMMSGSPPSARHGEALSDV

Query:  RFYRSVVGALQYATFTPPEISYSVNKACQFMHAPTSTHWQLVKRILRYLKGTISHGLLLSVPRCLSLQGFADADWASDPDDRKSTSGFCIFFGGNLISWG
           RSVVGALQYAT T PEISYSVNKACQFMH P  THWQLVKRILRYLKG + HGL LS     SL GFADADWASDPDDRKSTSG C++FG NL+SWG
Subjt:  RFYRSVVGALQYATFTPPEISYSVNKACQFMHAPTSTHWQLVKRILRYLKGTISHGLLLSVPRCLSLQGFADADWASDPDDRKSTSGFCIFFGGNLISWG

Query:  SKKQSIISRSSTEAEYTCLATAAAELIWLNSLFCDLCVSLLQKPTLWCDNLSAVHLSANPVLHSRTKHVEIDIYFVRDLVLQKRLLIQHLPATDQVADIL
        SKKQSIISRSSTEAEY CLA  A E++W+ SL  DL + L   P L CDNLSAVH SANP+LHS+TKHVE+DIYFVRDL+ +++L ++HLPAT+Q+ADIL
Subjt:  SKKQSIISRSSTEAEYTCLATAAAELIWLNSLFCDLCVSLLQKPTLWCDNLSAVHLSANPVLHSRTKHVEIDIYFVRDLVLQKRLLIQHLPATDQVADIL

Query:  TKPLSASSFLRLTSKLNVRDPLTIGLRGVLR
        TKPLSA SF +L + + V D   IGL+GVLR
Subjt:  TKPLSASSFLRLTSKLNVRDPLTIGLRGVLR

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.7e-4734.25Show/hide
Query:  ECCYILVYVDDIIITGSSSDTITKLIADLNGKFALKDLGTLSYFLGIEVSYPSSGGIFLSQSKYINDLLCKTNMDGAKPMNIPMMSGSPPSARHGEALSD
        E  Y+L+YVDD++I       +      L  KF + DL  + +F+GI +       I+LSQS Y+  +L K NM+    ++ P+     PS  + E L+ 
Subjt:  ECCYILVYVDDIIITGSSSDTITKLIADLNGKFALKDLGTLSYFLGIEVSYPSSGGIFLSQSKYINDLLCKTNMDGAKPMNIPMMSGSPPSARHGEALSD

Query:  VRF----YRSVVGALQYATF-TPPEISYSVNKACQFMHAPTSTHWQLVKRILRYLKGTISHGLLLSVPRCL--SLQGFADADWASDPDDRKSTSGFCI-F
                RS++G L Y    T P+++ +VN   ++     S  WQ +KR+LRYLKGTI   L+          + G+ D+DWA    DRKST+G+    
Subjt:  VRF----YRSVVGALQYATF-TPPEISYSVNKACQFMHAPTSTHWQLVKRILRYLKGTISHGLLLSVPRCL--SLQGFADADWASDPDDRKSTSGFCI-F

Query:  FGGNLISWGSKKQSIISRSSTEAEYTCLATAAAELIWLNSLFCDLCVSLLQKPTLWCDNLSAVHLSANPVLHSRTKHVEIDIYFVRDLVLQKRLLIQHLP
        F  NLI W +K+Q+ ++ SSTEAEY  L  A  E +WL  L   + + L     ++ DN   + ++ NP  H R KH++I  +F R+ V    + ++++P
Subjt:  FGGNLISWGSKKQSIISRSSTEAEYTCLATAAAELIWLNSLFCDLCVSLLQKPTLWCDNLSAVHLSANPVLHSRTKHVEIDIYFVRDLVLQKRLLIQHLP

Query:  ATDQVADILTKPLSASSFLRLTSKLNV
          +Q+ADI TKPL A+ F+ L  KL +
Subjt:  ATDQVADILTKPLSASSFLRLTSKLNV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.1e-4736.71Show/hide
Query:  ILVYVDDIIITGSSSDTITKLIADLNGKFALKDLGTLSYFLGIE-VSYPSSGGIFLSQSKYINDLLCKTNMDGAKPMNIPMMSG-------SPPSARHGE
        +L+YVDD++I G     I KL  DL+  F +KDLG     LG++ V   +S  ++LSQ KYI  +L + NM  AKP++ P+           P +     
Subjt:  ILVYVDDIIITGSSSDTITKLIADLNGKFALKDLGTLSYFLGIE-VSYPSSGGIFLSQSKYINDLLCKTNMDGAKPMNIPMMSG-------SPPSARHGE

Query:  ALSDVRFYRSVVGALQYA-TFTPPEISYSVNKACQFMHAPTSTHWQLVKRILRYLKGTISHGLLLSVPRCLSLQGFADADWASDPDDRKSTSGFCIFFGG
         ++ V  Y S VG+L YA   T P+I+++V    +F+  P   HW+ VK ILRYL+GT    L       + L+G+ DAD A D D+RKS++G+   F G
Subjt:  ALSDVRFYRSVVGALQYA-TFTPPEISYSVNKACQFMHAPTSTHWQLVKRILRYLKGTISHGLLLSVPRCLSLQGFADADWASDPDDRKSTSGFCIFFGG

Query:  NLISWGSKKQSIISRSSTEAEYTCLATAAAELIWLNSLFCDLCVSLLQKP-TLWCDNLSAVHLSANPVLHSRTKHVEIDIYFVRDLVLQKRLLIQHLPAT
          ISW SK Q  ++ S+TEAEY        E+IWL     +L   L QK   ++CD+ SA+ LS N + H+RTKH+++  +++R++V  + L +  +   
Subjt:  NLISWGSKKQSIISRSSTEAEYTCLATAAAELIWLNSLFCDLCVSLLQKP-TLWCDNLSAVHLSANPVLHSRTKHVEIDIYFVRDLVLQKRLLIQHLPAT

Query:  DQVADILTKPLSASSF
        +  AD+LTK +  + F
Subjt:  DQVADILTKPLSASSF

P92519 Uncharacterized mitochondrial protein AtMg008103.0e-5549.13Show/hide
Query:  YILVYVDDIIITGSSSDTITKLIADLNGKFALKDLGTLSYFLGIEV-SYPSSGGIFLSQSKYINDLLCKTNMDGAKPMNIPM---MSGSPPSARHGEALS
        Y+L+YVDDI++TGSS+  +  LI  L+  F++KDLG + YFLGI++ ++PS  G+FLSQ+KY   +L    M   KPM+ P+   ++ S  +A++     
Subjt:  YILVYVDDIIITGSSSDTITKLIADLNGKFALKDLGTLSYFLGIEV-SYPSSGGIFLSQSKYINDLLCKTNMDGAKPMNIPM---MSGSPPSARHGEALS

Query:  DVRFYRSVVGALQYATFTPPEISYSVNKACQFMHAPTSTHWQLVKRILRYLKGTISHGLLLSVPRCLSLQGFADADWASDPDDRKSTSGFCIFFGGNLIS
        D   +RS+VGALQY T T P+ISY+VN  CQ MH PT   + L+KR+LRY+KGTI HGL +     L++Q F D+DWA     R+ST+GFC F G N+IS
Subjt:  DVRFYRSVVGALQYATFTPPEISYSVNKACQFMHAPTSTHWQLVKRILRYLKGTISHGLLLSVPRCLSLQGFADADWASDPDDRKSTSGFCIFFGGNLIS

Query:  WGSKKQSIISRSSTEAEYTCLATAAAELIW
        W +K+Q  +SRSSTE EY  LA  AAEL W
Subjt:  WGSKKQSIISRSSTEAEYTCLATAAAELIW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.3e-7847.78Show/hide
Query:  YILVYVDDIIITGSSSDTITKLIADLNGKFALKDLGTLSYFLGIEVSYPSSGGIFLSQSKYINDLLCKTNMDGAKPMNIPMMSGSPPSARHGEALSDVRF
        Y+LVYVDDI+ITG+    +   + +L+ +F++KD   L YFLGIE     + G+ LSQ +YI DLL +TNM  AKP+  PM      S   G  L+D   
Subjt:  YILVYVDDIIITGSSSDTITKLIADLNGKFALKDLGTLSYFLGIEVSYPSSGGIFLSQSKYINDLLCKTNMDGAKPMNIPMMSGSPPSARHGEALSDVRF

Query:  YRSVVGALQYATFTPPEISYSVNKACQFMHAPTSTHWQLVKRILRYLKGTISHGLLLSVPRCLSLQGFADADWASDPDDRKSTSGFCIFFGGNLISWGSK
        YR +VG+LQY  FT P+ISY+VN+  QFMH PT  H Q +KRILRYL GT +HG+ L     LSL  ++DADWA D DD  ST+G+ ++ G + ISW SK
Subjt:  YRSVVGALQYATFTPPEISYSVNKACQFMHAPTSTHWQLVKRILRYLKGTISHGLLLSVPRCLSLQGFADADWASDPDDRKSTSGFCIFFGGNLISWGSK

Query:  KQSIISRSSTEAEYTCLATAAAELIWLNSLFCDLCVSLLQKPTLWCDNLSAVHLSANPVLHSRTKHVEIDIYFVRDLVLQKRLLIQHLPATDQVADILTK
        KQ  + RSSTEAEY  +A  ++E+ W+ SL  +L + L + P ++CDN+ A +L ANPV HSR KH+ ID +F+R+ V    L + H+   DQ+AD LTK
Subjt:  KQSIISRSSTEAEYTCLATAAAELIWLNSLFCDLCVSLLQKPTLWCDNLSAVHLSANPVLHSRTKHVEIDIYFVRDLVLQKRLLIQHLPATDQVADILTK

Query:  PLSASSFLRLTSKLNV
        PLS ++F    SK+ V
Subjt:  PLSASSFLRLTSKLNV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.8e-7645.45Show/hide
Query:  YILVYVDDIIITGSSSDTITKLIADLNGKFALKDLGTLSYFLGIEVSYPSSGGIFLSQSKYINDLLCKTNMDGAKPMNIPMMSGSPPSARHGEALSDVRF
        Y+LVYVDDI+ITG+ +  +   +  L+ +F++K+   L YFLGIE       G+ LSQ +Y  DLL +TNM  AKP+  PM +    +   G  L D   
Subjt:  YILVYVDDIIITGSSSDTITKLIADLNGKFALKDLGTLSYFLGIEVSYPSSGGIFLSQSKYINDLLCKTNMDGAKPMNIPMMSGSPPSARHGEALSDVRF

Query:  YRSVVGALQYATFTPPEISYSVNKACQFMHAPTSTHWQLVKRILRYLKGTISHGLLLSVPRCLSLQGFADADWASDPDDRKSTSGFCIFFGGNLISWGSK
        YR +VG+LQY  FT P++SY+VN+  Q+MH PT  HW  +KR+LRYL GT  HG+ L     LSL  ++DADWA D DD  ST+G+ ++ G + ISW SK
Subjt:  YRSVVGALQYATFTPPEISYSVNKACQFMHAPTSTHWQLVKRILRYLKGTISHGLLLSVPRCLSLQGFADADWASDPDDRKSTSGFCIFFGGNLISWGSK

Query:  KQSIISRSSTEAEYTCLATAAAELIWLNSLFCDLCVSLLQKPTLWCDNLSAVHLSANPVLHSRTKHVEIDIYFVRDLVLQKRLLIQHLPATDQVADILTK
        KQ  + RSSTEAEY  +A  ++EL W+ SL  +L + L   P ++CDN+ A +L ANPV HSR KH+ +D +F+R+ V    L + H+   DQ+AD LTK
Subjt:  KQSIISRSSTEAEYTCLATAAAELIWLNSLFCDLCVSLLQKPTLWCDNLSAVHLSANPVLHSRTKHVEIDIYFVRDLVLQKRLLIQHLPATDQVADILTK

Query:  PLSASSFLRLTSKLNV-RDPLTIGLRGVLR
        PLS  +F   + K+ V + P + G  GVLR
Subjt:  PLSASSFLRLTSKLNV-RDPLTIGLRGVLR

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 87.5e-6242.81Show/hide
Query:  ILVYVDDIIITGSSSDTITKLIADLNGKFALKDLGTLSYFLGIEVSYPSSGGIFLSQSKYINDLLCKTNMDGAKPMNIPMMSGSPPSARHGEALSDVRFY
        +LVYVDDIII  ++   + +L + L   F L+DLG L YFLG+E++  S+ GI + Q KY  DLL +T + G KP ++PM      SA  G    D + Y
Subjt:  ILVYVDDIIITGSSSDTITKLIADLNGKFALKDLGTLSYFLGIEVSYPSSGGIFLSQSKYINDLLCKTNMDGAKPMNIPMMSGSPPSARHGEALSDVRFY

Query:  RSVVGALQYATFTPPEISYSVNKACQFMHAPTSTHWQLVKRILRYLKGTISHGLLLSVPRCLSLQGFADADWASDPDDRKSTSGFCIFFGGNLISWGSKK
        R ++G L Y   T  +IS++VNK  QF  AP   H Q V +IL Y+KGT+  GL  S    + LQ F+DA + S  D R+ST+G+C+F G +LISW SKK
Subjt:  RSVVGALQYATFTPPEISYSVNKACQFMHAPTSTHWQLVKRILRYLKGTISHGLLLSVPRCLSLQGFADADWASDPDDRKSTSGFCIFFGGNLISWGSKK

Query:  QSIISRSSTEAEYTCLATAAAELIWLNSLFCDLCVSLLQKPTLWCDNLSAVHLSANPVLHSRTKHVEIDIYFVRDLVLQKRLLIQHLPATDQ
        Q ++S+SS EAEY  L+ A  E++WL   F +L + L +   L+CDN +A+H++ N V H RTKH+E D + VR+  + +  L     A D+
Subjt:  QSIISRSSTEAEYTCLATAAAELIWLNSLFCDLCVSLLQKPTLWCDNLSAVHLSANPVLHSRTKHVEIDIYFVRDLVLQKRLLIQHLPATDQ

ATMG00240.1 Gag-Pol-related retrotransposon family protein4.0e-1541.24Show/hide
Query:  YATFTPPEISYSVNKACQFMHAPTSTHWQLVKRILRYLKGTISHGLLLSVPRCLSLQGFADADWASDPDDRKSTSGFCIFFGGNLISWGSKKQSIIS
        Y T T P+++++VN+  QF  A  +   Q V ++L Y+KGT+  GL  S    L L+ FAD+DWAS PD R+S +GFC      L   G+ ++SI+S
Subjt:  YATFTPPEISYSVNKACQFMHAPTSTHWQLVKRILRYLKGTISHGLLLSVPRCLSLQGFADADWASDPDDRKSTSGFCIFFGGNLISWGSKKQSIIS

ATMG00810.1 DNA/RNA polymerases superfamily protein2.1e-5649.13Show/hide
Query:  YILVYVDDIIITGSSSDTITKLIADLNGKFALKDLGTLSYFLGIEV-SYPSSGGIFLSQSKYINDLLCKTNMDGAKPMNIPM---MSGSPPSARHGEALS
        Y+L+YVDDI++TGSS+  +  LI  L+  F++KDLG + YFLGI++ ++PS  G+FLSQ+KY   +L    M   KPM+ P+   ++ S  +A++     
Subjt:  YILVYVDDIIITGSSSDTITKLIADLNGKFALKDLGTLSYFLGIEV-SYPSSGGIFLSQSKYINDLLCKTNMDGAKPMNIPM---MSGSPPSARHGEALS

Query:  DVRFYRSVVGALQYATFTPPEISYSVNKACQFMHAPTSTHWQLVKRILRYLKGTISHGLLLSVPRCLSLQGFADADWASDPDDRKSTSGFCIFFGGNLIS
        D   +RS+VGALQY T T P+ISY+VN  CQ MH PT   + L+KR+LRY+KGTI HGL +     L++Q F D+DWA     R+ST+GFC F G N+IS
Subjt:  DVRFYRSVVGALQYATFTPPEISYSVNKACQFMHAPTSTHWQLVKRILRYLKGTISHGLLLSVPRCLSLQGFADADWASDPDDRKSTSGFCIFFGGNLIS

Query:  WGSKKQSIISRSSTEAEYTCLATAAAELIW
        W +K+Q  +SRSSTE EY  LA  AAEL W
Subjt:  WGSKKQSIISRSSTEAEYTCLATAAAELIW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACAAGGAGCTAACGAGGACAACCGGGGAGAAATCGGATTGGGAGATGGACCCAAGAGGCGAAACCGGCAAGTGGGACGGGCCAAGACCGAAGGGGTCGGGTTTTTG
GCCCGAGCCCGTCCGACTCTGCTTGGTCCCTACCGTCTTTGGGTGCCCCGGTTCCGTCTGGTTTGTCCCGAAGCACCTCCAAATTCCTAAAAACCCTAAGAGCATGAGCA
GGAAGTCGGAGGATCTCTCGAAAGTCAAGAATATTTTGGAAGCCATAGCAACTTTGGCTCCGAAGCTGGGTAGAACTTCGTTCAAGCACTGCAAATGTGAGGAGAATCGA
GTCGCTCACCTCCTTGCCAGCGAAGCAGTGTTCGATACCCACGATGTTTTTGGCTTCATTAGTGAGCTTTTGGACCCTGGGAAAATGGAAAATGACTCTATTTTTTGCCA
GAGCTCCCATGGAGGTTTGAGCTTTGGATCCCTCTACTCCATCACTAGCCTCAACGCCAATTCTGTTGAAATCCAGCAGAGTTTGGCTGTGATATCACAGAATTATATGA
AAAAACGATTTCAAAATGATGTAAATTGTACAACACTTGGTTCAGGAATTTCGTCGAACAGTTTGGATGCAACTACTTCCAATGAACCACAAGGTGAATGCTGCTATATA
CTAGTGTACGTAGACGATATTATTATTACTGGTAGCTCGTCTGATACTATTACCAAACTCATTGCTGATCTAAATGGTAAGTTTGCGCTAAAAGATCTAGGAACCTTAAG
CTACTTTCTTGGGATTGAGGTTTCCTATCCTTCCAGTGGAGGCATATTCTTGTCTCAGTCTAAATACATCAATGACCTGCTATGTAAAACAAATATGGATGGTGCTAAAC
CCATGAACATTCCTATGATGAGTGGTTCTCCTCCTTCTGCCAGACATGGTGAAGCGTTGTCTGATGTTCGATTTTATAGAAGTGTTGTGGGAGCACTTCAATATGCCACT
TTTACCCCGCCTGAAATTTCATATAGTGTGAATAAAGCATGTCAATTTATGCATGCTCCAACGTCAACTCATTGGCAACTTGTCAAGCGGATCCTTAGATATCTAAAAGG
AACAATCTCACATGGGTTACTTCTTTCTGTTCCTCGATGTCTATCTCTTCAGGGCTTTGCGGATGCTGACTGGGCTTCGGACCCAGATGATCGCAAATCGACATCAGGCT
TCTGTATTTTCTTTGGTGGCAACTTAATCTCATGGGGCTCCAAGAAGCAGTCGATAATCTCTCGCTCGAGCACTGAAGCAGAGTACACATGTTTGGCTACTGCAGCTGCT
GAGCTTATATGGTTGAATTCACTTTTTTGTGACTTATGTGTTTCCTTGTTGCAGAAACCCACTCTATGGTGTGACAATCTCAGTGCCGTGCATCTCAGCGCTAATCCAGT
GCTTCATTCACGGACCAAACATGTTGAAATCGACATCTACTTTGTACGTGATTTAGTTCTCCAAAAACGTCTGCTTATTCAACATCTTCCAGCTACTGATCAAGTGGCTG
ATATATTGACCAAACCACTGTCTGCTTCATCGTTTCTCCGACTAACCTCCAAGCTCAATGTTCGAGACCCACTGACCATTGGCTTGCGGGGGGTGTTAAGGCAGCTCATT
GAGCACGAGGGTTGCACGTGGTCCTTCAGTTACTTTATGATAAATCTATTTCAGCAATTTCTTAATTGGCACGATGAATCAGTAACGAGACAATCGAAGAAACCTAAATG
TTTATATTGGAGGCTAAGGATTTGGGAGACTAATTACAACTTACGATCAAGATTTGCAAACATTCCTTATCGTTATCGGAGCTCCGCCACGGCTGCGCTGCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTACAAGGAGCTAACGAGGACAACCGGGGAGAAATCGGATTGGGAGATGGACCCAAGAGGCGAAACCGGCAAGTGGGACGGGCCAAGACCGAAGGGGTCGGGTTTTTG
GCCCGAGCCCGTCCGACTCTGCTTGGTCCCTACCGTCTTTGGGTGCCCCGGTTCCGTCTGGTTTGTCCCGAAGCACCTCCAAATTCCTAAAAACCCTAAGAGCATGAGCA
GGAAGTCGGAGGATCTCTCGAAAGTCAAGAATATTTTGGAAGCCATAGCAACTTTGGCTCCGAAGCTGGGTAGAACTTCGTTCAAGCACTGCAAATGTGAGGAGAATCGA
GTCGCTCACCTCCTTGCCAGCGAAGCAGTGTTCGATACCCACGATGTTTTTGGCTTCATTAGTGAGCTTTTGGACCCTGGGAAAATGGAAAATGACTCTATTTTTTGCCA
GAGCTCCCATGGAGGTTTGAGCTTTGGATCCCTCTACTCCATCACTAGCCTCAACGCCAATTCTGTTGAAATCCAGCAGAGTTTGGCTGTGATATCACAGAATTATATGA
AAAAACGATTTCAAAATGATGTAAATTGTACAACACTTGGTTCAGGAATTTCGTCGAACAGTTTGGATGCAACTACTTCCAATGAACCACAAGGTGAATGCTGCTATATA
CTAGTGTACGTAGACGATATTATTATTACTGGTAGCTCGTCTGATACTATTACCAAACTCATTGCTGATCTAAATGGTAAGTTTGCGCTAAAAGATCTAGGAACCTTAAG
CTACTTTCTTGGGATTGAGGTTTCCTATCCTTCCAGTGGAGGCATATTCTTGTCTCAGTCTAAATACATCAATGACCTGCTATGTAAAACAAATATGGATGGTGCTAAAC
CCATGAACATTCCTATGATGAGTGGTTCTCCTCCTTCTGCCAGACATGGTGAAGCGTTGTCTGATGTTCGATTTTATAGAAGTGTTGTGGGAGCACTTCAATATGCCACT
TTTACCCCGCCTGAAATTTCATATAGTGTGAATAAAGCATGTCAATTTATGCATGCTCCAACGTCAACTCATTGGCAACTTGTCAAGCGGATCCTTAGATATCTAAAAGG
AACAATCTCACATGGGTTACTTCTTTCTGTTCCTCGATGTCTATCTCTTCAGGGCTTTGCGGATGCTGACTGGGCTTCGGACCCAGATGATCGCAAATCGACATCAGGCT
TCTGTATTTTCTTTGGTGGCAACTTAATCTCATGGGGCTCCAAGAAGCAGTCGATAATCTCTCGCTCGAGCACTGAAGCAGAGTACACATGTTTGGCTACTGCAGCTGCT
GAGCTTATATGGTTGAATTCACTTTTTTGTGACTTATGTGTTTCCTTGTTGCAGAAACCCACTCTATGGTGTGACAATCTCAGTGCCGTGCATCTCAGCGCTAATCCAGT
GCTTCATTCACGGACCAAACATGTTGAAATCGACATCTACTTTGTACGTGATTTAGTTCTCCAAAAACGTCTGCTTATTCAACATCTTCCAGCTACTGATCAAGTGGCTG
ATATATTGACCAAACCACTGTCTGCTTCATCGTTTCTCCGACTAACCTCCAAGCTCAATGTTCGAGACCCACTGACCATTGGCTTGCGGGGGGTGTTAAGGCAGCTCATT
GAGCACGAGGGTTGCACGTGGTCCTTCAGTTACTTTATGATAAATCTATTTCAGCAATTTCTTAATTGGCACGATGAATCAGTAACGAGACAATCGAAGAAACCTAAATG
TTTATATTGGAGGCTAAGGATTTGGGAGACTAATTACAACTTACGATCAAGATTTGCAAACATTCCTTATCGTTATCGGAGCTCCGCCACGGCTGCGCTGCCTTGA
Protein sequenceShow/hide protein sequence
MYKELTRTTGEKSDWEMDPRGETGKWDGPRPKGSGFWPEPVRLCLVPTVFGCPGSVWFVPKHLQIPKNPKSMSRKSEDLSKVKNILEAIATLAPKLGRTSFKHCKCEENR
VAHLLASEAVFDTHDVFGFISELLDPGKMENDSIFCQSSHGGLSFGSLYSITSLNANSVEIQQSLAVISQNYMKKRFQNDVNCTTLGSGISSNSLDATTSNEPQGECCYI
LVYVDDIIITGSSSDTITKLIADLNGKFALKDLGTLSYFLGIEVSYPSSGGIFLSQSKYINDLLCKTNMDGAKPMNIPMMSGSPPSARHGEALSDVRFYRSVVGALQYAT
FTPPEISYSVNKACQFMHAPTSTHWQLVKRILRYLKGTISHGLLLSVPRCLSLQGFADADWASDPDDRKSTSGFCIFFGGNLISWGSKKQSIISRSSTEAEYTCLATAAA
ELIWLNSLFCDLCVSLLQKPTLWCDNLSAVHLSANPVLHSRTKHVEIDIYFVRDLVLQKRLLIQHLPATDQVADILTKPLSASSFLRLTSKLNVRDPLTIGLRGVLRQLI
EHEGCTWSFSYFMINLFQQFLNWHDESVTRQSKKPKCLYWRLRIWETNYNLRSRFANIPYRYRSSATAALP