; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh07G008350 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh07G008350
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationCmo_Chr07:3913276..3917801
RNA-Seq ExpressionCmoCh07G008350
SyntenyCmoCh07G008350
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GEV96962.1 retrovirus-related Pol polyprotein from transposon TNT 1-94 [Tanacetum cinerariifolium]1.1e-5940.28Show/hide
Query:  LFRAVDESGFEKIVRATISKEASDTLEKVFKGTDRVKQVLLQTLCGELESMKMKESENVSDHIAHIQTVANQLNRNGEMLPETRVVEKILRSLTDNFENV
        LF+++DESGFEKI  ATI+KEA D LEKV+KG DRVK+V L+TL  ELE++KMKE+E + D+I  +QTV N+L RNGE L ++R VEKIL SLTD FENV
Subjt:  LFRAVDESGFEKIVRATISKEASDTLEKVFKGTDRVKQVLLQTLCGELESMKMKESENVSDHIAHIQTVANQLNRNGEMLPETRVVEKILRSLTDNFENV

Query:  VCATEESKDLAKFTVNELAGSLKAHEQRK-KKKEETLDQALQIKASIKDEK-DLATGKMIGSGKQFGGLYHISLSSIKSSANQVSQSSHLWHLRLVPLRR
        VCA           +++L+ SL+ HEQRK K K+E+ D ALQ    + D K + A     G G+   G  H+  + + ++   V + +  W+L      +
Subjt:  VCATEESKDLAKFTVNELAGSLKAHEQRK-KKKEETLDQALQIKASIKDEK-DLATGKMIGSGKQFGGLYHISLSSIKSSANQVSQSSHLWHLRLVPLRR

Query:  STSTKRPP-----AYYNDNEMSSGANHLTLAQVPTLAPIGNPLWQQAMNDEIVALEHNHTWSLVPLPPSHKAIGWCWVYKIKHNSDGSVEHYKARRVAKG
        + + +  P      Y + N+M    +      V     + +  W+ AM++EI ++E N T  LV LP  H+ IG  WV+  K N+ G VE +KAR +A G
Subjt:  STSTKRPP-----AYYNDNEMSSGANHLTLAQVPTLAPIGNPLWQQAMNDEIVALEHNHTWSLVPLPPSHKAIGWCWVYKIKHNSDGSVEHYKARRVAKG

Query:  YTQVEGVDYKETFSPTTKLTTLRYLLTIVVARKWFTHQLDVQNAFLHGNLDEEVYMSLPP
        Y Q   +D  E F+   ++ ++R L++     KW  +  DV++  L+G L EEVY+  PP
Subjt:  YTQVEGVDYKETFSPTTKLTTLRYLLTIVVARKWFTHQLDVQNAFLHGNLDEEVYMSLPP

GEZ08408.1 retrovirus-related Pol polyprotein from transposon TNT 1-94 [Tanacetum cinerariifolium]1.5e-5340.38Show/hide
Query:  MLFRAVDESGFEKIVRATISKEASDTLEKVFKGTDRVKQVLLQTLCGELESMKMKESENVSDHIAHIQTVANQLNRNGEMLPETRVVEKILRSLTDNFEN
        +LF++VDESGFEKI  A+ +K+A DTLEK +KG DRVKQV LQTL GELE+MKMKE+E VSD+   +QTV NQL RNGE LP++R++EKILRSL   FEN
Subjt:  MLFRAVDESGFEKIVRATISKEASDTLEKVFKGTDRVKQVLLQTLCGELESMKMKESENVSDHIAHIQTVANQLNRNGEMLPETRVVEKILRSLTDNFEN

Query:  VVCATEESKDLAKFTVNELAGSLKAHEQRK-KKKEETLDQALQIKASIKDEKDL--ATGKMIGSGKQFGGLYHISLSSIKSSANQVSQSSHLWHLRLVPL
        VVC  EE+KDL + T+ ELAGSL+AHEQ K +KK+E+LD+ALQ KA+IK+EK L       I      G   +      +  ++Q  Q S     R    
Subjt:  VVCATEESKDLAKFTVNELAGSLKAHEQRK-KKKEETLDQALQIKASIKDEKDL--ATGKMIGSGKQFGGLYHISLSSIKSSANQVSQSSHLWHLRLVPL

Query:  RRSTSTKRPPAYYNDN-EMSSGANH----LTLAQVPTLAPIGNPLWQQAMNDEIVALEHNHTWSLVPLPPSHKAIGWCWVYKIKHNSDGSVEHY-KARRV
         R     R    ++ N +  +   H         V       N + +  + +  V L  +        P S   + W       H+  G    + +   V
Subjt:  RRSTSTKRPPAYYNDN-EMSSGANH----LTLAQVPTLAPIGNPLWQQAMNDEIVALEHNHTWSLVPLPPSHKAIGWCWVYKIKHNSDGSVEHY-KARRV

Query:  AKGYTQVEGVDYKETFSPTTKLTTLRYLLTIVVARKWFTHQLDVQNAFLHGNLDEEVYMSLPPG
         +G+    G+D+ E F+P  ++ T+R L+++    +W  +Q+DV++AFL+G LDE VY   PPG
Subjt:  AKGYTQVEGVDYKETFSPTTKLTTLRYLLTIVVARKWFTHQLDVQNAFLHGNLDEEVYMSLPPG

KAE8721174.1 hypothetical protein F3Y22_tig00016637pilonHSYRG00095 [Hibiscus syriacus]2.0e-5375Show/hide
Query:  MLFRAVDESGFEKIVRATISKEASDTLEKVFKGTDRVKQVLLQTLCGELESMKMKESENVSDHIAHIQTVANQLNRNGEMLPETRVVEKILRSLTDNFEN
        MLFRAVDESGFEKI  A  SKEA D L KVFKG DRVKQV LQTL GELES+KMKE E+VSD+I  +QTV NQLNRNGE L E RVVEKILRSLTDNFEN
Subjt:  MLFRAVDESGFEKIVRATISKEASDTLEKVFKGTDRVKQVLLQTLCGELESMKMKESENVSDHIAHIQTVANQLNRNGEMLPETRVVEKILRSLTDNFEN

Query:  VVCATEESKDLAKFTVNELAGSLKAHEQR-KKKKEETLDQALQIKASIKDEKDLATGKMIGSGKQFGG
        VVCA EESKDLA  TVNEL GSL+AHEQR KKKKEETL+QALQ KA IKDEK   +    G G+  GG
Subjt:  VVCATEESKDLAKFTVNELAGSLKAHEQR-KKKKEETLDQALQIKASIKDEKDLATGKMIGSGKQFGG

KAE8721174.1 hypothetical protein F3Y22_tig00016637pilonHSYRG00095 [Hibiscus syriacus]1.6e-2640.82Show/hide
Query:  YYNDNEMSSGANHLTLAQVPTLAPIGNPLWQQAMNDEIVALEHNHTWSLVPLPPSHKAIGWCWVYKIKHNSDGSVEHYKARRVAKGYTQVEGVDYKETFS
        YY+ NE++          +     + +  W+ AMN+EI A++ N+TW L  LP   + IG  WV+K K N+ G +E YKAR VAKGY Q  G+DY E F+
Subjt:  YYNDNEMSSGANHLTLAQVPTLAPIGNPLWQQAMNDEIVALEHNHTWSLVPLPPSHKAIGWCWVYKIKHNSDGSVEHYKARRVAKGYTQVEGVDYKETFS

Query:  PTTKLTTLRYLLTIVVARKWFTHQLDVQNAFLHGNLDEEVYMSLPPG
        P  ++ T+R L++     KW   Q+DV++AFL+G L+E++Y+  PPG
Subjt:  PTTKLTTLRYLLTIVVARKWFTHQLDVQNAFLHGNLDEEVYMSLPPG

KAE8721174.1 hypothetical protein F3Y22_tig00016637pilonHSYRG00095 [Hibiscus syriacus]3.4e-5339.37Show/hide
Query:  MLFRAVDESGFEKIVRATISKEASDTLEKVFKGTDRVKQVLLQTLCGELESMKMKESENVSDHIAHIQTVANQLNRNGEMLPETRVVEKILRSLTDNFEN
        +LF++VDESGFEKI  A+ +K+A D LEK +KG DRVKQV LQTL GELE+MKMKE+E VSD+I  +QTV NQL RNGE LP++R++EK LRSL   FEN
Subjt:  MLFRAVDESGFEKIVRATISKEASDTLEKVFKGTDRVKQVLLQTLCGELESMKMKESENVSDHIAHIQTVANQLNRNGEMLPETRVVEKILRSLTDNFEN

Query:  VVCATEESKDLAKFTVNELAGSLKAHEQRK-KKKEETLDQALQIKASIKDEKDL------------ATGKMIGSGKQFGGLYHISLSSIKSSANQVSQSS
        VVCA EE+KDL + T+ ELAGSL+AHEQRK +KK+E+LD+ALQ KA+IK+EK L              G+    G+  G          +  ++Q  Q S
Subjt:  VVCATEESKDLAKFTVNELAGSLKAHEQRK-KKKEETLDQALQIKASIKDEKDL------------ATGKMIGSGKQFGGLYHISLSSIKSSANQVSQSS

Query:  HLWHLRLVPLRRSTSTKRPPAYYNDN-EMSSGANHLTLAQVPTLAPIGNPLWQQAMNDEIVA---LEHNHTWSLVPLPPSHKAIGWCWVYKIKHNSDGSV
             R     R     R    Y  N +  +   H   A+    A       +   N  +V    +E      +    P  + +   W        D   
Subjt:  HLWHLRLVPLRRSTSTKRPPAYYNDN-EMSSGANHLTLAQVPTLAPIGNPLWQQAMNDEIVA---LEHNHTWSLVPLPPSHKAIGWCWVYKIKHNSDGSV

Query:  EHYKARR---------VAKGYTQVEGVDYKETFSPTTKLTTLRYLLTIVVARKWFTHQLDVQNAFLHGNLDEEVYMSLPPG
         H+ +R+         V +G+    G+D+ E F+   ++ T+R L+++    +W  +Q+DV++AFL+G LDE VY+  PPG
Subjt:  EHYKARR---------VAKGYTQVEGVDYKETFSPTTKLTTLRYLLTIVVARKWFTHQLDVQNAFLHGNLDEEVYMSLPPG

XP_023521026.1 putative WEB family protein At1g65010, chloroplastic [Cucurbita pepo subsp. pepo]1.4e-5779.14Show/hide
Query:  MLFRAVDESGFEKIVRATISKEASDTLEKVFKGTDRVKQVLLQTLCGELESMKMKESENVSDHIAHIQTVANQLNRNGEMLPETRVVEKILRSLTDNFEN
        ML  AVDE GFEKI RAT SKE  DTLEKVFKGTD+VK V LQTL GELESMKMKESENV D+I  IQTVANQLNRNGEMLPETRVVEKILRSLTDNFEN
Subjt:  MLFRAVDESGFEKIVRATISKEASDTLEKVFKGTDRVKQVLLQTLCGELESMKMKESENVSDHIAHIQTVANQLNRNGEMLPETRVVEKILRSLTDNFEN

Query:  VVCATEESKDLAKFTVNELAGSLKAHEQRKKKKEETLDQALQIKASIKDEKDLATGKMIGSGK
        V+C  EESK+LAKFTV+ELAGSL+AHEQRKKKK+ETLDQALQ KAS+KDEK   +  + G G+
Subjt:  VVCATEESKDLAKFTVNELAGSLKAHEQRKKKKEETLDQALQIKASIKDEKDLATGKMIGSGK

TrEMBL top hitse value%identityAlignment
A0A6A3AD07 Uncharacterized protein5.1e-2647.46Show/hide
Query:  WQQAMNDEIVALEHNHTWSLVPLPPSHKAIGWCWVYKIKHNSDGSVEHYKARRVAKGYTQVEGVDYKETFSPTTKLTTLRYLLTIVVARKWFTHQLDVQN
        W+ AMN+EI A++ N+TW L  LP   + IG  WV+K K N+ G +E YKAR VAKGY Q  G+DY E F+P  ++ T+R L++     KW   Q+DV++
Subjt:  WQQAMNDEIVALEHNHTWSLVPLPPSHKAIGWCWVYKIKHNSDGSVEHYKARRVAKGYTQVEGVDYKETFSPTTKLTTLRYLLTIVVARKWFTHQLDVQN

Query:  AFLHGNLDEEVYMSLPPG
        AFL+G L+E++Y+  PPG
Subjt:  AFLHGNLDEEVYMSLPPG

A0A6A3AD07 Uncharacterized protein1.1e-5273.81Show/hide
Query:  MLFRAVDESGFEKIVRATISKEASDTLEKVFKGTDRVKQVLLQTLCGELESMKMKESENVSDHIAHIQTVANQLNRNGEMLPETRVVEKILRSLTDNFEN
        MLFRAVDESGFEKI  A  SKEA D L KVFKG DRVKQV LQTL GELES+KMKE E+VSD+I  +QTV NQLN NGE L E RVVEKILRSLTDNFEN
Subjt:  MLFRAVDESGFEKIVRATISKEASDTLEKVFKGTDRVKQVLLQTLCGELESMKMKESENVSDHIAHIQTVANQLNRNGEMLPETRVVEKILRSLTDNFEN

Query:  VVCATEESKDLAKFTVNELAGSLKAHEQRKKK-KEETLDQALQIKASIKDEKDLATGKMIGSGKQFGG
        VVCA EESKDLA  T+NEL GSL+AHEQRKKK KEETL+QALQ KA IKDEK   +    G G+  GG
Subjt:  VVCATEESKDLAKFTVNELAGSLKAHEQRKKK-KEETLDQALQIKASIKDEKDLATGKMIGSGKQFGG

A0A6A3ASE2 Uncharacterized protein8.4e-1334.75Show/hide
Query:  WQQAMNDEIVALEHNHTWSLVPLPPSHKAIGWCWVYKIKHNSDGSVEHYKARRVAKGYTQVEGVDYKETFSPTTKLTTLRYLLTIVVARKWFTHQLDVQN
        W+ AMN+EI A++ N+TW L  LP   + I                          GY Q  G+DY E F+P  ++ T+R L++     KW   Q+DV++
Subjt:  WQQAMNDEIVALEHNHTWSLVPLPPSHKAIGWCWVYKIKHNSDGSVEHYKARRVAKGYTQVEGVDYKETFSPTTKLTTLRYLLTIVVARKWFTHQLDVQN

Query:  AFLHGNLDEEVYMSLPPG
        AFL+G L+E++Y+  PPG
Subjt:  AFLHGNLDEEVYMSLPPG

A0A6A3ASE2 Uncharacterized protein7.8e-5170.73Show/hide
Query:  MLFRAVDESGFEKIVRATISKEASDTLEKVFKGTDRVKQVLLQTLCGELESMKMKESENVSDHIAHIQTVANQLNRNGEMLPETRVVEKILRSLTDNFEN
        +L++A+DESGFEKI  AT SKEA +TLEKVFKG DRVKQV LQTL GELE MKMKESE VSD+I  +QTV NQL RNGEML + RVVEKILRSLT+NFEN
Subjt:  MLFRAVDESGFEKIVRATISKEASDTLEKVFKGTDRVKQVLLQTLCGELESMKMKESENVSDHIAHIQTVANQLNRNGEMLPETRVVEKILRSLTDNFEN

Query:  VVCATEESKDLAKFTVNELAGSLKAHE-QRKKKKEETLDQALQIKASIKDEKDLATGKMIGSGK
        VVCA EESKDL   TV ELAGSL+AHE Q+KKKKEE L++ LQ KA+IKD+K L +    G G+
Subjt:  VVCATEESKDLAKFTVNELAGSLKAHE-QRKKKKEETLDQALQIKASIKDEKDLATGKMIGSGK

A0A6A3BX58 Uncharacterized protein9.8e-5475Show/hide
Query:  MLFRAVDESGFEKIVRATISKEASDTLEKVFKGTDRVKQVLLQTLCGELESMKMKESENVSDHIAHIQTVANQLNRNGEMLPETRVVEKILRSLTDNFEN
        MLFRAVDESGFEKI  A  SKEA D L KVFKG DRVKQV LQTL GELES+KMKE E+VSD+I  +QTV NQLNRNGE L E RVVEKILRSLTDNFEN
Subjt:  MLFRAVDESGFEKIVRATISKEASDTLEKVFKGTDRVKQVLLQTLCGELESMKMKESENVSDHIAHIQTVANQLNRNGEMLPETRVVEKILRSLTDNFEN

Query:  VVCATEESKDLAKFTVNELAGSLKAHEQR-KKKKEETLDQALQIKASIKDEKDLATGKMIGSGKQFGG
        VVCA EESKDLA  TVNEL GSL+AHEQR KKKKEETL+QALQ KA IKDEK   +    G G+  GG
Subjt:  VVCATEESKDLAKFTVNELAGSLKAHEQR-KKKKEETLDQALQIKASIKDEKDLATGKMIGSGKQFGG

A0A6A3BX58 Uncharacterized protein7.8e-2740.82Show/hide
Query:  YYNDNEMSSGANHLTLAQVPTLAPIGNPLWQQAMNDEIVALEHNHTWSLVPLPPSHKAIGWCWVYKIKHNSDGSVEHYKARRVAKGYTQVEGVDYKETFS
        YY+ NE++          +     + +  W+ AMN+EI A++ N+TW L  LP   + IG  WV+K K N+ G +E YKAR VAKGY Q  G+DY E F+
Subjt:  YYNDNEMSSGANHLTLAQVPTLAPIGNPLWQQAMNDEIVALEHNHTWSLVPLPPSHKAIGWCWVYKIKHNSDGSVEHYKARRVAKGYTQVEGVDYKETFS

Query:  PTTKLTTLRYLLTIVVARKWFTHQLDVQNAFLHGNLDEEVYMSLPPG
        P  ++ T+R L++     KW   Q+DV++AFL+G L+E++Y+  PPG
Subjt:  PTTKLTTLRYLLTIVVARKWFTHQLDVQNAFLHGNLDEEVYMSLPPG

A0A6A3BX58 Uncharacterized protein8.3e-5373.81Show/hide
Query:  MLFRAVDESGFEKIVRATISKEASDTLEKVFKGTDRVKQVLLQTLCGELESMKMKESENVSDHIAHIQTVANQLNRNGEMLPETRVVEKILRSLTDNFEN
        MLFRAVDESGFEKI  A  SKEA D L KVFKG +RVKQV LQTL GELES+K+KE E+VSD+I  +QTV NQLNRNGE L E RVVEKILRSLTDNFEN
Subjt:  MLFRAVDESGFEKIVRATISKEASDTLEKVFKGTDRVKQVLLQTLCGELESMKMKESENVSDHIAHIQTVANQLNRNGEMLPETRVVEKILRSLTDNFEN

Query:  VVCATEESKDLAKFTVNELAGSLKAHEQR-KKKKEETLDQALQIKASIKDEKDLATGKMIGSGKQFGG
        VVCA EESKDLA  TVNEL GSL+AHEQR KKKKEETL+QALQ KA IKDEK   +    G G+  GG
Subjt:  VVCATEESKDLAKFTVNELAGSLKAHEQR-KKKKEETLDQALQIKASIKDEKDLATGKMIGSGKQFGG

A0A6A3CZT6 Uncharacterized protein1.5e-2546.61Show/hide
Query:  WQQAMNDEIVALEHNHTWSLVPLPPSHKAIGWCWVYKIKHNSDGSVEHYKARRVAKGYTQVEGVDYKETFSPTTKLTTLRYLLTIVVARKWFTHQLDVQN
        W+  MN+EI A++ N+TW L  LP   + IG  WV+K K N+ G +E YKAR VAKGY Q  G+DY E F+P  ++ T+R L++     KW   Q+DV++
Subjt:  WQQAMNDEIVALEHNHTWSLVPLPPSHKAIGWCWVYKIKHNSDGSVEHYKARRVAKGYTQVEGVDYKETFSPTTKLTTLRYLLTIVVARKWFTHQLDVQN

Query:  AFLHGNLDEEVYMSLPPG
        AFL+G L+E++Y+  PPG
Subjt:  AFLHGNLDEEVYMSLPPG

A0A6A3CZT6 Uncharacterized protein1.4e-5273.81Show/hide
Query:  MLFRAVDESGFEKIVRATISKEASDTLEKVFKGTDRVKQVLLQTLCGELESMKMKESENVSDHIAHIQTVANQLNRNGEMLPETRVVEKILRSLTDNFEN
        MLFRAVDESGFEKI  A  SKEA D L KVFKG DRVKQV LQTL  ELES+KMKE E+VSD+I  +QTV NQLNRNGE L E RV+EKILRSLTDNFEN
Subjt:  MLFRAVDESGFEKIVRATISKEASDTLEKVFKGTDRVKQVLLQTLCGELESMKMKESENVSDHIAHIQTVANQLNRNGEMLPETRVVEKILRSLTDNFEN

Query:  VVCATEESKDLAKFTVNELAGSLKAHEQR-KKKKEETLDQALQIKASIKDEKDLATGKMIGSGKQFGG
        VVCA EESKDLA  TVNEL GSL+AHEQR KKKKEETL+QALQ KA IKDEK   +    G G+  GG
Subjt:  VVCATEESKDLAKFTVNELAGSLKAHEQR-KKKKEETLDQALQIKASIKDEKDLATGKMIGSGKQFGG

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.0e-2541.18Show/hide
Query:  WQQAMNDEIVALEHNHTWSLVPLPPSHKAIGWCWVYKIKHNSDGSVEHYKARRVAKGYTQVEGVDYKETFSPTTKLTTLRYLLTIVVARKWFTHQLDVQN
        W++A+N E+ A + N+TW++   P +   +   WV+ +K+N  G+   YKAR VA+G+TQ   +DY+ETF+P  ++++ R++L++V+      HQ+DV+ 
Subjt:  WQQAMNDEIVALEHNHTWSLVPLPPSHKAIGWCWVYKIKHNSDGSVEHYKARRVAKGYTQVEGVDYKETFSPTTKLTTLRYLLTIVVARKWFTHQLDVQN

Query:  AFLHGNLDEEVYMSLPPGI
        AFL+G L EE+YM LP GI
Subjt:  AFLHGNLDEEVYMSLPPGI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.9e-2338.04Show/hide
Query:  PLRRSTSTKRPPAYYNDNE---MSSGANHLTLAQVPTLAPIGNPLWQQAMNDEIVALEHNHTWSLVPLPPSHKAIGWCWVYKIKHNSDGSVEHYKARRVA
        PLRRS   +     Y   E   +S      +L +V +  P  N L  +AM +E+ +L+ N T+ LV LP   + +   WV+K+K + D  +  YKAR V 
Subjt:  PLRRSTSTKRPPAYYNDNE---MSSGANHLTLAQVPTLAPIGNPLWQQAMNDEIVALEHNHTWSLVPLPPSHKAIGWCWVYKIKHNSDGSVEHYKARRVA

Query:  KGYTQVEGVDYKETFSPTTKLTTLRYLLTIVVARKWFTHQLDVQNAFLHGNLDEEVYMSLPPG
        KG+ Q +G+D+ E FSP  K+T++R +L++  +      QLDV+ AFLHG+L+EE+YM  P G
Subjt:  KGYTQVEGVDYKETFSPTTKLTTLRYLLTIVVARKWFTHQLDVQNAFLHGNLDEEVYMSLPPG

P92520 Uncharacterized mitochondrial protein AtMg008202.7e-1645.45Show/hide
Query:  NPLWQQAMNDEIVALEHNHTWSLVPLPPSHKAIGWCWVYKIKHNSDGSVEHYKARRVAKGYTQVEGVDYKETFSPTTKLTTLRYLLTI
        +P W QAM +E+ AL  N TW LVP P +   +G  WV+K K +SDG+++  KAR VAKG+ Q EG+ + ET+SP  +  T+R +L +
Subjt:  NPLWQQAMNDEIVALEHNHTWSLVPLPPSHKAIGWCWVYKIKHNSDGSVEHYKARRVAKGYTQVEGVDYKETFSPTTKLTTLRYLLTI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.6e-3243.55Show/hide
Query:  LSSIKSSANQVSQSSHLWHLRLVPLRRSTSTKRPPAYYNDNEMSSGANHLTLAQVPTLA--PIGNPLWQQAMNDEIVALEHNHTWSLVPLPPSHKAIGWC
        L+ I ++ NQ   ++H           S  T+        N   S A  L     P  A   + +  W+ AM  EI A   NHTW LVP PPSH  I  C
Subjt:  LSSIKSSANQVSQSSHLWHLRLVPLRRSTSTKRPPAYYNDNEMSSGANHLTLAQVPTLA--PIGNPLWQQAMNDEIVALEHNHTWSLVPLPPSHKAIGWC

Query:  -WVYKIKHNSDGSVEHYKARRVAKGYTQVEGVDYKETFSPTTKLTTLRYLLTIVVARKWFTHQLDVQNAFLHGNLDEEVYMSLPPG
         W++  K+NSDGS+  YKAR VAKGY Q  G+DY ETFSP  K T++R +L + V R W   QLDV NAFL G L ++VYMS PPG
Subjt:  -WVYKIKHNSDGSVEHYKARRVAKGYTQVEGVDYKETFSPTTKLTTLRYLLTIVVARKWFTHQLDVQNAFLHGNLDEEVYMSLPPG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.9e-3247.8Show/hide
Query:  STSTKRPPAYYNDNEMSSGANHLTLAQVPTLA--PIGNPLWQQAMNDEIVALEHNHTWSLVPLPPSHKAIGWC-WVYKIKHNSDGSVEHYKARRVAKGYT
        S +T+        N+  S A  L     P  A   + +  W+QAM  EI A   NHTW LVP PP    I  C W++  K NSDGS+  YKAR VAKGY 
Subjt:  STSTKRPPAYYNDNEMSSGANHLTLAQVPTLA--PIGNPLWQQAMNDEIVALEHNHTWSLVPLPPSHKAIGWC-WVYKIKHNSDGSVEHYKARRVAKGYT

Query:  QVEGVDYKETFSPTTKLTTLRYLLTIVVARKWFTHQLDVQNAFLHGNLDEEVYMSLPPG
        Q  G+DY ETFSP  K T++R +L + V R W   QLDV NAFL G L +EVYMS PPG
Subjt:  QVEGVDYKETFSPTTKLTTLRYLLTIVVARKWFTHQLDVQNAFLHGNLDEEVYMSLPPG

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.0e-3859.66Show/hide
Query:  LWQQAMNDEIVALEHNHTWSLVPLPPSHKAIGWCWVYKIKHNSDGSVEHYKARRVAKGYTQVEGVDYKETFSPTTKLTTLRYLLTIVVARKWFTHQLDVQ
        +W  AM+DEI A+E  HTW +  LPP+ K IG  WVYKIK+NSDG++E YKAR VAKGYTQ EG+D+ ETFSP  KLT+++ +L I     +  HQLD+ 
Subjt:  LWQQAMNDEIVALEHNHTWSLVPLPPSHKAIGWCWVYKIKHNSDGSVEHYKARRVAKGYTQVEGVDYKETFSPTTKLTTLRYLLTIVVARKWFTHQLDVQ

Query:  NAFLHGNLDEEVYMSLPPG
        NAFL+G+LDEE+YM LPPG
Subjt:  NAFLHGNLDEEVYMSLPPG

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.9e-1745.45Show/hide
Query:  NPLWQQAMNDEIVALEHNHTWSLVPLPPSHKAIGWCWVYKIKHNSDGSVEHYKARRVAKGYTQVEGVDYKETFSPTTKLTTLRYLLTI
        +P W QAM +E+ AL  N TW LVP P +   +G  WV+K K +SDG+++  KAR VAKG+ Q EG+ + ET+SP  +  T+R +L +
Subjt:  NPLWQQAMNDEIVALEHNHTWSLVPLPPSHKAIGWCWVYKIKHNSDGSVEHYKARRVAKGYTQVEGVDYKETFSPTTKLTTLRYLLTI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGTTCCGGGCGGTTGACGAGTCGGGCTTTGAGAAGATTGTCAGGGCAACTATTTCAAAAGAAGCGTCGGACACTTTAGAAAAGGTGTTCAAAGGAACTGAC
CGAGTCAAGCAAGTGCTTCTCCAAACTCTGTGTGGCGAGTTGGAGAGCATGAAGATGAAGGAGTCAGAAAATGTATCTGACCACATTGCGCACATACAGACCGTG
GCAAATCAATTAAATCGAAACGGAGAAATGTTACCCGAGACGCGAGTTGTGGAGAAGATCTTGAGGTCGTTAACCGACAACTTCGAGAATGTTGTATGTGCGACA
GAAGAGTCAAAGGACCTAGCGAAGTTCACAGTCAATGAGCTTGCCGGTTCTCTTAAGGCACACGAGCAACGTAAGAAAAAGAAGGAGGAGACACTCGATCAAGCG
CTTCAGATTAAGGCATCAATAAAGGATGAAAAGGACTTGGCTACGGGGAAGATGATTGGCTCGGGTAAACAATTCGGAGGTCTCTATCATATTTCTTTATCTTCT
ATCAAATCTTCAGCAAATCAAGTATCTCAGTCATCTCATTTGTGGCATTTACGCCTAGTTCCACTTCGACGTTCTACTTCTACCAAACGACCTCCAGCTTATTAT
AATGATAATGAGATGTCTTCTGGAGCCAATCATTTAACTCTAGCTCAAGTCCCAACTCTGGCACCAATTGGCAACCCATTATGGCAGCAGGCTATGAATGATGAA
ATTGTAGCTTTGGAACATAATCATACTTGGTCTCTCGTTCCTCTACCACCTAGCCATAAAGCTATTGGTTGGTGTTGGGTGTACAAGATTAAACACAACTCTGAT
GGTTCTGTTGAACATTACAAAGCTCGACGGGTAGCAAAGGGATACACTCAGGTTGAAGGTGTTGATTACAAAGAGACATTTTCTCCTACAACGAAACTTACTACA
CTTCGTTACTTACTCACTATTGTCGTTGCTCGAAAATGGTTCACTCATCAATTGGATGTTCAAAATGCCTTTCTCCATGGTAATCTAGACGAGGAAGTTTATATG
TCTTTACCACCAGGTATTCGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTGTTCCGGGCGGTTGACGAGTCGGGCTTTGAGAAGATTGTCAGGGCAACTATTTCAAAAGAAGCGTCGGACACTTTAGAAAAGGTGTTCAAAGGAACTGAC
CGAGTCAAGCAAGTGCTTCTCCAAACTCTGTGTGGCGAGTTGGAGAGCATGAAGATGAAGGAGTCAGAAAATGTATCTGACCACATTGCGCACATACAGACCGTG
GCAAATCAATTAAATCGAAACGGAGAAATGTTACCCGAGACGCGAGTTGTGGAGAAGATCTTGAGGTCGTTAACCGACAACTTCGAGAATGTTGTATGTGCGACA
GAAGAGTCAAAGGACCTAGCGAAGTTCACAGTCAATGAGCTTGCCGGTTCTCTTAAGGCACACGAGCAACGTAAGAAAAAGAAGGAGGAGACACTCGATCAAGCG
CTTCAGATTAAGGCATCAATAAAGGATGAAAAGGACTTGGCTACGGGGAAGATGATTGGCTCGGGTAAACAATTCGGAGGTCTCTATCATATTTCTTTATCTTCT
ATCAAATCTTCAGCAAATCAAGTATCTCAGTCATCTCATTTGTGGCATTTACGCCTAGTTCCACTTCGACGTTCTACTTCTACCAAACGACCTCCAGCTTATTAT
AATGATAATGAGATGTCTTCTGGAGCCAATCATTTAACTCTAGCTCAAGTCCCAACTCTGGCACCAATTGGCAACCCATTATGGCAGCAGGCTATGAATGATGAA
ATTGTAGCTTTGGAACATAATCATACTTGGTCTCTCGTTCCTCTACCACCTAGCCATAAAGCTATTGGTTGGTGTTGGGTGTACAAGATTAAACACAACTCTGAT
GGTTCTGTTGAACATTACAAAGCTCGACGGGTAGCAAAGGGATACACTCAGGTTGAAGGTGTTGATTACAAAGAGACATTTTCTCCTACAACGAAACTTACTACA
CTTCGTTACTTACTCACTATTGTCGTTGCTCGAAAATGGTTCACTCATCAATTGGATGTTCAAAATGCCTTTCTCCATGGTAATCTAGACGAGGAAGTTTATATG
TCTTTACCACCAGGTATTCGCTGA
Protein sequenceShow/hide protein sequence
MLFRAVDESGFEKIVRATISKEASDTLEKVFKGTDRVKQVLLQTLCGELESMKMKESENVSDHIAHIQTVANQLNRNGEMLPETRVVEKILRSLTDNFENVVCAT
EESKDLAKFTVNELAGSLKAHEQRKKKKEETLDQALQIKASIKDEKDLATGKMIGSGKQFGGLYHISLSSIKSSANQVSQSSHLWHLRLVPLRRSTSTKRPPAYY
NDNEMSSGANHLTLAQVPTLAPIGNPLWQQAMNDEIVALEHNHTWSLVPLPPSHKAIGWCWVYKIKHNSDGSVEHYKARRVAKGYTQVEGVDYKETFSPTTKLTT
LRYLLTIVVARKWFTHQLDVQNAFLHGNLDEEVYMSLPPGIR