; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh12G009200 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh12G009200
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationCma_Chr12:7072178..7073309
RNA-Seq ExpressionCmaCh12G009200
SyntenyCmaCh12G009200
Gene Ontology termsNA
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ABA97666.1 retrotransposon protein, putative, unclassified [Oryza sativa Japonica Group]5.8e-7447.94Show/hide
Query:  MPGRFWGEVVMTAVYLLNRSPTRSLDGKTSYEAWYNKKPAVHHFRVFGCVAYM------------KGGELTCLPTSSSTKALSKQWNDVIEADHNPNKFM
        MPG FWGE V TAV+LLNRSPT+ L  KT Y AWY ++PAVH  R FGC+ ++            +   +  L     +KA  + ++ V   D  P  F 
Subjt:  MPGRFWGEVVMTAVYLLNRSPTRSLDGKTSYEAWYNKKPAVHHFRVFGCVAYM------------KGGELTCLPTSSSTKALSKQWNDVIEADHNPNKFM

Query:  VKYLITESEE------GGAQHQQPSPPPAGATPE--------PVEFTTPRTADSTLDADHDTD-LEARYRRMDDLVGGGEPPGLAAHKLKEMAELHAISA
        V++ ++ ++E            Q  P P   TPE         VE  +P + DS LD D D +  E RYR +D+++G   P G+A  +L E  ELHA+SA
Subjt:  VKYLITESEE------GGAQHQQPSPPPAGATPE--------PVEFTTPRTADSTLDADHDTD-LEARYRRMDDLVGGGEPPGLAAHKLKEMAELHAISA

Query:  DEPNTFVEVEK--NPCWLKAMQEEMTSITENQTWSLEDIPPGHQAIGLKWVFKLKRKEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLA
        +EP++  E E   +P W  AM++EM SI EN TWSL D+P GH+AIGLKWV+KLKR E+G VV HKARLVAKGYVQ+QGVDF+EVF PVARLESV  LLA
Subjt:  DEPNTFVEVEK--NPCWLKAMQEEMTSITENQTWSLEDIPPGHQAIGLKWVFKLKRKEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLA

Query:  ITAHYSWEVHHMDVKSAFLNGELRRPSISDNHLASWTTTT
        + AH +WEVHHMDVKSAFLN EL+     +  L +W+  T
Subjt:  ITAHYSWEVHHMDVKSAFLNGELRRPSISDNHLASWTTTT

CAA2616957.1 unnamed protein product [Spirodela intermedia]1.7e-7346.51Show/hide
Query:  MPGRFWGEVVMTAVYLLNRSPTRSLDGKTSYEAWYNKKPAVHHFRVFGCVAYM------------KGGELTCLPTSSSTKALSKQ---------WNDVI-
        +PG FWGE V TAVY+LNR PT+S+DG T +E W+ KKPAVHH +VFGC+AY+            +G ++  +     +KA               DV+ 
Subjt:  MPGRFWGEVVMTAVYLLNRSPTRSLDGKTSYEAWYNKKPAVHHFRVFGCVAYM------------KGGELTCLPTSSSTKALSKQ---------WNDVI-

Query:  -----EADHNPNKFMVKYLI-----------TESEEGGAQHQQPSPPPA-----GATP-----EPVEFTTPRTAD-STLDADHDTDLEARYRRMDDLVGG
             +A  + + F ++Y +            E  +  A   +P  PP+     GA P     E VEF +P +     LDA+HD D   R+R++D++VG 
Subjt:  -----EADHNPNKFMVKYLI-----------TESEEGGAQHQQPSPPPA-----GATP-----EPVEFTTPRTAD-STLDADHDTDLEARYRRMDDLVGG

Query:  GEPPGLAAHKLKEMAELHAISADEPNTFVEVEKNPCWLKAMQEEMTSITENQTWSLEDIPPGHQAIGLKWVFKLKRKEKGEVVKHKARLVAKGYVQKQGV
          P GLA+  L    ELHA+S+DEP +FVE E +P W KAM+EEM SI EN+TWSL D+P G +AIGLKWV+K+KR E G V K+KARLV KGY Q+QG+
Subjt:  GEPPGLAAHKLKEMAELHAISADEPNTFVEVEKNPCWLKAMQEEMTSITENQTWSLEDIPPGHQAIGLKWVFKLKRKEKGEVVKHKARLVAKGYVQKQGV

Query:  DFEEVFAPVARLESVRFLLAITAHYSWEVHHMDVKSAFLNGELR
        D++EVFAPVARL++VR L+A+ AH  WEVHHMDVKSAFLNG+L+
Subjt:  DFEEVFAPVARLESVRFLLAITAHYSWEVHHMDVKSAFLNGELR

CAE03692.2 OSJNBb0026E15.10 [Oryza sativa Japonica Group]1.0e-7848.26Show/hide
Query:  MPGRFWGEVVMTAVYLLNRSPTRSLDGKTSYEAWYNKKPAVHHFRVFGCVAYMK------------GGELTCLPTSSSTKA-------------------
        +PGRFWGE + TAV+LLNRSPT+SLD +T YEAWY + PAVH  R FGCV ++K               +  L     +KA                   
Subjt:  MPGRFWGEVVMTAVYLLNRSPTRSLDGKTSYEAWYNKKPAVHHFRVFGCVAYMK------------GGELTCLPTSSSTKA-------------------

Query:  ---LSKQWNDVI-EADHNPNKFMVKYLITESEEGGAQHQQPSP---------------PPAGATPEPVEFTTPRTADSTLDADHDTDLEARYRRMDDLVG
           ++  W  V  +       F V+ ++T +  G A    P+P               PP+  +PE VEF TP T DS LDAD D D+  RYR +D+L+G
Subjt:  ---LSKQWNDVI-EADHNPNKFMVKYLITESEEGGAQHQQPSP---------------PPAGATPEPVEFTTPRTADSTLDADHDTDLEARYRRMDDLVG

Query:  GGEPPGLAAHKLKEMAELHAISADEPNTFVEVEKNPCWLKAMQEEMTSITENQTWSLEDIPPGHQAIGLKWVFKLKRKEKGEVVKHKARLVAKGYVQKQG
           PPG A   L+++ ELH +SADEP +  E E +P W  AMQ+E+ +I +N TWSL D+P GH+AIGLKWV+KLKR E+G +V++KARLVAKGYVQ+QG
Subjt:  GGEPPGLAAHKLKEMAELHAISADEPNTFVEVEKNPCWLKAMQEEMTSITENQTWSLEDIPPGHQAIGLKWVFKLKRKEKGEVVKHKARLVAKGYVQKQG

Query:  VDFEEVFAPVARLESVRFLLAITAHYSWEVHHMDVKSAFLNGEL
        VDF+EVFA VARLESVR LLA+ AH  W+VHHMDVKSAFLNGEL
Subjt:  VDFEEVFAPVARLESVRFLLAITAHYSWEVHHMDVKSAFLNGEL

XP_023521510.1 uncharacterized protein LOC111785335 [Cucurbita pepo subsp. pepo]5.6e-7746.84Show/hide
Query:  LNRSPTRSLDGKTSYEAWYNKKPAVHHFRVFGCVAYMK------------GGELTCLPTSSSTKALSKQWNDVIEADHNPNKFMVKYLITESEEGGAQHQ
        +NRSPTR L GKTSYEAWYNKK AVHHFRVF C+AYMK            G ++  +     +K      NDVIE D NPN+F V+YLITE  EGGAQH+
Subjt:  LNRSPTRSLDGKTSYEAWYNKKPAVHHFRVFGCVAYMK------------GGELTCLPTSSSTKALSKQWNDVIEADHNPNKFMVKYLITESEEGGAQHQ

Query:  QPSPPPAGATPEPVEFTTPRTADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAAHKLKEMAELHAISADEPNTFVEVEKNPCWLKAMQEEMTS------
        + SP  A  TP+PVEF TPRTADSTLD DHD DL ARYRRMDDLVGGGEPPGLA  +L+E+ ELHAIS DEPNTF + E+NPC LK    ++ S      
Subjt:  QPSPPPAGATPEPVEFTTPRTADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAAHKLKEMAELHAISADEPNTFVEVEKNPCWLKAMQEEMTS------

Query:  -------------------------------ITENQTWS--------------------------------------------LEDI-----PPGHQAIG
                                       ITE +                                               ++D+     PPG     
Subjt:  -------------------------------ITENQTWS--------------------------------------------LEDI-----PPGHQAIG

Query:  LKWVFKL------------------KRKEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAITAHYSWEVHHMDVKSAFLNGELR
        L+ V +L                   R EKGEVVKHKA LVAKGY+ KQGVDFEEVFA V RLE VR LL I  H SWEVHHMDVKS FLNGEL+
Subjt:  LKWVFKL------------------KRKEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAITAHYSWEVHHMDVKSAFLNGELR

XP_023522344.1 uncharacterized protein LOC111786267, partial [Cucurbita pepo subsp. pepo]4.9e-9758.5Show/hide
Query:  MPGRFWGEVVMTAVYLLNRSPTRSLDGKTSYEAWYNKKPAVHHFRVFGCVAYMKGGELTCLPTSSSTKALSKQWNDVIEADHNPNKFMVKYLITESEEGG
        MPGRFWGE VMTAVYLLNRSPTRSLDGKT YEAW   +  V    VF                    ++   QWNDVIEADHNPN+F V+YL+TE EE G
Subjt:  MPGRFWGEVVMTAVYLLNRSPTRSLDGKTSYEAWYNKKPAVHHFRVFGCVAYMKGGELTCLPTSSSTKALSKQWNDVIEADHNPNKFMVKYLITESEEGG

Query:  AQHQQPSPPPAGATPEPVEFTTPRTADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAAHKLKEMAELHAISADEPNTFVEVEKNPCWLKAMQEEMTSIT
        AQHQ+PSPPPAGA PEPVEF TPRTA+STLDADHDT LEARYRR+DDLVGGGEPPGLAA +LKE+AELHAISADEPNTF E EKNPCW            
Subjt:  AQHQQPSPPPAGATPEPVEFTTPRTADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAAHKLKEMAELHAISADEPNTFVEVEKNPCWLKAMQEEMTSIT

Query:  ENQTWSLEDIPPGHQAIGLKWVFKLKRKEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAITAHYSWEVHHMDVKSAFLNGEL------
                                LKR +K EVVK+KARLV KGYVQK GVDFEEVFAPV RLESVRFLL+I AHYSWEVHHMDVKSAFLN EL      
Subjt:  ENQTWSLEDIPPGHQAIGLKWVFKLKRKEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAITAHYSWEVHHMDVKSAFLNGEL------

Query:  RRP----------------------------------------SISDNHLASWTTTTPI
        R+P                                        S+SDNHLASWTTTT I
Subjt:  RRP----------------------------------------SISDNHLASWTTTTPI

TrEMBL top hitse value%identityAlignment
A0A3L6TJD2 Integrase catalytic domain-containing protein1.1e-7046.34Show/hide
Query:  MPGRFWGEVVMTAVYLLNRSPTRSLDGKTSYEAWYNKKPAVHHFRVFGCVA-------YMKGGELTCLPT-----SSSTKALS-----------------
        +PG FWGE V TAV++LNRSPTRSLDGKT YEA +  +PAV   R FGC+A       Y+K  E    P       + +KA                   
Subjt:  MPGRFWGEVVMTAVYLLNRSPTRSLDGKTSYEAWYNKKPAVHHFRVFGCVA-------YMKGGELTCLPT-----SSSTKALS-----------------

Query:  ---KQWNDVIEA---DHNPN-KFMVKY-LITESEEGG---------------------------------AQHQQPSPPP----AGATPEPVEFTT-PRT
            QW+   EA   D   N +F V +    ES  GG                                   H  PSP P       +PEP+EF T P  
Subjt:  ---KQWNDVIEA---DHNPN-KFMVKY-LITESEEGG---------------------------------AQHQQPSPPP----AGATPEPVEFTT-PRT

Query:  ADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAAHKLKEMAELHAISADEPNTFVEVEKNPCWLKAMQEEMTSITENQTWSLEDIPPGHQAIGLKWVFKL
            LDADHD D+  R+RR+D+L+G G  PGLA  ++    EL   +A+EP +F E EK+ CW +AM+EEM SI EN+TWSL ++P GH+ IGLKWVFK+
Subjt:  ADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAAHKLKEMAELHAISADEPNTFVEVEKNPCWLKAMQEEMTSITENQTWSLEDIPPGHQAIGLKWVFKL

Query:  KRKEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAITAHYSWEVHHMDVKSAFLNGEL
        KR E G +VKHKARLVAKGYVQ+ G+DF+EVFAPVARLESVR LLA+ A   WEVHHMDVKSAFLNG+L
Subjt:  KRKEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAITAHYSWEVHHMDVKSAFLNGEL

A0A7I8IFL9 Hypothetical protein8.2e-7446.51Show/hide
Query:  MPGRFWGEVVMTAVYLLNRSPTRSLDGKTSYEAWYNKKPAVHHFRVFGCVAYM------------KGGELTCLPTSSSTKALSKQ---------WNDVI-
        +PG FWGE V TAVY+LNR PT+S+DG T +E W+ KKPAVHH +VFGC+AY+            +G ++  +     +KA               DV+ 
Subjt:  MPGRFWGEVVMTAVYLLNRSPTRSLDGKTSYEAWYNKKPAVHHFRVFGCVAYM------------KGGELTCLPTSSSTKALSKQ---------WNDVI-

Query:  -----EADHNPNKFMVKYLI-----------TESEEGGAQHQQPSPPPA-----GATP-----EPVEFTTPRTAD-STLDADHDTDLEARYRRMDDLVGG
             +A  + + F ++Y +            E  +  A   +P  PP+     GA P     E VEF +P +     LDA+HD D   R+R++D++VG 
Subjt:  -----EADHNPNKFMVKYLI-----------TESEEGGAQHQQPSPPPA-----GATP-----EPVEFTTPRTAD-STLDADHDTDLEARYRRMDDLVGG

Query:  GEPPGLAAHKLKEMAELHAISADEPNTFVEVEKNPCWLKAMQEEMTSITENQTWSLEDIPPGHQAIGLKWVFKLKRKEKGEVVKHKARLVAKGYVQKQGV
          P GLA+  L    ELHA+S+DEP +FVE E +P W KAM+EEM SI EN+TWSL D+P G +AIGLKWV+K+KR E G V K+KARLV KGY Q+QG+
Subjt:  GEPPGLAAHKLKEMAELHAISADEPNTFVEVEKNPCWLKAMQEEMTSITENQTWSLEDIPPGHQAIGLKWVFKLKRKEKGEVVKHKARLVAKGYVQKQGV

Query:  DFEEVFAPVARLESVRFLLAITAHYSWEVHHMDVKSAFLNGELR
        D++EVFAPVARL++VR L+A+ AH  WEVHHMDVKSAFLNG+L+
Subjt:  DFEEVFAPVARLESVRFLLAITAHYSWEVHHMDVKSAFLNGELR

A0A7I8IJM7 Hypothetical protein1.4e-7044.92Show/hide
Query:  MPGRFWGEVVMTAVYLLNRSPTRSLDGKTSYEAWYNKKPAVHHFRVFGCVAYM------------KGGELTCLPTSSSTKAL------------------
        +PG FWGE V TAVY+LNR PT+S+DG T +E W+ KKPAVHH +VFGC+AY+            +G ++  +     +KA                   
Subjt:  MPGRFWGEVVMTAVYLLNRSPTRSLDGKTSYEAWYNKKPAVHHFRVFGCVAYM------------KGGELTCLPTSSSTKAL------------------

Query:  --SKQWN-----DVIEADHNPNKFMVKYLI-----------TESEEGGAQHQQPSPPPA-----GATP-----EPVEFTTPRTAD-STLDADHDTDLEAR
          + QW+     +  +A  + + F ++Y +            E  +  A   +P  PP+     GA P     E VEF +P +     LDA+HD D   R
Subjt:  --SKQWN-----DVIEADHNPNKFMVKYLI-----------TESEEGGAQHQQPSPPPA-----GATP-----EPVEFTTPRTAD-STLDADHDTDLEAR

Query:  YRRMDDLVGGGEPPGLAAHKLKEMAELHAISADEPNTFVEVEKNPCWLKAMQEEMTSITENQTWSLEDIPPGHQAIGLKWVFKLKRKEKGEVVKHKARLV
        +R++D++VG   P GLA+  L    ELHA+S+DEP +FVE E +P W KAM+EEM SI EN+T SL D+P G +AIGLKWV+K+KR E   VVK+KARLV
Subjt:  YRRMDDLVGGGEPPGLAAHKLKEMAELHAISADEPNTFVEVEKNPCWLKAMQEEMTSITENQTWSLEDIPPGHQAIGLKWVFKLKRKEKGEVVKHKARLV

Query:  AKGYVQKQGVDFEEVFAPVARLESVRFLLAITAHYSWEVHHMDVKSAFLNGELR
         KGYVQ QG+D++EVFAPVARL+++R L+A+ AH  WEVHHMDVKSAFLNG L+
Subjt:  AKGYVQKQGVDFEEVFAPVARLESVRFLLAITAHYSWEVHHMDVKSAFLNGELR

Q2QSF4 Retrotransposon protein, putative, unclassified2.8e-7447.94Show/hide
Query:  MPGRFWGEVVMTAVYLLNRSPTRSLDGKTSYEAWYNKKPAVHHFRVFGCVAYM------------KGGELTCLPTSSSTKALSKQWNDVIEADHNPNKFM
        MPG FWGE V TAV+LLNRSPT+ L  KT Y AWY ++PAVH  R FGC+ ++            +   +  L     +KA  + ++ V   D  P  F 
Subjt:  MPGRFWGEVVMTAVYLLNRSPTRSLDGKTSYEAWYNKKPAVHHFRVFGCVAYM------------KGGELTCLPTSSSTKALSKQWNDVIEADHNPNKFM

Query:  VKYLITESEE------GGAQHQQPSPPPAGATPE--------PVEFTTPRTADSTLDADHDTD-LEARYRRMDDLVGGGEPPGLAAHKLKEMAELHAISA
        V++ ++ ++E            Q  P P   TPE         VE  +P + DS LD D D +  E RYR +D+++G   P G+A  +L E  ELHA+SA
Subjt:  VKYLITESEE------GGAQHQQPSPPPAGATPE--------PVEFTTPRTADSTLDADHDTD-LEARYRRMDDLVGGGEPPGLAAHKLKEMAELHAISA

Query:  DEPNTFVEVEK--NPCWLKAMQEEMTSITENQTWSLEDIPPGHQAIGLKWVFKLKRKEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLA
        +EP++  E E   +P W  AM++EM SI EN TWSL D+P GH+AIGLKWV+KLKR E+G VV HKARLVAKGYVQ+QGVDF+EVF PVARLESV  LLA
Subjt:  DEPNTFVEVEK--NPCWLKAMQEEMTSITENQTWSLEDIPPGHQAIGLKWVFKLKRKEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLA

Query:  ITAHYSWEVHHMDVKSAFLNGELRRPSISDNHLASWTTTT
        + AH +WEVHHMDVKSAFLN EL+     +  L +W+  T
Subjt:  ITAHYSWEVHHMDVKSAFLNGELRRPSISDNHLASWTTTT

Q7XPB1 OSJNBb0026E15.10 protein4.9e-7948.26Show/hide
Query:  MPGRFWGEVVMTAVYLLNRSPTRSLDGKTSYEAWYNKKPAVHHFRVFGCVAYMK------------GGELTCLPTSSSTKA-------------------
        +PGRFWGE + TAV+LLNRSPT+SLD +T YEAWY + PAVH  R FGCV ++K               +  L     +KA                   
Subjt:  MPGRFWGEVVMTAVYLLNRSPTRSLDGKTSYEAWYNKKPAVHHFRVFGCVAYMK------------GGELTCLPTSSSTKA-------------------

Query:  ---LSKQWNDVI-EADHNPNKFMVKYLITESEEGGAQHQQPSP---------------PPAGATPEPVEFTTPRTADSTLDADHDTDLEARYRRMDDLVG
           ++  W  V  +       F V+ ++T +  G A    P+P               PP+  +PE VEF TP T DS LDAD D D+  RYR +D+L+G
Subjt:  ---LSKQWNDVI-EADHNPNKFMVKYLITESEEGGAQHQQPSP---------------PPAGATPEPVEFTTPRTADSTLDADHDTDLEARYRRMDDLVG

Query:  GGEPPGLAAHKLKEMAELHAISADEPNTFVEVEKNPCWLKAMQEEMTSITENQTWSLEDIPPGHQAIGLKWVFKLKRKEKGEVVKHKARLVAKGYVQKQG
           PPG A   L+++ ELH +SADEP +  E E +P W  AMQ+E+ +I +N TWSL D+P GH+AIGLKWV+KLKR E+G +V++KARLVAKGYVQ+QG
Subjt:  GGEPPGLAAHKLKEMAELHAISADEPNTFVEVEKNPCWLKAMQEEMTSITENQTWSLEDIPPGHQAIGLKWVFKLKRKEKGEVVKHKARLVAKGYVQKQG

Query:  VDFEEVFAPVARLESVRFLLAITAHYSWEVHHMDVKSAFLNGEL
        VDF+EVFA VARLESVR LLA+ AH  W+VHHMDVKSAFLNGEL
Subjt:  VDFEEVFAPVARLESVRFLLAITAHYSWEVHHMDVKSAFLNGEL

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.1e-3026.91Show/hide
Query:  FWGEVVMTAVYLLNRSPTRSL--DGKTSYEAWYNKKPAVHHFRVFGCVAYMKGGELTCLPTSSSTKAL--------SKQWND-----------VIEADHN
        FWGE V+TA YL+NR P+R+L    KT YE W+NKKP + H RVFG   Y+            S K++         K W+            V++  + 
Subjt:  FWGEVVMTAVYLLNRSPTRSL--DGKTSYEAWYNKKPAVHHFRVFGCVAYMKGGELTCLPTSSSTKAL--------SKQWND-----------VIEADHN

Query:  PNKFMVKY---LITESEEGGAQH--------QQPSPPPAGATPEPVEFTTPRTADSTLDADHDTDL-------------------------------EAR
         N   VK+    + +S+E   ++         Q   P      + ++F          +  +D+                                 E++
Subjt:  PNKFMVKY---LITESEEGGAQH--------QQPSPPPAGATPEPVEFTTPRTADSTLDADHDTDL-------------------------------EAR

Query:  YRRMDDLV----GGGEP----PGLAAHKLKEMA----------------------------------------ELHAISADEPNTFVEV---EKNPCWLK
         R+ DD +    G G P        A  LKE+                                           H I  D PN+F E+   +    W +
Subjt:  YRRMDDLV----GGGEP----PGLAAHKLKEMA----------------------------------------ELHAISADEPNTFVEV---EKNPCWLK

Query:  AMQEEMTSITENQTWSLEDIPPGHQAIGLKWVFKLKRKEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAITAHYSWEVHHMDVKSAFL
        A+  E+ +   N TW++   P     +  +WVF +K  E G  +++KARLVA+G+ QK  +D+EE FAPVAR+ S RF+L++   Y+ +VH MDVK+AFL
Subjt:  AMQEEMTSITENQTWSLEDIPPGHQAIGLKWVFKLKRKEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAITAHYSWEVHHMDVKSAFL

Query:  NGELR
        NG L+
Subjt:  NGELR

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.7e-3330.24Show/hide
Query:  MPGRFWGEVVMTAVYLLNRSPTRSLDGKTSYEAWYNKKPAVHHFRVFGCVAY----------MKGGELTCLPTSSSTKALS-KQWNDV-------IEADH
        +P  FWGE V TA YL+NRSP+  L  +     W NK+ +  H +VFGC A+          +    + C+      +    + W+ V        +   
Subjt:  MPGRFWGEVVMTAVYLLNRSPTRSLDGKTSYEAWYNKKPAVHHFRVFGCVAY----------MKGGELTCLPTSSSTKALS-KQWNDV-------IEADH

Query:  NPNKFMVKYLITESEEGGAQHQQPSPPPAGATPEPVEFTTPRTA-----------------DSTLDADHDTDLEARYRRMDDLVGGGEPPGLAAHKLKEM
          ++      ++E  + G      + P     P   E TT   +                 +   + +H T  E +++ +       E P + + +    
Subjt:  NPNKFMVKYLITESEEGGAQHQQPSPPPAGATPEPVEFTTPRTA-----------------DSTLDADHDTDLEARYRRMDDLVGGGEPPGLAAHKLKEM

Query:  AELHAISAD-EPNTFVEV----EKNPCWLKAMQEEMTSITENQTWSLEDIPPGHQAIGLKWVFKLKRKEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPV
         E   IS D EP +  EV    EKN   +KAMQEEM S+ +N T+ L ++P G + +  KWVFKLK+    ++V++KARLV KG+ QK+G+DF+E+F+PV
Subjt:  AELHAISAD-EPNTFVEV----EKNPCWLKAMQEEMTSITENQTWSLEDIPPGHQAIGLKWVFKLKRKEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPV

Query:  ARLESVRFLLAITAHYSWEVHHMDVKSAFLNGEL
         ++ S+R +L++ A    EV  +DVK+AFL+G+L
Subjt:  ARLESVRFLLAITAHYSWEVHHMDVKSAFLNGEL

P92520 Uncharacterized mitochondrial protein AtMg008209.8e-1640.82Show/hide
Query:  EPNTFVEVEKNPCWLKAMQEEMTSITENQTWSLEDIPPGHQAIGLKWVFKLKRKEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAI
        EP + +   K+P W +AMQEE+ +++ N+TW L   P     +G KWVFK K    G + + KARLVAKG+ Q++G+ F E ++PV R  ++R +L +
Subjt:  EPNTFVEVEKNPCWLKAMQEEMTSITENQTWSLEDIPPGHQAIGLKWVFKLKRKEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.9e-2028.92Show/hide
Query:  GGELTCLPTSSSTKALSKQWNDVIEADHNPNKFMVKYLITESEEGGAQHQQPSPPP--------AGATPEPVEFTTPRTADSTLDADHDTDLEARYRRMD
        G + T  PT + T+  S Q      + +NP       L  +S    AQ    SP P           TP  +    P      ++ ++   L        
Subjt:  GGELTCLPTSSSTKALSKQWNDVIEADHNPNKFMVKYLITESEEGGAQHQQPSPPP--------AGATPEPVEFTTPRTADSTLDADHDTDLEARYRRMD

Query:  DLVGGGEPPGLAAHKLKEMAELHAISADEPNTFVEVEKNPCWLKAMQEEMTSITENQTWSLEDIPPGHQAI-GLKWVFKLKRKEKGEVVKHKARLVAKGY
          +G     G+     K    +   +  EP T ++  K+  W  AM  E+ +   N TW L   PP H  I G +W+F  K    G + ++KARLVAKGY
Subjt:  DLVGGGEPPGLAAHKLKEMAELHAISADEPNTFVEVEKNPCWLKAMQEEMTSITENQTWSLEDIPPGHQAI-GLKWVFKLKRKEKGEVVKHKARLVAKGY

Query:  VQKQGVDFEEVFAPVARLESVRFLLAITAHYSWEVHHMDVKSAFLNGEL
         Q+ G+D+ E F+PV +  S+R +L +    SW +  +DV +AFL G L
Subjt:  VQKQGVDFEEVFAPVARLESVRFLLAITAHYSWEVHHMDVKSAFLNGEL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.5e-1938.84Show/hide
Query:  EPNTFVEVEKNPCWLKAMQEEMTSITENQTWSL-EDIPPGHQAIGLKWVFKLKRKEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIT
        EP T ++  K+  W +AM  E+ +   N TW L    PP    +G +W+F  K    G + ++KARLVAKGY Q+ G+D+ E F+PV +  S+R +L + 
Subjt:  EPNTFVEVEKNPCWLKAMQEEMTSITENQTWSL-EDIPPGHQAIGLKWVFKLKRKEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAIT

Query:  AHYSWEVHHMDVKSAFLNGEL
           SW +  +DV +AFL G L
Subjt:  AHYSWEVHHMDVKSAFLNGEL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 85.1e-2844.26Show/hide
Query:  ADEPNTFVEVEKNPCWLKAMQEEMTSITENQTWSLEDIPPGHQAIGLKWVFKLKRKEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAI
        A EP+T+ E ++   W  AM +E+ ++    TW +  +PP  + IG KWV+K+K    G + ++KARLVAKGY Q++G+DF E F+PV +L SV+ +LAI
Subjt:  ADEPNTFVEVEKNPCWLKAMQEEMTSITENQTWSLEDIPPGHQAIGLKWVFKLKRKEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAI

Query:  TAHYSWEVHHMDVKSAFLNGEL
        +A Y++ +H +D+ +AFLNG+L
Subjt:  TAHYSWEVHHMDVKSAFLNGEL

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)6.9e-1740.82Show/hide
Query:  EPNTFVEVEKNPCWLKAMQEEMTSITENQTWSLEDIPPGHQAIGLKWVFKLKRKEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAI
        EP + +   K+P W +AMQEE+ +++ N+TW L   P     +G KWVFK K    G + + KARLVAKG+ Q++G+ F E ++PV R  ++R +L +
Subjt:  EPNTFVEVEKNPCWLKAMQEEMTSITENQTWSLEDIPPGHQAIGLKWVFKLKRKEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTGGGAGATTCTGGGGAGAGGTAGTAATGACGGCCGTCTATCTCCTCAATCGGTCACCAACCCGAAGCCTTGACGGGAAGACGTCATATGAGGCCTGGTACAACAA
AAAACCAGCGGTACATCATTTTCGCGTGTTCGGCTGCGTCGCATACATGAAGGGGGGAGAGCTCACGTGTCTCCCGACGTCGTCTTCGACGAAAGCACTGTCTAAGCAGT
GGAATGACGTGATCGAGGCAGACCATAATCCAAATAAATTCATGGTGAAGTACCTCATCACCGAGTCGGAAGAAGGAGGAGCCCAGCATCAGCAGCCGTCACCGCCGCCA
GCAGGTGCAACCCCTGAACCAGTAGAATTTACAACACCACGGACTGCGGATTCGACGCTGGATGCCGATCACGATACTGATCTGGAGGCCAGGTACCGAAGGATGGACGA
CCTAGTAGGAGGAGGTGAACCACCTGGACTGGCAGCGCACAAACTCAAAGAAATGGCCGAACTACATGCCATCAGTGCAGATGAACCGAACACCTTCGTCGAAGTAGAAA
AGAACCCGTGCTGGCTGAAGGCAATGCAGGAGGAGATGACATCCATCACCGAGAACCAGACATGGAGTCTGGAGGATATACCGCCGGGACACCAAGCCATAGGGCTCAAA
TGGGTCTTCAAACTGAAGCGCAAAGAAAAAGGAGAAGTTGTGAAGCACAAGGCCCGTCTGGTGGCGAAGGGCTACGTCCAGAAGCAAGGAGTGGACTTTGAAGAGGTATT
TGCGCCAGTGGCAAGGTTAGAATCCGTTCGTTTCTTGCTGGCAATTACGGCACATTACTCTTGGGAGGTTCACCATATGGACGTAAAGTCTGCTTTCCTTAACGGAGAGT
TGAGGAGACCGTCTATATCTGACAATCACCTGGCTTCCTGGACAACGACAACCCCAATAAAGTACTGCGCCTGCACAAGGCACTCTACGGGCTTCGACAAGCTCCACGAG
CCTGGAACCCAAAGCTCGACAGTACCTTACTGTCACTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTGGGAGATTCTGGGGAGAGGTAGTAATGACGGCCGTCTATCTCCTCAATCGGTCACCAACCCGAAGCCTTGACGGGAAGACGTCATATGAGGCCTGGTACAACAA
AAAACCAGCGGTACATCATTTTCGCGTGTTCGGCTGCGTCGCATACATGAAGGGGGGAGAGCTCACGTGTCTCCCGACGTCGTCTTCGACGAAAGCACTGTCTAAGCAGT
GGAATGACGTGATCGAGGCAGACCATAATCCAAATAAATTCATGGTGAAGTACCTCATCACCGAGTCGGAAGAAGGAGGAGCCCAGCATCAGCAGCCGTCACCGCCGCCA
GCAGGTGCAACCCCTGAACCAGTAGAATTTACAACACCACGGACTGCGGATTCGACGCTGGATGCCGATCACGATACTGATCTGGAGGCCAGGTACCGAAGGATGGACGA
CCTAGTAGGAGGAGGTGAACCACCTGGACTGGCAGCGCACAAACTCAAAGAAATGGCCGAACTACATGCCATCAGTGCAGATGAACCGAACACCTTCGTCGAAGTAGAAA
AGAACCCGTGCTGGCTGAAGGCAATGCAGGAGGAGATGACATCCATCACCGAGAACCAGACATGGAGTCTGGAGGATATACCGCCGGGACACCAAGCCATAGGGCTCAAA
TGGGTCTTCAAACTGAAGCGCAAAGAAAAAGGAGAAGTTGTGAAGCACAAGGCCCGTCTGGTGGCGAAGGGCTACGTCCAGAAGCAAGGAGTGGACTTTGAAGAGGTATT
TGCGCCAGTGGCAAGGTTAGAATCCGTTCGTTTCTTGCTGGCAATTACGGCACATTACTCTTGGGAGGTTCACCATATGGACGTAAAGTCTGCTTTCCTTAACGGAGAGT
TGAGGAGACCGTCTATATCTGACAATCACCTGGCTTCCTGGACAACGACAACCCCAATAAAGTACTGCGCCTGCACAAGGCACTCTACGGGCTTCGACAAGCTCCACGAG
CCTGGAACCCAAAGCTCGACAGTACCTTACTGTCACTGA
Protein sequenceShow/hide protein sequence
MPGRFWGEVVMTAVYLLNRSPTRSLDGKTSYEAWYNKKPAVHHFRVFGCVAYMKGGELTCLPTSSSTKALSKQWNDVIEADHNPNKFMVKYLITESEEGGAQHQQPSPPP
AGATPEPVEFTTPRTADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAAHKLKEMAELHAISADEPNTFVEVEKNPCWLKAMQEEMTSITENQTWSLEDIPPGHQAIGLK
WVFKLKRKEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAITAHYSWEVHHMDVKSAFLNGELRRPSISDNHLASWTTTTPIKYCACTRHSTGFDKLHE
PGTQSSTVPYCH