; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0023410 (gene) of Chayote v1 genome

Gene IDSed0023410
OrganismSechium edule (Chayote v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationLG02:35352543..35357209
RNA-Seq ExpressionSed0023410
SyntenySed0023410
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU32278.1 hypothetical protein TSUD_62940 [Trifolium subterraneum]5.0e-6243.09Show/hide
Query:  MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYLRPY---KLDFRSKPCTFLGYSSMHKGYKCLDENGKLYVSR
        +S A++PL +WD AF TA+HLINRL T  L+   P   LF + PDY+ L VFG  CFP +RPY   K DFRS  C FLGYS+ HKGYKCL   G++YVS+
Subjt:  MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYLRPY---KLDFRSKPCTFLGYSSMHKGYKCLDENGKLYVSR

Query:  NILFDELTFSFATKGHSPKTTSKSVVTSSIPILSQPPQSSSQSLSTLQPPNVSVNDSSTLQNSSPLLSSPPSQPVASDTHSLQSNGAND---IAPITSVI
        +++F+E  F + +   +P  ++ S  T  IP+ + P  + ++++ T        + ++   ++SP  +  P+QP     HS  SN  +D   ++P    +
Subjt:  NILFDELTFSFATKGHSPKTTSKSVVTSSIPILSQPPQSSSQSLSTLQPPNVSVNDSSTLQNSSPLLSSPPSQPVASDTHSLQSNGAND---IAPITSVI

Query:  PCPIVNAHPMQTRGKSGIFRPKALISKTYDEIEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTIARYKARLVA
        P    N HPMQTR KSG+  PK          EP   K AL+   W  AM +E++ L +NKTWTLV  P N+ V+GCKWVF+ K++ DGTI +YKARLVA
Subjt:  PCPIVNAHPMQTRGKSGIFRPKALISKTYDEIEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTIARYKARLVA

Query:  KGFHQTLDVNF
        KGFHQ    +F
Subjt:  KGFHQTLDVNF

KAG8473223.1 hypothetical protein CXB51_035172 [Gossypium anomalum]9.1e-6444.38Show/hide
Query:  MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYLRPY---KLDFRSKPCTFLGYSSMHKGYKCLDENGKLYVSR
        ++ A +PL FW  AF +A++LINRL T VL G SP + L    P Y  L +FG +C+PYLRP+   KL  RS+PC FLGYSS+HKGYKC+D+ GKL+VSR
Subjt:  MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYLRPY---KLDFRSKPCTFLGYSSMHKGYKCLDENGKLYVSR

Query:  NILFDELTFSFATKGHSPKTTSKSVVTSS--------IPILSQP----PQSSSQSLSTLQPPNVSVNDSSTLQNSSPLLSSPPSQPVASDTHSLQSNGAN
        +++FDE  F FAT   SP  +S S +TSS        +P++  P       S+ S S    P ++    S +  SS  +S  P++     +  + S+   
Subjt:  NILFDELTFSFATKGHSPKTTSKSVVTSS--------IPILSQP----PQSSSQSLSTLQPPNVSVNDSSTLQNSSPLLSSPPSQPVASDTHSLQSNGAN

Query:  DIAPITSVIPCPIVNAHPMQTRGKSGIFRPKALISKTYDEIEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTI
           P+   IP P VN HPMQTR KSGIF+P+ + S      EP   + AL    W  A Q E+D L++N+TW LV  P N + VGCKWVFK+K+H+DG+I
Subjt:  DIAPITSVIPCPIVNAHPMQTRGKSGIFRPKALISKTYDEIEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTI

Query:  ARYKARLVAKGFHQTLDVNF
        ARYK RLV KG+ Q   ++F
Subjt:  ARYKARLVAKGFHQTLDVNF

KAG8479334.1 hypothetical protein CXB51_029681 [Gossypium anomalum]1.3e-6242.37Show/hide
Query:  MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYLRPY---KLDFRSKPCTFLGYSSMHKGYKCLDENGKLYVSR
        ++ AS+P+++W DAF+TA++++NRL T  L GVSP ++LF   PDY QL VFG  C+P LRPY   KL +RS PCTFLGY++ H+GYKC+D  G++Y+SR
Subjt:  MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYLRPY---KLDFRSKPCTFLGYSSMHKGYKCLDENGKLYVSR

Query:  NILFDELTFSFATKGHSPKTTSKSVVTSSIPILSQPPQSSSQSLSTLQPPNVSVNDSSTLQNSSPLLSSPPSQPVASDTHSL---QSNGANDIAPITSVI
        ++ FDE T+ FA         SKSVV       S P   S Q +  +    + +   + +  SSP+ S+P S     DT  +    SN  + +A I    
Subjt:  NILFDELTFSFATKGHSPKTTSKSVVTSSIPILSQPPQSSSQSLSTLQPPNVSVNDSSTLQNSSPLLSSPPSQPVASDTHSL---QSNGANDIAPITSVI

Query:  PCPIVNAHPMQTRGKSGIFRPKALISKTYDEIEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTIARYKARLVA
           ++N HPM TR K GI++PK  ++   D +EP     A+    W++A+ DE   L++N+TW LVS P+NQ +VGCKW+FK+K++SDG++AR K RLVA
Subjt:  PCPIVNAHPMQTRGKSGIFRPKALISKTYDEIEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTIARYKARLVA

Query:  KGFHQT--LDVNFFFRLLARL
        +GF Q   LD +  F L+ ++
Subjt:  KGFHQT--LDVNFFFRLLARL

KAG8502419.1 hypothetical protein CXB51_000456 [Gossypium anomalum]7.7e-6344.59Show/hide
Query:  MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYLRPY---KLDFRSKPCTFLGYSSMHKGYKCLDENGKLYVSR
        ++HASMPL++W+DAF+T I+LINRL +  L  + P +KLF   P Y  L VFG  CFP LRPY   KL FRS PCTFLGYSS+HKGY+CL  +GK+YVSR
Subjt:  MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYLRPY---KLDFRSKPCTFLGYSSMHKGYKCLDENGKLYVSR

Query:  NILFDELTFSFATKGHSPKTTSKSVVTSSIPILSQPPQSS--SQSLSTLQPPNVSVNDSSTLQNSSPLLSSPPSQPVASDTH--SLQSNGANDIAPITSV
        ++ F E TF F T   +PK+T+   V++ + +LS   +S+  SQS+     PN +V+       ++   +S P+    S TH     S   + ++  T  
Subjt:  NILFDELTFSFATKGHSPKTTSKSVVTSSIPILSQPPQSS--SQSLSTLQPPNVSVNDSSTLQNSSPLLSSPPSQPVASDTH--SLQSNGANDIAPITSV

Query:  IPCPIVNAHPMQTRGKSGIFRPKALIS--KTYDEIEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTIARYKAR
        +P   +N+HPM TRGK+ IF+PK  +S         P +   A++  HW+  + +E   L+ N TW+L + P N+K +GCKW+FKVKK +DGTI RYKAR
Subjt:  IPCPIVNAHPMQTRGKSGIFRPKALIS--KTYDEIEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTIARYKAR

Query:  LVAKGFHQTLDVNF
        LVAKGF Q   ++F
Subjt:  LVAKGFHQTLDVNF

TYK18915.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]5.3e-6445.59Show/hide
Query:  MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYLRPY---KLDFRSKPCTFLGYSSMHKGYKCLDENGKLYVSR
        +S A++PL+FWD+AFST+++LIN L TPVL  +SPL+K+F R P++  L VFG KC+PYLRPY   KL  RS PCTFLGYS+ HKGYKCL  +G+L++SR
Subjt:  MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYLRPY---KLDFRSKPCTFLGYSSMHKGYKCLDENGKLYVSR

Query:  NILFDELTFSFAT-KGHSPKTTSKSVVTSSIPILSQPPQS-------------SSQSLSTLQPPNVSVNDSSTLQ------NSSPLLSSP-PSQPVASDT
        ++LFDE +F +A+   HS    SK+V+  S P+ S  P S              S +   L P  V   ++ T +      NS  +  SP P +P     
Subjt:  NILFDELTFSFAT-KGHSPKTTSKSVVTSSIPILSQPPQS-------------SSQSLSTLQPPNVSVNDSSTLQ------NSSPLLSSP-PSQPVASDT

Query:  HSLQSNGANDIAPITSVIPCPIVNAHPMQTRGKSGIFRPKALISKTYDEIEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFK
        H   S G N     TS+        HPM T+ K  IF+PKA +   Y + E  NAK A    HW+KAM++E+  L +N TW+L+ +  NQK+VGCKWVFK
Subjt:  HSLQSNGANDIAPITSVIPCPIVNAHPMQTRGKSGIFRPKALISKTYDEIEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFK

Query:  VKKHSDGTIARYKARLVAKGFHQTLDVNF
        +K++S G+I+RYKARLVAKGFHQT ++++
Subjt:  VKKHSDGTIARYKARLVAKGFHQTLDVNF

TrEMBL top hitse value%identityAlignment
A0A2K3NHD4 Retrovirus-related Pol polyprotein from transposon TNT 1-94 (Fragment)4.1e-6245.02Show/hide
Query:  MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYLRPY---KLDFRSKPCTFLGYSSMHKGYKCLDENGKLYVSR
        +S AS+PL++WD AF TA++LINRL +  L    P   LF + PDY  L VFG  CFP LRPY   KLD+RS  C FLGYS  HKGY+CL  NG+L++S+
Subjt:  MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYLRPY---KLDFRSKPCTFLGYSSMHKGYKCLDENGKLYVSR

Query:  NILFDELTFSFA---TKGHSPKTTSKSVVTSSIPILSQPPQSSSQSLSTLQPPNVSVNDSSTLQNSSPLLSSPPSQPVASDTHSLQSNGANDIAPITSVI
        +++F+E  F F    T  HS   +  S VT S P++  P  S S + S    P+ S+ +S     SSP++S  P+ P  +   SL    A  I P    +
Subjt:  NILFDELTFSFA---TKGHSPKTTSKSVVTSSIPILSQPPQSSSQSLSTLQPPNVSVNDSSTLQNSSPLLSSPPSQPVASDTHSLQSNGANDIAPITSVI

Query:  PCPIVNAHPMQTRGKSGIFRPKALISKTYDEIEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTIARYKARLVA
             N HPM TR K G  +P+         IEP + K AL+   W+ AMQ EYD L+ N TWTLV  P ++  +GCKWVF+VK++SDGT+ +YKARLVA
Subjt:  PCPIVNAHPMQTRGKSGIFRPKALISKTYDEIEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTIARYKARLVA

Query:  KGFHQTLDVNF
        KGFHQ   ++F
Subjt:  KGFHQTLDVNF

A0A2Z6MJI3 Integrase catalytic domain-containing protein2.4e-6243.09Show/hide
Query:  MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYLRPY---KLDFRSKPCTFLGYSSMHKGYKCLDENGKLYVSR
        +S A++PL +WD AF TA+HLINRL T  L+   P   LF + PDY+ L VFG  CFP +RPY   K DFRS  C FLGYS+ HKGYKCL   G++YVS+
Subjt:  MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYLRPY---KLDFRSKPCTFLGYSSMHKGYKCLDENGKLYVSR

Query:  NILFDELTFSFATKGHSPKTTSKSVVTSSIPILSQPPQSSSQSLSTLQPPNVSVNDSSTLQNSSPLLSSPPSQPVASDTHSLQSNGAND---IAPITSVI
        +++F+E  F + +   +P  ++ S  T  IP+ + P  + ++++ T        + ++   ++SP  +  P+QP     HS  SN  +D   ++P    +
Subjt:  NILFDELTFSFATKGHSPKTTSKSVVTSSIPILSQPPQSSSQSLSTLQPPNVSVNDSSTLQNSSPLLSSPPSQPVASDTHSLQSNGAND---IAPITSVI

Query:  PCPIVNAHPMQTRGKSGIFRPKALISKTYDEIEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTIARYKARLVA
        P    N HPMQTR KSG+  PK          EP   K AL+   W  AM +E++ L +NKTWTLV  P N+ V+GCKWVF+ K++ DGTI +YKARLVA
Subjt:  PCPIVNAHPMQTRGKSGIFRPKALISKTYDEIEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTIARYKARLVA

Query:  KGFHQTLDVNF
        KGFHQ    +F
Subjt:  KGFHQTLDVNF

A0A438H844 Retrovirus-related Pol polyprotein from transposon RE13.1e-6245.83Show/hide
Query:  MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYLRPY---KLDFRSKPCTFLGYSSMHKGYKCLDENGKLYVSR
        ++ AS+P  +WD+AF T+++LINRL TPVL   SPL+ LF + P YSQL VFG  C+P LRP+   KL FRS PCTFLGYS  HKGYKCL  NG + +SR
Subjt:  MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYLRPY---KLDFRSKPCTFLGYSSMHKGYKCLDENGKLYVSR

Query:  NILFDELTFSFATKGHSPKTTSK-SVVTSSIPILSQPP---QSSSQSLSTLQPPNVSVNDSSTLQNSSPLLSSPPSQPVASDTHSLQSNGANDIAPITSV
        +++FDE  F FA +    +TTS  S  ++S+P  +  P     SS S ST  P N S+  +++  N +       SQP                 P +S 
Subjt:  NILFDELTFSFATKGHSPKTTSK-SVVTSSIPILSQPP---QSSSQSLSTLQPPNVSVNDSSTLQNSSPLLSSPPSQPVASDTHSLQSNGANDIAPITSV

Query:  IPCPIVNAHPMQTRGKSGIFRPKALISKTYDEIEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTIARYKARLV
         P P   +H M TR K+GIF+PKA +  T     P +   AL+ +HW++AM DEY  L++N TW LV  P ++K++G KWVFKVK++ DGTI +YKARLV
Subjt:  IPCPIVNAHPMQTRGKSGIFRPKALISKTYDEIEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTIARYKARLV

Query:  AKGFHQTLDVNF
        AKGFHQ +  +F
Subjt:  AKGFHQTLDVNF

A0A5D3D5W0 Retrovirus-related Pol polyprotein from transposon TNT 1-942.6e-6445.59Show/hide
Query:  MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYLRPY---KLDFRSKPCTFLGYSSMHKGYKCLDENGKLYVSR
        +S A++PL+FWD+AFST+++LIN L TPVL  +SPL+K+F R P++  L VFG KC+PYLRPY   KL  RS PCTFLGYS+ HKGYKCL  +G+L++SR
Subjt:  MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYLRPY---KLDFRSKPCTFLGYSSMHKGYKCLDENGKLYVSR

Query:  NILFDELTFSFAT-KGHSPKTTSKSVVTSSIPILSQPPQS-------------SSQSLSTLQPPNVSVNDSSTLQ------NSSPLLSSP-PSQPVASDT
        ++LFDE +F +A+   HS    SK+V+  S P+ S  P S              S +   L P  V   ++ T +      NS  +  SP P +P     
Subjt:  NILFDELTFSFAT-KGHSPKTTSKSVVTSSIPILSQPPQS-------------SSQSLSTLQPPNVSVNDSSTLQ------NSSPLLSSP-PSQPVASDT

Query:  HSLQSNGANDIAPITSVIPCPIVNAHPMQTRGKSGIFRPKALISKTYDEIEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFK
        H   S G N     TS+        HPM T+ K  IF+PKA +   Y + E  NAK A    HW+KAM++E+  L +N TW+L+ +  NQK+VGCKWVFK
Subjt:  HSLQSNGANDIAPITSVIPCPIVNAHPMQTRGKSGIFRPKALISKTYDEIEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFK

Query:  VKKHSDGTIARYKARLVAKGFHQTLDVNF
        +K++S G+I+RYKARLVAKGFHQT ++++
Subjt:  VKKHSDGTIARYKARLVAKGFHQTLDVNF

A0A803NRU8 Uncharacterized protein2.4e-6245.66Show/hide
Query:  MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYLRPY---KLDFRSKPCTFLGYSSMHKGYKCLDENGKLYVSR
        ++ ASMPL FWD+AF  A++L NRL TPVL+ +SP++ LF+  PDY  L +FG  CFP +RPY   KLDFRS PCTFLG S  HKGYKCLD +G+LY+SR
Subjt:  MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYLRPY---KLDFRSKPCTFLGYSSMHKGYKCLDENGKLYVSR

Query:  NILFDELTFSFAT-KGHSPKTTSKSVVTSSIPILSQPPQSSSQSLSTLQPPNVS--VNDSSTLQNSSPLLSSPPSQPVASDTHSLQSNGANDIAPITSVI
        +++FDE  FS+A+    + ++ S+S + S+IP+   P   + +SL +L   +V+  V D++ + +S   L  P    V S   SLQ              
Subjt:  NILFDELTFSFAT-KGHSPKTTSKSVVTSSIPILSQPPQSSSQSLSTLQPPNVS--VNDSSTLQNSSPLLSSPPSQPVASDTHSLQSNGANDIAPITSVI

Query:  PCPIVNAHPMQTRGKSGIFRPKALISKTYDEIEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTIARYKARLVA
        P P +N H M+TR KSGI++PKAL+       EP N K ALK   W  AM +E   L +N TWT V  P  +  +GCKWV+K K ++DG + R KARLVA
Subjt:  PCPIVNAHPMQTRGKSGIFRPKALISKTYDEIEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTIARYKARLVA

Query:  KGFHQTLDVNF
        KGFHQ    +F
Subjt:  KGFHQTLDVNF

SwissProt top hitse value%identityAlignment
P04146 Copia protein9.6e-0834.23Show/hide
Query:  ISKTYDEIEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTIARYKARLVAKGFHQTLDVNF--FFRLLARLLNQ
        +  ++DEI+  + K     + W +A+  E +    N TWT+  +P N+ +V  +WVF VK +  G   RYKARLVA+GF Q   +++   F  +AR+ + 
Subjt:  ISKTYDEIEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTIARYKARLVAKGFHQTLDVNF--FFRLLARLLNQ

Query:  LLFVFFLLLLL
          F F L L++
Subjt:  LLFVFFLLLLL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.4e-2128.48Show/hide
Query:  ASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCF---PYLRPYKLDFRSKPCTFLGYSSMHKGYKCLDE-NGKLYVSRNI
        A +P +FW +A  TA +LINR  +  L    P +   ++   YS L VFG + F   P  +  KLD +S PC F+GY     GY+  D    K+  SR++
Subjt:  ASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCF---PYLRPYKLDFRSKPCTFLGYSSMHKGYKCLDE-NGKLYVSRNI

Query:  LFDELTFSFATKGHSPKTTSKSVVTSSIPILSQPPQSSSQSLSTLQPPNVSVNDSSTLQNSSPLLSSPPSQPVASDTHSLQSNGANDIAPITSVIPCPIV
        +F E      T     +     ++ + + I S     +S   +T +           ++    L      + V    H  Q    +           P+ 
Subjt:  LFDELTFSFATKGHSPKTTSKSVVTSSIPILSQPPQSSSQSLSTLQPPNVSVNDSSTLQNSSPLLSSPPSQPVASDTHSLQSNGANDIAPITSVIPCPIV

Query:  NAHPMQTRGKSGIFRPKALISKTYDEIEPPNAKVAL---KCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTIARYKARLVAKG
         +   +   +        LIS   D+ EP + K  L   +     KAMQ+E + L +N T+ LV  P  ++ + CKWVFK+KK  D  + RYKARLV KG
Subjt:  NAHPMQTRGKSGIFRPKALISKTYDEIEPPNAKVAL---KCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTIARYKARLVAKG

Query:  FHQTLDVNF
        F Q   ++F
Subjt:  FHQTLDVNF

P92520 Uncharacterized mitochondrial protein AtMg008201.4e-2254.29Show/hide
Query:  MQTRGKSGI--FRPK-ALISKTYDEIEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTIARYKARLVAKGFHQT
        M TR K+GI    PK +L   T  + EP +   ALK   W +AMQ+E D L +NKTW LV  P+NQ ++GCKWVFK K HSDGT+ R KARLVAKGFHQ 
Subjt:  MQTRGKSGI--FRPK-ALISKTYDEIEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTIARYKARLVAKGFHQT

Query:  LDVNF
          + F
Subjt:  LDVNF

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.2e-4234.7Show/hide
Query:  MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYLRPY---KLDFRSKPCTFLGYSSMHKGYKCLD-ENGKLYVS
        +SHAS+P  +W  AF+ A++LINRL TP+L   SP QKLF   P+Y +L VFG  C+P+LRPY   KLD +S+ C FLGYS     Y CL  +  +LY+S
Subjt:  MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYLRPY---KLDFRSKPCTFLGYSSMHKGYKCLD-ENGKLYVS

Query:  RNILFDELTFSF---------------------------------------------ATKGHSPKTTSK----------SVVTSSIPILSQP-------P
        R++ FDE  F F                                             AT   SP    +          S  +SS P   +P       P
Subjt:  RNILFDELTFSF---------------------------------------------ATKGHSPKTTSK----------SVVTSSIPILSQP-------P

Query:  QSSSQSLSTLQPPNVSVNDS-STLQNSSP-----LLSSP--PSQPVASDTHSLQSNGANDIAPITSVIPCP------------IVNAHPMQTRGKSGIFR
        Q ++Q   T    + S N S +   N SP      LS+P   S    S T S  S+  +   P   + P P             +N H M TR K+GI +
Subjt:  QSSSQSLSTLQPPNVSVNDS-STLQNSSP-----LLSSP--PSQPVASDTHSLQSNGANDIAPITSVIPCP------------IVNAHPMQTRGKSGIFR

Query:  PK---ALISKTYDEIEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQ-KVVGCKWVFKVKKHSDGTIARYKARLVAKGFHQ
        P    +L      E EP  A  ALK   WR AM  E +  + N TW LV  P +   +VGC+W+F  K +SDG++ RYKARLVAKG++Q
Subjt:  PK---ALISKTYDEIEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQ-KVVGCKWVFKVKKHSDGTIARYKARLVAKGFHQ

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.7e-4433.33Show/hide
Query:  MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYLRPY---KLDFRSKPCTFLGYSSMHKGYKCLD-ENGKLYVS
        +SHAS+P  +W  AFS A++LINRL TP+L   SP QKLF + P+Y +L VFG  C+P+LRPY   KL+ +SK C F+GYS     Y CL    G+LY S
Subjt:  MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYLRPY---KLDFRSKPCTFLGYSSMHKGYKCLD-ENGKLYVS

Query:  RNILFDELTFSFATKGHSPKT------------------------------------TSKSVVTSSIPIL------SQPPQSSSQSLSTLQP--------
        R++ FDE  F F+T      T                                    TS    +S  P+       S  P SS  S S+ +P        
Subjt:  RNILFDELTFSFATKGHSPKT------------------------------------TSKSVVTSSIPIL------SQPPQSSSQSLSTLQP--------

Query:  -PNVSVNDSSTLQNSSPLLSSP----------------PSQPVAS----------DTHSLQSNGANDIAPITSVIPCP---------IVNAHPMQTRGKS
         P    + +    ++SP+L++P                P  P++S             +  S+ +    P+  V+P P          VN H M TR K 
Subjt:  -PNVSVNDSSTLQNSSPLLSSP----------------PSQPVAS----------DTHSLQSNGANDIAPITSVIPCP---------IVNAHPMQTRGKS

Query:  GIFRPKALISKTYD---EIEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLV-SKPINQKVVGCKWVFKVKKHSDGTIARYKARLVAKGFHQ
        GI +P    S         EP  A  A+K   WR+AM  E +  + N TW LV   P +  +VGC+W+F  K +SDG++ RYKARLVAKG++Q
Subjt:  GIFRPKALISKTYD---EIEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLV-SKPINQKVVGCKWVFKVKKHSDGTIARYKARLVAKGFHQ

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 85.9e-1339.09Show/hide
Query:  EPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTIARYKARLVAKGFHQTLDVNFF--FRLLARLLNQLLFVFFLL
        EP     A +   W  AM DE   +    TW + + P N+K +GCKWV+K+K +SDGTI RYKARLVAKG+ Q   ++F   F  + +L +  L +    
Subjt:  EPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTIARYKARLVAKGFHQTLDVNFF--FRLLARLLNQLLFVFFLL

Query:  LLLHTTRPLD
        +   T   LD
Subjt:  LLLHTTRPLD

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)9.8e-2454.29Show/hide
Query:  MQTRGKSGI--FRPK-ALISKTYDEIEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTIARYKARLVAKGFHQT
        M TR K+GI    PK +L   T  + EP +   ALK   W +AMQ+E D L +NKTW LV  P+NQ ++GCKWVFK K HSDGT+ R KARLVAKGFHQ 
Subjt:  MQTRGKSGI--FRPK-ALISKTYDEIEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTIARYKARLVAKGFHQT

Query:  LDVNF
          + F
Subjt:  LDVNF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCATGCTTCTATGCCTCTTGCTTTTTGGGATGATGCTTTTTCTACTGCTATTCATTTGATTAACAGGCTATCTACTCCGGTTCTTCATGGTGTTAGTCCCTTGCA
GAAGCTTTTTGATCGGGTGCCAGATTACTCACAACTTTGTGTTTTTGGATTTAAGTGTTTTCCCTATCTTCGTCCATATAAGCTTGACTTTCGTTCTAAACCTTGCACTT
TTCTTGGGTATAGTTCCATGCATAAAGGGTATAAATGTCTTGATGAGAATGGAAAACTCTATGTATCTAGGAACATTCTATTTGATGAACTTACATTTTCTTTTGCTACT
AAAGGCCATTCACCCAAAACCACTTCCAAATCAGTTGTTACTTCTTCTATTCCTATTCTCTCTCAACCACCTCAGTCCTCTTCTCAGTCTCTCTCCACTTTACAACCTCC
CAATGTTTCAGTAAATGATTCTTCCACTTTACAAAATTCAAGCCCATTGTTGAGCTCTCCACCTAGCCAGCCTGTTGCCTCGGACACTCATTCTCTTCAAAGTAATGGTG
CAAATGACATAGCACCCATTACATCTGTCATACCTTGTCCTATTGTTAATGCCCATCCTATGCAGACAAGGGGGAAGAGTGGCATTTTTCGACCAAAGGCTCTTATTTCT
AAAACCTATGATGAAATTGAACCACCAAATGCCAAGGTAGCCCTCAAATGTGCCCATTGGAGAAAAGCAATGCAAGATGAGTATGATGATTTGATGCAAAATAAAACATG
GACCTTAGTCTCCAAACCTATAAATCAGAAAGTGGTTGGCTGCAAGTGGGTTTTCAAAGTCAAGAAACATTCGGATGGTACTATTGCTCGATACAAGGCACGTCTTGTTG
CTAAGGGGTTTCATCAAACTCTTGATGTAAATTTTTTTTTTAGACTTTTAGCCCGGTTGTTAAACCAATTACTATTCGTGTTCTTCTTACTCTTGCTCTTACATACAACT
AGACCATTAGACAAATAG
mRNA sequenceShow/hide mRNA sequence
GGAGTTTTTCATCTCTTTATATCTGCTTTTCGTGCTCACTACTCCATTGATATTCTGTAGAATGAGCTTCTCAGTGTTATTCTTCAATGAGTAAATACAAAGAGGTTTTC
TCCTGGAAATTCTTGTGTTCTTGGTTTATTGCAGCTGGTTTCATGGTATCCGAGCTCTAAAGTTTAACGACGAATTTTTTTTTTTTCAAATGGAAAGCTTCAAAATGGCC
TCCGATCTACAAAGCGTCTTGCCCGCTCCAACGACGATTGTTAATCCGGGAAACAAAGTTTCCACTGTTCTTCTGAGTAGTGAGAATTTCTTGTTGTGGAAATTTTAGGT
GGAGTTTGCTCTAGAGGGATATGGTCTATTCTCGGCACACATCGATCCTGACTCGATTTCTCCACCAGAGAAAATTCAAGTTCGAGACGACTTCGATAAGCCTAATCCCG
AGTATACAGCATGGAAGAAGCAAGATAGACTGATTTGTTCATGGTTGCTTGGGTCAATGTCTGAAGATATACTCCAACAAATGCTCCACTGTACTTCGGCAAAGGAAATA
TAGGCGTGCCTTCTACAAATCTTCAATTCCAGAAGCCTTGCTCAGGTCATGAAGCTAAAGTCAACTCTCTAAAATATGAAGAAAGGTAATTCCTCACTAAGTGATTATTT
CGCCAAGATTAAACGAATAGTTGACTCTTTAGCTGCAGTTGATAAATCAATTTCATATGAGGACCATATACTGTACATCTTAGCTGGTTTGGGATCTGAGTATGAATCCA
TGATTTCGGTCATCACGGCTAAGGTTAATACAGACTCTGTTCAAGATATCATGGCTCTCCTACTCACACACGAGACACGACTTGAGTCGAAAACTGTTAATGCTGATGGC
AGTGTTCCTTCGGCTAATGTGGTGCAACAACAGCCTACATACAGACCCTCCTCTGAATCTAAACCCCAGCGCTATCAAAATTTTAACTCGGGAAATGGCAGGGGACGGGG
AAAAAACAATGGACGAGGAGGTAGTCGCTCGGGCGGCAGAAACAAGTTTTTCTGCACCATTTGCAACAAACATGGTCATACATCGAGCAGATGTTACTACCGGAATGATG
CTCCCTCACATCATGCTCGCCCGATGTTTGCTAATCAGAGTATTGGACCAATGTTCCCATCTAATTTTCAGCAACCACCAATGTCATATCAAATGGCACCTAGTTATGGA
TATCAGTACTCACCACAGCCACATGGATACTATGGAATGCAGGCTTCTTCCACTTTCAATTCTGACAATAATTGGTATCCGAATTCGGGAGCAACAAATCATTTAACAAA
CAACTTTGGCAATCTATCAATGGGCTCGGAGTTTGGTGGTTCTAGTCAAGTTCATGTTGGTAATGGTGCAAGTCTGCCTATTTCTCACACTGGATCTAGCAACTGGTCGG
GCTCTGCTCCAAGGGACTCTACATGAAGGACTATATCGATTTCCTATGACTTTCTCCTTATTCAAAGTCTTTAGCTGTTAATTCTGTTCAAACTCATTTCAGTACTGTGC
ATCCTGCTTGTTTTAGTTCTGTTGTTCCTCATACGAAACTTTATTTGTGGCAACAACGGCTTGGTCATCCTGCTTTTCTTATTGTTCAAAACATTGTTAAAAGTAGTATG
CATGCTGCTTTGTCTAAAAATAATTCAAGTTCTTTTTGCAATGCATGTGCTCTTGGTAAAATACATGCCGCCCCTTATTCTAAATCATTAACTGTGTATACCCGTCCCTT
ACAACTTGTTGTTATTGATTTATGGGGCCCGGCTTATACTGTTTCTAGGAATGGTTTCAAGTATTACATGAGTTTTATTGATGTTTTCTCTAGATTCACCTGGATTTATT
TTCTGGAATCTAAATCTGATGCCTCTTCTATGCTTCATACTTTTAAAACACATGTTGAAAAACTTTGGGTGCACCCATTGTTCGTGTCCAAACTGATGGTGGTTCTGAGT
TTAAGCCTCTTATCCCTGTTTTCGAATCCAATGGTATCACTCATCGGTTAGCGTGTCCTTACACCTCAAAACAAAACGGTATAGTTGAACGCAAACACAGACATATAGTT
GAAACAGGCTTAACCCTTATGTCTCATGCTTCTATGCCTCTTGCTTTTTGGGATGATGCTTTTTCTACTGCTATTCATTTGATTAACAGGCTATCTACTCCGGTTCTTCA
TGGTGTTAGTCCCTTGCAGAAGCTTTTTGATCGGGTGCCAGATTACTCACAACTTTGTGTTTTTGGATTTAAGTGTTTTCCCTATCTTCGTCCATATAAGCTTGACTTTC
GTTCTAAACCTTGCACTTTTCTTGGGTATAGTTCCATGCATAAAGGGTATAAATGTCTTGATGAGAATGGAAAACTCTATGTATCTAGGAACATTCTATTTGATGAACTT
ACATTTTCTTTTGCTACTAAAGGCCATTCACCCAAAACCACTTCCAAATCAGTTGTTACTTCTTCTATTCCTATTCTCTCTCAACCACCTCAGTCCTCTTCTCAGTCTCT
CTCCACTTTACAACCTCCCAATGTTTCAGTAAATGATTCTTCCACTTTACAAAATTCAAGCCCATTGTTGAGCTCTCCACCTAGCCAGCCTGTTGCCTCGGACACTCATT
CTCTTCAAAGTAATGGTGCAAATGACATAGCACCCATTACATCTGTCATACCTTGTCCTATTGTTAATGCCCATCCTATGCAGACAAGGGGGAAGAGTGGCATTTTTCGA
CCAAAGGCTCTTATTTCTAAAACCTATGATGAAATTGAACCACCAAATGCCAAGGTAGCCCTCAAATGTGCCCATTGGAGAAAAGCAATGCAAGATGAGTATGATGATTT
GATGCAAAATAAAACATGGACCTTAGTCTCCAAACCTATAAATCAGAAAGTGGTTGGCTGCAAGTGGGTTTTCAAAGTCAAGAAACATTCGGATGGTACTATTGCTCGAT
ACAAGGCACGTCTTGTTGCTAAGGGGTTTCATCAAACTCTTGATGTAAATTTTTTTTTTAGACTTTTAGCCCGGTTGTTAAACCAATTACTATTCGTGTTCTTCTTACTC
TTGCTCTTACATACAACTAGACCATTAGACAAATAGATGTTAATAATTCTTTTTTACATGGTATTCTAACCGAAACTGTTTACATGGAACAACCGATCGGTTTTATCTCT
CCAAATGGAAGAAATGAAGTATGCAAGTTACATAAGGCTTTGTATGGCTTGAAACAAGCCCCAAGAGCCTGGTTTGAGAGATTAACTGCTTGCTTGAAGAGCTTGGGTTT
TGAACATTCTCATGCAGATACGTCTTTGCTGTTTAGACATACATTGAATAGTTGTTGTTATGTTCTCGTTTATGTTGATGATATCTTAGTGATGGGGAACTCTGCTTCTG
TGGTCTCTGACTTGATCTCTAGACTCAATGCTTCGTTTTCGCTTAAGGATCTAGGACCATTAAACTATTTCATTGGCATTGAAGTATCCTACCCACCTACAGGAGGAATT
TTCTTATCGCAGTCGAAATATGTTCTAGATTTACTTCGCAAAACCAATATGAGTGATGCTAATGCTATGAATACTCCCATGGTAAGTGGGAGCCTTCCGTCTGCTATTGG
GGGAGAAATGTTTTCTGATGTCACATTATACAAGAGTGTTGTAGGAGCATTGCAGTATGTTCTCCTTACTCGGCCTGAACTCTCTTTTAGTGTAAACAAGGCTTGTCAAT
TCATGCACTCTCCAAAGGCTATTCACTAGAAATTGGTCAAACGCATTTTACGATACTTACAAGGCACTCGCTCTTCTGGTCTATTACTGACAAAACCTACATCTTTAACC
TTACAGGGGTATGCAGATTCTAATTGGGCGTCTGATCCTGATGATAGGAAATCTACCTCGGGTCATTGTATTTATTTTGGTGGAAATTTAATTTCATGGGGATCTAAGAA
GCAAACTATTATTTCTCGTTCTAGTACTGAGGCAGAGTATAGATGTCTAGCTACTGCTGCTACTGAACTTATCTGGTTGAATTCTCTGTTTGCTGACTTGAGAATATCTT
ATGCTGGTCCTCCTATTCTGTGGTGTGATAATTTAGGTGATGTCCACTTAAGTATGAATCCTGTTTTACATTCTAAAACTAAGCATGTGGAGTTAGATATCTATTATGTG
CGTGACTTAGTTCATAACAGGAAACTTGTTGTTCGCCATCTTCCCATGACTATGCAGATTGCTGATATATTTATGAAGCCATTGTCTGCTCATACTTTCCTTCCTCTTCG
ATTCAAGCTCAATGTTCGTGATCCTCCAACCATAGGCTTGCGGGGGGTATTAGGAAAGACTCTCATTGACACATCAGCCCAAGCCCATGTAGTTTAAGTGGGCCGTTTAT
TGTGTTATTTCTTTGTAATGACTATTTAGCCCATGTGAGACGATACTTCTCTGCTTGTTGATGTATGTTTAATTCTAGGAGTTTTTCATCTCTTTGTATCTGCTTTTCGT
Protein sequenceShow/hide protein sequence
MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYLRPYKLDFRSKPCTFLGYSSMHKGYKCLDENGKLYVSRNILFDELTFSFAT
KGHSPKTTSKSVVTSSIPILSQPPQSSSQSLSTLQPPNVSVNDSSTLQNSSPLLSSPPSQPVASDTHSLQSNGANDIAPITSVIPCPIVNAHPMQTRGKSGIFRPKALIS
KTYDEIEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTIARYKARLVAKGFHQTLDVNFFFRLLARLLNQLLFVFFLLLLLHTT
RPLDK