; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc07g0185391 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc07g0185391
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr07:3477778..3478975
RNA-Seq ExpressionCmc07g0185391
SyntenyCmc07g0185391
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034956.1 F5J5.1 [Cucumis melo var. makuwa]2.8e-12671.43Show/hide
Query:  MDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKNVYKLNKALYGLKQAPRAWYELLTVYLRGKGYSKGEIDKTLFIHRKSDQLLARNKWTPAATHVKLTRDT
        MDVKSAFLNGYLNE+VYVAQPKGFVDS+ P++VYKLNKALYGLKQAP A YE LT+YL  + Y+K ++ K        DQ  +++K TPAATHVK+T+D 
Subjt:  MDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKNVYKLNKALYGLKQAPRAWYELLTVYLRGKGYSKGEIDKTLFIHRKSDQLLARNKWTPAATHVKLTRDT

Query:  YGAKVDHKLYRSIVGNLLYLTASRPDIAYVVGICARYQADPRISHLEAVKIILKYVHGTSDFGMMYSYDTTLTLVRYCDADWAGSGVDRKSTSRGCFFLG
            +DHKLYRSI+ +LLYLTASRPDIAY VGICA++Q+DPR SHL  VK I+KYVHGT+DFG++YSYDTT  LV YCDADW GS  DR+STS GCFFLG
Subjt:  YGAKVDHKLYRSIVGNLLYLTASRPDIAYVVGICARYQADPRISHLEAVKIILKYVHGTSDFGMMYSYDTTLTLVRYCDADWAGSGVDRKSTSRGCFFLG

Query:  NNLISWLSKKQNCVSLSTTEAEYIIAGSGCTQLIWMKNMLHEYGLDQDTMMLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIRSNL
             W SKKQ CVSLST E EYI  GSGCTQLIWMKNMLHEYG+ QD M LYCDNMSAIDISK PVQHS+TKHIDIRHHFIREL+ENK+I L H+RS  
Subjt:  NNLISWLSKKQNCVSLSTTEAEYIIAGSGCTQLIWMKNMLHEYGLDQDTMMLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIRSNL

Query:  QLADIFTKPLDANSFKYLRAGL
        QLADIFTKPLD  +F++LRAGL
Subjt:  QLADIFTKPLDANSFKYLRAGL

KAA0066740.1 gag-pol polyprotein [Cucumis melo var. makuwa]5.1e-12864.21Show/hide
Query:  MDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKNVYKLNKALYGLKQAPRAWYELLTVYLRGKGYSKGEIDKTLFIHRKSDQLL------------------
        MDVKS FLNGYLNEEVYVAQPKGFVDSEH K+VYKLNKALYGLKQAPRAWY+ LTVYLRGKGYS+GEIDKTLFIHRKSDQLL                  
Subjt:  MDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKNVYKLNKALYGLKQAPRAWYELLTVYLRGKGYSKGEIDKTLFIHRKSDQLL------------------

Query:  ------------------------------------------------------ARNKWTPAATHVKLTRDTYGAKVDHKLYRSIVGNLLYLTASRPDIA
                                                              ARNK TPA THVKLT+D  GA+VDHKLYRSIVG+LLYLTASRPDIA
Subjt:  ------------------------------------------------------ARNKWTPAATHVKLTRDTYGAKVDHKLYRSIVGNLLYLTASRPDIA

Query:  YVVGICARYQADPRISHLEAVKIILKYVHGTSDFGMMYSYDTTLTLVRYCDADWAGSGVDRKSTSRGCFFLGNNLISWLSKKQNCVSLSTTEAEYIIAGS
        Y +GI ARYQ  PRI+HLEA+K ILKYVH T DFGMMYSYDTT TLV YCDADWAGS  DRK                             EAEYI AGS
Subjt:  YVVGICARYQADPRISHLEAVKIILKYVHGTSDFGMMYSYDTTLTLVRYCDADWAGSGVDRKSTSRGCFFLGNNLISWLSKKQNCVSLSTTEAEYIIAGS

Query:  GCTQLIWMKNMLHEYGLDQDTMMLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIRSNLQLADIFTKPLDANSFKYLRAGL
        GCTQLIWMKN+LHEYG DQDTM LYC+NMSAIDISKN VQHSRTKHIDIRHHFIRE VE KVI+LDHIRSNLQLA+IFTKPLDA+SF+YL AGL
Subjt:  GCTQLIWMKNMLHEYGLDQDTMMLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIRSNLQLADIFTKPLDANSFKYLRAGL

TYJ97126.1 gag-pol polyprotein [Cucumis melo var. makuwa]6.0e-12972.75Show/hide
Query:  MDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKNVYKLNKALYGLKQAPRAWYELLTVYLRGKGYSKGE----IDKTLFIH---RKSDQLLARNKWTPAATH
        MDVKSAFLNGYLNEEVYV QPKGFVDSEHPK+VYKLNKALYGLKQA RAWY+ LTVYLRG    +      I +  ++    +K     ARNK TPAATH
Subjt:  MDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKNVYKLNKALYGLKQAPRAWYELLTVYLRGKGYSKGE----IDKTLFIH---RKSDQLLARNKWTPAATH

Query:  VKLTRDTYGAKVDHKLYRSIVGNLLYLTASRPDIAYVVGICARYQADPRISHLEAVKIILKYVHGTSDFGMMYSYDTTLTLVRYCDADWAGSGVDRKSTS
        VKLT+DT GAKVDHKLYRSI G+LLYLTASRPDIAY +GI ARYQA+PRI+HLEAVK ILKYVHGTSDFGMMYSYDTT T+V YCDADWAGS  D K   
Subjt:  VKLTRDTYGAKVDHKLYRSIVGNLLYLTASRPDIAYVVGICARYQADPRISHLEAVKIILKYVHGTSDFGMMYSYDTTLTLVRYCDADWAGSGVDRKSTS

Query:  RGCFFLGNNLISWLSKKQNCVSLSTTEAEYIIAGSGCTQLIWMKNMLHEYGLDQDTMMLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELVENKVIRL
                                 TEAEY+ AGSGCTQLIWM+NML EYG DQDTM LYCDNMSAIDISKNPVQHSRTKHIDIRHHF+RELVE+KVI+ 
Subjt:  RGCFFLGNNLISWLSKKQNCVSLSTTEAEYIIAGSGCTQLIWMKNMLHEYGLDQDTMMLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELVENKVIRL

Query:  DHIRSNLQLADIFTKPLDANSFKYLRAGLRVCRT
        DHIRSNLQLADIFTKPLDA+SF+YL AGL VCRT
Subjt:  DHIRSNLQLADIFTKPLDANSFKYLRAGLRVCRT

TYK21311.1 F5J5.1 [Cucumis melo var. makuwa]1.6e-12671.3Show/hide
Query:  MDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKNVYKLNKALYGLKQAPRAWYELLTVYLRGKGYSKGEIDKTLFIHRKSDQLLARNKWTPAATHVKLTRDT
        MDVKSAFLNGYLNE+VYVAQPKGFVDS+ P++VYKLNKALYGLKQAP A YE LT+YL  + Y+K ++ K        DQ  +++K TPAATHVK+T+D 
Subjt:  MDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKNVYKLNKALYGLKQAPRAWYELLTVYLRGKGYSKGEIDKTLFIHRKSDQLLARNKWTPAATHVKLTRDT

Query:  YGAKVDHKLYRSIVGNLLYLTASRPDIAYVVGICARYQADPRISHLEAVKIILKYVHGTSDFGMMYSYDTTLTLVRYCDADWAGSGVDRKSTSRGCFFLG
            +DHKLYRSI+ +LLYLTASRPDIAY VGICA++Q+DPR SHL  VK I+KYVHGT+DFG++YSYDTT  LV YCDADW GS  DR+STS GCFFLG
Subjt:  YGAKVDHKLYRSIVGNLLYLTASRPDIAYVVGICARYQADPRISHLEAVKIILKYVHGTSDFGMMYSYDTTLTLVRYCDADWAGSGVDRKSTSRGCFFLG

Query:  NNLISWLSKKQNCVSLSTTEAEYIIAGSGCTQLIWMKNMLHEYGLDQDTMMLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIRSNL
             W SKKQ CVSLST E EYI  GSGCTQLIWMKNMLHEYG+ QD M LYCDNMSAIDISK PVQHS+TKHIDIRHHFIREL+ENK+I L H+RS  
Subjt:  NNLISWLSKKQNCVSLSTTEAEYIIAGSGCTQLIWMKNMLHEYGLDQDTMMLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIRSNL

Query:  QLADIFTKPLDANSFKYLRAGLRV
        QLADIFTKPLD  +F++LRAGL V
Subjt:  QLADIFTKPLDANSFKYLRAGLRV

TYK23188.1 gag-pol polyprotein [Cucumis melo var. makuwa]6.4e-14772.82Show/hide
Query:  MDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKNVYKLNKALYGLKQAPRAWYELLTVYLRGKGYSKGEIDKTLFIHRKSDQLL------------------
        MDVKSAFLNGYLNEEVYVAQPK FVDSEHPK+VYKLNKALYGLKQAPR WYE LTVYLRGKGYS+GEIDKTLFIHRKSDQLL                  
Subjt:  MDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKNVYKLNKALYGLKQAPRAWYELLTVYLRGKGYSKGEIDKTLFIHRKSDQLL------------------

Query:  ------------------------------------------------------ARNKWTPAATHVKLTRDTYGAKVDHKLYRSIVGNLLYLTASRPDIA
                                                              ARNK TPAATHVKLTRD  GA+VDHKLYRSIV NLLYLTASRPDIA
Subjt:  ------------------------------------------------------ARNKWTPAATHVKLTRDTYGAKVDHKLYRSIVGNLLYLTASRPDIA

Query:  YVVGICARYQADPRISHLEAVKIILKYVHGTSDFGMMYSYDTTLTLVRYCDADWAGSGVDRKSTSRGCFFLGNNLISWLSKKQNCVSLSTTEAEYIIAGS
        Y VGICARYQADPRISHLEAVK ILKYVHGT+DFGMMYSYDTT TLV YCDADWAG   DRKSTS GCFFLGNNLI WLSKKQNCVSLSTTEAEYI+AGS
Subjt:  YVVGICARYQADPRISHLEAVKIILKYVHGTSDFGMMYSYDTTLTLVRYCDADWAGSGVDRKSTSRGCFFLGNNLISWLSKKQNCVSLSTTEAEYIIAGS

Query:  GCTQLIWMKNMLHEYGLDQDTMMLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIRSNLQLADIFT
        GCTQLIWM+N+L EYG DQ T+ LY DNMSAIDISKNPVQHSR KHIDIRHHFIRELVE+KVIRLDHIRSNLQLADIFT
Subjt:  GCTQLIWMKNMLHEYGLDQDTMMLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIRSNLQLADIFT

TrEMBL top hitse value%identityAlignment
A0A5A7SYI8 F5J5.11.4e-12671.43Show/hide
Query:  MDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKNVYKLNKALYGLKQAPRAWYELLTVYLRGKGYSKGEIDKTLFIHRKSDQLLARNKWTPAATHVKLTRDT
        MDVKSAFLNGYLNE+VYVAQPKGFVDS+ P++VYKLNKALYGLKQAP A YE LT+YL  + Y+K ++ K        DQ  +++K TPAATHVK+T+D 
Subjt:  MDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKNVYKLNKALYGLKQAPRAWYELLTVYLRGKGYSKGEIDKTLFIHRKSDQLLARNKWTPAATHVKLTRDT

Query:  YGAKVDHKLYRSIVGNLLYLTASRPDIAYVVGICARYQADPRISHLEAVKIILKYVHGTSDFGMMYSYDTTLTLVRYCDADWAGSGVDRKSTSRGCFFLG
            +DHKLYRSI+ +LLYLTASRPDIAY VGICA++Q+DPR SHL  VK I+KYVHGT+DFG++YSYDTT  LV YCDADW GS  DR+STS GCFFLG
Subjt:  YGAKVDHKLYRSIVGNLLYLTASRPDIAYVVGICARYQADPRISHLEAVKIILKYVHGTSDFGMMYSYDTTLTLVRYCDADWAGSGVDRKSTSRGCFFLG

Query:  NNLISWLSKKQNCVSLSTTEAEYIIAGSGCTQLIWMKNMLHEYGLDQDTMMLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIRSNL
             W SKKQ CVSLST E EYI  GSGCTQLIWMKNMLHEYG+ QD M LYCDNMSAIDISK PVQHS+TKHIDIRHHFIREL+ENK+I L H+RS  
Subjt:  NNLISWLSKKQNCVSLSTTEAEYIIAGSGCTQLIWMKNMLHEYGLDQDTMMLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIRSNL

Query:  QLADIFTKPLDANSFKYLRAGL
        QLADIFTKPLD  +F++LRAGL
Subjt:  QLADIFTKPLDANSFKYLRAGL

A0A5D3BDQ5 Gag-pol polyprotein2.9e-12972.75Show/hide
Query:  MDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKNVYKLNKALYGLKQAPRAWYELLTVYLRGKGYSKGE----IDKTLFIH---RKSDQLLARNKWTPAATH
        MDVKSAFLNGYLNEEVYV QPKGFVDSEHPK+VYKLNKALYGLKQA RAWY+ LTVYLRG    +      I +  ++    +K     ARNK TPAATH
Subjt:  MDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKNVYKLNKALYGLKQAPRAWYELLTVYLRGKGYSKGE----IDKTLFIH---RKSDQLLARNKWTPAATH

Query:  VKLTRDTYGAKVDHKLYRSIVGNLLYLTASRPDIAYVVGICARYQADPRISHLEAVKIILKYVHGTSDFGMMYSYDTTLTLVRYCDADWAGSGVDRKSTS
        VKLT+DT GAKVDHKLYRSI G+LLYLTASRPDIAY +GI ARYQA+PRI+HLEAVK ILKYVHGTSDFGMMYSYDTT T+V YCDADWAGS  D K   
Subjt:  VKLTRDTYGAKVDHKLYRSIVGNLLYLTASRPDIAYVVGICARYQADPRISHLEAVKIILKYVHGTSDFGMMYSYDTTLTLVRYCDADWAGSGVDRKSTS

Query:  RGCFFLGNNLISWLSKKQNCVSLSTTEAEYIIAGSGCTQLIWMKNMLHEYGLDQDTMMLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELVENKVIRL
                                 TEAEY+ AGSGCTQLIWM+NML EYG DQDTM LYCDNMSAIDISKNPVQHSRTKHIDIRHHF+RELVE+KVI+ 
Subjt:  RGCFFLGNNLISWLSKKQNCVSLSTTEAEYIIAGSGCTQLIWMKNMLHEYGLDQDTMMLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELVENKVIRL

Query:  DHIRSNLQLADIFTKPLDANSFKYLRAGLRVCRT
        DHIRSNLQLADIFTKPLDA+SF+YL AGL VCRT
Subjt:  DHIRSNLQLADIFTKPLDANSFKYLRAGLRVCRT

A0A5D3DCC3 F5J5.17.9e-12771.3Show/hide
Query:  MDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKNVYKLNKALYGLKQAPRAWYELLTVYLRGKGYSKGEIDKTLFIHRKSDQLLARNKWTPAATHVKLTRDT
        MDVKSAFLNGYLNE+VYVAQPKGFVDS+ P++VYKLNKALYGLKQAP A YE LT+YL  + Y+K ++ K        DQ  +++K TPAATHVK+T+D 
Subjt:  MDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKNVYKLNKALYGLKQAPRAWYELLTVYLRGKGYSKGEIDKTLFIHRKSDQLLARNKWTPAATHVKLTRDT

Query:  YGAKVDHKLYRSIVGNLLYLTASRPDIAYVVGICARYQADPRISHLEAVKIILKYVHGTSDFGMMYSYDTTLTLVRYCDADWAGSGVDRKSTSRGCFFLG
            +DHKLYRSI+ +LLYLTASRPDIAY VGICA++Q+DPR SHL  VK I+KYVHGT+DFG++YSYDTT  LV YCDADW GS  DR+STS GCFFLG
Subjt:  YGAKVDHKLYRSIVGNLLYLTASRPDIAYVVGICARYQADPRISHLEAVKIILKYVHGTSDFGMMYSYDTTLTLVRYCDADWAGSGVDRKSTSRGCFFLG

Query:  NNLISWLSKKQNCVSLSTTEAEYIIAGSGCTQLIWMKNMLHEYGLDQDTMMLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIRSNL
             W SKKQ CVSLST E EYI  GSGCTQLIWMKNMLHEYG+ QD M LYCDNMSAIDISK PVQHS+TKHIDIRHHFIREL+ENK+I L H+RS  
Subjt:  NNLISWLSKKQNCVSLSTTEAEYIIAGSGCTQLIWMKNMLHEYGLDQDTMMLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIRSNL

Query:  QLADIFTKPLDANSFKYLRAGLRV
        QLADIFTKPLD  +F++LRAGL V
Subjt:  QLADIFTKPLDANSFKYLRAGLRV

A0A5D3DI97 Gag-pol polyprotein3.1e-14772.82Show/hide
Query:  MDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKNVYKLNKALYGLKQAPRAWYELLTVYLRGKGYSKGEIDKTLFIHRKSDQLL------------------
        MDVKSAFLNGYLNEEVYVAQPK FVDSEHPK+VYKLNKALYGLKQAPR WYE LTVYLRGKGYS+GEIDKTLFIHRKSDQLL                  
Subjt:  MDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKNVYKLNKALYGLKQAPRAWYELLTVYLRGKGYSKGEIDKTLFIHRKSDQLL------------------

Query:  ------------------------------------------------------ARNKWTPAATHVKLTRDTYGAKVDHKLYRSIVGNLLYLTASRPDIA
                                                              ARNK TPAATHVKLTRD  GA+VDHKLYRSIV NLLYLTASRPDIA
Subjt:  ------------------------------------------------------ARNKWTPAATHVKLTRDTYGAKVDHKLYRSIVGNLLYLTASRPDIA

Query:  YVVGICARYQADPRISHLEAVKIILKYVHGTSDFGMMYSYDTTLTLVRYCDADWAGSGVDRKSTSRGCFFLGNNLISWLSKKQNCVSLSTTEAEYIIAGS
        Y VGICARYQADPRISHLEAVK ILKYVHGT+DFGMMYSYDTT TLV YCDADWAG   DRKSTS GCFFLGNNLI WLSKKQNCVSLSTTEAEYI+AGS
Subjt:  YVVGICARYQADPRISHLEAVKIILKYVHGTSDFGMMYSYDTTLTLVRYCDADWAGSGVDRKSTSRGCFFLGNNLISWLSKKQNCVSLSTTEAEYIIAGS

Query:  GCTQLIWMKNMLHEYGLDQDTMMLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIRSNLQLADIFT
        GCTQLIWM+N+L EYG DQ T+ LY DNMSAIDISKNPVQHSR KHIDIRHHFIRELVE+KVIRLDHIRSNLQLADIFT
Subjt:  GCTQLIWMKNMLHEYGLDQDTMMLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIRSNLQLADIFT

A0A5D3DWS6 Gag-pol polyprotein2.5e-12864.21Show/hide
Query:  MDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKNVYKLNKALYGLKQAPRAWYELLTVYLRGKGYSKGEIDKTLFIHRKSDQLL------------------
        MDVKS FLNGYLNEEVYVAQPKGFVDSEH K+VYKLNKALYGLKQAPRAWY+ LTVYLRGKGYS+GEIDKTLFIHRKSDQLL                  
Subjt:  MDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKNVYKLNKALYGLKQAPRAWYELLTVYLRGKGYSKGEIDKTLFIHRKSDQLL------------------

Query:  ------------------------------------------------------ARNKWTPAATHVKLTRDTYGAKVDHKLYRSIVGNLLYLTASRPDIA
                                                              ARNK TPA THVKLT+D  GA+VDHKLYRSIVG+LLYLTASRPDIA
Subjt:  ------------------------------------------------------ARNKWTPAATHVKLTRDTYGAKVDHKLYRSIVGNLLYLTASRPDIA

Query:  YVVGICARYQADPRISHLEAVKIILKYVHGTSDFGMMYSYDTTLTLVRYCDADWAGSGVDRKSTSRGCFFLGNNLISWLSKKQNCVSLSTTEAEYIIAGS
        Y +GI ARYQ  PRI+HLEA+K ILKYVH T DFGMMYSYDTT TLV YCDADWAGS  DRK                             EAEYI AGS
Subjt:  YVVGICARYQADPRISHLEAVKIILKYVHGTSDFGMMYSYDTTLTLVRYCDADWAGSGVDRKSTSRGCFFLGNNLISWLSKKQNCVSLSTTEAEYIIAGS

Query:  GCTQLIWMKNMLHEYGLDQDTMMLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIRSNLQLADIFTKPLDANSFKYLRAGL
        GCTQLIWMKN+LHEYG DQDTM LYC+NMSAIDISKN VQHSRTKHIDIRHHFIRE VE KVI+LDHIRSNLQLA+IFTKPLDA+SF+YL AGL
Subjt:  GCTQLIWMKNMLHEYGLDQDTMMLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIRSNLQLADIFTKPLDANSFKYLRAGL

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.4e-4832.49Show/hide
Query:  MDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKNVYKLNKALYGLKQAPRAWYELLTVYLRGKGYSKGEIDKTLFIHRK-------------SDQLLARNKW
        MDVK+AFLNG L EE+Y+  P+G   S +  NV KLNKA+YGLKQA R W+E+    L+   +    +D+ ++I  K              D ++A    
Subjt:  MDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKNVYKLNKALYGLKQAPRAWYELLTVYLRGKGYSKGEIDKTLFIHRK-------------SDQLLARNKW

Query:  T--------------------------------------PAATHVKLTRDTYG------------AKVDHKLY----------RSIVGNLLY-LTASRPD
        T                                        + +VK     +             +K++++L           RS++G L+Y +  +RPD
Subjt:  T--------------------------------------PAATHVKLTRDTYG------------AKVDHKLY----------RSIVGNLLY-LTASRPD

Query:  IAYVVGICARYQADPRISHLEAVKIILKYVHGTSDFGMMYSYDTTL--TLVRYCDADWAGSGVDRKSTSRGCFFLGN-NLISWLSKKQNCVSLSTTEAEY
        +   V I +RY +       + +K +L+Y+ GT D  +++  +      ++ Y D+DWAGS +DRKST+   F + + NLI W +K+QN V+ S+TEAEY
Subjt:  IAYVVGICARYQADPRISHLEAVKIILKYVHGTSDFGMMYSYDTTL--TLVRYCDADWAGSGVDRKSTSRGCFFLGN-NLISWLSKKQNCVSLSTTEAEY

Query:  IIAGSGCTQLIWMKNMLHEYGLD-QDTMMLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIRSNLQLADIFTKPLDANSFKYLR
        +       + +W+K +L    +  ++ + +Y DN   I I+ NP  H R KHIDI++HF RE V+N VI L++I +  QLADIFTKPL A  F  LR
Subjt:  IIAGSGCTQLIWMKNMLHEYGLD-QDTMMLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIRSNLQLADIFTKPLDANSFKYLR

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-5735.43Show/hide
Query:  MDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKNVYKLNKALYGLKQAPRAWYELLTVYLRGKGYSKGEIDKTLFIHRKSD---------------------
        +DVK+AFL+G L EE+Y+ QP+GF  +     V KLNK+LYGLKQAPR WY     +++ + Y K   D  ++  R S+                     
Subjt:  MDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKNVYKLNKALYGLKQAPRAWYELLTVYLRGKGYSKGEIDKTLFIHRKSD---------------------

Query:  ----------------------QLL---------ARNKW-----------------------TPAATHVKLTRDTYGAKVDHK------LYRSIVGNLLY
                              Q+L         +R  W                       TP A H+KL++      V+ K       Y S VG+L+Y
Subjt:  ----------------------QLL---------ARNKW-----------------------TPAATHVKLTRDTYGAKVDHK------LYRSIVGNLLY

Query:  -LTASRPDIAYVVGICARYQADPRISHLEAVKIILKYVHGTSDFGMMYSYDTTLTLVRYCDADWAGSGVDRKSTSRGCFFLGNNLISWLSKKQNCVSLST
         +  +RPDIA+ VG+ +R+  +P   H EAVK IL+Y+ GT+   + +     + L  Y DAD AG   +RKS++   F      ISW SK Q CV+LST
Subjt:  -LTASRPDIAYVVGICARYQADPRISHLEAVKIILKYVHGTSDFGMMYSYDTTLTLVRYCDADWAGSGVDRKSTSRGCFFLGNNLISWLSKKQNCVSLST

Query:  TEAEYIIAGSGCTQLIWMKNMLHEYGLDQDTMMLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIRSNLQLADIFTKPLDANSFK
        TEAEYI A     ++IW+K  L E GL Q   ++YCD+ SAID+SKN + H+RTKHID+R+H+IRE+V+++ +++  I +N   AD+ TK +  N F+
Subjt:  TEAEYIIAGSGCTQLIWMKNMLHEYGLDQDTMMLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIRSNLQLADIFTKPLDANSFK

P92519 Uncharacterized mitochondrial protein AtMg008104.6e-2340.56Show/hide
Query:  VKLTRDTYGAKV-DHKLYRSIVGNLLYLTASRPDIAYVVGICARYQADPRISHLEAVKIILKYVHGTSDFGMMYSYDTTLTLVRYCDADWAGSGVDRKST
        +KL      AK  D   +RSIVG L YLT +RPDI+Y V I  +   +P ++  + +K +L+YV GT   G+    ++ L +  +CD+DWAG    R+ST
Subjt:  VKLTRDTYGAKV-DHKLYRSIVGNLLYLTASRPDIAYVVGICARYQADPRISHLEAVKIILKYVHGTSDFGMMYSYDTTLTLVRYCDADWAGSGVDRKST

Query:  SRGCFFLGNNLISWLSKKQNCVSLSTTEAEYIIAGSGCTQLIW
        +  C FLG N+ISW +K+Q  VS S+TE EY        +L W
Subjt:  SRGCFFLGNNLISWLSKKQNCVSLSTTEAEYIIAGSGCTQLIW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.3e-5735.59Show/hide
Query:  MDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKNVYKLNKALYGLKQAPRAWYELLTVYLRGKGYSKGEIDKTLFI--------------------------
        +DV +AFL G L ++VY++QP GF+D + P  V KL KALYGLKQAPRAWY  L  YL   G+     D +LF+                          
Subjt:  MDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKNVYKLNKALYGLKQAPRAWYELLTVYLRGKGYSKGEIDKTLFI--------------------------

Query:  -HRKSD--------------------------------------QLLARNKW-------TPAATHVKLTRDTYGAKVDHKLYRSIVGNLLYLTASRPDIA
         H   D                                       LLAR          TP A   KL+  +     D   YR IVG+L YL  +RPDI+
Subjt:  -HRKSD--------------------------------------QLLARNKW-------TPAATHVKLTRDTYGAKVDHKLYRSIVGNLLYLTASRPDIA

Query:  YVVGICARYQADPRISHLEAVKIILKYVHGTSDFGMMYSYDTTLTLVRYCDADWAGSGVDRKSTSRGCFFLGNNLISWLSKKQNCVSLSTTEAEYIIAGS
        Y V   +++   P   HL+A+K IL+Y+ GT + G+      TL+L  Y DADWAG   D  ST+    +LG++ ISW SKKQ  V  S+TEAEY    +
Subjt:  YVVGICARYQADPRISHLEAVKIILKYVHGTSDFGMMYSYDTTLTLVRYCDADWAGSGVDRKSTSRGCFFLGNNLISWLSKKQNCVSLSTTEAEYIIAGS

Query:  GCTQLIWMKNMLHEYGLD-QDTMMLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIRSNLQLADIFTKPLDANSFKYLRAGLRVCR
          +++ W+ ++L E G+      ++YCDN+ A  +  NPV HSR KHI I +HFIR  V++  +R+ H+ ++ QLAD  TKPL   +F+   + + V R
Subjt:  GCTQLIWMKNMLHEYGLD-QDTMMLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIRSNLQLADIFTKPLDANSFKYLRAGLRVCR

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.2e-5735.99Show/hide
Query:  MDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKNVYKLNKALYGLKQAPRAWYELLTVYLRGKGYSKGEIDKTLFI--------------------------
        +DV +AFL G L +EVY++QP GFVD + P  V +L KA+YGLKQAPRAWY  L  YL   G+     D +LF+                          
Subjt:  MDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKNVYKLNKALYGLKQAPRAWYELLTVYLRGKGYSKGEIDKTLFI--------------------------

Query:  ---------------------------------------HRKSDQLLARNKW-------TPAATHVKLTRDTYGAKVDHKLYRSIVGNLLYLTASRPDIA
                                                R +  LLAR          TP AT  KLT  +     D   YR IVG+L YL  +RPD++
Subjt:  ---------------------------------------HRKSDQLLARNKW-------TPAATHVKLTRDTYGAKVDHKLYRSIVGNLLYLTASRPDIA

Query:  YVVGICARYQADPRISHLEAVKIILKYVHGTSDFGMMYSYDTTLTLVRYCDADWAGSGVDRKSTSRGCFFLGNNLISWLSKKQNCVSLSTTEAEYIIAGS
        Y V   ++Y   P   H  A+K +L+Y+ GT D G+      TL+L  Y DADWAG   D  ST+    +LG++ ISW SKKQ  V  S+TEAEY    +
Subjt:  YVVGICARYQADPRISHLEAVKIILKYVHGTSDFGMMYSYDTTLTLVRYCDADWAGSGVDRKSTSRGCFFLGNNLISWLSKKQNCVSLSTTEAEYIIAGS

Query:  GCTQLIWMKNMLHEYGLD-QDTMMLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIRSNLQLADIFTKPLDANSFK
          ++L W+ ++L E G+      ++YCDN+ A  +  NPV HSR KHI + +HFIR  V++  +R+ H+ ++ QLAD  TKPL   +F+
Subjt:  GCTQLIWMKNMLHEYGLD-QDTMMLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIRSNLQLADIFTKPLDANSFK

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 86.6e-4130.75Show/hide
Query:  MDVKSAFLNGYLNEEVYVAQPKGFV----DSEHPKNVYKLNKALYGLKQAPRAWYELLTVYLRGKGYSKGEIDKTLFIHRKS------------------
        +D+ +AFLNG L+EE+Y+  P G+     DS  P  V  L K++YGLKQA R W+   +V L G G+ +   D T F+   +                  
Subjt:  MDVKSAFLNGYLNEEVYVAQPKGFV----DSEHPKNVYKLNKALYGLKQAPRAWYELLTVYLRGKGYSKGEIDKTLFIHRKS------------------

Query:  -----DQLLARNK-------------------------------------------------WTPAATHVKLTRDTYGAKVDHKLYRSIVGNLLYLTASR
             D+L ++ K                                                   P    V  +  + G  VD K YR ++G L+YL  +R
Subjt:  -----DQLLARNK-------------------------------------------------WTPAATHVKLTRDTYGAKVDHKLYRSIVGNLLYLTASR

Query:  PDIAYVVGICARYQADPRISHLEAVKIILKYVHGTSDFGMMYSYDTTLTLVRYCDADWAGSGVDRKSTSRGCFFLGNNLISWLSKKQNCVSLSTTEAEYI
         DI++ V   +++   PR++H +AV  IL Y+ GT   G+ YS    + L  + DA +      R+ST+  C FLG +LISW SKKQ  VS S+ EAEY 
Subjt:  PDIAYVVGICARYQADPRISHLEAVKIILKYVHGTSDFGMMYSYDTTLTLVRYCDADWAGSGVDRKSTSRGCFFLGNNLISWLSKKQNCVSLSTTEAEYI

Query:  IAGSGCTQLIWMKNMLHEYGLD-QDTMMLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRE
               +++W+     E  L      +L+CDN +AI I+ N V H RTKHI+   H +RE
Subjt:  IAGSGCTQLIWMKNMLHEYGLD-QDTMMLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRE

ATMG00240.1 Gag-Pol-related retrotransposon family protein6.6e-0934.09Show/hide
Query:  LYLTASRPDIAYVVGICARYQADPRISHLEAVKIILKYVHGTSDFGMMYSYDTTLTLVRYCDADWAGSGVDRKSTSRGC-----FFLG
        +YLT +RPD+ + V   +++ +  R + ++AV  +L YV GT   G+ YS  + L L  + D+DWA     R+S +  C     +FLG
Subjt:  LYLTASRPDIAYVVGICARYQADPRISHLEAVKIILKYVHGTSDFGMMYSYDTTLTLVRYCDADWAGSGVDRKSTSRGC-----FFLG

ATMG00810.1 DNA/RNA polymerases superfamily protein3.3e-2440.56Show/hide
Query:  VKLTRDTYGAKV-DHKLYRSIVGNLLYLTASRPDIAYVVGICARYQADPRISHLEAVKIILKYVHGTSDFGMMYSYDTTLTLVRYCDADWAGSGVDRKST
        +KL      AK  D   +RSIVG L YLT +RPDI+Y V I  +   +P ++  + +K +L+YV GT   G+    ++ L +  +CD+DWAG    R+ST
Subjt:  VKLTRDTYGAKV-DHKLYRSIVGNLLYLTASRPDIAYVVGICARYQADPRISHLEAVKIILKYVHGTSDFGMMYSYDTTLTLVRYCDADWAGSGVDRKST

Query:  SRGCFFLGNNLISWLSKKQNCVSLSTTEAEYIIAGSGCTQLIW
        +  C FLG N+ISW +K+Q  VS S+TE EY        +L W
Subjt:  SRGCFFLGNNLISWLSKKQNCVSLSTTEAEYIIAGSGCTQLIW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGTAAAGAGTGCTTTCTTAAATGGATATTTGAATGAAGAGGTTTATGTTGCTCAACCAAAAGGTTTTGTTGATTCTGAGCATCCAAAGAATGTGTATAAGCTCAA
CAAAGCTTTATATGGGCTAAAGCAAGCTCCGAGAGCTTGGTATGAATTACTAACTGTTTACTTGAGAGGTAAAGGATATTCTAAAGGAGAAATTGACAAGACCTTATTTA
TACACAGGAAATCTGATCAACTTTTGGCTCGAAATAAGTGGACTCCAGCTGCAACGCATGTTAAACTTACAAGAGACACTTATGGTGCTAAAGTTGATCACAAACTCTAC
AGGAGTATAGTAGGCAACTTACTATATTTAACAGCAAGTCGACCTGACATAGCTTATGTTGTGGGAATATGTGCCCGTTATCAGGCTGATCCCCGCATCTCTCACCTAGA
AGCTGTTAAAATAATTCTTAAGTATGTTCATGGGACCAGTGACTTTGGAATGATGTATTCCTATGATACCACCCTCACTCTTGTTAGATATTGTGATGCTGACTGGGCAG
GTTCGGGTGTTGATCGTAAAAGTACGTCTAGAGGATGTTTCTTTTTAGGAAACAATCTAATTTCTTGGTTAAGTAAGAAGCAGAACTGTGTCTCTTTATCTACAACTGAA
GCTGAATATATAATAGCAGGTAGTGGTTGTACACAGTTGATTTGGATGAAAAATATGTTGCATGAATATGGCCTTGATCAGGATACTATGATGTTGTATTGTGACAATAT
GAGCGCAATTGATATATCCAAGAATCCTGTTCAACATAGTCGAACAAAGCACATTGACATAAGACATCATTTTATTCGTGAACTTGTTGAAAATAAAGTAATTAGGCTTG
ATCATATTCGTTCCAACTTACAATTAGCCGATATTTTCACTAAACCTCTGGATGCAAACTCATTCAAATATTTACGTGCTGGTTTACGAGTGTGTCGCACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATGTAAAGAGTGCTTTCTTAAATGGATATTTGAATGAAGAGGTTTATGTTGCTCAACCAAAAGGTTTTGTTGATTCTGAGCATCCAAAGAATGTGTATAAGCTCAA
CAAAGCTTTATATGGGCTAAAGCAAGCTCCGAGAGCTTGGTATGAATTACTAACTGTTTACTTGAGAGGTAAAGGATATTCTAAAGGAGAAATTGACAAGACCTTATTTA
TACACAGGAAATCTGATCAACTTTTGGCTCGAAATAAGTGGACTCCAGCTGCAACGCATGTTAAACTTACAAGAGACACTTATGGTGCTAAAGTTGATCACAAACTCTAC
AGGAGTATAGTAGGCAACTTACTATATTTAACAGCAAGTCGACCTGACATAGCTTATGTTGTGGGAATATGTGCCCGTTATCAGGCTGATCCCCGCATCTCTCACCTAGA
AGCTGTTAAAATAATTCTTAAGTATGTTCATGGGACCAGTGACTTTGGAATGATGTATTCCTATGATACCACCCTCACTCTTGTTAGATATTGTGATGCTGACTGGGCAG
GTTCGGGTGTTGATCGTAAAAGTACGTCTAGAGGATGTTTCTTTTTAGGAAACAATCTAATTTCTTGGTTAAGTAAGAAGCAGAACTGTGTCTCTTTATCTACAACTGAA
GCTGAATATATAATAGCAGGTAGTGGTTGTACACAGTTGATTTGGATGAAAAATATGTTGCATGAATATGGCCTTGATCAGGATACTATGATGTTGTATTGTGACAATAT
GAGCGCAATTGATATATCCAAGAATCCTGTTCAACATAGTCGAACAAAGCACATTGACATAAGACATCATTTTATTCGTGAACTTGTTGAAAATAAAGTAATTAGGCTTG
ATCATATTCGTTCCAACTTACAATTAGCCGATATTTTCACTAAACCTCTGGATGCAAACTCATTCAAATATTTACGTGCTGGTTTACGAGTGTGTCGCACTTAA
Protein sequenceShow/hide protein sequence
MDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKNVYKLNKALYGLKQAPRAWYELLTVYLRGKGYSKGEIDKTLFIHRKSDQLLARNKWTPAATHVKLTRDTYGAKVDHKLY
RSIVGNLLYLTASRPDIAYVVGICARYQADPRISHLEAVKIILKYVHGTSDFGMMYSYDTTLTLVRYCDADWAGSGVDRKSTSRGCFFLGNNLISWLSKKQNCVSLSTTE
AEYIIAGSGCTQLIWMKNMLHEYGLDQDTMMLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIRSNLQLADIFTKPLDANSFKYLRAGLRVCRT