; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0062671 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0062671
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr03:2946805..2947944
RNA-Seq ExpressionCmc03g0062671
SyntenyCmc03g0062671
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0041516.1 gag-pol polyprotein [Cucumis melo var. makuwa]6.3e-15494.22Show/hide
Query:  MSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSD
        MSDRDHRRKWDSKSDRGIFLGYSAN+RAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKP EGELESTAPTNETTYLPSHLGSSRSD
Subjt:  MSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSD

Query:  MSTPSTSTIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKH
        MSTPSTS I TDTHESEAS+SASQHT ERTTGATDS KCDLIP THIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTT+SAALSD+H
Subjt:  MSTPSTSTIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKH

Query:  WILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLF
        WILAMQEELLQFERNQVW+LVPKPPYANIIGTKWIFKNK DEEGRVIRNKARLVAQGYSQIEGLDFGETF  VARLEAIRLLL Y CF+RFKLF
Subjt:  WILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLF

KAA0043008.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.8e-16983.38Show/hide
Query:  MSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSD
        +SDRDHRRKWDSKSDRGIFLGYS NSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSL  KP EGEL+STA  NETTYLPSHLGS RSD
Subjt:  MSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSD

Query:  MSTPSTSTIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKH
        MSTPSTSTI TD    EA I A QHTLE+T GATDS KC+LIPPTHIAKNHPSSFI GDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSA LSD+H
Subjt:  MSTPSTSTIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKH

Query:  WILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKS
        WILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEE                             VARLEAIRLLLSYTCFRRFKLFQMDVKS
Subjt:  WILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKS

Query:  AFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLIV
        AFLNGYL EEVYVAQPKGFV+  HQDHVYKLRKALY LKQAPRAWYERLSTYLLQQ   +GSADQTMFIYRQG DFLIV
Subjt:  AFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLIV

KAA0059225.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.3e-17284.17Show/hide
Query:  MSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSD
        +SDRDHRRKWDSKSDRGIFLGY ANSRAYRVYNQ SK VMESINVIIDDL                        EGELES A TNETTYLPSHLG SR D
Subjt:  MSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSD

Query:  MSTPSTSTIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKH
        MSTPSTS I  +THESEA +SASQHT E+T GATDS KCDLIPPTH AKNHPSSFII DIHSGIITRKKERKDYAKMVANVCYTS LEPTTVSAALSD+H
Subjt:  MSTPSTSTIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKH

Query:  WILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKS
        WIL +QEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSY CF RFKLFQMDVKS
Subjt:  WILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKS

Query:  AFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLIV
        AFLNGYL EEVYVAQPKGFV+ VH+DHVYKLRKALY LKQAPRAWYERLSTYLLQQG ++GSADQTMFIYRQG +FLIV
Subjt:  AFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLIV

KAA0059939.1 gag-pol polyprotein [Cucumis melo var. makuwa]4.7e-13382.3Show/hide
Query:  MESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSDMSTPSTSTIQTDTHESEASISASQHTLERTTGATDSLKC
        MESINVIIDDLGKEPNRNLDDEDEVFWNSLS KP EGELE TA                         TI TDT ESE S+SASQHT ERT GA+DS KC
Subjt:  MESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSDMSTPSTSTIQTDTHESEASISASQHTLERTTGATDSLKC

Query:  DLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNK
        D IPPTHIAKNHPSSFII DIHSGIITRKKERKDY KMVANVCYTSSLEPT VSA LSD+HWIL MQEELLQFERNQ+WELVPKPPYANIIGTKWIFKNK
Subjt:  DLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNK

Query:  TDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLK
        TDEEGRVIRNKARLVAQ Y QIEGLDFGETF PVARLE IRLLLSY  FRRFKLFQMDVKSAFLNGYL EEVYVAQPKGFV+ VHQDHVYKL+KALYGLK
Subjt:  TDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLK

Query:  QAPRA
        QAPRA
Subjt:  QAPRA

KAA0060049.1 F9C16.17 [Cucumis melo var. makuwa]7.7e-13668.34Show/hide
Query:  MSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSD
        +SDR+H RKWDSKSDRGIFLGYS NSRAYRVYNQR+K VME INVII DLGKEPNRNLDDEDE FW+SLS K V+ E EST+ T ETTY P H  S+R D
Subjt:  MSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSD

Query:  MSTPSTSTIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKH
        MSTPSTS    +T E EA++SASQHT ERT G+TDS K  L+P T+IAK+HPSSFII D+HSGIITRKKERKDYAKMV N+CYTSSLEPTTVS AL+++H
Subjt:  MSTPSTSTIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKH

Query:  WILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKS
        WILAMQ+ELLQFERN+VWELVPKPP+ANIIGTKWIFKNKTDE+GRVIRNKARLVAQGYSQIE                                      
Subjt:  WILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKS

Query:  AFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLIV
                         GFV+ VH DHVY+LRKALYGLKQAPRAWYERLSTYLLQQG ++GS DQTMF+YRQG DFLIV
Subjt:  AFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLIV

TrEMBL top hitse value%identityAlignment
A0A5A7TJM5 Gag-pol polyprotein8.8e-17083.38Show/hide
Query:  MSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSD
        +SDRDHRRKWDSKSDRGIFLGYS NSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSL  KP EGEL+STA  NETTYLPSHLGS RSD
Subjt:  MSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSD

Query:  MSTPSTSTIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKH
        MSTPSTSTI TD    EA I A QHTLE+T GATDS KC+LIPPTHIAKNHPSSFI GDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSA LSD+H
Subjt:  MSTPSTSTIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKH

Query:  WILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKS
        WILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEE                             VARLEAIRLLLSYTCFRRFKLFQMDVKS
Subjt:  WILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKS

Query:  AFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLIV
        AFLNGYL EEVYVAQPKGFV+  HQDHVYKLRKALY LKQAPRAWYERLSTYLLQQ   +GSADQTMFIYRQG DFLIV
Subjt:  AFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLIV

A0A5A7V046 Gag-pol polyprotein6.5e-17384.17Show/hide
Query:  MSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSD
        +SDRDHRRKWDSKSDRGIFLGY ANSRAYRVYNQ SK VMESINVIIDDL                        EGELES A TNETTYLPSHLG SR D
Subjt:  MSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSD

Query:  MSTPSTSTIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKH
        MSTPSTS I  +THESEA +SASQHT E+T GATDS KCDLIPPTH AKNHPSSFII DIHSGIITRKKERKDYAKMVANVCYTS LEPTTVSAALSD+H
Subjt:  MSTPSTSTIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKH

Query:  WILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKS
        WIL +QEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSY CF RFKLFQMDVKS
Subjt:  WILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKS

Query:  AFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLIV
        AFLNGYL EEVYVAQPKGFV+ VH+DHVYKLRKALY LKQAPRAWYERLSTYLLQQG ++GSADQTMFIYRQG +FLIV
Subjt:  AFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLIV

A0A5A7V293 Gag-pol polyprotein2.3e-13382.3Show/hide
Query:  MESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSDMSTPSTSTIQTDTHESEASISASQHTLERTTGATDSLKC
        MESINVIIDDLGKEPNRNLDDEDEVFWNSLS KP EGELE TA                         TI TDT ESE S+SASQHT ERT GA+DS KC
Subjt:  MESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSDMSTPSTSTIQTDTHESEASISASQHTLERTTGATDSLKC

Query:  DLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNK
        D IPPTHIAKNHPSSFII DIHSGIITRKKERKDY KMVANVCYTSSLEPT VSA LSD+HWIL MQEELLQFERNQ+WELVPKPPYANIIGTKWIFKNK
Subjt:  DLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNK

Query:  TDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLK
        TDEEGRVIRNKARLVAQ Y QIEGLDFGETF PVARLE IRLLLSY  FRRFKLFQMDVKSAFLNGYL EEVYVAQPKGFV+ VHQDHVYKL+KALYGLK
Subjt:  TDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLK

Query:  QAPRA
        QAPRA
Subjt:  QAPRA

A0A5A7V2P3 F9C16.173.7e-13668.34Show/hide
Query:  MSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSD
        +SDR+H RKWDSKSDRGIFLGYS NSRAYRVYNQR+K VME INVII DLGKEPNRNLDDEDE FW+SLS K V+ E EST+ T ETTY P H  S+R D
Subjt:  MSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSD

Query:  MSTPSTSTIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKH
        MSTPSTS    +T E EA++SASQHT ERT G+TDS K  L+P T+IAK+HPSSFII D+HSGIITRKKERKDYAKMV N+CYTSSLEPTTVS AL+++H
Subjt:  MSTPSTSTIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKH

Query:  WILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKS
        WILAMQ+ELLQFERN+VWELVPKPP+ANIIGTKWIFKNKTDE+GRVIRNKARLVAQGYSQIE                                      
Subjt:  WILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKS

Query:  AFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLIV
                         GFV+ VH DHVY+LRKALYGLKQAPRAWYERLSTYLLQQG ++GS DQTMF+YRQG DFLIV
Subjt:  AFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLIV

A0A5D3DM65 Gag-pol polyprotein3.0e-15494.22Show/hide
Query:  MSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSD
        MSDRDHRRKWDSKSDRGIFLGYSAN+RAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKP EGELESTAPTNETTYLPSHLGSSRSD
Subjt:  MSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSD

Query:  MSTPSTSTIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKH
        MSTPSTS I TDTHESEAS+SASQHT ERTTGATDS KCDLIP THIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTT+SAALSD+H
Subjt:  MSTPSTSTIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKH

Query:  WILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLF
        WILAMQEELLQFERNQVW+LVPKPPYANIIGTKWIFKNK DEEGRVIRNKARLVAQGYSQIEGLDFGETF  VARLEAIRLLL Y CF+RFKLF
Subjt:  WILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLF

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.2e-3242.2Show/hide
Query:  WILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKS
        W  A+  EL   + N  W +  +P   NI+ ++W+F  K +E G  IR KARLVA+G++Q   +D+ ETFAPVAR+ + R +LS       K+ QMDVK+
Subjt:  WILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKS

Query:  AFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQG
        AFLNG L EE+Y+  P+G     + D+V KL KA+YGLKQA R W+E     L +      S D+ ++I  +G
Subjt:  AFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQG

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.8e-3229.38Show/hide
Query:  RDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSDMST
        ++ R K D KS   IF+GY      YR+++   K V+ S +V+  +      R   D  E   N + P  V               +PS   +S +  S 
Subjt:  RDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSDMST

Query:  PSTSTIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALS--DKHW
         ST+   ++  E    +      L+      +        PT   + H    +       + +R+    +Y      V  +   EP ++   LS  +K+ 
Subjt:  PSTSTIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALS--DKHW

Query:  IL-AMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKS
        ++ AMQEE+   ++N  ++LV  P     +  KW+FK K D + +++R KARLV +G+ Q +G+DF E F+PV ++ +IR +LS       ++ Q+DVK+
Subjt:  IL-AMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKS

Query:  AFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYR
        AFL+G L EE+Y+ QP+GF     +  V KL K+LYGLKQAPR WY +  +++  Q   K  +D  ++  R
Subjt:  AFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYR

P92520 Uncharacterized mitochondrial protein AtMg008205.5e-2044.72Show/hide
Query:  IITRKKE--RKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQI
        ++TR K    K   K    +  T   EP +V  AL D  W  AMQEEL    RN+ W LVP P   NI+G KW+FK K   +G + R KARLVA+G+ Q 
Subjt:  IITRKKE--RKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQI

Query:  EGLDFGETFAPVARLEAIRLLLS
        EG+ F ET++PV R   IR +L+
Subjt:  EGLDFGETFAPVARLEAIRLLLS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.1e-4336.82Show/hide
Query:  ESTAPTNETTYLPSHLGSSRSDMSTPSTSTIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMV
        ++T+  N T   PS L  S   +STP+ S+  + +  + AS S+       T+    S+     PP     N+ +   +     G   +    K   K  
Subjt:  ESTAPTNETTYLPSHLGSSRSDMSTPSTSTIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMV

Query:  ANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPP-YANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLE
          V   +  EP T   AL D+ W  AM  E+     N  W+LVP PP +  I+G +WIF  K + +G + R KARLVA+GY+Q  GLD+ ETF+PV +  
Subjt:  ANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPP-YANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLE

Query:  AIRLLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQG
        +IR++L     R + + Q+DV +AFL G L ++VY++QP GF+++   ++V KLRKALYGLKQAPRAWY  L  YLL  G     +D ++F+ ++G
Subjt:  AIRLLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.7e-4237.26Show/hide
Query:  NSLSPKPVEGELESTAPT----NETTYLPSHLGSSRSDMSTPSTSTIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHS
        NS S  P+       +P+    N+ + LP      +S +S+P   T  T   E  +  S+S         +T  L   L  P  I  N  +     + HS
Subjt:  NSLSPKPVEGELESTAPT----NETTYLPSHLGSSRSDMSTPSTSTIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHS

Query:  GIITRKKE--RKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELV-PKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYS
         + TR K+  RK   K        ++ EP T   A+ D  W  AM  E+     N  W+LV P PP   I+G +WIF  K + +G + R KARLVA+GY+
Subjt:  GIITRKKE--RKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELV-PKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYS

Query:  QIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCK
        Q  GLD+ ETF+PV +  +IR++L     R + + Q+DV +AFL G L +EVY++QP GFV++   D+V +LRKA+YGLKQAPRAWY  L TYLL  G  
Subjt:  QIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCK

Query:  KGSADQTMFIYRQG
           +D ++F+ ++G
Subjt:  KGSADQTMFIYRQG

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 86.7e-3739.41Show/hide
Query:  VCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIR
        VC   + EP+T + A     W  AM +E+   E    WE+   PP    IG KW++K K + +G + R KARLVA+GY+Q EG+DF ETF+PV +L +++
Subjt:  VCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIR

Query:  LLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNR----VHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDF
        L+L+ +    F L Q+D+ +AFLNG L EE+Y+  P G+  R    +  + V  L+K++YGLKQA R W+ + S  L+  G  +  +D T F+      F
Subjt:  LLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNR----VHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDF

Query:  LIV
        L V
Subjt:  LIV

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.9e-2144.72Show/hide
Query:  IITRKKE--RKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQI
        ++TR K    K   K    +  T   EP +V  AL D  W  AMQEEL    RN+ W LVP P   NI+G KW+FK K   +G + R KARLVA+G+ Q 
Subjt:  IITRKKE--RKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQI

Query:  EGLDFGETFAPVARLEAIRLLLS
        EG+ F ET++PV R   IR +L+
Subjt:  EGLDFGETFAPVARLEAIRLLLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGATAGGGATCATCGCAGAAAGTGGGACTCAAAGTCTGATCGTGGAATATTTCTGGGATATTCTGCTAACAGCCGAGCCTACAGGGTCTACAACCAACGTTCCAA
AACAGTAATGGAATCCATTAACGTCATTATTGATGACCTTGGTAAGGAACCCAATAGAAATCTTGATGATGAAGATGAGGTTTTTTGGAATTCCCTTTCTCCTAAACCTG
TTGAAGGAGAGTTAGAATCGACGGCCCCCACTAATGAAACAACATACTTACCCTCTCATCTCGGTTCAAGCAGAAGTGACATGTCAACACCTTCTACATCAACCATTCAG
ACTGACACACATGAAAGTGAAGCATCAATATCTGCAAGTCAGCACACTCTAGAGCGAACTACGGGTGCAACTGATTCTTTAAAGTGTGACCTCATACCTCCTACGCATAT
AGCCAAAAACCACCCCTCCAGCTTCATTATTGGAGATATTCACAGTGGAATCATAACTCGGAAGAAGGAGAGGAAAGATTATGCGAAAATGGTTGCCAATGTATGCTACA
CATCTTCACTAGAACCGACCACGGTCTCTGCAGCACTCTCCGACAAACACTGGATCTTGGCTATGCAGGAAGAGCTACTCCAGTTTGAAAGAAACCAAGTATGGGAATTA
GTGCCAAAGCCACCTTATGCTAACATAATTGGTACCAAATGGATCTTTAAGAACAAAACGGATGAAGAAGGTAGAGTTATTCGTAATAAAGCTAGACTGGTTGCTCAAGG
ATATTCTCAAATAGAAGGGCTGGATTTTGGAGAAACATTTGCCCCAGTTGCCAGATTAGAAGCCATCCGACTACTGCTAAGCTACACATGTTTTCGGAGATTCAAACTAT
TCCAAATGGATGTAAAGAGTGCGTTCCTAAATGGGTACTTATATGAGGAAGTATATGTGGCCCAGCCAAAAGGATTTGTTAATCGAGTACATCAGGATCATGTTTACAAA
CTTCGAAAGGCACTCTATGGACTTAAACAAGCTCCTAGAGCTTGGTATGAGAGACTCTCCACTTACCTGTTACAACAAGGATGTAAAAAGGGCAGTGCGGATCAAACTAT
GTTTATATATCGTCAAGGCATTGACTTTCTGATCGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTGATAGGGATCATCGCAGAAAGTGGGACTCAAAGTCTGATCGTGGAATATTTCTGGGATATTCTGCTAACAGCCGAGCCTACAGGGTCTACAACCAACGTTCCAA
AACAGTAATGGAATCCATTAACGTCATTATTGATGACCTTGGTAAGGAACCCAATAGAAATCTTGATGATGAAGATGAGGTTTTTTGGAATTCCCTTTCTCCTAAACCTG
TTGAAGGAGAGTTAGAATCGACGGCCCCCACTAATGAAACAACATACTTACCCTCTCATCTCGGTTCAAGCAGAAGTGACATGTCAACACCTTCTACATCAACCATTCAG
ACTGACACACATGAAAGTGAAGCATCAATATCTGCAAGTCAGCACACTCTAGAGCGAACTACGGGTGCAACTGATTCTTTAAAGTGTGACCTCATACCTCCTACGCATAT
AGCCAAAAACCACCCCTCCAGCTTCATTATTGGAGATATTCACAGTGGAATCATAACTCGGAAGAAGGAGAGGAAAGATTATGCGAAAATGGTTGCCAATGTATGCTACA
CATCTTCACTAGAACCGACCACGGTCTCTGCAGCACTCTCCGACAAACACTGGATCTTGGCTATGCAGGAAGAGCTACTCCAGTTTGAAAGAAACCAAGTATGGGAATTA
GTGCCAAAGCCACCTTATGCTAACATAATTGGTACCAAATGGATCTTTAAGAACAAAACGGATGAAGAAGGTAGAGTTATTCGTAATAAAGCTAGACTGGTTGCTCAAGG
ATATTCTCAAATAGAAGGGCTGGATTTTGGAGAAACATTTGCCCCAGTTGCCAGATTAGAAGCCATCCGACTACTGCTAAGCTACACATGTTTTCGGAGATTCAAACTAT
TCCAAATGGATGTAAAGAGTGCGTTCCTAAATGGGTACTTATATGAGGAAGTATATGTGGCCCAGCCAAAAGGATTTGTTAATCGAGTACATCAGGATCATGTTTACAAA
CTTCGAAAGGCACTCTATGGACTTAAACAAGCTCCTAGAGCTTGGTATGAGAGACTCTCCACTTACCTGTTACAACAAGGATGTAAAAAGGGCAGTGCGGATCAAACTAT
GTTTATATATCGTCAAGGCATTGACTTTCTGATCGTTTAG
Protein sequenceShow/hide protein sequence
MSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSDMSTPSTSTIQ
TDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWEL
VPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRVHQDHVYK
LRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLIV