; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0006513 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0006513
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionGag-pol polyprotein
Genome locationchr07:22374031..22379861
RNA-Seq ExpressionPay0006513
SyntenyPay0006513
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0063011.1 F5J5.1 [Cucumis melo var. makuwa]5.9e-26283.89Show/hide
Query:  TFLTEFEEHRAGHVMFVDGLSANPIRISQLCDQGFSKKFGWDTCKVISQKNNVILR--------------------------------------------
        T+  +FEEHRAGHVMFVDGLSANPIRISQLCDQGFSKKFGWDTCKVISQKNNVILR                                            
Subjt:  TFLTEFEEHRAGHVMFVDGLSANPIRISQLCDQGFSKKFGWDTCKVISQKNNVILR--------------------------------------------

Query:  -----------------------DKGIFHEFTAPITPQQNGITERKNWTLQEMARAMMHAKSVPLQFYAKA-----LNIACANPSEILSSDTLVCISSGE
                               DKGIFHEFTAPITPQQNGITERKNWTLQEMARAMMHAK         +     L ++ ANPSEILSSDTLVCISSGE
Subjt:  -----------------------DKGIFHEFTAPITPQQNGITERKNWTLQEMARAMMHAKSVPLQFYAKA-----LNIACANPSEILSSDTLVCISSGE

Query:  NLRHTPSVDVSNVSKATNIGASSPSGPPSVPKTSILAPSSHVSKNHSISLVIGDVHNGVTTRIKERKDYAKMIANVCFTSQIELSVEEALTDEKWILAIQ
        NLRHTPSVDVSNVSKATNIGASSPSGPPSVPKTSILAPSSHVSKNHSISLVIGDVHNGVTTRIKERKDYAKMIANVCFTSQIELSVEEALTDEKWILAIQ
Subjt:  NLRHTPSVDVSNVSKATNIGASSPSGPPSVPKTSILAPSSHVSKNHSISLVIGDVHNGVTTRIKERKDYAKMIANVCFTSQIELSVEEALTDEKWILAIQ

Query:  EELLQFERNVFWELVPRPTNANIIGTKWIFKNKIDEQGVITRNKARLVAQGYTQIEGIDFGETFAPIARFETIRLLLNFSYLRHIKLFQMDVKSAFLNGF
        EELLQFERNVFWELVPRPTNANIIGTKWIFKNKIDEQGVITRNKARLVAQGYTQIEGIDFGETFAPIARFETIRLLLNFSYLRHIKLFQMDVKSAFLNGF
Subjt:  EELLQFERNVFWELVPRPTNANIIGTKWIFKNKIDEQGVITRNKARLVAQGYTQIEGIDFGETFAPIARFETIRLLLNFSYLRHIKLFQMDVKSAFLNGF

Query:  LSEEVYVAQPKGFIDPAHLDHVSIFERHYMVLSRHLGLVFVEQMKSKFEMSMMGELTFFLGFQIKKCSSGIFLFQASPRISHLHAAKRILKYILRTVDYG
        LSEEVYVAQPKGFIDPAHLDHVSIFERHYMVLSRHLGLVFVEQMKSKFEMSMMGELTFFLGFQIKKCSSGIFLFQASPRISHLHAAKRILKYILRTVDYG
Subjt:  LSEEVYVAQPKGFIDPAHLDHVSIFERHYMVLSRHLGLVFVEQMKSKFEMSMMGELTFFLGFQIKKCSSGIFLFQASPRISHLHAAKRILKYILRTVDYG

Query:  LWYTYDTSSALVGFCDADWAGCSDDRKSTSSGCFFLRNNLTAWFSKKHNSVSLSIAEAEYIVAGS
        LWYTYDTSSALVGFCDADWAGCSDDRKSTSSGCFFLRNNLTAWFSKKHNSVSLSIAEAEYIVAG+
Subjt:  LWYTYDTSSALVGFCDADWAGCSDDRKSTSSGCFFLRNNLTAWFSKKHNSVSLSIAEAEYIVAGS

TYK16336.1 retrotransposon protein, putative, Ty1-copia subclass [Cucumis melo var. makuwa]9.7e-25782.74Show/hide
Query:  EFEEHRAGHVMFVDGLSANPIRISQLCDQGFSKKFGWDTCKVISQKNNVILR------------------------------------------------
        +FEE+RAGHVMFVDGLSANPIRISQLCDQGFSKKFGWDTC+VISQKNNVILR                                                
Subjt:  EFEEHRAGHVMFVDGLSANPIRISQLCDQGFSKKFGWDTCKVISQKNNVILR------------------------------------------------

Query:  --------------DKGIFHEFTAPITPQQNGITERKNWTLQEMARAMMHAKSVPLQFYAKA-----LNIACANPSEILSSDTLVCISSGENLRHTPSVD
                      D+GIFHEFTAPITPQQNGI E KNWTLQEMARAMMHAK         +     L ++ ANPSEILSSDTLVCISSGENLRHTPSVD
Subjt:  --------------DKGIFHEFTAPITPQQNGITERKNWTLQEMARAMMHAKSVPLQFYAKA-----LNIACANPSEILSSDTLVCISSGENLRHTPSVD

Query:  VSNVSKATNIGASSPSGPPSVPKTSILAPSSHVSKNHSISLVIGDVHNGVTTRIKERKDYAKMIANVCFTSQIELSVEEALTDEKWILAIQEELLQFERN
        VSNVSK TN GASSP+GPPSVPKTSILAPSSHVSKNH ISLVIGDVHNGVTTRIKERKDYAKMIANVCFTSQIE SVEEALTDEKWILAIQEELLQFERN
Subjt:  VSNVSKATNIGASSPSGPPSVPKTSILAPSSHVSKNHSISLVIGDVHNGVTTRIKERKDYAKMIANVCFTSQIELSVEEALTDEKWILAIQEELLQFERN

Query:  VFWELVPRPTNANIIGTKWIFKNKIDEQGVITRNKARLVAQGYTQIEGIDFGETFAPIARFETIRLLLNFSYLRHIKLFQMDVKSAFLNGFLSEEVYVAQ
        VFWELVPRPTNANIIGTKWIFKNKIDEQGVITRNKARLVAQGYTQI+GIDFGETFAPIAR ETIRLLLNFSYLRHIKLFQMDVKSAFLNGFLSEEVYVAQ
Subjt:  VFWELVPRPTNANIIGTKWIFKNKIDEQGVITRNKARLVAQGYTQIEGIDFGETFAPIARFETIRLLLNFSYLRHIKLFQMDVKSAFLNGFLSEEVYVAQ

Query:  PKGFIDPAHLDHVSIFERHYMVLSRHLGLVFVEQMKSKFEMSMMGELTFFLGFQIKKCSSGIFLFQASPRISHLHAAKRILKYILRTVDYGLWYTYDTSS
         KGFIDPAHLDHVSIFERHYMVLSRHLGLVFVEQMKSKFEMSMMGELTFFLGFQIKKCSSGIFLFQASPRISHLHAAKRILKYIL TVDYGLWYTYDTSS
Subjt:  PKGFIDPAHLDHVSIFERHYMVLSRHLGLVFVEQMKSKFEMSMMGELTFFLGFQIKKCSSGIFLFQASPRISHLHAAKRILKYILRTVDYGLWYTYDTSS

Query:  ALVGFCDADWAGCSDDRKSTSSGCFFLRNNLTAWFSKKHNSVSLSIAEAEYIVAGSSCTQLL
        ALVGFCDADWAGCSDDRKSTSSGCFFLRNNLTAWFSK+HNSVSLSIAEAEYIVAGSSCTQLL
Subjt:  ALVGFCDADWAGCSDDRKSTSSGCFFLRNNLTAWFSKKHNSVSLSIAEAEYIVAGSSCTQLL

XP_012846969.1 PREDICTED: uncharacterized protein LOC105966941 [Erythranthe guttata]1.9e-11939.63Show/hide
Query:  DGYF-YGCLRHITDNATFLTEFEEHRAG---------------------------HVMFVDGLSANPIRISQLCDQGFSKKFGWDTCKVISQKNNVILR-
        D YF  GC RH+T    FL  +++   G                           +V+ V+GL++N I ISQLCDQ    +F    C+V+ +  N +++ 
Subjt:  DGYF-YGCLRHITDNATFLTEFEEHRAG---------------------------HVMFVDGLSANPIRISQLCDQGFSKKFGWDTCKVISQKNNVILR-

Query:  ----------------------------DKGIFHEFTAPITPQQNGITERKNWTLQEMARAMMHAKSVPLQFYAKALNIAC-------ANPSEILSSDTL
                                     K I HEF+AP TPQQNG+ ERKN TLQEMAR MM+AK +  +F+A+A+N AC         P  +    T 
Subjt:  ----------------------------DKGIFHEFTAPITPQQNGITERKNWTLQEMARAMMHAKSVPLQFYAKALNIAC-------ANPSEILSSDTL

Query:  VCISSGENLRHTPSVD-VSNVSKATNI----GASSPSGPPSVPKTSILAPSSHVSKNHSISLVIGDVHNGVTTRIKERKDYAKMIANVCFTSQIE-LSVE
           +S       P+VD  S +S+ +++    G  +   P  +   +   PS  V KNH +  VIG V  G+ TR K + +Y +M   VCFTS IE  +V+
Subjt:  VCISSGENLRHTPSVD-VSNVSKATNI----GASSPSGPPSVPKTSILAPSSHVSKNHSISLVIGDVHNGVTTRIKERKDYAKMIANVCFTSQIE-LSVE

Query:  EALTDEKWILAIQEELLQFERNVFWELVPRPTNANIIGTKWIFKNKIDEQGVITRNKARLVAQGYTQIEGIDFGETFAPIARFETIRLLLNFSYLRHIKL
        EAL DE WI A+ EEL QF RN  W LVPRP N NIIGTKW+FKNK DE G I RNKARLVAQGY+QIEGIDF ETFAP+AR E++RLLL  +    IKL
Subjt:  EALTDEKWILAIQEELLQFERNVFWELVPRPTNANIIGTKWIFKNKIDEQGVITRNKARLVAQGYTQIEGIDFGETFAPIARFETIRLLLNFSYLRHIKL

Query:  FQMDVKSAFLNGFLSEEVYVAQPKGFIDPAHLDHV---------------SIFERHYMVLSRHLGLV---------------------------------
        FQMDVKSAFLNG L EEVYV QPKGF DP +  HV               + +ER    LS H G                                   
Subjt:  FQMDVKSAFLNGFLSEEVYVAQPKGFIDPAHLDHV---------------SIFERHYMVLSRHLGLV---------------------------------

Query:  ------FVEQMKSKFEMSMMGELTFFLGFQIKKCSSGIFL------------------------------------------------------------
              FV+QM S+FEMSM+GELT+FLG Q+K+ S GIF+                                                            
Subjt:  ------FVEQMKSKFEMSMMGELTFFLGFQIKKCSSGIFL------------------------------------------------------------

Query:  -----------FQASPRISHLHAAKRILKYILRTVDYGLWYTYDTSSALVGFCDADWAGCSDDRKSTSSGCFFLRNNLTAWFSKKHNSVSLSIAEAEYIV
                   +Q++P+  HL A KRI++Y+  T+D+G+WY+ DT++ L GF DADWAG +DDRKST+ GCF+L NNL +W+SKK NS+SLS AE+EYI 
Subjt:  -----------FQASPRISHLHAAKRILKYILRTVDYGLWYTYDTSSALVGFCDADWAGCSDDRKSTSSGCFFLRNNLTAWFSKKHNSVSLSIAEAEYIV

Query:  AGSSCTQLL
        AGS C QLL
Subjt:  AGSSCTQLL

XP_012850949.1 PREDICTED: uncharacterized protein LOC105970659 [Erythranthe guttata]6.8e-10937.04Show/hide
Query:  HVMFVDGLSANPIRISQLCDQGFSKKFGWDTCKVISQKNNVILR-----------------------------DKGIFHEFTAPITPQQNGITERKNWTL
        +V+ V+GL++N I ISQLCDQ    +F    C+V+ +  N +++                              K I HEF+AP TPQQNG+ ERKN TL
Subjt:  HVMFVDGLSANPIRISQLCDQGFSKKFGWDTCKVISQKNNVILR-----------------------------DKGIFHEFTAPITPQQNGITERKNWTL

Query:  QEMARAMMHAKSVPLQFYAKALNIAC-------------ANPSEILSSD------------TLVCISSGENL----------------RHTPSVDVSNV-
        QEMAR MM+AK +  +F+A+A+N AC               P EI                T   ++  E L                R++ +  + N+ 
Subjt:  QEMARAMMHAKSVPLQFYAKALNIAC-------------ANPSEILSSD------------TLVCISSGENL----------------RHTPSVDVSNV-

Query:  -----------------------SKATNIGASSPSGPPSVPKTSILA------------------------------PSSHVSKNHSISLVIGDVHNGVT
                                ++T    +S S  P+VP     +                              PS  V KNH +  VIG V  G+ 
Subjt:  -----------------------SKATNIGASSPSGPPSVPKTSILA------------------------------PSSHVSKNHSISLVIGDVHNGVT

Query:  TRIKERKDYAKMIANVCFTSQIE-LSVEEALTDEKWILAIQEELLQFERNVFWELVPRPTNANIIGTKWIFKNKIDEQGVITRNKARLVAQGYTQIEGID
        TR K + +Y +M   VCFTS IE  +V+EAL DE W+ A+ EEL QF RN  W LVPRP N NIIGTKW+FKNK DE G I RNKARLVAQGY+QIEGID
Subjt:  TRIKERKDYAKMIANVCFTSQIE-LSVEEALTDEKWILAIQEELLQFERNVFWELVPRPTNANIIGTKWIFKNKIDEQGVITRNKARLVAQGYTQIEGID

Query:  FGETFAPIARFETIRLLLNFSYLRHIKLFQMDVKSAFLNGFLSEEVYVAQPKGFIDPAHLDHV---------------SIFERHYMVLSRHLGLV-----
        F ETFAP+AR E++RLLL  +    IKLFQMDVKSAFLNG L EEVYV QPKGF DP H  HV               + +ER    LS H G       
Subjt:  FGETFAPIARFETIRLLLNFSYLRHIKLFQMDVKSAFLNGFLSEEVYVAQPKGFIDPAHLDHV---------------SIFERHYMVLSRHLGLV-----

Query:  ----------------------------------FVEQMKSKFEMSMMGELTFFLGFQIKKCSSGIFL--------------------------------
                                          FV+QM S+FEMSM+GELT+FLG Q+K+ S GIF+                                
Subjt:  ----------------------------------FVEQMKSKFEMSMMGELTFFLGFQIKKCSSGIFL--------------------------------

Query:  ---------------------------------------FQASPRISHLHAAKRILKYILRTVDYGLWYTYDTSSALVGFCDADWAGCSDDRKSTSSGCF
                                               +Q++P+  HL A KRI++Y+  T D+G+WY+ DT++ L GF DADWA  +DDRKST+ GCF
Subjt:  ---------------------------------------FQASPRISHLHAAKRILKYILRTVDYGLWYTYDTSSALVGFCDADWAGCSDDRKSTSSGCF

Query:  FLRNNLTAWFSKKHNSVSLSIAEAEYIVAGSSCTQLL
        +L NNL +W+SKK NS+SLS AE+EYI AGS C QLL
Subjt:  FLRNNLTAWFSKKHNSVSLSIAEAEYIVAGSSCTQLL

XP_016901150.1 PREDICTED: uncharacterized protein LOC107991184 [Cucumis melo]8.4e-30996.9Show/hide
Query:  MVRESHRESYALLARYEEALKISNLGTHYDLEIDPDGYFYGCLRHITDNATFLTEFEEHRAGHVMFVDGLSANPIRISQLCDQGFSKKFGWDTCKVISQK
        MVRESHRESYALLARY EALKI NLGTHYDLEIDPDGYFYGCLRHITDNATFLTEFEE+RAGHVMFVDGLSANPIRISQLCDQGFSKKFGWDTC+VISQK
Subjt:  MVRESHRESYALLARYEEALKISNLGTHYDLEIDPDGYFYGCLRHITDNATFLTEFEEHRAGHVMFVDGLSANPIRISQLCDQGFSKKFGWDTCKVISQK

Query:  NNVILRDKGIFHEFTAPITPQQNGITERKNWTLQEMARAMMHAKSVPLQFYAKALNIACANPSEILSSDTLVCISSGENLRHTPSVDVSNVSKATNIGAS
        NNVILRD+GIFHEFTAPITPQQNGI E KNWTLQEMARAMMHAKSVPLQFYAKALNIACANPSEILSSDTLVCISSGENLRHTPSVDVSNVSK TN GAS
Subjt:  NNVILRDKGIFHEFTAPITPQQNGITERKNWTLQEMARAMMHAKSVPLQFYAKALNIACANPSEILSSDTLVCISSGENLRHTPSVDVSNVSKATNIGAS

Query:  SPSGPPSVPKTSILAPSSHVSKNHSISLVIGDVHNGVTTRIKERKDYAKMIANVCFTSQIELSVEEALTDEKWILAIQEELLQFERNVFWELVPRPTNAN
        SP+GPPSVPKTSILAPSSHVSKNH ISLVIGDVHNGVTTRIKERKDYAKMIANVCFTSQIE SVEEALTDEKWILAIQEELLQFERNVFWELVPRPTNAN
Subjt:  SPSGPPSVPKTSILAPSSHVSKNHSISLVIGDVHNGVTTRIKERKDYAKMIANVCFTSQIELSVEEALTDEKWILAIQEELLQFERNVFWELVPRPTNAN

Query:  IIGTKWIFKNKIDEQGVITRNKARLVAQGYTQIEGIDFGETFAPIARFETIRLLLNFSYLRHIKLFQMDVKSAFLNGFLSEEVYVAQPKGFIDPAHLDHV
        IIGTKWIFKNKIDEQGVITRNKARLVAQGYTQI+GIDFGETFAPIAR ETIRLLLNFSYLRHIKLFQMDVKSAFLNGFLSEEVYVAQ KGFIDPAHLDHV
Subjt:  IIGTKWIFKNKIDEQGVITRNKARLVAQGYTQIEGIDFGETFAPIARFETIRLLLNFSYLRHIKLFQMDVKSAFLNGFLSEEVYVAQPKGFIDPAHLDHV

Query:  SIFERHYMVLSRHLGLVFVEQMKSKFEMSMMGELTFFLGFQIKKCSSGIFLFQASPRISHLHAAKRILKYILRTVDYGLWYTYDTSSALVGFCDADWAGC
        SIFERHYMVLSRHLGLVFVEQMKSKFEMSMMGELTFFLGFQIKKCSSGIFLFQASPRISHLHAAKRILKYIL TVDYGLWYTYDTSSALVGFCDADWAGC
Subjt:  SIFERHYMVLSRHLGLVFVEQMKSKFEMSMMGELTFFLGFQIKKCSSGIFLFQASPRISHLHAAKRILKYILRTVDYGLWYTYDTSSALVGFCDADWAGC

Query:  SDDRKSTSSGCFFLRNNLTAWFSKKHNSVSLSIAEAEYIVAGSSCTQLL
        SDDRKSTSSGCFFLRNNLTAWFSK+HNSVSLSIAEAEYIVAGSSCTQLL
Subjt:  SDDRKSTSSGCFFLRNNLTAWFSKKHNSVSLSIAEAEYIVAGSSCTQLL

TrEMBL top hitse value%identityAlignment
A0A1S4DZJ2 uncharacterized protein LOC1079911844.1e-30996.9Show/hide
Query:  MVRESHRESYALLARYEEALKISNLGTHYDLEIDPDGYFYGCLRHITDNATFLTEFEEHRAGHVMFVDGLSANPIRISQLCDQGFSKKFGWDTCKVISQK
        MVRESHRESYALLARY EALKI NLGTHYDLEIDPDGYFYGCLRHITDNATFLTEFEE+RAGHVMFVDGLSANPIRISQLCDQGFSKKFGWDTC+VISQK
Subjt:  MVRESHRESYALLARYEEALKISNLGTHYDLEIDPDGYFYGCLRHITDNATFLTEFEEHRAGHVMFVDGLSANPIRISQLCDQGFSKKFGWDTCKVISQK

Query:  NNVILRDKGIFHEFTAPITPQQNGITERKNWTLQEMARAMMHAKSVPLQFYAKALNIACANPSEILSSDTLVCISSGENLRHTPSVDVSNVSKATNIGAS
        NNVILRD+GIFHEFTAPITPQQNGI E KNWTLQEMARAMMHAKSVPLQFYAKALNIACANPSEILSSDTLVCISSGENLRHTPSVDVSNVSK TN GAS
Subjt:  NNVILRDKGIFHEFTAPITPQQNGITERKNWTLQEMARAMMHAKSVPLQFYAKALNIACANPSEILSSDTLVCISSGENLRHTPSVDVSNVSKATNIGAS

Query:  SPSGPPSVPKTSILAPSSHVSKNHSISLVIGDVHNGVTTRIKERKDYAKMIANVCFTSQIELSVEEALTDEKWILAIQEELLQFERNVFWELVPRPTNAN
        SP+GPPSVPKTSILAPSSHVSKNH ISLVIGDVHNGVTTRIKERKDYAKMIANVCFTSQIE SVEEALTDEKWILAIQEELLQFERNVFWELVPRPTNAN
Subjt:  SPSGPPSVPKTSILAPSSHVSKNHSISLVIGDVHNGVTTRIKERKDYAKMIANVCFTSQIELSVEEALTDEKWILAIQEELLQFERNVFWELVPRPTNAN

Query:  IIGTKWIFKNKIDEQGVITRNKARLVAQGYTQIEGIDFGETFAPIARFETIRLLLNFSYLRHIKLFQMDVKSAFLNGFLSEEVYVAQPKGFIDPAHLDHV
        IIGTKWIFKNKIDEQGVITRNKARLVAQGYTQI+GIDFGETFAPIAR ETIRLLLNFSYLRHIKLFQMDVKSAFLNGFLSEEVYVAQ KGFIDPAHLDHV
Subjt:  IIGTKWIFKNKIDEQGVITRNKARLVAQGYTQIEGIDFGETFAPIARFETIRLLLNFSYLRHIKLFQMDVKSAFLNGFLSEEVYVAQPKGFIDPAHLDHV

Query:  SIFERHYMVLSRHLGLVFVEQMKSKFEMSMMGELTFFLGFQIKKCSSGIFLFQASPRISHLHAAKRILKYILRTVDYGLWYTYDTSSALVGFCDADWAGC
        SIFERHYMVLSRHLGLVFVEQMKSKFEMSMMGELTFFLGFQIKKCSSGIFLFQASPRISHLHAAKRILKYIL TVDYGLWYTYDTSSALVGFCDADWAGC
Subjt:  SIFERHYMVLSRHLGLVFVEQMKSKFEMSMMGELTFFLGFQIKKCSSGIFLFQASPRISHLHAAKRILKYILRTVDYGLWYTYDTSSALVGFCDADWAGC

Query:  SDDRKSTSSGCFFLRNNLTAWFSKKHNSVSLSIAEAEYIVAGSSCTQLL
        SDDRKSTSSGCFFLRNNLTAWFSK+HNSVSLSIAEAEYIVAGSSCTQLL
Subjt:  SDDRKSTSSGCFFLRNNLTAWFSKKHNSVSLSIAEAEYIVAGSSCTQLL

A0A5A7VBE1 F5J5.12.9e-26283.89Show/hide
Query:  TFLTEFEEHRAGHVMFVDGLSANPIRISQLCDQGFSKKFGWDTCKVISQKNNVILR--------------------------------------------
        T+  +FEEHRAGHVMFVDGLSANPIRISQLCDQGFSKKFGWDTCKVISQKNNVILR                                            
Subjt:  TFLTEFEEHRAGHVMFVDGLSANPIRISQLCDQGFSKKFGWDTCKVISQKNNVILR--------------------------------------------

Query:  -----------------------DKGIFHEFTAPITPQQNGITERKNWTLQEMARAMMHAKSVPLQFYAKA-----LNIACANPSEILSSDTLVCISSGE
                               DKGIFHEFTAPITPQQNGITERKNWTLQEMARAMMHAK         +     L ++ ANPSEILSSDTLVCISSGE
Subjt:  -----------------------DKGIFHEFTAPITPQQNGITERKNWTLQEMARAMMHAKSVPLQFYAKA-----LNIACANPSEILSSDTLVCISSGE

Query:  NLRHTPSVDVSNVSKATNIGASSPSGPPSVPKTSILAPSSHVSKNHSISLVIGDVHNGVTTRIKERKDYAKMIANVCFTSQIELSVEEALTDEKWILAIQ
        NLRHTPSVDVSNVSKATNIGASSPSGPPSVPKTSILAPSSHVSKNHSISLVIGDVHNGVTTRIKERKDYAKMIANVCFTSQIELSVEEALTDEKWILAIQ
Subjt:  NLRHTPSVDVSNVSKATNIGASSPSGPPSVPKTSILAPSSHVSKNHSISLVIGDVHNGVTTRIKERKDYAKMIANVCFTSQIELSVEEALTDEKWILAIQ

Query:  EELLQFERNVFWELVPRPTNANIIGTKWIFKNKIDEQGVITRNKARLVAQGYTQIEGIDFGETFAPIARFETIRLLLNFSYLRHIKLFQMDVKSAFLNGF
        EELLQFERNVFWELVPRPTNANIIGTKWIFKNKIDEQGVITRNKARLVAQGYTQIEGIDFGETFAPIARFETIRLLLNFSYLRHIKLFQMDVKSAFLNGF
Subjt:  EELLQFERNVFWELVPRPTNANIIGTKWIFKNKIDEQGVITRNKARLVAQGYTQIEGIDFGETFAPIARFETIRLLLNFSYLRHIKLFQMDVKSAFLNGF

Query:  LSEEVYVAQPKGFIDPAHLDHVSIFERHYMVLSRHLGLVFVEQMKSKFEMSMMGELTFFLGFQIKKCSSGIFLFQASPRISHLHAAKRILKYILRTVDYG
        LSEEVYVAQPKGFIDPAHLDHVSIFERHYMVLSRHLGLVFVEQMKSKFEMSMMGELTFFLGFQIKKCSSGIFLFQASPRISHLHAAKRILKYILRTVDYG
Subjt:  LSEEVYVAQPKGFIDPAHLDHVSIFERHYMVLSRHLGLVFVEQMKSKFEMSMMGELTFFLGFQIKKCSSGIFLFQASPRISHLHAAKRILKYILRTVDYG

Query:  LWYTYDTSSALVGFCDADWAGCSDDRKSTSSGCFFLRNNLTAWFSKKHNSVSLSIAEAEYIVAGS
        LWYTYDTSSALVGFCDADWAGCSDDRKSTSSGCFFLRNNLTAWFSKKHNSVSLSIAEAEYIVAG+
Subjt:  LWYTYDTSSALVGFCDADWAGCSDDRKSTSSGCFFLRNNLTAWFSKKHNSVSLSIAEAEYIVAGS

A0A5D3D0Z2 Retrotransposon protein, putative, Ty1-copia subclass4.7e-25782.74Show/hide
Query:  EFEEHRAGHVMFVDGLSANPIRISQLCDQGFSKKFGWDTCKVISQKNNVILR------------------------------------------------
        +FEE+RAGHVMFVDGLSANPIRISQLCDQGFSKKFGWDTC+VISQKNNVILR                                                
Subjt:  EFEEHRAGHVMFVDGLSANPIRISQLCDQGFSKKFGWDTCKVISQKNNVILR------------------------------------------------

Query:  --------------DKGIFHEFTAPITPQQNGITERKNWTLQEMARAMMHAKSVPLQFYAKA-----LNIACANPSEILSSDTLVCISSGENLRHTPSVD
                      D+GIFHEFTAPITPQQNGI E KNWTLQEMARAMMHAK         +     L ++ ANPSEILSSDTLVCISSGENLRHTPSVD
Subjt:  --------------DKGIFHEFTAPITPQQNGITERKNWTLQEMARAMMHAKSVPLQFYAKA-----LNIACANPSEILSSDTLVCISSGENLRHTPSVD

Query:  VSNVSKATNIGASSPSGPPSVPKTSILAPSSHVSKNHSISLVIGDVHNGVTTRIKERKDYAKMIANVCFTSQIELSVEEALTDEKWILAIQEELLQFERN
        VSNVSK TN GASSP+GPPSVPKTSILAPSSHVSKNH ISLVIGDVHNGVTTRIKERKDYAKMIANVCFTSQIE SVEEALTDEKWILAIQEELLQFERN
Subjt:  VSNVSKATNIGASSPSGPPSVPKTSILAPSSHVSKNHSISLVIGDVHNGVTTRIKERKDYAKMIANVCFTSQIELSVEEALTDEKWILAIQEELLQFERN

Query:  VFWELVPRPTNANIIGTKWIFKNKIDEQGVITRNKARLVAQGYTQIEGIDFGETFAPIARFETIRLLLNFSYLRHIKLFQMDVKSAFLNGFLSEEVYVAQ
        VFWELVPRPTNANIIGTKWIFKNKIDEQGVITRNKARLVAQGYTQI+GIDFGETFAPIAR ETIRLLLNFSYLRHIKLFQMDVKSAFLNGFLSEEVYVAQ
Subjt:  VFWELVPRPTNANIIGTKWIFKNKIDEQGVITRNKARLVAQGYTQIEGIDFGETFAPIARFETIRLLLNFSYLRHIKLFQMDVKSAFLNGFLSEEVYVAQ

Query:  PKGFIDPAHLDHVSIFERHYMVLSRHLGLVFVEQMKSKFEMSMMGELTFFLGFQIKKCSSGIFLFQASPRISHLHAAKRILKYILRTVDYGLWYTYDTSS
         KGFIDPAHLDHVSIFERHYMVLSRHLGLVFVEQMKSKFEMSMMGELTFFLGFQIKKCSSGIFLFQASPRISHLHAAKRILKYIL TVDYGLWYTYDTSS
Subjt:  PKGFIDPAHLDHVSIFERHYMVLSRHLGLVFVEQMKSKFEMSMMGELTFFLGFQIKKCSSGIFLFQASPRISHLHAAKRILKYILRTVDYGLWYTYDTSS

Query:  ALVGFCDADWAGCSDDRKSTSSGCFFLRNNLTAWFSKKHNSVSLSIAEAEYIVAGSSCTQLL
        ALVGFCDADWAGCSDDRKSTSSGCFFLRNNLTAWFSK+HNSVSLSIAEAEYIVAGSSCTQLL
Subjt:  ALVGFCDADWAGCSDDRKSTSSGCFFLRNNLTAWFSKKHNSVSLSIAEAEYIVAGSSCTQLL

Q84VI2 Gag-pol polyprotein1.3e-10840.49Show/hide
Query:  KGIFHEFTAPITPQQNGITERKNWTLQEMARAMMHAKSVPLQFYAKALNIAC---------------------------------ANPSEILS-------
        +GI HEF+A ITPQQNGI ERKN TLQE AR M+HAK +P   +A+A+N AC                                  +P  IL+       
Subjt:  KGIFHEFTAPITPQQNGITERKNWTLQEMARAMMHAKSVPLQFYAKALNIAC---------------------------------ANPSEILS-------

Query:  ----SDTLVCISSGENLRH-----------TPSVDV---------------------SNVSKATNIGASSPSGPPSVPKTSILAP----SSHVSKNHSIS
            SD  + +    N R              S++V                      NV+ A   G ++ +   +  +++I  P    S+ + K H   
Subjt:  ----SDTLVCISSGENLRH-----------TPSVDV---------------------SNVSKATNIGASSPSGPPSVPKTSILAP----SSHVSKNHSIS

Query:  LVIGDVHNGVTTRIKERKDYAKMIANVCFTSQIE-LSVEEALTDEKWILAIQEELLQFERNVFWELVPRPTNANIIGTKWIFKNKIDEQGVITRNKARLV
        L+IGD + GVTTR +E     ++++N CF S+IE  +V+EALTDE WI A+QEEL QF+RN  WELVPRP   N+IGTKWIFKNK +E+GVITRNKARLV
Subjt:  LVIGDVHNGVTTRIKERKDYAKMIANVCFTSQIE-LSVEEALTDEKWILAIQEELLQFERNVFWELVPRPTNANIIGTKWIFKNKIDEQGVITRNKARLV

Query:  AQGYTQIEGIDFGETFAPIARFETIRLLLNFSYLRHIKLFQMDVKSAFLNGFLSEEVYVAQPKGFIDPAHLDHV---------------SIFER------
        AQGYTQIEG+DF ETFAP+AR E+IRLLL  + +   KL+QMDVKSAFLNG+L+EEVYV QPKGF DP H DHV               + +ER      
Subjt:  AQGYTQIEGIDFGETFAPIARFETIRLLLNFSYLRHIKLFQMDVKSAFLNGFLSEEVYVAQPKGFIDPAHLDHV---------------SIFER------

Query:  --------------------HYMV------------LSRHLGLVFVEQMKSKFEMSMMGELTFFLGFQIKKCSSGIFL----------------------
                            + M+            +S  +   FV+QM+S+FEMS++GELT+FLG Q+K+    IFL                      
Subjt:  --------------------HYMV------------LSRHLGLVFVEQMKSKFEMSMMGELTFFLGFQIKKCSSGIFL----------------------

Query:  -------------------------------------------------FQASPRISHLHAAKRILKYILRTVDYGLWYTYDTSSALVGFCDADWAGCSD
                                                         +QA+P+ISHL+  KRILKY+  T DYG+ Y + +SS LVG+CDADWAG +D
Subjt:  -------------------------------------------------FQASPRISHLHAAKRILKYILRTVDYGLWYTYDTSSALVGFCDADWAGCSD

Query:  DRKSTSSGCFFLRNNLTAWFSKKHNSVSLSIAEAEYIVAGSSCTQLL
        DRKSTS GCF+L NNL +WFSKK N VSLS AEAEYI AGSSC+QL+
Subjt:  DRKSTSSGCFFLRNNLTAWFSKKHNSVSLSIAEAEYIVAGSSCTQLL

Q84VI4 Gag-pol polyprotein2.4e-10740.19Show/hide
Query:  KGIFHEFTAPITPQQNGITERKNWTLQEMARAMMHAKSVPLQFYAKALNIAC---------------------------------ANPSEILS-------
        +GI HEF+A ITPQQNGI ERKN TLQE AR M+HAK +P   +A+A+N AC                                  +P  IL+       
Subjt:  KGIFHEFTAPITPQQNGITERKNWTLQEMARAMMHAKSVPLQFYAKALNIAC---------------------------------ANPSEILS-------

Query:  ----SDTLVCISSGENLRH-----------TPSVDV---------------------SNVSKATNIGASSPSGPPSVPKTSILAP----SSHVSKNHSIS
            SD  + +    N R              S++V                      NV+ A   G ++ +   +  +++I  P    S+ + K H   
Subjt:  ----SDTLVCISSGENLRH-----------TPSVDV---------------------SNVSKATNIGASSPSGPPSVPKTSILAP----SSHVSKNHSIS

Query:  LVIGDVHNGVTTRIKERKDYAKMIANVCFTSQIE-LSVEEALTDEKWILAIQEELLQFERNVFWELVPRPTNANIIGTKWIFKNKIDEQGVITRNKARLV
        L+IGD + GVTTR +E     ++++N CF S+IE  +V+EALTDE WI A+QEEL QF+RN  WELVPRP   N+IGTKWIFKNK +E+GVITRNKARLV
Subjt:  LVIGDVHNGVTTRIKERKDYAKMIANVCFTSQIE-LSVEEALTDEKWILAIQEELLQFERNVFWELVPRPTNANIIGTKWIFKNKIDEQGVITRNKARLV

Query:  AQGYTQIEGIDFGETFAPIARFETIRLLLNFSYLRHIKLFQMDVKSAFLNGFLSEEVYVAQPKGFIDPAHLDHV---------------SIFER------
        AQGYTQIEG+DF ETFAP+AR E+IRLLL  + +   KL+QMDVKSAFLNG+L+EEVYV QPKGF DP H DHV               + +ER      
Subjt:  AQGYTQIEGIDFGETFAPIARFETIRLLLNFSYLRHIKLFQMDVKSAFLNGFLSEEVYVAQPKGFIDPAHLDHV---------------SIFER------

Query:  --------------------HYMV------------LSRHLGLVFVEQMKSKFEMSMMGELTFFLGFQIKKCSSGIFL----------------------
                            + M+            +S  +   FV+QM+S+FEMS++GELT+FLG Q+K+    IFL                      
Subjt:  --------------------HYMV------------LSRHLGLVFVEQMKSKFEMSMMGELTFFLGFQIKKCSSGIFL----------------------

Query:  -------------------------------------------------FQASPRISHLHAAKRILKYILRTVDYGLWYTYDTSSALVGFCDADWAGCSD
                                                         +QA+P+ISHL   KRILKY+  T DYG+ Y + ++  LVG+CDADWAG +D
Subjt:  -------------------------------------------------FQASPRISHLHAAKRILKYILRTVDYGLWYTYDTSSALVGFCDADWAGCSD

Query:  DRKSTSSGCFFLRNNLTAWFSKKHNSVSLSIAEAEYIVAGSSCTQLL
        DRKSTS GCF+L NNL +WFSKK N VSLS AEAEYI AGSSC+QL+
Subjt:  DRKSTSSGCFFLRNNLTAWFSKKHNSVSLSIAEAEYIVAGSSCTQLL

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.8e-2726.9Show/hide
Query:  WILAIQEELLQFERNVFWELVPRPTNANIIGTKWIFKNKIDEQGVITRNKARLVAQGYTQIEGIDFGETFAPIARFETIRLLLNFSYLRHIKLFQMDVKS
        W  AI  EL   + N  W +  RP N NI+ ++W+F  K +E G   R KARLVA+G+TQ   ID+ ETFAP+AR  + R +L+     ++K+ QMDVK+
Subjt:  WILAIQEELLQFERNVFWELVPRPTNANIIGTKWIFKNKIDEQGVITRNKARLVAQGYTQIEGIDFGETFAPIARFETIRLLLNFSYLRHIKLFQMDVKS

Query:  AFLNGFLSEEVYVAQPKG-------------------------------------FIDPA------HLDHVSIFERHYMVLSRHLGLV----------FV
        AFLNG L EE+Y+  P+G                                     F++ +       LD  +I E  Y++L     ++          F 
Subjt:  AFLNGFLSEEVYVAQPKG-------------------------------------FIDPA------HLDHVSIFERHYMVLSRHLGLV----------FV

Query:  EQMKSKFEMSMMGELTFFLGFQIKKCSSGIFLFQ-------------------ASPRISHLH--------------------------------------
          +  KF M+ + E+  F+G +I+     I+L Q                   ++P  S ++                                      
Subjt:  EQMKSKFEMSMMGELTFFLGFQIKKCSSGIFLFQ-------------------ASPRISHLH--------------------------------------

Query:  --------------AAKRILKYILRTVDYGLWYTYDTS--SALVGFCDADWAGCSDDRKSTSSGCFFLRN-NLTAWFSKKHNSVSLSIAEAEYI
                        KR+L+Y+  T+D  L +  + +  + ++G+ D+DWAG   DRKST+   F + + NL  W +K+ NSV+ S  EAEY+
Subjt:  --------------AAKRILKYILRTVDYGLWYTYDTS--SALVGFCDADWAGCSDDRKSTSSGCFFLRN-NLTAWFSKKHNSVSLSIAEAEYI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.0e-2041.38Show/hide
Query:  AIQEELLQFERNVFWELVPRPTNANIIGTKWIFKNKIDEQGVITRNKARLVAQGYTQIEGIDFGETFAPIARFETIRLLLNFSYLRHIKLFQMDVKSAFL
        A+QEE+   ++N  ++LV  P     +  KW+FK K D    + R KARLV +G+ Q +GIDF E F+P+ +  +IR +L+ +    +++ Q+DVK+AFL
Subjt:  AIQEELLQFERNVFWELVPRPTNANIIGTKWIFKNKIDEQGVITRNKARLVAQGYTQIEGIDFGETFAPIARFETIRLLLNFSYLRHIKLFQMDVKSAFL

Query:  NGFLSEEVYVAQPKGF
        +G L EE+Y+ QP+GF
Subjt:  NGFLSEEVYVAQPKGF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.9e-0534.69Show/hide
Query:  FQASPRISHLHAAKRILKYILRTVDYGLWYTYDTSSALVGFCDADWAGCSDDRKSTSSGCFFLRNNLTAWFSKKHNSVSLSIAEAEYIVAGSSCTQLL
        F  +P   H  A K IL+Y+  T    L +   +   L G+ DAD AG  D+RKS++   F       +W SK    V+LS  EAEYI A  +  +++
Subjt:  FQASPRISHLHAAKRILKYILRTVDYGLWYTYDTSSALVGFCDADWAGCSDDRKSTSSGCFFLRNNLTAWFSKKHNSVSLSIAEAEYIVAGSSCTQLL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.3e-0545.1Show/hide
Query:  GIFHEFTAPITPQQNGITERKNWTLQEMARAMMHAKSVPLQFYAKALNIAC
        GI HE T P TPQ NG+ ER N T+ E  R+M+    +P  F+ +A+  AC
Subjt:  GIFHEFTAPITPQQNGITERKNWTLQEMARAMMHAKSVPLQFYAKALNIAC

P92520 Uncharacterized mitochondrial protein AtMg008201.8e-1953.19Show/hide
Query:  SVEEALTDEKWILAIQEELLQFERNVFWELVPRPTNANIIGTKWIFKNKIDEQGVITRNKARLVAQGYTQIEGIDFGETFAPIARFETIRLLLN
        SV  AL D  W  A+QEEL    RN  W LVP P N NI+G KW+FK K+   G + R KARLVA+G+ Q EGI F ET++P+ R  TIR +LN
Subjt:  SVEEALTDEKWILAIQEELLQFERNVFWELVPRPTNANIIGTKWIFKNKIDEQGVITRNKARLVAQGYTQIEGIDFGETFAPIARFETIRLLLN

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.0e-3225Show/hide
Query:  SNVSKATNIGASSPSGPPSVPKTSILAP---SSHVSKNHSISLVIGDVHNGVTTRIKERKDYAKMIANVCFTSQIELSVEEALTDEKWILAIQEELLQFE
        S+ S +    ASS S  P+ P   I  P   +  V+ N+   L    +       I +      +  ++   S+   ++ +AL DE+W  A+  E+    
Subjt:  SNVSKATNIGASSPSGPPSVPKTSILAP---SSHVSKNHSISLVIGDVHNGVTTRIKERKDYAKMIANVCFTSQIELSVEEALTDEKWILAIQEELLQFE

Query:  RNVFWELV-PRPTNANIIGTKWIFKNKIDEQGVITRNKARLVAQGYTQIEGIDFGETFAPIARFETIRLLLNFSYLRHIKLFQMDVKSAFLNGFLSEEVY
         N  W+LV P P++  I+G +WIF  K +  G + R KARLVA+GY Q  G+D+ ETF+P+ +  +IR++L  +  R   + Q+DV +AFL G L+++VY
Subjt:  RNVFWELV-PRPTNANIIGTKWIFKNKIDEQGVITRNKARLVAQGYTQIEGIDFGETFAPIARFETIRLLLNFSYLRHIKLFQMDVKSAFLNGFLSEEVY

Query:  VAQPKGFIDPAHLDHV--------------------------------SIFERHYMVLSRHLGLVF---------------------VEQMKSKFEMSMM
        ++QP GFID    ++V                                S+ +    VL R   +V+                     ++ +  +F +   
Subjt:  VAQPKGFIDPAHLDHV--------------------------------SIFERHYMVLSRHLGLVF---------------------VEQMKSKFEMSMM

Query:  GELTFFLGFQIKKCSSGIFLFQ------------------------ASPRIS-----------------------------------------------H
         EL +FLG + K+  +G+ L Q                         SP++S                                               H
Subjt:  GELTFFLGFQIKKCSSGIFLFQ------------------------ASPRIS-----------------------------------------------H

Query:  LHAAKRILKYILRTVDYGLWYTYDTSSALVGFCDADWAGCSDDRKSTSSGCFFLRNNLTAWFSKKHNSVSLSIAEAEYIVAGSSCTQL
        L A KRIL+Y+  T ++G++     + +L  + DADWAG  DD  ST+    +L ++  +W SKK   V  S  EAEY    ++ +++
Subjt:  LHAAKRILKYILRTVDYGLWYTYDTSSALVGFCDADWAGCSDDRKSTSSGCFFLRNNLTAWFSKKHNSVSLSIAEAEYIVAGSSCTQL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.7e-3424.8Show/hide
Query:  TPSVDVSNVSKATNIGASSPSGPPSVPKTSILAPSSHVSKN-HSISLVIGDVHNGVTTRIKERKDYAKMIANVCFTSQIELSVEEALTDEKWILAIQEEL
        TPS  +S  +  ++   S+P  PP +P   I+  ++    N HS++    D   G+  +  ++  YA  +A     +    +  +A+ D++W  A+  E+
Subjt:  TPSVDVSNVSKATNIGASSPSGPPSVPKTSILAPSSHVSKN-HSISLVIGDVHNGVTTRIKERKDYAKMIANVCFTSQIELSVEEALTDEKWILAIQEEL

Query:  LQFERNVFWELV-PRPTNANIIGTKWIFKNKIDEQGVITRNKARLVAQGYTQIEGIDFGETFAPIARFETIRLLLNFSYLRHIKLFQMDVKSAFLNGFLS
             N  W+LV P P +  I+G +WIF  K +  G + R KARLVA+GY Q  G+D+ ETF+P+ +  +IR++L  +  R   + Q+DV +AFL G L+
Subjt:  LQFERNVFWELV-PRPTNANIIGTKWIFKNKIDEQGVITRNKARLVAQGYTQIEGIDFGETFAPIARFETIRLLLNFSYLRHIKLFQMDVKSAFLNGFLS

Query:  EEVYVAQPKGFIDPAHLDHV--------------------------------SIFERHYMVLSRHLGLVF---------------------VEQMKSKFE
        +EVY++QP GF+D    D+V                                SI +    VL R   +++                     ++ +  +F 
Subjt:  EEVYVAQPKGFIDPAHLDHV--------------------------------SIFERHYMVLSRHLGLVF---------------------VEQMKSKFE

Query:  MSMMGELTFFLGFQIKKCSSGIFLFQ------------------------ASPRIS--------------------------------------------
        +    +L +FLG + K+   G+ L Q                         SP+++                                            
Subjt:  MSMMGELTFFLGFQIKKCSSGIFLFQ------------------------ASPRIS--------------------------------------------

Query:  ---HLHAAKRILKYILRTVDYGLWYTYDTSSALVGFCDADWAGCSDDRKSTSSGCFFLRNNLTAWFSKKHNSVSLSIAEAEYIVAGSSCTQL
           H +A KR+L+Y+  T D+G++     + +L  + DADWAG +DD  ST+    +L ++  +W SKK   V  S  EAEY    ++ ++L
Subjt:  ---HLHAAKRILKYILRTVDYGLWYTYDTSSALVGFCDADWAGCSDDRKSTSSGCFFLRNNLTAWFSKKHNSVSLSIAEAEYIVAGSSCTQL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.8e-3628.26Show/hide
Query:  VCFTSQIELSV-EEALTDEKWILAIQEELLQFERNVFWELVPRPTNANIIGTKWIFKNKIDEQGVITRNKARLVAQGYTQIEGIDFGETFAPIARFETIR
        VC     E S   EA     W  A+ +E+   E    WE+   P N   IG KW++K K +  G I R KARLVA+GYTQ EGIDF ETF+P+ +  +++
Subjt:  VCFTSQIELSV-EEALTDEKWILAIQEELLQFERNVFWELVPRPTNANIIGTKWIFKNKIDEQGVITRNKARLVAQGYTQIEGIDFGETFAPIARFETIR

Query:  LLLNFSYLRHIKLFQMDVKSAFLNGFLSEEVYVAQPKGF-------IDP-----------------------------------AHLDH-----------
        L+L  S + +  L Q+D+ +AFLNG L EE+Y+  P G+       + P                                   +H DH           
Subjt:  LLLNFSYLRHIKLFQMDVKSAFLNGFLSEEVYVAQPKGF-------IDP-----------------------------------AHLDH-----------

Query:  --VSIFERHYMVLSRHLGLV--FVEQMKSKFEMSMMGELTFFLGFQIKKCSSGIFL--------------------------------------------
          V ++    ++ S +   V     Q+KS F++  +G L +FLG +I + ++GI +                                            
Subjt:  --VSIFERHYMVLSRHLGLV--FVEQMKSKFEMSMMGELTFFLGFQIKKCSSGIFL--------------------------------------------

Query:  ---------------------------FQASPRISHLHAAKRILKYILRTVDYGLWYTYDTSSALVGFCDADWAGCSDDRKSTSSGCFFLRNNLTAWFSK
                                   F  +PR++H  A  +IL YI  TV  GL+Y+      L  F DA +  C D R+ST+  C FL  +L +W SK
Subjt:  ---------------------------FQASPRISHLHAAKRILKYILRTVDYGLWYTYDTSSALVGFCDADWAGCSDDRKSTSSGCFFLRNNLTAWFSK

Query:  KHNSVSLSIAEAEY
        K   VS S AEAEY
Subjt:  KHNSVSLSIAEAEY

ATMG00240.1 Gag-Pol-related retrotransposon family protein8.8e-0635Show/hide
Query:  FQASPRISHLHAAKRILKYILRTVDYGLWYTYDTSSALVGFCDADWAGCSDDRKSTSSGC
        F ++ R + + A  ++L Y+  TV  GL+Y+  +   L  F D+DWA C D R+S +  C
Subjt:  FQASPRISHLHAAKRILKYILRTVDYGLWYTYDTSSALVGFCDADWAGCSDDRKSTSSGC

ATMG00810.1 DNA/RNA polymerases superfamily protein3.7e-1234.41Show/hide
Query:  PRISHLHAAKRILKYILRTVDYGLWYTYDTSSALVGFCDADWAGCSDDRKSTSSGCFFLRNNLTAWFSKKHNSVSLSIAEAEYIVAGSSCTQL
        P ++     KR+L+Y+  T+ +GL+   ++   +  FCD+DWAGC+  R+ST+  C FL  N+ +W +K+  +VS S  E EY     +  +L
Subjt:  PRISHLHAAKRILKYILRTVDYGLWYTYDTSSALVGFCDADWAGCSDDRKSTSSGCFFLRNNLTAWFSKKHNSVSLSIAEAEYIVAGSSCTQL

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.3e-2053.19Show/hide
Query:  SVEEALTDEKWILAIQEELLQFERNVFWELVPRPTNANIIGTKWIFKNKIDEQGVITRNKARLVAQGYTQIEGIDFGETFAPIARFETIRLLLN
        SV  AL D  W  A+QEEL    RN  W LVP P N NI+G KW+FK K+   G + R KARLVA+G+ Q EGI F ET++P+ R  TIR +LN
Subjt:  SVEEALTDEKWILAIQEELLQFERNVFWELVPRPTNANIIGTKWIFKNKIDEQGVITRNKARLVAQGYTQIEGIDFGETFAPIARFETIRLLLN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAAGGGAGTCCCACAGAGAATCTTATGCTTTGTTGGCTCGGTATGAAGAAGCATTGAAGATTTCCAATTTGGGGACTCATTATGATTTGGAGATCGATCCT
GATGGTTATTTCTATGGATGCTTGCGACATATAACTGATAATGCAACTTTTCTAACAGAATTTGAGGAGCATCGTGCAGGACATGTTATGTTCGTTGATGGTCTA
TCAGCAAATCCTATTAGAATCAGCCAGTTGTGTGATCAAGGATTTTCTAAGAAGTTTGGATGGGACACATGTAAAGTAATCAGTCAAAAGAATAATGTGATTCTC
AGAGACAAAGGGATTTTTCATGAGTTTACTGCACCTATCACTCCTCAACAAAATGGGATTACTGAGCGTAAAAACTGGACGTTGCAGGAGATGGCCAGAGCTATG
ATGCATGCTAAGTCAGTTCCTCTTCAGTTCTATGCAAAAGCACTCAATATTGCATGTGCCAACCCTTCAGAGATTCTCTCAAGTGACACCTTAGTATGTATCTCA
TCTGGTGAGAACCTCAGACACACACCATCCGTAGATGTCTCAAATGTATCTAAAGCCACTAACATAGGTGCTTCATCACCATCAGGTCCACCTAGTGTTCCCAAG
ACATCAATTTTAGCACCGTCTTCGCATGTATCCAAGAATCATTCCATCAGTTTAGTGATTGGCGATGTACATAATGGTGTAACCACAAGAATAAAAGAACGAAAA
GACTATGCTAAGATGATTGCGAATGTGTGCTTCACCTCTCAAATTGAGCTGAGTGTGGAAGAAGCATTAACTGATGAGAAGTGGATCCTAGCTATACAAGAGGAA
TTGTTACAGTTTGAAAGAAATGTTTTCTGGGAACTGGTTCCTCGACCTACAAATGCTAACATTATTGGGACAAAATGGATTTTTAAGAACAAGATCGATGAACAA
GGAGTCATAACTCGAAACAAGGCTCGCTTAGTGGCACAAGGGTACACACAGATCGAAGGAATCGATTTTGGTGAGACGTTTGCTCCCATAGCCAGATTTGAAACC
ATTCGTCTACTTCTTAACTTTTCCTATCTCCGTCACATCAAGCTCTTTCAAATGGATGTCAAAAGCGCATTTCTAAATGGTTTTCTCTCTGAAGAAGTGTATGTG
GCCCAACCAAAAGGGTTCATTGATCCTGCACATCTTGATCACGTTTCAATCTTCGAAAGGCATTATATGGTCTTAAGCAGGCACCTCGGGCTTGTGTTTGTGGAA
CAGATGAAGTCGAAATTCGAGATGAGCATGATGGGTGAGCTTACCTTCTTCCTAGGATTCCAAATTAAAAAATGCTCGTCTGGAATCTTCCTTTTTCAAGCTTCT
CCCCGTATTTCTCATCTTCATGCTGCAAAACGTATTCTGAAATACATATTGCGCACTGTCGACTATGGGTTGTGGTACACCTATGATACTTCATCAGCACTGGTG
GGGTTTTGTGATGCGGATTGGGCAGGATGTTCTGATGATAGGAAGAGTACCTCTAGTGGCTGTTTCTTCTTGAGAAATAATCTCACAGCATGGTTTAGCAAAAAG
CATAATAGTGTCTCGTTGTCCATTGCCGAAGCTGAATATATTGTCGCTGGCAGTAGTTGCACACAGCTCCTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTAAGGGAGTCCCACAGAGAATCTTATGCTTTGTTGGCTCGGTATGAAGAAGCATTGAAGATTTCCAATTTGGGGACTCATTATGATTTGGAGATCGATCCT
GATGGTTATTTCTATGGATGCTTGCGACATATAACTGATAATGCAACTTTTCTAACAGAATTTGAGGAGCATCGTGCAGGACATGTTATGTTCGTTGATGGTCTA
TCAGCAAATCCTATTAGAATCAGCCAGTTGTGTGATCAAGGATTTTCTAAGAAGTTTGGATGGGACACATGTAAAGTAATCAGTCAAAAGAATAATGTGATTCTC
AGAGACAAAGGGATTTTTCATGAGTTTACTGCACCTATCACTCCTCAACAAAATGGGATTACTGAGCGTAAAAACTGGACGTTGCAGGAGATGGCCAGAGCTATG
ATGCATGCTAAGTCAGTTCCTCTTCAGTTCTATGCAAAAGCACTCAATATTGCATGTGCCAACCCTTCAGAGATTCTCTCAAGTGACACCTTAGTATGTATCTCA
TCTGGTGAGAACCTCAGACACACACCATCCGTAGATGTCTCAAATGTATCTAAAGCCACTAACATAGGTGCTTCATCACCATCAGGTCCACCTAGTGTTCCCAAG
ACATCAATTTTAGCACCGTCTTCGCATGTATCCAAGAATCATTCCATCAGTTTAGTGATTGGCGATGTACATAATGGTGTAACCACAAGAATAAAAGAACGAAAA
GACTATGCTAAGATGATTGCGAATGTGTGCTTCACCTCTCAAATTGAGCTGAGTGTGGAAGAAGCATTAACTGATGAGAAGTGGATCCTAGCTATACAAGAGGAA
TTGTTACAGTTTGAAAGAAATGTTTTCTGGGAACTGGTTCCTCGACCTACAAATGCTAACATTATTGGGACAAAATGGATTTTTAAGAACAAGATCGATGAACAA
GGAGTCATAACTCGAAACAAGGCTCGCTTAGTGGCACAAGGGTACACACAGATCGAAGGAATCGATTTTGGTGAGACGTTTGCTCCCATAGCCAGATTTGAAACC
ATTCGTCTACTTCTTAACTTTTCCTATCTCCGTCACATCAAGCTCTTTCAAATGGATGTCAAAAGCGCATTTCTAAATGGTTTTCTCTCTGAAGAAGTGTATGTG
GCCCAACCAAAAGGGTTCATTGATCCTGCACATCTTGATCACGTTTCAATCTTCGAAAGGCATTATATGGTCTTAAGCAGGCACCTCGGGCTTGTGTTTGTGGAA
CAGATGAAGTCGAAATTCGAGATGAGCATGATGGGTGAGCTTACCTTCTTCCTAGGATTCCAAATTAAAAAATGCTCGTCTGGAATCTTCCTTTTTCAAGCTTCT
CCCCGTATTTCTCATCTTCATGCTGCAAAACGTATTCTGAAATACATATTGCGCACTGTCGACTATGGGTTGTGGTACACCTATGATACTTCATCAGCACTGGTG
GGGTTTTGTGATGCGGATTGGGCAGGATGTTCTGATGATAGGAAGAGTACCTCTAGTGGCTGTTTCTTCTTGAGAAATAATCTCACAGCATGGTTTAGCAAAAAG
CATAATAGTGTCTCGTTGTCCATTGCCGAAGCTGAATATATTGTCGCTGGCAGTAGTTGCACACAGCTCCTCTAG
Protein sequenceShow/hide protein sequence
MVRESHRESYALLARYEEALKISNLGTHYDLEIDPDGYFYGCLRHITDNATFLTEFEEHRAGHVMFVDGLSANPIRISQLCDQGFSKKFGWDTCKVISQKNNVIL
RDKGIFHEFTAPITPQQNGITERKNWTLQEMARAMMHAKSVPLQFYAKALNIACANPSEILSSDTLVCISSGENLRHTPSVDVSNVSKATNIGASSPSGPPSVPK
TSILAPSSHVSKNHSISLVIGDVHNGVTTRIKERKDYAKMIANVCFTSQIELSVEEALTDEKWILAIQEELLQFERNVFWELVPRPTNANIIGTKWIFKNKIDEQ
GVITRNKARLVAQGYTQIEGIDFGETFAPIARFETIRLLLNFSYLRHIKLFQMDVKSAFLNGFLSEEVYVAQPKGFIDPAHLDHVSIFERHYMVLSRHLGLVFVE
QMKSKFEMSMMGELTFFLGFQIKKCSSGIFLFQASPRISHLHAAKRILKYILRTVDYGLWYTYDTSSALVGFCDADWAGCSDDRKSTSSGCFFLRNNLTAWFSKK
HNSVSLSIAEAEYIVAGSSCTQLL