; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh09G012170 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh09G012170
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRetrotransposon protein, putative, Ty1-copia subclass
Genome locationCmo_Chr09:9417204..9418871
RNA-Seq ExpressionCmoCh09G012170
SyntenyCmoCh09G012170
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR025314 - Domain of unknown function DUF4219
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BAF23632.1 Os08g0389500 [Oryza sativa Japonica Group]6.6e-16052.99Show/hide
Query:  LTRSNYNEWALLMRVNLQAQGLWHAVEPKEGETIEYREDRLAFAAILRSVPPEMLASLSTKRTAQSAWEGIKSRRVGV--------QQLRKELSEIRFKD
        LTR+NY+EWA+LM+VN++A G+W+AVEP   E +EYR+DRLA AAILRSVP EML +L  KR+A++AW+ IK+ RVGV        QQLR+E + + +K+
Subjt:  LTRSNYNEWALLMRVNLQAQGLWHAVEPKEGETIEYREDRLAFAAILRSVPPEMLASLSTKRTAQSAWEGIKSRRVGV--------QQLRKELSEIRFKD

Query:  GESVDDFSMRITGLANSITTLGGGISEAEIVKKMLQVVPDHLEQVAISIETLLDVNNLTVEEVTGRLRNVEQRKKNITSAVDKEGRLLLTEEEWLARLKL
        GE+ +DFS+RITGLAN++ TLG  IS+A++V+KML VVP+HLEQ+A+++ETLLD+N ++VEEVTGRLR VEQR+      V+ +GRLLLT+EEW+A+LK 
Subjt:  GESVDDFSMRITGLANSITTLGGGISEAEIVKKMLQVVPDHLEQVAISIETLLDVNNLTVEEVTGRLRNVEQRKKNITSAVDKEGRLLLTEEEWLARLKL

Query:  RDNTGESNGSSSSRKGG-KKPWMSHGRTRGKDGNQKKESTNG------RPIQCSNCGKREHLSRNCWSKLRNKEKAEKAHVAQSEEDEPALFMVTACVPN
           +GE    S S  GG K+   S    RGK G+++            +  +C  CGK+ H ++ C S+LR     ++AH+AQ EE+E  + MV     N
Subjt:  RDNTGESNGSSSSRKGG-KKPWMSHGRTRGKDGNQKKESTNG------RPIQCSNCGKREHLSRNCWSKLRNKEKAEKAHVAQSEEDEPALFMVTACVPN

Query:  FDSKSDDDVEPKKELQLGVAKMAPAGEPIQLEEERVFAQIGKRDEQHEHQQWILDTGATNHMTGARSAFSKLDSGIRGTVKFGDGSVVEIEGRGTILFVS
          S S     P +   L      PA E I L+E ++F Q+G  +   E  +WILDTGATNHMTG RSAFS+L++GIRGTVKFGDGSVV IEGRGT+LF  
Subjt:  FDSKSDDDVEPKKELQLGVAKMAPAGEPIQLEEERVFAQIGKRDEQHEHQQWILDTGATNHMTGARSAFSKLDSGIRGTVKFGDGSVVEIEGRGTILFVS

Query:  KGGEHRQLTDVYFIPRLKANLVSLGQLD-----------------NQRRLLTQARRTTNRLYILELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKL
        K GEH+ L  VY IPRL  N+VSLGQLD                  QRRLL +  R+ NRLY+++L I +PV L+A+  +++WRWHAR+GHLNF ALEKL
Subjt:  KGGEHRQLTDVYFIPRLKANLVSLGQLD-----------------NQRRLLTQARRTTNRLYILELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKL

Query:  QKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAK
         +  +V GLP I  V+++CD CL+GKQRR PFPS+  YRA E LELVHGDICGP+ PATP G  LFLLLVDD SR+MWL LL +K
Subjt:  QKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAK

CAE03285.2 OSJNBb0046P18.1 [Oryza sativa Japonica Group]8.0e-15051.66Show/hide
Query:  LTRSNYNEWALLMRVNLQAQGLWHAVEPKEGETIEYREDRLAFAAILRSVPPEMLASLSTKRTAQSAWEGIKSRRVGV--------QQLRKELSEIRFKD
        LTR+NY +W L+MRVN+QAQGLW AVEP+  + ++YR+DR A  AILR+VP EMLA+L+ K T Q AWE IK+RR+GV        QQLR+E  +I FKD
Subjt:  LTRSNYNEWALLMRVNLQAQGLWHAVEPKEGETIEYREDRLAFAAILRSVPPEMLASLSTKRTAQSAWEGIKSRRVGV--------QQLRKELSEIRFKD

Query:  GESVDDFSMRITGLANSITTLGGGISEAEIVKKMLQVVPDHLEQVAISIETLLDVNNLTVEEVTGRLRNVEQRKKN----ITSAVDKEGRLLLTEEEWLA
        GE+VDDFSMRI  LAN++ TLG  I++AE+V+K+LQVVP+HL+Q+AISIETLLDVN L++EEVTG LR+VEQRK+      +S VD  G LL TEEEWLA
Subjt:  GESVDDFSMRITGLANSITTLGGGISEAEIVKKMLQVVPDHLEQVAISIETLLDVNNLTVEEVTGRLRNVEQRKKN----ITSAVDKEGRLLLTEEEWLA

Query:  RLK----LRDNTGESNGSSSSRKGGKKPWMSHGRTRGKDGNQKKESTNGR-PIQCSNCGKREHLSRNCWSKLRNKEKAEKAHVAQSEEDEPALFMVTACV
        + +    L+D    S+G +S R+G  +     G +  K+   K  +  GR P  C NCGKR H +++C    R+K KA++A+VAQ E+DEPAL +     
Subjt:  RLK----LRDNTGESNGSSSSRKGGKKPWMSHGRTRGKDGNQKKESTNGR-PIQCSNCGKREHLSRNCWSKLRNKEKAEKAHVAQSEEDEPALFMVTACV

Query:  PNFDSKSDDDVEPKKELQLGVAKMAPA-GEPIQLEEERVFAQIGKRDEQHEHQQWILDTGATNHMTGARSAFSKLDSGIRGTVKFGDGSVVEIEGRGTIL
           D      V P   +    A   P+    + + + +VF Q+    E H    WILDTGATN+MTG+RSAF++LD+ + GTVKFGDGSVV IEG GT+L
Subjt:  PNFDSKSDDDVEPKKELQLGVAKMAPA-GEPIQLEEERVFAQIGKRDEQHEHQQWILDTGATNHMTGARSAFSKLDSGIRGTVKFGDGSVVEIEGRGTIL

Query:  FVSKGGEHRQLTDVYFIPRLKANLVSLGQLDNQRRLLTQARRTTNRLYILELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVHGLPEIKG
        F  +  EHR +T +Y+IP L  N+VSLGQLD+                   L+ID+PV L+A++ E +WRWHARYGHLNFPAL KL ++E+V GLP ++ 
Subjt:  FVSKGGEHRQLTDVYFIPRLKANLVSLGQLDNQRRLLTQARRTTNRLYILELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVHGLPEIKG

Query:  VNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAK
        V ++CDGCL+GKQRR  FP+++ Y  +E L LVHGD+C PI+PATP G   FLLLVDD SR+MWLTL+++K
Subjt:  VNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAK

CAE03692.2 OSJNBb0026E15.10 [Oryza sativa Japonica Group]1.7e-16053.19Show/hide
Query:  LTRSNYNEWALLMRVNLQAQGLWHAVEPKEGETIEYREDRLAFAAILRSVPPEMLASLSTKRTAQSAWEGIKSRRVGV--------QQLRKELSEIRFKD
        LTR+NY +WAL+MRVN+QAQ LW AVEP+  + ++YR+D+ A AAILR+VP EMLA+L+ K TAQ AWE IK+RR+GV        QQLR+E  +I FKD
Subjt:  LTRSNYNEWALLMRVNLQAQGLWHAVEPKEGETIEYREDRLAFAAILRSVPPEMLASLSTKRTAQSAWEGIKSRRVGV--------QQLRKELSEIRFKD

Query:  GESVDDFSMRITGLANSITTLGGGISEAEIVKKMLQVVPDHLEQVAISIETLLDVNNLTVEEVTGRLRNVEQRKKNITSA----VDKEGRLLLTEEEWLA
         E+VDDFSMRI GLAN++ TLG  I+EAE+V+K+LQVVP+HL+Q+AISIETLLDVN L++EEVTGRLR+VEQRK+  T+A    VD  GRLL TEEEWLA
Subjt:  GESVDDFSMRITGLANSITTLGGGISEAEIVKKMLQVVPDHLEQVAISIETLLDVNNLTVEEVTGRLRNVEQRKKNITSA----VDKEGRLLLTEEEWLA

Query:  RLK----LRDNTGESNGSSSSRKGGKKPWMSHGRTRG-------KDGNQKKESTNGR-PIQCSNCGKREHLSRNCWSKLRNKEKAEKAHVAQSEEDEPAL
        + +    L+D    S+G+   R          GR RG       K+   K  +  GR P  C NCGKR H +++C    R+K KA++A+VAQ E++EPAL
Subjt:  RLK----LRDNTGESNGSSSSRKGGKKPWMSHGRTRG-------KDGNQKKESTNGR-PIQCSNCGKREHLSRNCWSKLRNKEKAEKAHVAQSEEDEPAL

Query:  FMVTACVPNFDSKSDDDVEPKKELQLGVAKMAPA--GEPIQLEEERVFAQIGKRDEQHEHQQWILDTGATNHMTGARSAFSKLDSGIRGTVKFGDGSVVE
         +        D        P   +    A  AP+  GE + + E +VFAQ+    E H+   WILDTGATNHMTG+RSAF++LD+ + GTV+FGDGSVV 
Subjt:  FMVTACVPNFDSKSDDDVEPKKELQLGVAKMAPA--GEPIQLEEERVFAQIGKRDEQHEHQQWILDTGATNHMTGARSAFSKLDSGIRGTVKFGDGSVVE

Query:  IEGRGTILFVSKGGEHRQLTDVYFIPRLKANLVSLGQLDN-----------------QRRLLTQARRTTNRLYILELEIDQPVSLSAKTEEVSWRWHARY
        IEGR T+LF  + GEHR +  VY+IPRL AN+VSLGQLD                  +  LL + RR+ + LY ++L+ID+PV L+A++ E +WRWHARY
Subjt:  IEGRGTILFVSKGGEHRQLTDVYFIPRLKANLVSLGQLDN-----------------QRRLLTQARRTTNRLYILELEIDQPVSLSAKTEEVSWRWHARY

Query:  GHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAK
        GHLNFPAL KL ++E+V GLP ++ V ++CDGCL+GKQRR  FP+++ YRADE LELVHGD+CGPI+PATP G   FLLLVDD SR+MWLT++++K
Subjt:  GHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAK

CAH66352.1 OSIGBa0135C09.3 [Oryza sativa]4.6e-15352.8Show/hide
Query:  LQAQGLWHAVEPKEGETIEYREDRLAFAAILRSVPPEMLASLSTKRTAQSAWEGIKSRRVGV--------QQLRKELSEIRFKDGESVDDFSMRITGLAN
        +QAQGLW AVEP+  + ++YR+DR A AAILR+VP EMLA+L+ K TAQ AWE IK+RR+GV        QQLR+E  +I FKDGE+VD FSMRI GLAN
Subjt:  LQAQGLWHAVEPKEGETIEYREDRLAFAAILRSVPPEMLASLSTKRTAQSAWEGIKSRRVGV--------QQLRKELSEIRFKDGESVDDFSMRITGLAN

Query:  SITTLGGGISEAEIVKKMLQVVPDHLEQVAISIETLLDVNNLTVEEVTGRLRNVEQRKKNITSA----VDKEGRLLLTEEEWLARLKLR---DNTGESNG
        ++  LG  I+EAE+V+K+LQVVP+HL+Q+AISIETLLDVN L++EEVTG LR+VEQRK++ T+A    VD  GRLL TEEEWLA+ +      +   S+G
Subjt:  SITTLGGGISEAEIVKKMLQVVPDHLEQVAISIETLLDVNNLTVEEVTGRLRNVEQRKKNITSA----VDKEGRLLLTEEEWLARLKLR---DNTGESNG

Query:  SSSSRKGGKKPWMSHGRTRGKDGNQKKESTNGR-PIQCSNCGKREHLSRNCWSKLRNKEKAEKAHVAQSEEDEPALFMVTACVPNFDSKSDDDVEPKKEL
         +  R+G  +     G    K+   K  +  GR P  C NCGKR H +++C    R+K KA++A+VAQ E++EPAL +        D        P   +
Subjt:  SSSSRKGGKKPWMSHGRTRGKDGNQKKESTNGR-PIQCSNCGKREHLSRNCWSKLRNKEKAEKAHVAQSEEDEPALFMVTACVPNFDSKSDDDVEPKKEL

Query:  QLGVAKMAPA--GEPIQLEEERVFAQIGKRDEQHEHQQWILDTGATNHMTGARSAFSKLDSGIRGTVKFGDGSVVEIEGRGTILFVSKGGEHRQLTDVYF
            A  AP+  GE + + E +VFAQ+    E H+   WILDTGATNHMTG+RSAF+KLD+ + GTV+FGDGSVV IEGRGT+LF  + GEHR +  VY+
Subjt:  QLGVAKMAPA--GEPIQLEEERVFAQIGKRDEQHEHQQWILDTGATNHMTGARSAFSKLDSGIRGTVKFGDGSVVEIEGRGTILFVSKGGEHRQLTDVYF

Query:  IPRLKANLVSLGQLDN-----------------QRRLLTQARRTTNRLYILELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVHGLPEIK
        IPRL AN+VSLGQLD                  +  LL + RR+ + LY ++L ID+PV L+A++ + +WRWHARYGHLNFP+L KL ++E+V GLP ++
Subjt:  IPRLKANLVSLGQLDN-----------------QRRLLTQARRTTNRLYILELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVHGLPEIK

Query:  GVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAK
         V ++CDGCL+GKQRR  FP+++ YRADE LELVHGD+CGPI+PATP G   FLLLVDD SR+MWLTL+++K
Subjt:  GVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAK

CAH68021.1 H0807C06-H0308C08.8 [Oryza sativa]1.0e-14951.74Show/hide
Query:  LTRSNYNEWALLMRVNLQAQGLWHAVEPKEGETIEYREDRLAFAAILRSVPPEMLASLSTKRTAQSAWEGIKSRRVGV--------QQLRKELSEIRFKD
        LTR+NY +W L+MRVN+QAQGLW AVEP+  + ++YR+DR A  AILR+VP EMLA+L+ K T Q AWE IK+RR+GV        QQLR+E  +I FKD
Subjt:  LTRSNYNEWALLMRVNLQAQGLWHAVEPKEGETIEYREDRLAFAAILRSVPPEMLASLSTKRTAQSAWEGIKSRRVGV--------QQLRKELSEIRFKD

Query:  GESVDDFSMRITGLANSITTLGGGISEAEIVKKMLQVVPDHLEQVAISIETLLDVNNLTVEEVTGRLRNVEQRKKN----ITSAVDKEGRLLLTEEEWLA
        GE+VDDFSMRI  LAN++ TLG  I+EAE+V+K+LQVVP+HL+Q+AISIETLLDVN L++EEVTGRLR+VEQ K+      +S VD  G LL TEEEWLA
Subjt:  GESVDDFSMRITGLANSITTLGGGISEAEIVKKMLQVVPDHLEQVAISIETLLDVNNLTVEEVTGRLRNVEQRKKN----ITSAVDKEGRLLLTEEEWLA

Query:  RLK----LRDNTGESNGSSSSRKGGKKPWMSHGRT---RGKDGNQKKESTNGR-PIQCSNCGKREHLSRNCWSKLRNKEKAEKAHVAQSEEDEPALFMVT
        + +    L+D    S+G +S  +G       HG       K+   K  +  GR P  C NCGKR H +++C    R+K KA++AHVAQ E++EPAL +  
Subjt:  RLK----LRDNTGESNGSSSSRKGGKKPWMSHGRT---RGKDGNQKKESTNGR-PIQCSNCGKREHLSRNCWSKLRNKEKAEKAHVAQSEEDEPALFMVT

Query:  ACVPNFDSKSDDDVEPKKELQLGVAKMAPAGE-PIQLEEERVFAQIGKRDEQHEHQQWILDTGATNHMTGARSAFSKLDSGIRGTVKFGDGSVVEIEGRG
              D      V P   +    A   P+ +  + + + +VF Q+    E H    WILDTGATN+MTG+RSAF++LD+ + GTVKFGDGSVV IEG G
Subjt:  ACVPNFDSKSDDDVEPKKELQLGVAKMAPAGE-PIQLEEERVFAQIGKRDEQHEHQQWILDTGATNHMTGARSAFSKLDSGIRGTVKFGDGSVVEIEGRG

Query:  TILFVSKGGEHRQLTDVYFIPRLKANLVSLGQLDNQRRLLTQARRTTNRLYILELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVHGLPE
        T+LF  +  EHR +T +Y+IP L AN+VSLGQLD+                  +L+ID+PV L+AK+ E +WRWHA YGHLNFPAL KL ++E+V GLP 
Subjt:  TILFVSKGGEHRQLTDVYFIPRLKANLVSLGQLDNQRRLLTQARRTTNRLYILELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVHGLPE

Query:  IKGVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAK
        ++ V ++CDGCL+GKQRR  FP+++ Y A+E L LVHGD+C PI+PA P G   FLLLVDD SR+MWLTL+++K
Subjt:  IKGVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAK

TrEMBL top hitse value%identityAlignment
A0B9X7 OSIGBa0135C09.3 protein2.2e-15352.8Show/hide
Query:  LQAQGLWHAVEPKEGETIEYREDRLAFAAILRSVPPEMLASLSTKRTAQSAWEGIKSRRVGV--------QQLRKELSEIRFKDGESVDDFSMRITGLAN
        +QAQGLW AVEP+  + ++YR+DR A AAILR+VP EMLA+L+ K TAQ AWE IK+RR+GV        QQLR+E  +I FKDGE+VD FSMRI GLAN
Subjt:  LQAQGLWHAVEPKEGETIEYREDRLAFAAILRSVPPEMLASLSTKRTAQSAWEGIKSRRVGV--------QQLRKELSEIRFKDGESVDDFSMRITGLAN

Query:  SITTLGGGISEAEIVKKMLQVVPDHLEQVAISIETLLDVNNLTVEEVTGRLRNVEQRKKNITSA----VDKEGRLLLTEEEWLARLKLR---DNTGESNG
        ++  LG  I+EAE+V+K+LQVVP+HL+Q+AISIETLLDVN L++EEVTG LR+VEQRK++ T+A    VD  GRLL TEEEWLA+ +      +   S+G
Subjt:  SITTLGGGISEAEIVKKMLQVVPDHLEQVAISIETLLDVNNLTVEEVTGRLRNVEQRKKNITSA----VDKEGRLLLTEEEWLARLKLR---DNTGESNG

Query:  SSSSRKGGKKPWMSHGRTRGKDGNQKKESTNGR-PIQCSNCGKREHLSRNCWSKLRNKEKAEKAHVAQSEEDEPALFMVTACVPNFDSKSDDDVEPKKEL
         +  R+G  +     G    K+   K  +  GR P  C NCGKR H +++C    R+K KA++A+VAQ E++EPAL +        D        P   +
Subjt:  SSSSRKGGKKPWMSHGRTRGKDGNQKKESTNGR-PIQCSNCGKREHLSRNCWSKLRNKEKAEKAHVAQSEEDEPALFMVTACVPNFDSKSDDDVEPKKEL

Query:  QLGVAKMAPA--GEPIQLEEERVFAQIGKRDEQHEHQQWILDTGATNHMTGARSAFSKLDSGIRGTVKFGDGSVVEIEGRGTILFVSKGGEHRQLTDVYF
            A  AP+  GE + + E +VFAQ+    E H+   WILDTGATNHMTG+RSAF+KLD+ + GTV+FGDGSVV IEGRGT+LF  + GEHR +  VY+
Subjt:  QLGVAKMAPA--GEPIQLEEERVFAQIGKRDEQHEHQQWILDTGATNHMTGARSAFSKLDSGIRGTVKFGDGSVVEIEGRGTILFVSKGGEHRQLTDVYF

Query:  IPRLKANLVSLGQLDN-----------------QRRLLTQARRTTNRLYILELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVHGLPEIK
        IPRL AN+VSLGQLD                  +  LL + RR+ + LY ++L ID+PV L+A++ + +WRWHARYGHLNFP+L KL ++E+V GLP ++
Subjt:  IPRLKANLVSLGQLDN-----------------QRRLLTQARRTTNRLYILELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVHGLPEIK

Query:  GVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAK
         V ++CDGCL+GKQRR  FP+++ YRADE LELVHGD+CGPI+PATP G   FLLLVDD SR+MWLTL+++K
Subjt:  GVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAK

Q01HC5 H0807C06-H0308C08.8 protein5.1e-15051.74Show/hide
Query:  LTRSNYNEWALLMRVNLQAQGLWHAVEPKEGETIEYREDRLAFAAILRSVPPEMLASLSTKRTAQSAWEGIKSRRVGV--------QQLRKELSEIRFKD
        LTR+NY +W L+MRVN+QAQGLW AVEP+  + ++YR+DR A  AILR+VP EMLA+L+ K T Q AWE IK+RR+GV        QQLR+E  +I FKD
Subjt:  LTRSNYNEWALLMRVNLQAQGLWHAVEPKEGETIEYREDRLAFAAILRSVPPEMLASLSTKRTAQSAWEGIKSRRVGV--------QQLRKELSEIRFKD

Query:  GESVDDFSMRITGLANSITTLGGGISEAEIVKKMLQVVPDHLEQVAISIETLLDVNNLTVEEVTGRLRNVEQRKKN----ITSAVDKEGRLLLTEEEWLA
        GE+VDDFSMRI  LAN++ TLG  I+EAE+V+K+LQVVP+HL+Q+AISIETLLDVN L++EEVTGRLR+VEQ K+      +S VD  G LL TEEEWLA
Subjt:  GESVDDFSMRITGLANSITTLGGGISEAEIVKKMLQVVPDHLEQVAISIETLLDVNNLTVEEVTGRLRNVEQRKKN----ITSAVDKEGRLLLTEEEWLA

Query:  RLK----LRDNTGESNGSSSSRKGGKKPWMSHGRT---RGKDGNQKKESTNGR-PIQCSNCGKREHLSRNCWSKLRNKEKAEKAHVAQSEEDEPALFMVT
        + +    L+D    S+G +S  +G       HG       K+   K  +  GR P  C NCGKR H +++C    R+K KA++AHVAQ E++EPAL +  
Subjt:  RLK----LRDNTGESNGSSSSRKGGKKPWMSHGRT---RGKDGNQKKESTNGR-PIQCSNCGKREHLSRNCWSKLRNKEKAEKAHVAQSEEDEPALFMVT

Query:  ACVPNFDSKSDDDVEPKKELQLGVAKMAPAGE-PIQLEEERVFAQIGKRDEQHEHQQWILDTGATNHMTGARSAFSKLDSGIRGTVKFGDGSVVEIEGRG
              D      V P   +    A   P+ +  + + + +VF Q+    E H    WILDTGATN+MTG+RSAF++LD+ + GTVKFGDGSVV IEG G
Subjt:  ACVPNFDSKSDDDVEPKKELQLGVAKMAPAGE-PIQLEEERVFAQIGKRDEQHEHQQWILDTGATNHMTGARSAFSKLDSGIRGTVKFGDGSVVEIEGRG

Query:  TILFVSKGGEHRQLTDVYFIPRLKANLVSLGQLDNQRRLLTQARRTTNRLYILELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVHGLPE
        T+LF  +  EHR +T +Y+IP L AN+VSLGQLD+                  +L+ID+PV L+AK+ E +WRWHA YGHLNFPAL KL ++E+V GLP 
Subjt:  TILFVSKGGEHRQLTDVYFIPRLKANLVSLGQLDNQRRLLTQARRTTNRLYILELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVHGLPE

Query:  IKGVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAK
        ++ V ++CDGCL+GKQRR  FP+++ Y A+E L LVHGD+C PI+PA P G   FLLLVDD SR+MWLTL+++K
Subjt:  IKGVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAK

Q0J5Y3 Os08g0389500 protein3.2e-16052.99Show/hide
Query:  LTRSNYNEWALLMRVNLQAQGLWHAVEPKEGETIEYREDRLAFAAILRSVPPEMLASLSTKRTAQSAWEGIKSRRVGV--------QQLRKELSEIRFKD
        LTR+NY+EWA+LM+VN++A G+W+AVEP   E +EYR+DRLA AAILRSVP EML +L  KR+A++AW+ IK+ RVGV        QQLR+E + + +K+
Subjt:  LTRSNYNEWALLMRVNLQAQGLWHAVEPKEGETIEYREDRLAFAAILRSVPPEMLASLSTKRTAQSAWEGIKSRRVGV--------QQLRKELSEIRFKD

Query:  GESVDDFSMRITGLANSITTLGGGISEAEIVKKMLQVVPDHLEQVAISIETLLDVNNLTVEEVTGRLRNVEQRKKNITSAVDKEGRLLLTEEEWLARLKL
        GE+ +DFS+RITGLAN++ TLG  IS+A++V+KML VVP+HLEQ+A+++ETLLD+N ++VEEVTGRLR VEQR+      V+ +GRLLLT+EEW+A+LK 
Subjt:  GESVDDFSMRITGLANSITTLGGGISEAEIVKKMLQVVPDHLEQVAISIETLLDVNNLTVEEVTGRLRNVEQRKKNITSAVDKEGRLLLTEEEWLARLKL

Query:  RDNTGESNGSSSSRKGG-KKPWMSHGRTRGKDGNQKKESTNG------RPIQCSNCGKREHLSRNCWSKLRNKEKAEKAHVAQSEEDEPALFMVTACVPN
           +GE    S S  GG K+   S    RGK G+++            +  +C  CGK+ H ++ C S+LR     ++AH+AQ EE+E  + MV     N
Subjt:  RDNTGESNGSSSSRKGG-KKPWMSHGRTRGKDGNQKKESTNG------RPIQCSNCGKREHLSRNCWSKLRNKEKAEKAHVAQSEEDEPALFMVTACVPN

Query:  FDSKSDDDVEPKKELQLGVAKMAPAGEPIQLEEERVFAQIGKRDEQHEHQQWILDTGATNHMTGARSAFSKLDSGIRGTVKFGDGSVVEIEGRGTILFVS
          S S     P +   L      PA E I L+E ++F Q+G  +   E  +WILDTGATNHMTG RSAFS+L++GIRGTVKFGDGSVV IEGRGT+LF  
Subjt:  FDSKSDDDVEPKKELQLGVAKMAPAGEPIQLEEERVFAQIGKRDEQHEHQQWILDTGATNHMTGARSAFSKLDSGIRGTVKFGDGSVVEIEGRGTILFVS

Query:  KGGEHRQLTDVYFIPRLKANLVSLGQLD-----------------NQRRLLTQARRTTNRLYILELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKL
        K GEH+ L  VY IPRL  N+VSLGQLD                  QRRLL +  R+ NRLY+++L I +PV L+A+  +++WRWHAR+GHLNF ALEKL
Subjt:  KGGEHRQLTDVYFIPRLKANLVSLGQLD-----------------NQRRLLTQARRTTNRLYILELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKL

Query:  QKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAK
         +  +V GLP I  V+++CD CL+GKQRR PFPS+  YRA E LELVHGDICGP+ PATP G  LFLLLVDD SR+MWL LL +K
Subjt:  QKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAK

Q7X7K3 OSJNBb0046P18.1 protein3.9e-15051.66Show/hide
Query:  LTRSNYNEWALLMRVNLQAQGLWHAVEPKEGETIEYREDRLAFAAILRSVPPEMLASLSTKRTAQSAWEGIKSRRVGV--------QQLRKELSEIRFKD
        LTR+NY +W L+MRVN+QAQGLW AVEP+  + ++YR+DR A  AILR+VP EMLA+L+ K T Q AWE IK+RR+GV        QQLR+E  +I FKD
Subjt:  LTRSNYNEWALLMRVNLQAQGLWHAVEPKEGETIEYREDRLAFAAILRSVPPEMLASLSTKRTAQSAWEGIKSRRVGV--------QQLRKELSEIRFKD

Query:  GESVDDFSMRITGLANSITTLGGGISEAEIVKKMLQVVPDHLEQVAISIETLLDVNNLTVEEVTGRLRNVEQRKKN----ITSAVDKEGRLLLTEEEWLA
        GE+VDDFSMRI  LAN++ TLG  I++AE+V+K+LQVVP+HL+Q+AISIETLLDVN L++EEVTG LR+VEQRK+      +S VD  G LL TEEEWLA
Subjt:  GESVDDFSMRITGLANSITTLGGGISEAEIVKKMLQVVPDHLEQVAISIETLLDVNNLTVEEVTGRLRNVEQRKKN----ITSAVDKEGRLLLTEEEWLA

Query:  RLK----LRDNTGESNGSSSSRKGGKKPWMSHGRTRGKDGNQKKESTNGR-PIQCSNCGKREHLSRNCWSKLRNKEKAEKAHVAQSEEDEPALFMVTACV
        + +    L+D    S+G +S R+G  +     G +  K+   K  +  GR P  C NCGKR H +++C    R+K KA++A+VAQ E+DEPAL +     
Subjt:  RLK----LRDNTGESNGSSSSRKGGKKPWMSHGRTRGKDGNQKKESTNGR-PIQCSNCGKREHLSRNCWSKLRNKEKAEKAHVAQSEEDEPALFMVTACV

Query:  PNFDSKSDDDVEPKKELQLGVAKMAPA-GEPIQLEEERVFAQIGKRDEQHEHQQWILDTGATNHMTGARSAFSKLDSGIRGTVKFGDGSVVEIEGRGTIL
           D      V P   +    A   P+    + + + +VF Q+    E H    WILDTGATN+MTG+RSAF++LD+ + GTVKFGDGSVV IEG GT+L
Subjt:  PNFDSKSDDDVEPKKELQLGVAKMAPA-GEPIQLEEERVFAQIGKRDEQHEHQQWILDTGATNHMTGARSAFSKLDSGIRGTVKFGDGSVVEIEGRGTIL

Query:  FVSKGGEHRQLTDVYFIPRLKANLVSLGQLDNQRRLLTQARRTTNRLYILELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVHGLPEIKG
        F  +  EHR +T +Y+IP L  N+VSLGQLD+                   L+ID+PV L+A++ E +WRWHARYGHLNFPAL KL ++E+V GLP ++ 
Subjt:  FVSKGGEHRQLTDVYFIPRLKANLVSLGQLDNQRRLLTQARRTTNRLYILELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVHGLPEIKG

Query:  VNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAK
        V ++CDGCL+GKQRR  FP+++ Y  +E L LVHGD+C PI+PATP G   FLLLVDD SR+MWLTL+++K
Subjt:  VNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAK

Q7XPB1 OSJNBb0026E15.10 protein8.4e-16153.19Show/hide
Query:  LTRSNYNEWALLMRVNLQAQGLWHAVEPKEGETIEYREDRLAFAAILRSVPPEMLASLSTKRTAQSAWEGIKSRRVGV--------QQLRKELSEIRFKD
        LTR+NY +WAL+MRVN+QAQ LW AVEP+  + ++YR+D+ A AAILR+VP EMLA+L+ K TAQ AWE IK+RR+GV        QQLR+E  +I FKD
Subjt:  LTRSNYNEWALLMRVNLQAQGLWHAVEPKEGETIEYREDRLAFAAILRSVPPEMLASLSTKRTAQSAWEGIKSRRVGV--------QQLRKELSEIRFKD

Query:  GESVDDFSMRITGLANSITTLGGGISEAEIVKKMLQVVPDHLEQVAISIETLLDVNNLTVEEVTGRLRNVEQRKKNITSA----VDKEGRLLLTEEEWLA
         E+VDDFSMRI GLAN++ TLG  I+EAE+V+K+LQVVP+HL+Q+AISIETLLDVN L++EEVTGRLR+VEQRK+  T+A    VD  GRLL TEEEWLA
Subjt:  GESVDDFSMRITGLANSITTLGGGISEAEIVKKMLQVVPDHLEQVAISIETLLDVNNLTVEEVTGRLRNVEQRKKNITSA----VDKEGRLLLTEEEWLA

Query:  RLK----LRDNTGESNGSSSSRKGGKKPWMSHGRTRG-------KDGNQKKESTNGR-PIQCSNCGKREHLSRNCWSKLRNKEKAEKAHVAQSEEDEPAL
        + +    L+D    S+G+   R          GR RG       K+   K  +  GR P  C NCGKR H +++C    R+K KA++A+VAQ E++EPAL
Subjt:  RLK----LRDNTGESNGSSSSRKGGKKPWMSHGRTRG-------KDGNQKKESTNGR-PIQCSNCGKREHLSRNCWSKLRNKEKAEKAHVAQSEEDEPAL

Query:  FMVTACVPNFDSKSDDDVEPKKELQLGVAKMAPA--GEPIQLEEERVFAQIGKRDEQHEHQQWILDTGATNHMTGARSAFSKLDSGIRGTVKFGDGSVVE
         +        D        P   +    A  AP+  GE + + E +VFAQ+    E H+   WILDTGATNHMTG+RSAF++LD+ + GTV+FGDGSVV 
Subjt:  FMVTACVPNFDSKSDDDVEPKKELQLGVAKMAPA--GEPIQLEEERVFAQIGKRDEQHEHQQWILDTGATNHMTGARSAFSKLDSGIRGTVKFGDGSVVE

Query:  IEGRGTILFVSKGGEHRQLTDVYFIPRLKANLVSLGQLDN-----------------QRRLLTQARRTTNRLYILELEIDQPVSLSAKTEEVSWRWHARY
        IEGR T+LF  + GEHR +  VY+IPRL AN+VSLGQLD                  +  LL + RR+ + LY ++L+ID+PV L+A++ E +WRWHARY
Subjt:  IEGRGTILFVSKGGEHRQLTDVYFIPRLKANLVSLGQLDN-----------------QRRLLTQARRTTNRLYILELEIDQPVSLSAKTEEVSWRWHARY

Query:  GHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAK
        GHLNFPAL KL ++E+V GLP ++ V ++CDGCL+GKQRR  FP+++ YRADE LELVHGD+CGPI+PATP G   FLLLVDD SR+MWLT++++K
Subjt:  GHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAK

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.4e-1720.75Show/hide
Query:  YNEWALLMRVNLQAQGLWHAVE---PKEGETIEYREDRLAFAAILRSVPPEMLASLSTKRTAQSAWEGIKS---RRVGVQQ--LRKELSEIRFKDGESVD
        Y  W   +R  L  Q +   V+   P E +    + +R A + I+  +    L   ++  TA+   E + +   R+    Q  LRK L  ++     S+ 
Subjt:  YNEWALLMRVNLQAQGLWHAVE---PKEGETIEYREDRLAFAAILRSVPPEMLASLSTKRTAQSAWEGIKS---RRVGVQQ--LRKELSEIRFKDGESVD

Query:  DFSMRITGLANSITTLGGGISEAEIVKKMLQVVPDHLEQVAISIETLLDVNNLTVEEVTGRLRNVEQRKKNITSAVDKEGRLLLTEEEWLARLKLRDNTG
                L + +   G  I E + +  +L  +P   + +  +IETL +  NLT+  V  RL + E + KN  +   K+    +            +NT 
Subjt:  DFSMRITGLANSITTLGGGISEAEIVKKMLQVVPDHLEQVAISIETLLDVNNLTVEEVTGRLRNVEQRKKNITSAVDKEGRLLLTEEEWLARLKLRDNTG

Query:  ESNGSSSSRKGGKKPWMSHGRTRGKDGNQKKESTNGRPIQCSNCGKREHLSRNC--WSKLRNKEKAEKAHVAQSEEDEPALFMVTACVPNFDSKSDDDVE
        ++N   +     KK +          GN K +      ++C +CG+  H+ ++C  + ++ N +  E     Q+       FMV                
Subjt:  ESNGSSSSRKGGKKPWMSHGRTRGKDGNQKKESTNGRPIQCSNCGKREHLSRNC--WSKLRNKEKAEKAHVAQSEEDEPALFMVTACVPNFDSKSDDDVE

Query:  PKKELQLGVAKMAPAGEPIQLEEERVFAQIGKRDEQHEHQQWILDTGATNHMTGARSAFSKLDSGIRGTVKFGDGSVVEIEGRGTILFVSKGG------E
                           ++    V    G          ++LD+GA++H+    S ++        +V+      + +  +G  ++ +K G      +
Subjt:  PKKELQLGVAKMAPAGEPIQLEEERVFAQIGKRDEQHEHQQWILDTGATNHMTGARSAFSKLDSGIRGTVKFGDGSVVEIEGRGTILFVSKGG------E

Query:  HR-QLTDVYFIPRLKANLVSLGQLDNQRRLL-------------TQARRTTNRLYILELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVH
        H   L DV F      NL+S+ +L      +                 + +  L  + +   Q  S++AK +     WH R+GH++   L ++++K +  
Subjt:  HR-QLTDVYFIPRLKANLVSLGQLDNQRRLL-------------TQARRTTNRLYILELEIDQPVSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVH

Query:  GLPEIKGVN---KLCDGCLIGKQRRTPFPS-RTAYRADEPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAKS
            +  +    ++C+ CL GKQ R PF   +       PL +VH D+CGPI P T   K+ F++ VD  + +    L++ KS
Subjt:  GLPEIKGVN---KLCDGCLIGKQRRTPFPS-RTAYRADEPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAKS

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.8e-3326.85Show/hide
Query:  SNYNEWALLMRVNLQAQGLWHA--VEPKEGETIEYRE----DRLAFAAILRSVPPEMLASLSTKRTAQSAWEGIKSRRVGVQ-----QLRKELSEIRFKD
        + ++ W   MR  L  QGL     V+ K+ +T++  +    D  A +AI   +  +++ ++  + TA+  W  ++S  +         L+K+L  +   +
Subjt:  SNYNEWALLMRVNLQAQGLWHA--VEPKEGETIEYRE----DRLAFAAILRSVPPEMLASLSTKRTAQSAWEGIKSRRVGVQ-----QLRKELSEIRFKD

Query:  GESVDDFSMRITGLANSITTLGGGISEAEIVKKMLQVVPDHLEQVAISI---ETLLDVNNLTVEEVTGRLRNVEQRKKNITSAVDKEGRLLLTEEEWLAR
        G +         GL   +  LG  I E +    +L  +P   + +A +I   +T +++ ++T    +  L N + RKK      + +G+ L+TE    + 
Subjt:  GESVDDFSMRITGLANSITTLGGGISEAEIVKKMLQVVPDHLEQVAISI---ETLLDVNNLTVEEVTGRLRNVEQRKKNITSAVDKEGRLLLTEEEWLAR

Query:  LKLRDNTGESNGSSSSRKGGKKPWMSHGRTRGKDGNQKKESTNGRPIQCSNCGKREHLSRNCWSKLRNKEKAEKAHVAQSEEDEPALFMVTACVPNFDSK
         +  +N G S                    RGK  N+ K     R   C NC +  H  R+C     N  K +     Q  +D  A     A V N D+ 
Subjt:  LKLRDNTGESNGSSSSRKGGKKPWMSHGRTRGKDGNQKKESTNGRPIQCSNCGKREHLSRNCWSKLRNKEKAEKAHVAQSEEDEPALFMVTACVPNFDSK

Query:  SDDDVEPKKELQLGVAKMAPAGEPIQLEEERVFAQIGKRDEQHEHQQWILDTGATNHMTGARSAFSKLDSGIRGTVKFGDGSVVEIEGRGTILFVSKGGE
                  + L +            EEE      G   E      W++DT A++H T  R  F +  +G  GTVK G+ S  +I G G I   +  G 
Subjt:  SDDDVEPKKELQLGVAKMAPAGEPIQLEEERVFAQIGKRDEQHEHQQWILDTGATNHMTGARSAFSKLDSGIRGTVKFGDGSVVEIEGRGTILFVSKGGE

Query:  HRQLTDVYFIPRLKANLVSLGQLD---------NQRRLLTQARRTTNR------LYILELEIDQPVSLSAKTEEVSW-RWHARYGHLNFPALEKLQKKEL
           L DV  +P L+ NL+S   LD         NQ+  LT+      +      LY    EI Q   L+A  +E+S   WH R GH++   L+ L KK L
Subjt:  HRQLTDVYFIPRLKANLVSLGQLD---------NQRRLLTQARRTTNR------LYILELEIDQPVSLSAKTEEVSW-RWHARYGHLNFPALEKLQKKEL

Query:  VHGLPEIKGVN-KLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAK
        +      KG   K CD CL GKQ R  F + ++ R    L+LV+ D+CGP++  + GG   F+  +DD SR +W+ +L+ K
Subjt:  VHGLPEIKGVN-KLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTLLQAK

P25384 Transposon Ty2-C Gag-Pol polyprotein5.5e-0836.27Show/hide
Query:  HARYGHLNFPALEKLQKKELVHGLPE--IKGVNK---LCDGCLIGK--QRRTPFPSRTAYRAD-EPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFM
        H   GH NF +++K  KK  V  L E  I+  N     C  CLIGK  + R    SR  Y+   EP + +H DI GP+        S F+   D+K+RF 
Subjt:  HARYGHLNFPALEKLQKKELVHGLPE--IKGVNK---LCDGCLIGK--QRRTPFPSRTAYRAD-EPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFM

Query:  WL
        W+
Subjt:  WL

Q03494 Transposon Ty2-DR2 Gag-Pol polyprotein3.2e-0825.89Show/hide
Query:  SRNCWSKLRNKEKAEKAH-VAQSEEDEPALFMVTACVPNFDSKSDDDVEPKKELQLGVAKMAPAGEPIQLEEERVFAQIGKRDEQHEHQQWILDTGATNH
        +RN      +K +A KAH +A S +       V +   N  + S   +    EL LG           Q +E +    I   DE  +H   ++D+GA+  
Subjt:  SRNCWSKLRNKEKAEKAH-VAQSEEDEPALFMVTACVPNFDSKSDDDVEPKKELQLGVAKMAPAGEPIQLEEERVFAQIGKRDEQHEHQQWILDTGATNH

Query:  MTGARSAFSKLDSGIRGTVKFGDGSV--VEIEGRGTILFVSKGGEHRQLTDVYFIPRLKANLVSLGQLDNQRRLLTQARRTTNR--------------LY
         T  RSA     +     +   D     + I   G + F  + G    +  ++  P +  +L+SL +L NQ       R T  R               Y
Subjt:  MTGARSAFSKLDSGIRGTVKFGDGSV--VEIEGRGTILFVSKGGEHRQLTDVYFIPRLKANLVSLGQLDNQRRLLTQARRTTNR--------------LY

Query:  ILELEIDQPVSLSA-------KTEEVSW----RWHARYGHLNFPALEKLQKKELVHGLPE--IKGVNK---LCDGCLIGK--QRRTPFPSRTAYRAD-EP
         L  +   P  +S        K++ V+       H   GH NF +++K  KK  V  L E  I+  N     C  CLIGK  + R    SR  Y+   EP
Subjt:  ILELEIDQPVSLSA-------KTEEVSW----RWHARYGHLNFPALEKLQKKELVHGLPE--IKGVNK---LCDGCLIGK--QRRTPFPSRTAYRAD-EP

Query:  LELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWL
         + +H DI GP+        S F+   D+K+RF W+
Subjt:  LELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWL

Q12472 Transposon Ty2-DR1 Gag-Pol polyprotein5.5e-0836.27Show/hide
Query:  HARYGHLNFPALEKLQKKELVHGLPE--IKGVNK---LCDGCLIGK--QRRTPFPSRTAYRAD-EPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFM
        H   GH NF +++K  KK  V  L E  I+  N     C  CLIGK  + R    SR  Y+   EP + +H DI GP+        S F+   D+K+RF 
Subjt:  HARYGHLNFPALEKLQKKELVHGLPE--IKGVNK---LCDGCLIGK--QRRTPFPSRTAYRAD-EPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFM

Query:  WL
        W+
Subjt:  WL

Arabidopsis top hitse value%identityAlignment
AT3G21000.1 Gag-Pol-related retrotransposon family protein1.8e-0632.5Show/hide
Query:  WILDTGATNHMTGARSAFSKLDSGIRGTVKFGDGSVVEIEGRGTILFVSKGGEHRQLTDVYFIPRLKANLVSLGQLDNQR
        WI+   A  +MT     F+ LD   + TV   DG+V+ +EG+G +    K G+ + + +V F+P L  N++S G++ ++R
Subjt:  WILDTGATNHMTGARSAFSKLDSGIRGTVKFGDGSVVEIEGRGTILFVSKGGEHRQLTDVYFIPRLKANLVSLGQLDNQR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGACAAGGTCCAACTACAACGAGTGGGCCTTGTTGATGCGCGTTAACCTACAGGCGCAAGGGTTATGGCACGCTGTCGAGCCAAAGGAAGGAGAAACAATCGAGTA
TCGGGAGGATCGGCTGGCGTTCGCCGCCATATTACGATCTGTGCCCCCAGAGATGCTGGCTTCACTTTCCACCAAACGCACTGCACAATCGGCCTGGGAAGGAATCAAAT
CCCGTCGGGTCGGTGTACAGCAACTGCGGAAAGAGCTCTCGGAGATTCGCTTTAAGGACGGCGAATCCGTCGATGATTTTTCAATGCGGATCACGGGGCTCGCCAACAGC
ATCACCACCCTCGGTGGTGGCATCAGCGAGGCAGAAATCGTAAAGAAGATGTTGCAGGTCGTTCCCGATCACCTCGAGCAAGTCGCGATTTCAATCGAGACATTGCTTGA
TGTGAACAACCTAACAGTGGAAGAGGTGACCGGAAGACTACGTAACGTCGAGCAGCGGAAGAAGAATATCACTTCAGCTGTTGACAAGGAGGGCCGCCTACTCCTCACGG
AGGAGGAATGGCTTGCACGCCTAAAGCTCCGCGACAACACCGGCGAGAGCAACGGGTCCTCCTCCAGCCGCAAAGGCGGTAAGAAGCCATGGATGTCCCACGGGCGCACG
CGCGGGAAAGATGGGAATCAAAAGAAGGAGTCGACCAATGGCAGGCCGATCCAATGCTCGAACTGTGGGAAGAGAGAACACCTGAGCAGAAATTGCTGGAGCAAGCTGAG
GAATAAGGAGAAGGCCGAAAAGGCCCATGTGGCTCAATCCGAGGAGGACGAGCCAGCTCTCTTCATGGTAACTGCATGCGTCCCCAATTTCGATTCCAAATCCGACGACG
ATGTTGAACCCAAGAAAGAGCTCCAGTTGGGCGTAGCAAAGATGGCACCAGCTGGGGAGCCAATTCAACTGGAGGAGGAGCGAGTGTTCGCTCAGATCGGCAAAAGGGAC
GAGCAACACGAGCACCAGCAATGGATCCTTGACACAGGGGCAACAAACCATATGACCGGGGCTAGATCTGCGTTCTCCAAACTCGACTCGGGGATCCGTGGGACGGTGAA
ATTCGGCGACGGCTCCGTCGTCGAGATCGAAGGGCGCGGCACCATCCTGTTCGTCAGTAAGGGAGGCGAGCATCGCCAGCTGACCGACGTCTACTTCATCCCGAGGCTCA
AGGCTAACCTTGTGAGCCTGGGTCAACTCGATAATCAACGACGGCTGCTCACGCAGGCAAGGCGCACCACAAACCGTCTTTACATCCTGGAGTTAGAGATAGACCAACCC
GTTAGCCTCTCGGCCAAGACCGAAGAGGTATCTTGGAGGTGGCACGCAAGGTACGGACACTTAAACTTTCCTGCCCTAGAAAAGCTACAGAAGAAGGAGTTGGTGCACGG
CTTGCCAGAAATCAAAGGCGTGAACAAGCTGTGCGACGGGTGCCTCATCGGCAAACAGAGGCGCACACCCTTTCCGTCCCGAACAGCCTACCGAGCCGATGAGCCATTGG
AGCTTGTACACGGCGATATTTGCGGGCCCATCAAGCCGGCGACCCCAGGCGGTAAGAGTCTCTTCCTCCTGTTAGTCGATGACAAAAGCCGCTTCATGTGGCTGACCCTG
CTGCAAGCGAAAAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTGACAAGGTCCAACTACAACGAGTGGGCCTTGTTGATGCGCGTTAACCTACAGGCGCAAGGGTTATGGCACGCTGTCGAGCCAAAGGAAGGAGAAACAATCGAGTA
TCGGGAGGATCGGCTGGCGTTCGCCGCCATATTACGATCTGTGCCCCCAGAGATGCTGGCTTCACTTTCCACCAAACGCACTGCACAATCGGCCTGGGAAGGAATCAAAT
CCCGTCGGGTCGGTGTACAGCAACTGCGGAAAGAGCTCTCGGAGATTCGCTTTAAGGACGGCGAATCCGTCGATGATTTTTCAATGCGGATCACGGGGCTCGCCAACAGC
ATCACCACCCTCGGTGGTGGCATCAGCGAGGCAGAAATCGTAAAGAAGATGTTGCAGGTCGTTCCCGATCACCTCGAGCAAGTCGCGATTTCAATCGAGACATTGCTTGA
TGTGAACAACCTAACAGTGGAAGAGGTGACCGGAAGACTACGTAACGTCGAGCAGCGGAAGAAGAATATCACTTCAGCTGTTGACAAGGAGGGCCGCCTACTCCTCACGG
AGGAGGAATGGCTTGCACGCCTAAAGCTCCGCGACAACACCGGCGAGAGCAACGGGTCCTCCTCCAGCCGCAAAGGCGGTAAGAAGCCATGGATGTCCCACGGGCGCACG
CGCGGGAAAGATGGGAATCAAAAGAAGGAGTCGACCAATGGCAGGCCGATCCAATGCTCGAACTGTGGGAAGAGAGAACACCTGAGCAGAAATTGCTGGAGCAAGCTGAG
GAATAAGGAGAAGGCCGAAAAGGCCCATGTGGCTCAATCCGAGGAGGACGAGCCAGCTCTCTTCATGGTAACTGCATGCGTCCCCAATTTCGATTCCAAATCCGACGACG
ATGTTGAACCCAAGAAAGAGCTCCAGTTGGGCGTAGCAAAGATGGCACCAGCTGGGGAGCCAATTCAACTGGAGGAGGAGCGAGTGTTCGCTCAGATCGGCAAAAGGGAC
GAGCAACACGAGCACCAGCAATGGATCCTTGACACAGGGGCAACAAACCATATGACCGGGGCTAGATCTGCGTTCTCCAAACTCGACTCGGGGATCCGTGGGACGGTGAA
ATTCGGCGACGGCTCCGTCGTCGAGATCGAAGGGCGCGGCACCATCCTGTTCGTCAGTAAGGGAGGCGAGCATCGCCAGCTGACCGACGTCTACTTCATCCCGAGGCTCA
AGGCTAACCTTGTGAGCCTGGGTCAACTCGATAATCAACGACGGCTGCTCACGCAGGCAAGGCGCACCACAAACCGTCTTTACATCCTGGAGTTAGAGATAGACCAACCC
GTTAGCCTCTCGGCCAAGACCGAAGAGGTATCTTGGAGGTGGCACGCAAGGTACGGACACTTAAACTTTCCTGCCCTAGAAAAGCTACAGAAGAAGGAGTTGGTGCACGG
CTTGCCAGAAATCAAAGGCGTGAACAAGCTGTGCGACGGGTGCCTCATCGGCAAACAGAGGCGCACACCCTTTCCGTCCCGAACAGCCTACCGAGCCGATGAGCCATTGG
AGCTTGTACACGGCGATATTTGCGGGCCCATCAAGCCGGCGACCCCAGGCGGTAAGAGTCTCTTCCTCCTGTTAGTCGATGACAAAAGCCGCTTCATGTGGCTGACCCTG
CTGCAAGCGAAAAGTTAG
Protein sequenceShow/hide protein sequence
MLTRSNYNEWALLMRVNLQAQGLWHAVEPKEGETIEYREDRLAFAAILRSVPPEMLASLSTKRTAQSAWEGIKSRRVGVQQLRKELSEIRFKDGESVDDFSMRITGLANS
ITTLGGGISEAEIVKKMLQVVPDHLEQVAISIETLLDVNNLTVEEVTGRLRNVEQRKKNITSAVDKEGRLLLTEEEWLARLKLRDNTGESNGSSSSRKGGKKPWMSHGRT
RGKDGNQKKESTNGRPIQCSNCGKREHLSRNCWSKLRNKEKAEKAHVAQSEEDEPALFMVTACVPNFDSKSDDDVEPKKELQLGVAKMAPAGEPIQLEEERVFAQIGKRD
EQHEHQQWILDTGATNHMTGARSAFSKLDSGIRGTVKFGDGSVVEIEGRGTILFVSKGGEHRQLTDVYFIPRLKANLVSLGQLDNQRRLLTQARRTTNRLYILELEIDQP
VSLSAKTEEVSWRWHARYGHLNFPALEKLQKKELVHGLPEIKGVNKLCDGCLIGKQRRTPFPSRTAYRADEPLELVHGDICGPIKPATPGGKSLFLLLVDDKSRFMWLTL
LQAKS