; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0000967 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0000967
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionReverse transcriptase Ty1/copia-type domain-containing protein
Genome locationchr12:14692191..14700507
RNA-Seq ExpressionPI0000967
SyntenyPI0000967
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036824.1 uncharacterized protein E6C27_scaffold20G001240 [Cucumis melo var. makuwa]1.4e-5835.32Show/hide
Query:  GITTRRKEKLDYAKMIGNSKSQYSCPESTPTNPSTPPVTQNTSDSTHNINGSPPVPQLNPASVPSSNPPPPTTQPKTKAKNKLISTKSGRRKVPLHVNPV
        G TT   E+      I +  + ++  +  P N   PP+   T+ ST     SP V Q      PS          KT+   + I+TK+GR+K+ +++  V
Subjt:  GITTRRKEKLDYAKMIGNSKSQYSCPESTPTNPSTPPVTQNTSDSTHNINGSPPVPQLNPASVPSSNPPPPTTQPKTKAKNKLISTKSGRRKVPLHVNPV

Query:  SIDGLSFHHEAHVHKWKYVVQRRIADESEVFDDHHSYLAIMTLISQVKLLSTVPNLGPFYRKLVREMFVNIPSSFEDPSSPDFHKIHVRGMAFTVSPASL
         IDG+SFHHE +V +WK+VVQRRIADE  +   H S ++IM LI +  L  T+ N+GPFY +L+RE  VN+P  F +PSS  +  IH+RG  F +S A +
Subjt:  SIDGLSFHHEAHVHKWKYVVQRRIADESEVFDDHHSYLAIMTLISQVKLLSTVPNLGPFYRKLVREMFVNIPSSFEDPSSPDFHKIHVRGMAFTVSPASL

Query:  NAFLDIVLPSDPPTNPTPDE-LATELTGGIVSNWPTTGYLSAARSTYTSLAAYMFIRQLTYLAQACSDRKMHRSHQLVNSQLISSHEVYYNYQAQGPRPK
        N FL   +  D   +  P E LA  L+GG +S WP  G  +AA S       Y+F+ ++       S      SH    S  +      + YQ       
Subjt:  NAFLDIVLPSDPPTNPTPDE-LATELTGGIVSNWPTTGYLSAARSTYTSLAAYMFIRQLTYLAQACSDRKMHRSHQLVNSQLISSHEVYYNYQAQGPRPK

Query:  AQAHWTGKPVDFGSFVVNQLKHHIDSYAIKLPICLPRFFCNFLLTQNRTILHPNEALGPSPSKLQLSYHLFQGAHVPDIEHSLRHLSSSNLPHPDDLRTS
                 VD G+F+ NQL  H+ S+ +K+PI LPRFF + LL  N  +L   +A GP P  +  SY LFQG+HV DI H +    S  +    D   S
Subjt:  AQAHWTGKPVDFGSFVVNQLKHHIDSYAIKLPICLPRFFCNFLLTQNRTILHPNEALGPSPSKLQLSYHLFQGAHVPDIEHSLRHLSSSNLPHPDDLRTS

Query:  DGGLMLPPPLANQILHALSSESRSLSTIIHDLTERRKLIDSIVSYVRLLIPSS
          G  +   LA +I+++L++ESR+L+  I+ L+ERR  +D+++ +++ L PS+
Subjt:  DGGLMLPPPLANQILHALSSESRSLSTIIHDLTERRKLIDSIVSYVRLLIPSS

KAA0063048.1 gag-pol polyprotein [Cucumis melo var. makuwa]4.5e-5735.76Show/hide
Query:  SQYSCPESTPTNPST--PPVTQNTSDSTHNINGSPPVPQLNPASVPSSNPPPPTTQPKTKAKN---------KLISTKSGRRKVPLHVNPVSIDGLSFHH
        S+Y    S   +P T  P V ++  D+    +    VP+    +VP +            AK+          +ISTK+GR+K+P +V  V IDG+SFH 
Subjt:  SQYSCPESTPTNPST--PPVTQNTSDSTHNINGSPPVPQLNPASVPSSNPPPPTTQPKTKAKN---------KLISTKSGRRKVPLHVNPVSIDGLSFHH

Query:  EAHVHKWKYVVQRRIADESEVFDDHHSYLAIMTLISQVKLLSTVPNLGPFYRKLVREMFVNIPSSFEDPSSPDFHKIHVRGMAFTVSPASLNAFLDIVLP
        E   HKW YVV+RRIADE+ + D ++SY AI+ LI  V+L+ TV  +GPFY +L+RE+ VN+PS F DPS+ ++ K+H+RG+ F +SP  LN +L + LP
Subjt:  EAHVHKWKYVVQRRIADESEVFDDHHSYLAIMTLISQVKLLSTVPNLGPFYRKLVREMFVNIPSSFEDPSSPDFHKIHVRGMAFTVSPASLNAFLDIVLP

Query:  SDPPTN-PTPDELATELTGGIVSNWPTTGYLSAARSTYTSLAAYMFIRQLTYLAQACSDRKMHRSHQLVNSQLISSHEVYYNYQAQGPRPKAQAHWTGKP
        +D   + PTP+ LA ELTGG +  WP  G L           AY+ I             K    H++  S  I S   +    + G         TG  
Subjt:  SDPPTN-PTPDELATELTGGIVSNWPTTGYLSAARSTYTSLAAYMFIRQLTYLAQACSDRKMHRSHQLVNSQLISSHEVYYNYQAQGPRPKAQAHWTGKP

Query:  VDFGSFVVNQLKHHIDSYAIKLPICLPRFFCNFLLTQNRTILHPNEALGPSPSKLQLSYHLFQGAHVPDI----EHSLRHLSSSNLPHPDDLRTSDGGLM
        V+   F+ N L  H+D++AI +PIC PR    FLL Q  T L   + +G  P  + L  HLFQG+++PDI    +++    S +   HP    T    L 
Subjt:  VDFGSFVVNQLKHHIDSYAIKLPICLPRFFCNFLLTQNRTILHPNEALGPSPSKLQLSYHLFQGAHVPDI----EHSLRHLSSSNLPHPDDLRTSDGGLM

Query:  LPPPLANQILHALSSESRSLSTIIHDLTERRKLIDSIVSYVRLLIPSSEEAPN
        L   L N +L AL +ES SL+  I DLT+RR ++D ++  +R     S   P+
Subjt:  LPPPLANQILHALSSESRSLSTIIHDLTERRKLIDSIVSYVRLLIPSSEEAPN

KAA0067563.1 uncharacterized protein E6C27_scaffold485G00260 [Cucumis melo var. makuwa]8.7e-6137.59Show/hide
Query:  PVPQLNPASVPSSNPPPPTTQPKTKAKNKLISTKSGRRKVPLHVNPVSI-DGLSFHHEAHVHKWKYVVQRRIADESEVFDDHHSYLAIMTLISQVKLLST
        P  Q    SVP+    P  ++ K +   + I+TK+GR+K+PL++  V I DG+SFH E +V +WK+VVQRRIAD+  + D +HS ++IM LI +V L  T
Subjt:  PVPQLNPASVPSSNPPPPTTQPKTKAKNKLISTKSGRRKVPLHVNPVSI-DGLSFHHEAHVHKWKYVVQRRIADESEVFDDHHSYLAIMTLISQVKLLST

Query:  VPNLGPFYRKLVREMFVNIPSSFEDPSSPDFHKIHVRGMAFTVSPASLNAFL-DIVLPSDPPTNPTPDELATELTGGIVSNWPTTGYLSAARSTYTSLAA
        + ++G FY +L+RE  VN+P+ F DPSS D+  +H+RG  FT+S   +N FL + V  +  P++P+ + LA+ L GG +S+WP  G  + A S       
Subjt:  VPNLGPFYRKLVREMFVNIPSSFEDPSSPDFHKIHVRGMAFTVSPASLNAFL-DIVLPSDPPTNPTPDELATELTGGIVSNWPTTGYLSAARSTYTSLAA

Query:  YMFIRQLTYLAQACSDRKMHRSHQLVNSQLISSHEVYYNYQAQGPRPKAQAHWTGKPVDFGSFVVNQLKHHIDSYAIKLPICLPRFFCNFLLTQNRTILH
        Y  + ++   +   S      SH  V+S  ++     Y+             +    VD G F+ NQL  H+ S+ +KLPI LPRFF   LL  N  +L 
Subjt:  YMFIRQLTYLAQACSDRKMHRSHQLVNSQLISSHEVYYNYQAQGPRPKAQAHWTGKPVDFGSFVVNQLKHHIDSYAIKLPICLPRFFCNFLLTQNRTILH

Query:  PNEALGPSPSKLQLSYHLFQGAHVPDIEHSLRHLSSSNLPHPDDLRTSDGGLMLPPPLANQILHALSSESRSLSTIIHDLTERRKLIDSIVSYVRLLIPS
         ++A  P P  L LSY LFQG HVPDI+H++       +    D   +  G  +   LA +IL++L  ESRSL+T I  ++ERR  IDS++ +++   PS
Subjt:  PNEALGPSPSKLQLSYHLFQGAHVPDIEHSLRHLSSSNLPHPDDLRTSDGGLMLPPPLANQILHALSSESRSLSTIIHDLTERRKLIDSIVSYVRLLIPS

Query:  SEEA-PN
        S    PN
Subjt:  SEEA-PN

TYK16303.1 gag-pol polyprotein [Cucumis melo var. makuwa]9.0e-5835.76Show/hide
Query:  SQYSCPESTPTNPST--PPVTQNTSDSTHNINGSPPVPQLNPASVPSSNPPPPTTQPKTKAKN---------KLISTKSGRRKVPLHVNPVSIDGLSFHH
        S+Y    S   +P T  P V ++  D+    +    VP+    +VP +            AK+          +ISTK+GR+K+P +V  V IDG+SFH 
Subjt:  SQYSCPESTPTNPST--PPVTQNTSDSTHNINGSPPVPQLNPASVPSSNPPPPTTQPKTKAKN---------KLISTKSGRRKVPLHVNPVSIDGLSFHH

Query:  EAHVHKWKYVVQRRIADESEVFDDHHSYLAIMTLISQVKLLSTVPNLGPFYRKLVREMFVNIPSSFEDPSSPDFHKIHVRGMAFTVSPASLNAFLDIVLP
        E   HKW YVV+RRIADE+ + D ++SY AI+ LI  V+L+ TV  +GPFY +L+RE+ VN+PS F DPS+ ++ K+H+RG+ F +SP  LN +L + LP
Subjt:  EAHVHKWKYVVQRRIADESEVFDDHHSYLAIMTLISQVKLLSTVPNLGPFYRKLVREMFVNIPSSFEDPSSPDFHKIHVRGMAFTVSPASLNAFLDIVLP

Query:  SDPPTN-PTPDELATELTGGIVSNWPTTGYLSAARSTYTSLAAYMFIRQLTYLAQACSDRKMHRSHQLVNSQLISSHEVYYNYQAQGPRPKAQAHWTGKP
        +D   + PTP+ LA ELTGG +  WP  G L           AY+ I+  + L +      +  +H    S  +  H VY                TG  
Subjt:  SDPPTN-PTPDELATELTGGIVSNWPTTGYLSAARSTYTSLAAYMFIRQLTYLAQACSDRKMHRSHQLVNSQLISSHEVYYNYQAQGPRPKAQAHWTGKP

Query:  VDFGSFVVNQLKHHIDSYAIKLPICLPRFFCNFLLTQNRTILHPNEALGPSPSKLQLSYHLFQGAHVPDI----EHSLRHLSSSNLPHPDDLRTSDGGLM
        V+   F+ N L  H+D++AI +PIC PR    FLL Q  T L   + +G  P  + L  HLFQG+++PDI    +++    S +   HP    T    L 
Subjt:  VDFGSFVVNQLKHHIDSYAIKLPICLPRFFCNFLLTQNRTILHPNEALGPSPSKLQLSYHLFQGAHVPDI----EHSLRHLSSSNLPHPDDLRTSDGGLM

Query:  LPPPLANQILHALSSESRSLSTIIHDLTERRKLIDSIVSYVRLLIPSSEEAPN
        L   L N +L AL +ES SL+  I DLT+RR ++D+++  +R     S   P+
Subjt:  LPPPLANQILHALSSESRSLSTIIHDLTERRKLIDSIVSYVRLLIPSSEEAPN

XP_008463658.1 PREDICTED: uncharacterized protein LOC103501750 [Cucumis melo]3.5e-5432.95Show/hide
Query:  NPSTPPV-TQNTSDSTHNINGSPPVPQLNPASVPSSNPPPPTTQP-KTKAKNKLISTKSGRRKVPLHVNPVSIDGLSFHHEAHVHKWKYVVQRRIADESE
        NP  P V ++  SD   N          N  + P+S   P  +QP K K++    +  +GR+K+P  ++ V IDG+SFHHE +V +WK++VQRR+AD   
Subjt:  NPSTPPV-TQNTSDSTHNINGSPPVPQLNPASVPSSNPPPPTTQP-KTKAKNKLISTKSGRRKVPLHVNPVSIDGLSFHHEAHVHKWKYVVQRRIADESE

Query:  VFDDHHSYLAIMTLISQVKLLSTVPNLGPFYRKLVREMFVNIPSSFEDPSSPDFHKIHVRGMAFTVSPASLNAFLDIVLPSD-PPTNPTPDELATELTGG
        V   H S ++IM LI    L  T+ N+GPFY +L+++  VN+P  F DPSS D+  +H+RG  F +S A +N FL   +  D   ++P+ + LA  L+GG
Subjt:  VFDDHHSYLAIMTLISQVKLLSTVPNLGPFYRKLVREMFVNIPSSFEDPSSPDFHKIHVRGMAFTVSPASLNAFLDIVLPSD-PPTNPTPDELATELTGG

Query:  IVSNWPTTGYLSAARSTY--------------TSLAAYMFIRQLTYLAQACSDRKMHRSHQLVNSQLISSHEVYYNYQAQGPRPKAQAHWTGKPVDFGSF
         +S+W   G L  A S                +S A+ +     T+L + C+D K                                       VD G+F
Subjt:  IVSNWPTTGYLSAARSTY--------------TSLAAYMFIRQLTYLAQACSDRKMHRSHQLVNSQLISSHEVYYNYQAQGPRPKAQAHWTGKPVDFGSF

Query:  VVNQLKHHIDSYAIKLPICLPRFFCNFLLTQNRTILHPNEALGPSPSKLQLSYHLFQGAHVPDIEHSLRHLSSSNLPHPDDLRTSDGGLMLPPPLANQIL
        + N+L  H+  + +K+PI LPRFF + LL  N  ++  ++ALGP+P  L LSY LFQG+HVPDI+  +     S +    D   S  G  +   LA++I+
Subjt:  VVNQLKHHIDSYAIKLPICLPRFFCNFLLTQNRTILHPNEALGPSPSKLQLSYHLFQGAHVPDIEHSLRHLSSSNLPHPDDLRTSDGGLMLPPPLANQIL

Query:  HALSSESRSLSTIIHDLTERRKLIDSIVSYVRLLIPSSEE
        ++L+ ESR+LS  I+ L+E +  +D ++ +++   PS+ +
Subjt:  HALSSESRSLSTIIHDLTERRKLIDSIVSYVRLLIPSSEE

TrEMBL top hitse value%identityAlignment
A0A1S3CJS2 uncharacterized protein LOC1035017501.7e-5432.95Show/hide
Query:  NPSTPPV-TQNTSDSTHNINGSPPVPQLNPASVPSSNPPPPTTQP-KTKAKNKLISTKSGRRKVPLHVNPVSIDGLSFHHEAHVHKWKYVVQRRIADESE
        NP  P V ++  SD   N          N  + P+S   P  +QP K K++    +  +GR+K+P  ++ V IDG+SFHHE +V +WK++VQRR+AD   
Subjt:  NPSTPPV-TQNTSDSTHNINGSPPVPQLNPASVPSSNPPPPTTQP-KTKAKNKLISTKSGRRKVPLHVNPVSIDGLSFHHEAHVHKWKYVVQRRIADESE

Query:  VFDDHHSYLAIMTLISQVKLLSTVPNLGPFYRKLVREMFVNIPSSFEDPSSPDFHKIHVRGMAFTVSPASLNAFLDIVLPSD-PPTNPTPDELATELTGG
        V   H S ++IM LI    L  T+ N+GPFY +L+++  VN+P  F DPSS D+  +H+RG  F +S A +N FL   +  D   ++P+ + LA  L+GG
Subjt:  VFDDHHSYLAIMTLISQVKLLSTVPNLGPFYRKLVREMFVNIPSSFEDPSSPDFHKIHVRGMAFTVSPASLNAFLDIVLPSD-PPTNPTPDELATELTGG

Query:  IVSNWPTTGYLSAARSTY--------------TSLAAYMFIRQLTYLAQACSDRKMHRSHQLVNSQLISSHEVYYNYQAQGPRPKAQAHWTGKPVDFGSF
         +S+W   G L  A S                +S A+ +     T+L + C+D K                                       VD G+F
Subjt:  IVSNWPTTGYLSAARSTY--------------TSLAAYMFIRQLTYLAQACSDRKMHRSHQLVNSQLISSHEVYYNYQAQGPRPKAQAHWTGKPVDFGSF

Query:  VVNQLKHHIDSYAIKLPICLPRFFCNFLLTQNRTILHPNEALGPSPSKLQLSYHLFQGAHVPDIEHSLRHLSSSNLPHPDDLRTSDGGLMLPPPLANQIL
        + N+L  H+  + +K+PI LPRFF + LL  N  ++  ++ALGP+P  L LSY LFQG+HVPDI+  +     S +    D   S  G  +   LA++I+
Subjt:  VVNQLKHHIDSYAIKLPICLPRFFCNFLLTQNRTILHPNEALGPSPSKLQLSYHLFQGAHVPDIEHSLRHLSSSNLPHPDDLRTSDGGLMLPPPLANQIL

Query:  HALSSESRSLSTIIHDLTERRKLIDSIVSYVRLLIPSSEE
        ++L+ ESR+LS  I+ L+E +  +D ++ +++   PS+ +
Subjt:  HALSSESRSLSTIIHDLTERRKLIDSIVSYVRLLIPSSEE

A0A5A7SZY3 Reverse transcriptase Ty1/copia-type domain-containing protein6.7e-5935.32Show/hide
Query:  GITTRRKEKLDYAKMIGNSKSQYSCPESTPTNPSTPPVTQNTSDSTHNINGSPPVPQLNPASVPSSNPPPPTTQPKTKAKNKLISTKSGRRKVPLHVNPV
        G TT   E+      I +  + ++  +  P N   PP+   T+ ST     SP V Q      PS          KT+   + I+TK+GR+K+ +++  V
Subjt:  GITTRRKEKLDYAKMIGNSKSQYSCPESTPTNPSTPPVTQNTSDSTHNINGSPPVPQLNPASVPSSNPPPPTTQPKTKAKNKLISTKSGRRKVPLHVNPV

Query:  SIDGLSFHHEAHVHKWKYVVQRRIADESEVFDDHHSYLAIMTLISQVKLLSTVPNLGPFYRKLVREMFVNIPSSFEDPSSPDFHKIHVRGMAFTVSPASL
         IDG+SFHHE +V +WK+VVQRRIADE  +   H S ++IM LI +  L  T+ N+GPFY +L+RE  VN+P  F +PSS  +  IH+RG  F +S A +
Subjt:  SIDGLSFHHEAHVHKWKYVVQRRIADESEVFDDHHSYLAIMTLISQVKLLSTVPNLGPFYRKLVREMFVNIPSSFEDPSSPDFHKIHVRGMAFTVSPASL

Query:  NAFLDIVLPSDPPTNPTPDE-LATELTGGIVSNWPTTGYLSAARSTYTSLAAYMFIRQLTYLAQACSDRKMHRSHQLVNSQLISSHEVYYNYQAQGPRPK
        N FL   +  D   +  P E LA  L+GG +S WP  G  +AA S       Y+F+ ++       S      SH    S  +      + YQ       
Subjt:  NAFLDIVLPSDPPTNPTPDE-LATELTGGIVSNWPTTGYLSAARSTYTSLAAYMFIRQLTYLAQACSDRKMHRSHQLVNSQLISSHEVYYNYQAQGPRPK

Query:  AQAHWTGKPVDFGSFVVNQLKHHIDSYAIKLPICLPRFFCNFLLTQNRTILHPNEALGPSPSKLQLSYHLFQGAHVPDIEHSLRHLSSSNLPHPDDLRTS
                 VD G+F+ NQL  H+ S+ +K+PI LPRFF + LL  N  +L   +A GP P  +  SY LFQG+HV DI H +    S  +    D   S
Subjt:  AQAHWTGKPVDFGSFVVNQLKHHIDSYAIKLPICLPRFFCNFLLTQNRTILHPNEALGPSPSKLQLSYHLFQGAHVPDIEHSLRHLSSSNLPHPDDLRTS

Query:  DGGLMLPPPLANQILHALSSESRSLSTIIHDLTERRKLIDSIVSYVRLLIPSS
          G  +   LA +I+++L++ESR+L+  I+ L+ERR  +D+++ +++ L PS+
Subjt:  DGGLMLPPPLANQILHALSSESRSLSTIIHDLTERRKLIDSIVSYVRLLIPSS

A0A5A7V603 Gag-pol polyprotein2.2e-5735.76Show/hide
Query:  SQYSCPESTPTNPST--PPVTQNTSDSTHNINGSPPVPQLNPASVPSSNPPPPTTQPKTKAKN---------KLISTKSGRRKVPLHVNPVSIDGLSFHH
        S+Y    S   +P T  P V ++  D+    +    VP+    +VP +            AK+          +ISTK+GR+K+P +V  V IDG+SFH 
Subjt:  SQYSCPESTPTNPST--PPVTQNTSDSTHNINGSPPVPQLNPASVPSSNPPPPTTQPKTKAKN---------KLISTKSGRRKVPLHVNPVSIDGLSFHH

Query:  EAHVHKWKYVVQRRIADESEVFDDHHSYLAIMTLISQVKLLSTVPNLGPFYRKLVREMFVNIPSSFEDPSSPDFHKIHVRGMAFTVSPASLNAFLDIVLP
        E   HKW YVV+RRIADE+ + D ++SY AI+ LI  V+L+ TV  +GPFY +L+RE+ VN+PS F DPS+ ++ K+H+RG+ F +SP  LN +L + LP
Subjt:  EAHVHKWKYVVQRRIADESEVFDDHHSYLAIMTLISQVKLLSTVPNLGPFYRKLVREMFVNIPSSFEDPSSPDFHKIHVRGMAFTVSPASLNAFLDIVLP

Query:  SDPPTN-PTPDELATELTGGIVSNWPTTGYLSAARSTYTSLAAYMFIRQLTYLAQACSDRKMHRSHQLVNSQLISSHEVYYNYQAQGPRPKAQAHWTGKP
        +D   + PTP+ LA ELTGG +  WP  G L           AY+ I             K    H++  S  I S   +    + G         TG  
Subjt:  SDPPTN-PTPDELATELTGGIVSNWPTTGYLSAARSTYTSLAAYMFIRQLTYLAQACSDRKMHRSHQLVNSQLISSHEVYYNYQAQGPRPKAQAHWTGKP

Query:  VDFGSFVVNQLKHHIDSYAIKLPICLPRFFCNFLLTQNRTILHPNEALGPSPSKLQLSYHLFQGAHVPDI----EHSLRHLSSSNLPHPDDLRTSDGGLM
        V+   F+ N L  H+D++AI +PIC PR    FLL Q  T L   + +G  P  + L  HLFQG+++PDI    +++    S +   HP    T    L 
Subjt:  VDFGSFVVNQLKHHIDSYAIKLPICLPRFFCNFLLTQNRTILHPNEALGPSPSKLQLSYHLFQGAHVPDI----EHSLRHLSSSNLPHPDDLRTSDGGLM

Query:  LPPPLANQILHALSSESRSLSTIIHDLTERRKLIDSIVSYVRLLIPSSEEAPN
        L   L N +L AL +ES SL+  I DLT+RR ++D ++  +R     S   P+
Subjt:  LPPPLANQILHALSSESRSLSTIIHDLTERRKLIDSIVSYVRLLIPSSEEAPN

A0A5A7VHK0 Uncharacterized protein4.2e-6137.59Show/hide
Query:  PVPQLNPASVPSSNPPPPTTQPKTKAKNKLISTKSGRRKVPLHVNPVSI-DGLSFHHEAHVHKWKYVVQRRIADESEVFDDHHSYLAIMTLISQVKLLST
        P  Q    SVP+    P  ++ K +   + I+TK+GR+K+PL++  V I DG+SFH E +V +WK+VVQRRIAD+  + D +HS ++IM LI +V L  T
Subjt:  PVPQLNPASVPSSNPPPPTTQPKTKAKNKLISTKSGRRKVPLHVNPVSI-DGLSFHHEAHVHKWKYVVQRRIADESEVFDDHHSYLAIMTLISQVKLLST

Query:  VPNLGPFYRKLVREMFVNIPSSFEDPSSPDFHKIHVRGMAFTVSPASLNAFL-DIVLPSDPPTNPTPDELATELTGGIVSNWPTTGYLSAARSTYTSLAA
        + ++G FY +L+RE  VN+P+ F DPSS D+  +H+RG  FT+S   +N FL + V  +  P++P+ + LA+ L GG +S+WP  G  + A S       
Subjt:  VPNLGPFYRKLVREMFVNIPSSFEDPSSPDFHKIHVRGMAFTVSPASLNAFL-DIVLPSDPPTNPTPDELATELTGGIVSNWPTTGYLSAARSTYTSLAA

Query:  YMFIRQLTYLAQACSDRKMHRSHQLVNSQLISSHEVYYNYQAQGPRPKAQAHWTGKPVDFGSFVVNQLKHHIDSYAIKLPICLPRFFCNFLLTQNRTILH
        Y  + ++   +   S      SH  V+S  ++     Y+             +    VD G F+ NQL  H+ S+ +KLPI LPRFF   LL  N  +L 
Subjt:  YMFIRQLTYLAQACSDRKMHRSHQLVNSQLISSHEVYYNYQAQGPRPKAQAHWTGKPVDFGSFVVNQLKHHIDSYAIKLPICLPRFFCNFLLTQNRTILH

Query:  PNEALGPSPSKLQLSYHLFQGAHVPDIEHSLRHLSSSNLPHPDDLRTSDGGLMLPPPLANQILHALSSESRSLSTIIHDLTERRKLIDSIVSYVRLLIPS
         ++A  P P  L LSY LFQG HVPDI+H++       +    D   +  G  +   LA +IL++L  ESRSL+T I  ++ERR  IDS++ +++   PS
Subjt:  PNEALGPSPSKLQLSYHLFQGAHVPDIEHSLRHLSSSNLPHPDDLRTSDGGLMLPPPLANQILHALSSESRSLSTIIHDLTERRKLIDSIVSYVRLLIPS

Query:  SEEA-PN
        S    PN
Subjt:  SEEA-PN

A0A5D3CWQ1 Gag-pol polyprotein4.4e-5835.76Show/hide
Query:  SQYSCPESTPTNPST--PPVTQNTSDSTHNINGSPPVPQLNPASVPSSNPPPPTTQPKTKAKN---------KLISTKSGRRKVPLHVNPVSIDGLSFHH
        S+Y    S   +P T  P V ++  D+    +    VP+    +VP +            AK+          +ISTK+GR+K+P +V  V IDG+SFH 
Subjt:  SQYSCPESTPTNPST--PPVTQNTSDSTHNINGSPPVPQLNPASVPSSNPPPPTTQPKTKAKN---------KLISTKSGRRKVPLHVNPVSIDGLSFHH

Query:  EAHVHKWKYVVQRRIADESEVFDDHHSYLAIMTLISQVKLLSTVPNLGPFYRKLVREMFVNIPSSFEDPSSPDFHKIHVRGMAFTVSPASLNAFLDIVLP
        E   HKW YVV+RRIADE+ + D ++SY AI+ LI  V+L+ TV  +GPFY +L+RE+ VN+PS F DPS+ ++ K+H+RG+ F +SP  LN +L + LP
Subjt:  EAHVHKWKYVVQRRIADESEVFDDHHSYLAIMTLISQVKLLSTVPNLGPFYRKLVREMFVNIPSSFEDPSSPDFHKIHVRGMAFTVSPASLNAFLDIVLP

Query:  SDPPTN-PTPDELATELTGGIVSNWPTTGYLSAARSTYTSLAAYMFIRQLTYLAQACSDRKMHRSHQLVNSQLISSHEVYYNYQAQGPRPKAQAHWTGKP
        +D   + PTP+ LA ELTGG +  WP  G L           AY+ I+  + L +      +  +H    S  +  H VY                TG  
Subjt:  SDPPTN-PTPDELATELTGGIVSNWPTTGYLSAARSTYTSLAAYMFIRQLTYLAQACSDRKMHRSHQLVNSQLISSHEVYYNYQAQGPRPKAQAHWTGKP

Query:  VDFGSFVVNQLKHHIDSYAIKLPICLPRFFCNFLLTQNRTILHPNEALGPSPSKLQLSYHLFQGAHVPDI----EHSLRHLSSSNLPHPDDLRTSDGGLM
        V+   F+ N L  H+D++AI +PIC PR    FLL Q  T L   + +G  P  + L  HLFQG+++PDI    +++    S +   HP    T    L 
Subjt:  VDFGSFVVNQLKHHIDSYAIKLPICLPRFFCNFLLTQNRTILHPNEALGPSPSKLQLSYHLFQGAHVPDI----EHSLRHLSSSNLPHPDDLRTSDGGLM

Query:  LPPPLANQILHALSSESRSLSTIIHDLTERRKLIDSIVSYVRLLIPSSEEAPN
        L   L N +L AL +ES SL+  I DLT+RR ++D+++  +R     S   P+
Subjt:  LPPPLANQILHALSSESRSLSTIIHDLTERRKLIDSIVSYVRLLIPSSEEAPN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGTGATACATGTTGCGAGCTTGAGTATTCAGCTGAATGAAGTGTGCGAGCTGATGAATGAGTATGCGGCGAGAAGGCTTATGAACTATGGATCTGCATCTAATCG
TGACAATCAAAATAAGTTCAGAAATTCGTTTTCAAAAAGAAAAGATGAACGAAATATTGAGTTCAAAAAGGACGAAGACTTCAAGTGCAGAGAATGCGGTGGTAAAGGCC
ATTACCAATCTGAGTGTGCTACCTATCTGAGAAGGCAACGCAAAGGATACTCAGCTACACTGTCTGATGAATCAGAGACAAGTCATGACTCTGAGGATGAGTATGAGGAA
GTTATGGAGAAGTGGCAGGAGGATCTGAAAGTTATTGAACGTCAAAAAGATAGAATTCTGGAACTGGTGGAAGACAACCATAGATTGTTGCAAACTATATCAGACATTAA
AAAGGAGCTGAAAAATGCAAAAAATGAAAATGATAGGATGTCCAACTCAGTTCGCATGCTAAATTCAGGAACTAAAGATTTGAATCACATATTGAGTTCAGGTAAATCCG
TGTCCAACAAACAAGGGATCAACAGTGAACAACAATCTCATACACCAGAAAGGGATGATGAGCTTTCTGTCTCTGAACGTGCACCTTCAACAAGGGTTCAAAAGAACCAT
CCCACTAGTGATATTATAGGCCCTTTAGAAACAGGTATTACTACCAGAAGAAAGGAAAAACTAGATTATGCCAAAATGATTGGCAATTCGAAATCCCAGTACTCCTGTCC
TGAGTCAACTCCAACTAATCCTAGCACTCCTCCGGTCACTCAAAATACCTCTGATTCAACTCATAATATTAATGGTTCTCCTCCTGTGCCTCAACTCAATCCTGCTTCTG
TTCCTTCTTCAAATCCTCCTCCTCCTACTACTCAACCTAAAACTAAGGCCAAGAACAAACTTATTTCTACAAAGTCTGGTAGGAGAAAGGTGCCCCTGCATGTTAATCCT
GTCTCCATTGATGGACTTTCCTTCCATCATGAGGCTCATGTCCACAAGTGGAAATATGTGGTTCAACGCCGAATTGCTGATGAGTCTGAAGTTTTCGATGATCATCATTC
CTATCTTGCTATTATGACCCTCATATCTCAAGTCAAACTCCTGTCTACTGTTCCAAACCTTGGTCCGTTCTATCGCAAATTAGTTAGAGAAATGTTTGTTAATATTCCTT
CCTCTTTTGAAGATCCTAGTAGCCCTGATTTTCATAAAATTCATGTGAGAGGTATGGCCTTCACTGTCTCCCCTGCTTCACTAAATGCTTTTCTTGATATTGTTCTTCCC
TCTGATCCTCCAACAAATCCTACTCCTGATGAGCTTGCAACTGAGTTAACGGGAGGTATTGTCTCTAATTGGCCAACTACCGGTTACCTTTCAGCAGCTAGAAGTACATA
CACTTCCCTTGCCGCATACATGTTCATTCGCCAGCTCACATACTTAGCTCAAGCCTGCTCAGATCGCAAGATGCATCGCAGTCATCAGCTCGTCAATTCTCAGCTCATCA
GCTCGCATGAAGTCTATTATAACTACCAAGCCCAAGGCCCAAGGCCCAAGGCCCAAGCCCATTGGACTGGTAAACCTGTTGATTTTGGTTCTTTTGTTGTGAATCAACTC
AAGCATCACATAGATTCTTATGCCATCAAACTCCCTATCTGTCTGCCTCGCTTTTTCTGCAATTTCCTTCTAACCCAGAACCGTACAATTCTTCATCCTAATGAGGCTCT
TGGTCCTTCTCCCTCCAAACTTCAATTAAGCTATCATCTGTTTCAAGGAGCTCATGTACCAGACATTGAGCATTCTCTCCGCCATCTTTCCTCTTCTAATCTTCCTCATC
CTGATGATTTAAGAACTTCTGACGGTGGCCTAATGCTTCCTCCTCCTCTAGCTAATCAAATTTTGCATGCCTTATCCTCTGAATCAAGAAGTCTCAGTACCATCATTCAT
GATCTTACTGAAAGACGAAAGCTTATTGATTCCATCGTCTCTTATGTTCGGTTGCTGATCCCTTCATCAGAGGAAGCTCCTAATGTCCATCCTGATCATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTGTGATACATGTTGCGAGCTTGAGTATTCAGCTGAATGAAGTGTGCGAGCTGATGAATGAGTATGCGGCGAGAAGGCTTATGAACTATGGATCTGCATCTAATCG
TGACAATCAAAATAAGTTCAGAAATTCGTTTTCAAAAAGAAAAGATGAACGAAATATTGAGTTCAAAAAGGACGAAGACTTCAAGTGCAGAGAATGCGGTGGTAAAGGCC
ATTACCAATCTGAGTGTGCTACCTATCTGAGAAGGCAACGCAAAGGATACTCAGCTACACTGTCTGATGAATCAGAGACAAGTCATGACTCTGAGGATGAGTATGAGGAA
GTTATGGAGAAGTGGCAGGAGGATCTGAAAGTTATTGAACGTCAAAAAGATAGAATTCTGGAACTGGTGGAAGACAACCATAGATTGTTGCAAACTATATCAGACATTAA
AAAGGAGCTGAAAAATGCAAAAAATGAAAATGATAGGATGTCCAACTCAGTTCGCATGCTAAATTCAGGAACTAAAGATTTGAATCACATATTGAGTTCAGGTAAATCCG
TGTCCAACAAACAAGGGATCAACAGTGAACAACAATCTCATACACCAGAAAGGGATGATGAGCTTTCTGTCTCTGAACGTGCACCTTCAACAAGGGTTCAAAAGAACCAT
CCCACTAGTGATATTATAGGCCCTTTAGAAACAGGTATTACTACCAGAAGAAAGGAAAAACTAGATTATGCCAAAATGATTGGCAATTCGAAATCCCAGTACTCCTGTCC
TGAGTCAACTCCAACTAATCCTAGCACTCCTCCGGTCACTCAAAATACCTCTGATTCAACTCATAATATTAATGGTTCTCCTCCTGTGCCTCAACTCAATCCTGCTTCTG
TTCCTTCTTCAAATCCTCCTCCTCCTACTACTCAACCTAAAACTAAGGCCAAGAACAAACTTATTTCTACAAAGTCTGGTAGGAGAAAGGTGCCCCTGCATGTTAATCCT
GTCTCCATTGATGGACTTTCCTTCCATCATGAGGCTCATGTCCACAAGTGGAAATATGTGGTTCAACGCCGAATTGCTGATGAGTCTGAAGTTTTCGATGATCATCATTC
CTATCTTGCTATTATGACCCTCATATCTCAAGTCAAACTCCTGTCTACTGTTCCAAACCTTGGTCCGTTCTATCGCAAATTAGTTAGAGAAATGTTTGTTAATATTCCTT
CCTCTTTTGAAGATCCTAGTAGCCCTGATTTTCATAAAATTCATGTGAGAGGTATGGCCTTCACTGTCTCCCCTGCTTCACTAAATGCTTTTCTTGATATTGTTCTTCCC
TCTGATCCTCCAACAAATCCTACTCCTGATGAGCTTGCAACTGAGTTAACGGGAGGTATTGTCTCTAATTGGCCAACTACCGGTTACCTTTCAGCAGCTAGAAGTACATA
CACTTCCCTTGCCGCATACATGTTCATTCGCCAGCTCACATACTTAGCTCAAGCCTGCTCAGATCGCAAGATGCATCGCAGTCATCAGCTCGTCAATTCTCAGCTCATCA
GCTCGCATGAAGTCTATTATAACTACCAAGCCCAAGGCCCAAGGCCCAAGGCCCAAGCCCATTGGACTGGTAAACCTGTTGATTTTGGTTCTTTTGTTGTGAATCAACTC
AAGCATCACATAGATTCTTATGCCATCAAACTCCCTATCTGTCTGCCTCGCTTTTTCTGCAATTTCCTTCTAACCCAGAACCGTACAATTCTTCATCCTAATGAGGCTCT
TGGTCCTTCTCCCTCCAAACTTCAATTAAGCTATCATCTGTTTCAAGGAGCTCATGTACCAGACATTGAGCATTCTCTCCGCCATCTTTCCTCTTCTAATCTTCCTCATC
CTGATGATTTAAGAACTTCTGACGGTGGCCTAATGCTTCCTCCTCCTCTAGCTAATCAAATTTTGCATGCCTTATCCTCTGAATCAAGAAGTCTCAGTACCATCATTCAT
GATCTTACTGAAAGACGAAAGCTTATTGATTCCATCGTCTCTTATGTTCGGTTGCTGATCCCTTCATCAGAGGAAGCTCCTAATGTCCATCCTGATCATTAA
Protein sequenceShow/hide protein sequence
MVVIHVASLSIQLNEVCELMNEYAARRLMNYGSASNRDNQNKFRNSFSKRKDERNIEFKKDEDFKCRECGGKGHYQSECATYLRRQRKGYSATLSDESETSHDSEDEYEE
VMEKWQEDLKVIERQKDRILELVEDNHRLLQTISDIKKELKNAKNENDRMSNSVRMLNSGTKDLNHILSSGKSVSNKQGINSEQQSHTPERDDELSVSERAPSTRVQKNH
PTSDIIGPLETGITTRRKEKLDYAKMIGNSKSQYSCPESTPTNPSTPPVTQNTSDSTHNINGSPPVPQLNPASVPSSNPPPPTTQPKTKAKNKLISTKSGRRKVPLHVNP
VSIDGLSFHHEAHVHKWKYVVQRRIADESEVFDDHHSYLAIMTLISQVKLLSTVPNLGPFYRKLVREMFVNIPSSFEDPSSPDFHKIHVRGMAFTVSPASLNAFLDIVLP
SDPPTNPTPDELATELTGGIVSNWPTTGYLSAARSTYTSLAAYMFIRQLTYLAQACSDRKMHRSHQLVNSQLISSHEVYYNYQAQGPRPKAQAHWTGKPVDFGSFVVNQL
KHHIDSYAIKLPICLPRFFCNFLLTQNRTILHPNEALGPSPSKLQLSYHLFQGAHVPDIEHSLRHLSSSNLPHPDDLRTSDGGLMLPPPLANQILHALSSESRSLSTIIH
DLTERRKLIDSIVSYVRLLIPSSEEAPNVHPDH