; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0065501 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0065501
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRibonuclease H
Genome locationCMiso1.1chr03:6925596..6926957
RNA-Seq ExpressionCmc03g0065501
SyntenyCmc03g0065501
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006310 - DNA recombination (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0005634 - nucleus (cellular component)
GO:0030430 - host cell cytoplasm (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031738551.1 LOW QUALITY PROTEIN: uncharacterized protein LOC101203611 [Cucumis sativus]1.0e-24392.05Show/hide
Query:  MESDKESDDEDDVGISSELLRMVEEEDEVLGPHQGLVEAINLGSQEESKEVKIGTSLTSESRKKLINLLHEYSDIFAWSYQDMPGLSTDIVVHRVPLKPE
        MESDKESDDE+   ISSELLR+VEEED+VLGPHQ LVE INLGSQEESKEVKIGTS+TSE+RKK+INLL EYSDIFAWSYQDMPGL+TDIVVHRVPLKPE
Subjt:  MESDKESDDEDDVGISSELLRMVEEEDEVLGPHQGLVEAINLGSQEESKEVKIGTSLTSESRKKLINLLHEYSDIFAWSYQDMPGLSTDIVVHRVPLKPE

Query:  CNPVRQKLRKMKPDVLIKIKEEVQKQIEAGFLTVSENLEWVANIVPVPKKDGKVRMCIDYRDLNRASPKDNFPLPHIDMLVDNTAGYSTFSFMDGFSGYN
        CNPVRQKLRKMKPD+LIK+KEEVQKQIEAGFL VS+  EWVANIVPVPKKDGKVRMC+DYRDLNRASPKDNFPLPHIDMLVDNTAGYSTFSFMDGFSGYN
Subjt:  CNPVRQKLRKMKPDVLIKIKEEVQKQIEAGFLTVSENLEWVANIVPVPKKDGKVRMCIDYRDLNRASPKDNFPLPHIDMLVDNTAGYSTFSFMDGFSGYN

Query:  QIKMAEENREKTTFVTLWGTFCYKVMPFGLKNAGATYQRAMVTLFHDMMHKEIEVYVDDMIAKSKVDEDHTTTLQKLFDRLRKYQLKLNPSKCTFGATSG
        QIKMAEE+REKTTF+TLWGTFCYKVMPFGLKNAGATYQRAMVTLFHDMMHKEIEVYVDDMIAKSK +EDHTT LQKLFDRLRKYQLKLNPSKCTFGATSG
Subjt:  QIKMAEENREKTTFVTLWGTFCYKVMPFGLKNAGATYQRAMVTLFHDMMHKEIEVYVDDMIAKSKVDEDHTTTLQKLFDRLRKYQLKLNPSKCTFGATSG

Query:  KLLGFIVSEEGIKVDPYKVSAIMEMPSPKTEKEIRGFLGRLNYISRFISHLTPTCEPIFKLLRKNNPGKWNEDCGEAFNKIKQYLQSPPVLIPPAPWRPL
        KLLGFIVSEEGIKVDP KV AIM+MPSP+TEKE+R FLGRLNYISRFISHLTPTCEPIFKLLRKNNPGKWNEDC EAFNKIKQYLQSPPVLIPPAP RPL
Subjt:  KLLGFIVSEEGIKVDPYKVSAIMEMPSPKTEKEIRGFLGRLNYISRFISHLTPTCEPIFKLLRKNNPGKWNEDCGEAFNKIKQYLQSPPVLIPPAPWRPL

Query:  ILYLTVLESSTGGVLGQHDLSGKKEHAIYYLSKKFTDYESRYSMLERTCCALV
        ILYLTVLE S GGVLGQHDLSGKKEHAIYYLSKKFTDYES+YSMLERTCCALV
Subjt:  ILYLTVLESSTGGVLGQHDLSGKKEHAIYYLSKKFTDYESRYSMLERTCCALV

XP_031738857.1 LOW QUALITY PROTEIN: uncharacterized protein LOC116402780 [Cucumis sativus]1.0e-24392.05Show/hide
Query:  MESDKESDDEDDVGISSELLRMVEEEDEVLGPHQGLVEAINLGSQEESKEVKIGTSLTSESRKKLINLLHEYSDIFAWSYQDMPGLSTDIVVHRVPLKPE
        MESDKESDDE+   ISSELLR+VEEED+VLGPHQ LVE INLGSQEESKEVKIGTS+TSE+RKK+INLL EYSDIFAWSYQDMPGL+TDIVVHRVPLKPE
Subjt:  MESDKESDDEDDVGISSELLRMVEEEDEVLGPHQGLVEAINLGSQEESKEVKIGTSLTSESRKKLINLLHEYSDIFAWSYQDMPGLSTDIVVHRVPLKPE

Query:  CNPVRQKLRKMKPDVLIKIKEEVQKQIEAGFLTVSENLEWVANIVPVPKKDGKVRMCIDYRDLNRASPKDNFPLPHIDMLVDNTAGYSTFSFMDGFSGYN
        CNPVRQKLRKMKPD+LIK+KEEVQKQIEAGFL VS+  EWVANIVPVPKKDGKVRMC+DYRDLNRASPKDNFPLPHIDMLVDNTAGYSTFSFMDGFSGYN
Subjt:  CNPVRQKLRKMKPDVLIKIKEEVQKQIEAGFLTVSENLEWVANIVPVPKKDGKVRMCIDYRDLNRASPKDNFPLPHIDMLVDNTAGYSTFSFMDGFSGYN

Query:  QIKMAEENREKTTFVTLWGTFCYKVMPFGLKNAGATYQRAMVTLFHDMMHKEIEVYVDDMIAKSKVDEDHTTTLQKLFDRLRKYQLKLNPSKCTFGATSG
        QIKMAEE+REKTTF+TLWGTFCYKVMPFGLKNAGATYQRAMVTLFHDMMHKEIEVYVDDMIAKSK +EDHTT LQKLFDRLRKYQLKLNPSKCTFGATSG
Subjt:  QIKMAEENREKTTFVTLWGTFCYKVMPFGLKNAGATYQRAMVTLFHDMMHKEIEVYVDDMIAKSKVDEDHTTTLQKLFDRLRKYQLKLNPSKCTFGATSG

Query:  KLLGFIVSEEGIKVDPYKVSAIMEMPSPKTEKEIRGFLGRLNYISRFISHLTPTCEPIFKLLRKNNPGKWNEDCGEAFNKIKQYLQSPPVLIPPAPWRPL
        KLLGFIVSEEGIKVDP KV AIM+MPSP+TEKE+R FLGRLNYISRFISHLTPTCEPIFKLLRKNNPGKWNEDC EAFNKIKQYLQSPPVLIPPAP RPL
Subjt:  KLLGFIVSEEGIKVDPYKVSAIMEMPSPKTEKEIRGFLGRLNYISRFISHLTPTCEPIFKLLRKNNPGKWNEDCGEAFNKIKQYLQSPPVLIPPAPWRPL

Query:  ILYLTVLESSTGGVLGQHDLSGKKEHAIYYLSKKFTDYESRYSMLERTCCALV
        ILYLTVLE S GGVLGQHDLSGKKEHAIYYLSKKFTDYES+YSMLERTCCALV
Subjt:  ILYLTVLESSTGGVLGQHDLSGKKEHAIYYLSKKFTDYESRYSMLERTCCALV

XP_031739260.1 LOW QUALITY PROTEIN: uncharacterized protein LOC116402917 [Cucumis sativus]7.8e-24492.27Show/hide
Query:  MESDKESDDEDDVGISSELLRMVEEEDEVLGPHQGLVEAINLGSQEESKEVKIGTSLTSESRKKLINLLHEYSDIFAWSYQDMPGLSTDIVVHRVPLKPE
        MESDKESDDE+   ISSELLR+VEEED+VLGPHQ LVE INLGSQEESKEVKIGTS+TSE+RKK+INLL EYSDIFAWSYQDMPGL+TDIVVHRVPLKPE
Subjt:  MESDKESDDEDDVGISSELLRMVEEEDEVLGPHQGLVEAINLGSQEESKEVKIGTSLTSESRKKLINLLHEYSDIFAWSYQDMPGLSTDIVVHRVPLKPE

Query:  CNPVRQKLRKMKPDVLIKIKEEVQKQIEAGFLTVSENLEWVANIVPVPKKDGKVRMCIDYRDLNRASPKDNFPLPHIDMLVDNTAGYSTFSFMDGFSGYN
        CNPVRQKLRKMKPD+LIKIKEEVQKQIEAGFLTVS+  EWVANIVPVPKKDGKVRMC+DYRDLNRASPKDNFPLPHIDMLVDNTAGYSTFSFMDGFSGYN
Subjt:  CNPVRQKLRKMKPDVLIKIKEEVQKQIEAGFLTVSENLEWVANIVPVPKKDGKVRMCIDYRDLNRASPKDNFPLPHIDMLVDNTAGYSTFSFMDGFSGYN

Query:  QIKMAEENREKTTFVTLWGTFCYKVMPFGLKNAGATYQRAMVTLFHDMMHKEIEVYVDDMIAKSKVDEDHTTTLQKLFDRLRKYQLKLNPSKCTFGATSG
        QIKMAEE+REKTTF+TLWGTFCYKVMPFGLKNAGATYQRAMVTLFHDMMHKEIEVYVDDMIAKSK +EDHTT LQKLFDRLRKYQLKLNPSKCTFGATSG
Subjt:  QIKMAEENREKTTFVTLWGTFCYKVMPFGLKNAGATYQRAMVTLFHDMMHKEIEVYVDDMIAKSKVDEDHTTTLQKLFDRLRKYQLKLNPSKCTFGATSG

Query:  KLLGFIVSEEGIKVDPYKVSAIMEMPSPKTEKEIRGFLGRLNYISRFISHLTPTCEPIFKLLRKNNPGKWNEDCGEAFNKIKQYLQSPPVLIPPAPWRPL
        KLLGFIVSEEGIKVDP KV AIM+MPSP+TEKE+R FLGRLNYISRFISHLTPTCEPIFKLLRKNNPGKWNEDC EAFNKIKQYLQSPPVLIPPAP R L
Subjt:  KLLGFIVSEEGIKVDPYKVSAIMEMPSPKTEKEIRGFLGRLNYISRFISHLTPTCEPIFKLLRKNNPGKWNEDCGEAFNKIKQYLQSPPVLIPPAPWRPL

Query:  ILYLTVLESSTGGVLGQHDLSGKKEHAIYYLSKKFTDYESRYSMLERTCCALV
        ILYLTVLE S GGVLGQHDLSGKKEHAIYYLSKKFTDYES+YSMLERTCCALV
Subjt:  ILYLTVLESSTGGVLGQHDLSGKKEHAIYYLSKKFTDYESRYSMLERTCCALV

XP_031742360.1 uncharacterized protein LOC116404324 [Cucumis sativus]1.0e-24392.05Show/hide
Query:  MESDKESDDEDDVGISSELLRMVEEEDEVLGPHQGLVEAINLGSQEESKEVKIGTSLTSESRKKLINLLHEYSDIFAWSYQDMPGLSTDIVVHRVPLKPE
        MESDKESDDE+   ISSELLR+VEEED+VLGPHQ LVE INLGSQEESKEVKIGTS+TSE+RKK+INLL EYSDIFAWSYQDMPGL+TDIVVHRVPLKPE
Subjt:  MESDKESDDEDDVGISSELLRMVEEEDEVLGPHQGLVEAINLGSQEESKEVKIGTSLTSESRKKLINLLHEYSDIFAWSYQDMPGLSTDIVVHRVPLKPE

Query:  CNPVRQKLRKMKPDVLIKIKEEVQKQIEAGFLTVSENLEWVANIVPVPKKDGKVRMCIDYRDLNRASPKDNFPLPHIDMLVDNTAGYSTFSFMDGFSGYN
        CNPVRQKLRKMKPD+LIK+KEEVQKQIEAGFL VS+  EWVANIVPVPKKDGKVRMC+DYRDLNRASPKDNFPLPHIDMLVDNTAGYSTFSFMDGFSGYN
Subjt:  CNPVRQKLRKMKPDVLIKIKEEVQKQIEAGFLTVSENLEWVANIVPVPKKDGKVRMCIDYRDLNRASPKDNFPLPHIDMLVDNTAGYSTFSFMDGFSGYN

Query:  QIKMAEENREKTTFVTLWGTFCYKVMPFGLKNAGATYQRAMVTLFHDMMHKEIEVYVDDMIAKSKVDEDHTTTLQKLFDRLRKYQLKLNPSKCTFGATSG
        QIKMAEE+REKTTF+TLWGTFCYKVMPFGLKNAGATYQRAMVTLFHDMMHKEIEVYVDDMIAKSK +EDHTT LQKLFDRLRKYQLKLNPSKCTFGATSG
Subjt:  QIKMAEENREKTTFVTLWGTFCYKVMPFGLKNAGATYQRAMVTLFHDMMHKEIEVYVDDMIAKSKVDEDHTTTLQKLFDRLRKYQLKLNPSKCTFGATSG

Query:  KLLGFIVSEEGIKVDPYKVSAIMEMPSPKTEKEIRGFLGRLNYISRFISHLTPTCEPIFKLLRKNNPGKWNEDCGEAFNKIKQYLQSPPVLIPPAPWRPL
        KLLGFIVSEEGIKVDP KV AIM+MPSP+TEKE+R FLGRLNYISRFISHLTPTCEPIFKLLRKNNPGKWNEDC EAFNKIKQYLQSPPVLIPPAP RPL
Subjt:  KLLGFIVSEEGIKVDPYKVSAIMEMPSPKTEKEIRGFLGRLNYISRFISHLTPTCEPIFKLLRKNNPGKWNEDCGEAFNKIKQYLQSPPVLIPPAPWRPL

Query:  ILYLTVLESSTGGVLGQHDLSGKKEHAIYYLSKKFTDYESRYSMLERTCCALV
        ILYLTVLE S GGVLGQHDLSGKKEHAIYYLSKKFTDYES+YSMLERTCCALV
Subjt:  ILYLTVLESSTGGVLGQHDLSGKKEHAIYYLSKKFTDYESRYSMLERTCCALV

XP_031742474.1 LOW QUALITY PROTEIN: uncharacterized protein LOC116404360 [Cucumis sativus]1.0e-24392.05Show/hide
Query:  MESDKESDDEDDVGISSELLRMVEEEDEVLGPHQGLVEAINLGSQEESKEVKIGTSLTSESRKKLINLLHEYSDIFAWSYQDMPGLSTDIVVHRVPLKPE
        MESDKESDDE+   ISSELLR+VEEED+VLGPHQ LVE INLGSQEESKEVKIGTS+TSE+RKK+INLL EYSDIFAWSYQDMPGL+TDIVVHRVPLKPE
Subjt:  MESDKESDDEDDVGISSELLRMVEEEDEVLGPHQGLVEAINLGSQEESKEVKIGTSLTSESRKKLINLLHEYSDIFAWSYQDMPGLSTDIVVHRVPLKPE

Query:  CNPVRQKLRKMKPDVLIKIKEEVQKQIEAGFLTVSENLEWVANIVPVPKKDGKVRMCIDYRDLNRASPKDNFPLPHIDMLVDNTAGYSTFSFMDGFSGYN
        CNPVRQKLRKMKPD+LIK+KEEVQKQIEAGFL VS+  EWVANIVPVPKKDGKVRMC+DYRDLNRASPKDNFPLPHIDMLVDNTAGYSTFSFMDGFSGYN
Subjt:  CNPVRQKLRKMKPDVLIKIKEEVQKQIEAGFLTVSENLEWVANIVPVPKKDGKVRMCIDYRDLNRASPKDNFPLPHIDMLVDNTAGYSTFSFMDGFSGYN

Query:  QIKMAEENREKTTFVTLWGTFCYKVMPFGLKNAGATYQRAMVTLFHDMMHKEIEVYVDDMIAKSKVDEDHTTTLQKLFDRLRKYQLKLNPSKCTFGATSG
        QIKMAEE+REKTTF+TLWGTFCYKVMPFGLKNAGATYQRAMVTLFHDMMHKEIEVYVDDMIAKSK +EDHTT LQKLFDRLRKYQLKLNPSKCTFGATSG
Subjt:  QIKMAEENREKTTFVTLWGTFCYKVMPFGLKNAGATYQRAMVTLFHDMMHKEIEVYVDDMIAKSKVDEDHTTTLQKLFDRLRKYQLKLNPSKCTFGATSG

Query:  KLLGFIVSEEGIKVDPYKVSAIMEMPSPKTEKEIRGFLGRLNYISRFISHLTPTCEPIFKLLRKNNPGKWNEDCGEAFNKIKQYLQSPPVLIPPAPWRPL
        KLLGFIVSEEGIKVDP KV AIM+MPSP+TEKE+R FLGRLNYISRFISHLTPTCEPIFKLLRKNNPGKWNEDC EAFNKIKQYLQSPPVLIPPAP RPL
Subjt:  KLLGFIVSEEGIKVDPYKVSAIMEMPSPKTEKEIRGFLGRLNYISRFISHLTPTCEPIFKLLRKNNPGKWNEDCGEAFNKIKQYLQSPPVLIPPAPWRPL

Query:  ILYLTVLESSTGGVLGQHDLSGKKEHAIYYLSKKFTDYESRYSMLERTCCALV
        ILYLTVLE S GGVLGQHDLSGKKEHAIYYLSKKFTDYES+YSMLERTCCALV
Subjt:  ILYLTVLESSTGGVLGQHDLSGKKEHAIYYLSKKFTDYESRYSMLERTCCALV

TrEMBL top hitse value%identityAlignment
A0A5A7TRL7 Ribonuclease H9.4e-20381.9Show/hide
Query:  MESDKESDDEDDVGISSELLRMVEEEDEVLGPHQGLVEAINLGSQEESKEVKIGTSLTSESRKKLINLLHEYSDIFAWSYQDMPGLSTDIVVHRVPLKPE
        MESDKESDDEDDVGISSELLRMVEEEDEVLGPHQ LVEAINLGSQEESKE                               DMP LSTDIVVHRVPLKPE
Subjt:  MESDKESDDEDDVGISSELLRMVEEEDEVLGPHQGLVEAINLGSQEESKEVKIGTSLTSESRKKLINLLHEYSDIFAWSYQDMPGLSTDIVVHRVPLKPE

Query:  CNPVRQKLRKMKPDVLIKIKEEVQKQIEAGFLTVSENLEWVANIVPVPKKDGKVRMCIDYRDLNRASPKDNFPLPHIDMLVDNTAGYSTFSFMDGFSGYN
        CNPVRQKLRKMKPDVLIKIKEEVQKQIEAGFLTVS+  EWVAN+VPVPKKDGKVRMC+DYRDLNRASPKDNF LPHIDMLVDNTA YSTFSFMDGFSGYN
Subjt:  CNPVRQKLRKMKPDVLIKIKEEVQKQIEAGFLTVSENLEWVANIVPVPKKDGKVRMCIDYRDLNRASPKDNFPLPHIDMLVDNTAGYSTFSFMDGFSGYN

Query:  QIKMAEENREKTTFVTLWGTFCYKVMPFGLKNAGATYQRAMVTLFHDMMHKEIEVYVDDMIAKSKVDEDHTTTLQKLFDRLRKYQLKLNPSKCTFGATSG
        QIKMA+E+REKTTF+TLWGTFCYK                              VYVDDMIAKSKVDEDHTTTLQKLFDRLRKYQLKLNPSKCTFGATS 
Subjt:  QIKMAEENREKTTFVTLWGTFCYKVMPFGLKNAGATYQRAMVTLFHDMMHKEIEVYVDDMIAKSKVDEDHTTTLQKLFDRLRKYQLKLNPSKCTFGATSG

Query:  KLLGFIVSEEGIKVDPYKVSAIMEMPSPKTEKEIRGFLGRLNYISRFISHLTPTCEPIFKLLRKNNPGKWNEDCGEAFNKIKQYLQSPPVLIPPAPWRPL
        KLLGFIVSEEGIKVDP KV AIMEM SPKTEKEIRGFLGRLNYISRFISHLTPTC+PIFKLLRKNNPGKWNEDC EAFNKIKQYLQSPPVLIPPAP RPL
Subjt:  KLLGFIVSEEGIKVDPYKVSAIMEMPSPKTEKEIRGFLGRLNYISRFISHLTPTCEPIFKLLRKNNPGKWNEDCGEAFNKIKQYLQSPPVLIPPAPWRPL

Query:  ILYLTVLESSTGGVLGQHDLSGKKEHAIYYLSKKFTDYESRYSMLERTCCALV
        ILYLTVLESS GGVLGQHDLSGKKEHAIYYLSKKFT+YESRYSMLERTCCALV
Subjt:  ILYLTVLESSTGGVLGQHDLSGKKEHAIYYLSKKFTDYESRYSMLERTCCALV

A0A5A7VK41 Reverse transcriptase domain-containing protein2.1e-21886.31Show/hide
Query:  MESDKESDDEDDVGISSELLRMVEEEDEVLGPHQGLVEAINLGSQEESKEVKIGTSLTSESRKKLINLLHEYSDIFAWSYQDMPGLSTDIVVHRVPLKPE
        M+ DKESDDEDDVGISSELLRMVEEEDEVLGPHQ LVEAINLGSQEESKE                               DM GLSTDIVVHRVPLKPE
Subjt:  MESDKESDDEDDVGISSELLRMVEEEDEVLGPHQGLVEAINLGSQEESKEVKIGTSLTSESRKKLINLLHEYSDIFAWSYQDMPGLSTDIVVHRVPLKPE

Query:  CNPVRQKLRKMKPDVLIKIKEEVQKQIEAGFLTVSENLEWVANIVPVPKKDGKVRMCIDYRDLNRASPKDNFPLPHIDMLVDNTAGYSTFSFMDGFSGYN
        CNPVRQKLRKMKPDVLIKIKEEVQKQIEAGFLTV +  EWVANIVPVPKKDGKVRMC+DYRDLNRASPKDN PLPHIDMLV+NTAGYSTFSFMDGFSGYN
Subjt:  CNPVRQKLRKMKPDVLIKIKEEVQKQIEAGFLTVSENLEWVANIVPVPKKDGKVRMCIDYRDLNRASPKDNFPLPHIDMLVDNTAGYSTFSFMDGFSGYN

Query:  QIKMAEENREKTTFVTLWGTFCYKVMPFGLKNAGATYQRAMVTLFHDMMHKEIEVYVDDMIAKSKVDEDHTTTLQKLFDRLRKYQLKLNPSKCTFGATSG
        QIKMAEE+REK TF+TLWGTF YKVMPFGLKN GATYQRAMVTLFHDMMHKEIEVYVDDMIAKSK DEDHT TLQKLFDRLRKYQLK NPSKCTFGATSG
Subjt:  QIKMAEENREKTTFVTLWGTFCYKVMPFGLKNAGATYQRAMVTLFHDMMHKEIEVYVDDMIAKSKVDEDHTTTLQKLFDRLRKYQLKLNPSKCTFGATSG

Query:  KLLGFIVSEEGIKVDPYKVSAIMEMPSPKTEKEIRGFLGRLNYISRFISHLTPTCEPIFKLLRKNNPGKWNEDCGEAFNKIKQYLQSPPVLIPPAPWRPL
        KLLGFIVSEEGIKVDP KV AIMEM SPK EKEIRGFLGRLNYISRFISHLTPTCE IFKLL KNNPGKWNEDC EAFNKIKQYLQSPPVLIPPA  RPL
Subjt:  KLLGFIVSEEGIKVDPYKVSAIMEMPSPKTEKEIRGFLGRLNYISRFISHLTPTCEPIFKLLRKNNPGKWNEDCGEAFNKIKQYLQSPPVLIPPAPWRPL

Query:  ILYLTVLESSTGGVLGQHDLSGKKEHAIYYLSKKFTDYESRYSMLERTCCALV
        ILYLTVLESS  GVLGQHDLSGKKEHAIYYLSKKFTDYESRYS LERTCCALV
Subjt:  ILYLTVLESSTGGVLGQHDLSGKKEHAIYYLSKKFTDYESRYSMLERTCCALV

A0A6J1CNY7 Ribonuclease H2.5e-20376.33Show/hide
Query:  MESDKESDDEDDVGISSELLRMVEEEDEVLGPHQGLVEAINLGSQEESKEVKIGTSLTSESRKKLINLLHEYSDIFAWSYQDMPGLSTDIVVHRVPLKPE
        +ESDKE DDE     S+ELLRM+EEE+++LGPH+ L E +NLGSQ E+KE+KIGT ++SESRKKLI LLHEY+D+FAWSYQDMPGL TDIVVH++ + P+
Subjt:  MESDKESDDEDDVGISSELLRMVEEEDEVLGPHQGLVEAINLGSQEESKEVKIGTSLTSESRKKLINLLHEYSDIFAWSYQDMPGLSTDIVVHRVPLKPE

Query:  CNPVRQKLRKMKPDVLIKIKEEVQKQIEAGFLTVSENLEWVANIVPVPKKDGKVRMCIDYRDLNRASPKDNFPLPHIDMLVDNTAGYSTFSFMDGFSGYN
          PVRQKLRKM+PD+LIKIK+EV+KQI+AGFLT+S   EWVANIVPVPKK+G+VRMC+DYRDLNRASPKDNFPLPHID+LVDNTAG+STFSFMDGFSGYN
Subjt:  CNPVRQKLRKMKPDVLIKIKEEVQKQIEAGFLTVSENLEWVANIVPVPKKDGKVRMCIDYRDLNRASPKDNFPLPHIDMLVDNTAGYSTFSFMDGFSGYN

Query:  QIKMAEENREKTTFVTLWGTFCYKVMPFGLKNAGATYQRAMVTLFHDMMHKEIEVYVDDMIAKSKVDEDHTTTLQKLFDRLRKYQLKLNPSKCTFGATSG
        QIKMA E+REKTTF+TLWGTF YKVM FGLKNAGATYQRAMVTLFHD+MHKEIEVYVDDMIAKSK  E HTT L+KLFDRLRK++LKLNP+KC FGAT+G
Subjt:  QIKMAEENREKTTFVTLWGTFCYKVMPFGLKNAGATYQRAMVTLFHDMMHKEIEVYVDDMIAKSKVDEDHTTTLQKLFDRLRKYQLKLNPSKCTFGATSG

Query:  KLLGFIVSEEGIKVDPYKVSAIMEMPSPKTEKEIRGFLGRLNYISRFISHLTPTCEPIFKLLRKNNPGKWNEDCGEAFNKIKQYLQSPPVLIPPAPWRPL
        KLLGF+VS+EGIKV+  KV AI+EMP P+T+K++RGFLGRLNYI+RFISHLT TCEPIFKLLRKNN G W+E+C  A +KIKQYL  PP+L+PP P RP 
Subjt:  KLLGFIVSEEGIKVDPYKVSAIMEMPSPKTEKEIRGFLGRLNYISRFISHLTPTCEPIFKLLRKNNPGKWNEDCGEAFNKIKQYLQSPPVLIPPAPWRPL

Query:  ILYLTVLESSTGGVLGQHDLSGKKEHAIYYLSKKFTDYESRYSMLERTCCAL
        ILYL V E+S G VLGQHD SG+KE AIYYLSKKFTD E+RYS +E+TCCAL
Subjt:  ILYLTVLESSTGGVLGQHDLSGKKEHAIYYLSKKFTDYESRYSMLERTCCAL

A0A6J1D099 Ribonuclease H2.0e-20577.43Show/hide
Query:  MESDKESDDEDDVGISSELLRMVEEEDEVLGPHQGLVEAINLGSQEESKEVKIGTSLTSESRKKLINLLHEYSDIFAWSYQDMPGLSTDIVVHRVPLKPE
        +ESD+E DDE     S+ELLRM+EEE+++LGPH+ L E +NLGSQ E+KE+KIGT ++SESRKKLI LLHEY+D+FAWSY DM GL TDIVVH++P+ PE
Subjt:  MESDKESDDEDDVGISSELLRMVEEEDEVLGPHQGLVEAINLGSQEESKEVKIGTSLTSESRKKLINLLHEYSDIFAWSYQDMPGLSTDIVVHRVPLKPE

Query:  CNPVRQKLRKMKPDVLIKIKEEVQKQIEAGFLTVSENLEWVANIVPVPKKDGKVRMCIDYRDLNRASPKDNFPLPHIDMLVDNTAGYSTFSFMDGFSGYN
          P+RQKLRKM+PD+LIKIK EV+KQI+AGFLTVS   EWVANIVPVPKK+G+VRMC+DYRDLNRASPKDNFPLPHID+LVDNTAG+STFSFMDGFSGYN
Subjt:  CNPVRQKLRKMKPDVLIKIKEEVQKQIEAGFLTVSENLEWVANIVPVPKKDGKVRMCIDYRDLNRASPKDNFPLPHIDMLVDNTAGYSTFSFMDGFSGYN

Query:  QIKMAEENREKTTFVTLWGTFCYKVMPFGLKNAGATYQRAMVTLFHDMMHKEIEVYVDDMIAKSKVDEDHTTTLQKLFDRLRKYQLKLNPSKCTFGATSG
        QIKMA E+REKTTF+TLWGTFCYKVM FGLKNAGATYQRAMVTLFHD+MHKEIEVYVDDMIAKSK  E+HTT L+KLFDRLRK++LKLN +KC FGAT+G
Subjt:  QIKMAEENREKTTFVTLWGTFCYKVMPFGLKNAGATYQRAMVTLFHDMMHKEIEVYVDDMIAKSKVDEDHTTTLQKLFDRLRKYQLKLNPSKCTFGATSG

Query:  KLLGFIVSEEGIKVDPYKVSAIMEMPSPKTEKEIRGFLGRLNYISRFISHLTPTCEPIFKLLRKNNPGKWNEDCGEAFNKIKQYLQSPPVLIPPAPWRPL
        KLLGF+VS+EGIKVDP KV AI+EMP P+T+KE+RGFLGRLNYI+RFISHLT TCEPIFKLLRKNN G  +EDC  AF+K+KQYLQ PPVL+PP P R L
Subjt:  KLLGFIVSEEGIKVDPYKVSAIMEMPSPKTEKEIRGFLGRLNYISRFISHLTPTCEPIFKLLRKNNPGKWNEDCGEAFNKIKQYLQSPPVLIPPAPWRPL

Query:  ILYLTVLESSTGGVLGQHDLSGKKEHAIYYLSKKFTDYESRYSMLERTCCAL
        ILYLTV E+S G VLGQHD SG+KE AIYYLSKKFTD E+RYS +E+TCCAL
Subjt:  ILYLTVLESSTGGVLGQHDLSGKKEHAIYYLSKKFTDYESRYSMLERTCCAL

A0A6J1DM29 LOW QUALITY PROTEIN: uncharacterized protein LOC1110222313.8e-20477.85Show/hide
Query:  ISSELLRMVEEEDEVLGPHQGLVEAINLGSQEESKEVKIGTSLTSESRKKLINLLHEYSDIFAWSYQDMPGLSTDIVVHRVPLKPECNPVRQKLRKMKPD
        +SSELLRM+EEE++ LGPH    E +NLGSQ E KEVKIGT+++ ESRKKLI LLHE++D+FAWSYQDMP L TDIVVH++P+ PEC PVRQKLRKM+PD
Subjt:  ISSELLRMVEEEDEVLGPHQGLVEAINLGSQEESKEVKIGTSLTSESRKKLINLLHEYSDIFAWSYQDMPGLSTDIVVHRVPLKPECNPVRQKLRKMKPD

Query:  VLIKIKEEVQKQIEAGFLTVSENLEWVANIVPVPKKDGKVRMCIDYRDLNRASPKDNFPLPHIDMLVDNTAGYSTFSFMDGFSGYNQIKMAEENREKTTF
        +LIK+K+EV+KQI+AGFLTVS   EWVANIVPVPKK+G+VRMC+DYRDLNRASPKDNFPLPHID+LVDNTAG+STFSFMDGFSGYN+IKMA E+REKTTF
Subjt:  VLIKIKEEVQKQIEAGFLTVSENLEWVANIVPVPKKDGKVRMCIDYRDLNRASPKDNFPLPHIDMLVDNTAGYSTFSFMDGFSGYNQIKMAEENREKTTF

Query:  VTLWGTFCYKVMPFGLKNAGATYQRAMVTLFHDMMHKEIEVYVDDMIAKSKVDEDHTTTLQKLFDRLRKYQLKLNPSKCTFGATSGKLLGFIVSEEGIKV
        +TLWGTFCYKVM FGLKNAGATYQRAMVTLFHD+MHKEIEVYVDDMIAKS+  E+HT  L+KLF+RLRK+QLKLNP+KCTFG TSGKLLGF+V +EGIKV
Subjt:  VTLWGTFCYKVMPFGLKNAGATYQRAMVTLFHDMMHKEIEVYVDDMIAKSKVDEDHTTTLQKLFDRLRKYQLKLNPSKCTFGATSGKLLGFIVSEEGIKV

Query:  DPYKVSAIMEMPSPKTEKEIRGFLGRLNYISRFISHLTPTCEPIFKLLRKNNPGKWNEDCGEAFNKIKQYLQSPPVLIPPAPWRPLILYLTVLESSTGGV
        DP KV AI+EM  PKT KE+RGFLGRLNYI+RFISHLT TCEPIFKLLRKNN G W EDC  AF+KIKQY QSPP+L+PP   RPLILYLT+ E+S G V
Subjt:  DPYKVSAIMEMPSPKTEKEIRGFLGRLNYISRFISHLTPTCEPIFKLLRKNNPGKWNEDCGEAFNKIKQYLQSPPVLIPPAPWRPLILYLTVLESSTGGV

Query:  LGQHDLSGKKEHAIYYLSKKFTDYESRYSMLERTCCAL
        LGQHD SG+KE AIYYLSKKFTD E+ YS +E+TCCAL
Subjt:  LGQHDLSGKKEHAIYYLSKKFTDYESRYSMLERTCCAL

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.65.7e-4831.44Show/hide
Query:  LTSESRKKLINLLHEYSDIFAWSYQDMPGLS-TDIVVHRVPLKPECNPVRQKLRKMKPDVLIKIKEEVQKQIEAGFLTVSENLEWVANIVPVPKKDG---
        L +E +++L  LL +Y DI    Y +   L+ T+   H +  K    P+  K          +++ ++Q  +  G +  S N  + + I  VPKK     
Subjt:  LTSESRKKLINLLHEYSDIFAWSYQDMPGLS-TDIVVHRVPLKPECNPVRQKLRKMKPDVLIKIKEEVQKQIEAGFLTVSENLEWVANIVPVPKKDG---

Query:  --KVRMCIDYRDLNRASPKDNFPLPHIDMLVDNTAGYSTFSFMDGFSGYNQIKMAEENREKTTFVTLWGTFCYKVMPFGLKNAGATYQRAMVTLFHDMMH
          K R+ IDYR LN  +  D  P+P++D ++      + F+ +D   G++QI+M  E+  KT F T  G + Y  MPFGLKNA AT+QR M  +   +++
Subjt:  --KVRMCIDYRDLNRASPKDNFPLPHIDMLVDNTAGYSTFSFMDGFSGYNQIKMAEENREKTTFVTLWGTFCYKVMPFGLKNAGATYQRAMVTLFHDMMH

Query:  KEIEVYVDDMIAKSKVDEDHTTTLQKLFDRLRKYQLKLNPSKCTFGATSGKLLGFIVSEEGIKVDPYKVSAIMEMPSPKTEKEIRGFLGRLNYISRFISH
        K   VY+DD+I  S   ++H  +L  +F++L K  LKL   KC F       LG +++ +GIK +P K+ AI + P P   KEI+ FLG   Y  +FI +
Subjt:  KEIEVYVDDMIAKSKVDEDHTTTLQKLFDRLRKYQLKLNPSKCTFGATSGKLLGFIVSEEGIKVDPYKVSAIMEMPSPKTEKEIRGFLGRLNYISRFISH

Query:  LTPTCEPIFKLLRKN-NPGKWNEDCGEAFNKIKQYLQSPPVLIPPAPWRPLILYLTVLESSTGGVLGQHDLSGKKEHAIYYLSKKFTDYESRYSMLERTC
             +P+ K L+KN      N +   AF K+K  +   P+L  P   +   L     + + G VL Q        H + Y+S+   ++E  YS +E+  
Subjt:  LTPTCEPIFKLLRKN-NPGKWNEDCGEAFNKIKQYLQSPPVLIPPAPWRPLILYLTVLESSTGGVLGQHDLSGKKEHAIYYLSKKFTDYESRYSMLERTC

Query:  CALV
         A+V
Subjt:  CALV

P10394 Retrovirus-related Pol polyprotein from transposon 4125.0e-4430.23Show/hide
Query:  RKKLINLLHEYSDIFAWSYQDMPGLSTDIVVHRVPLKPECNPVRQKLRKMKPDVLIKIKEEVQKQIEAGFLTVSENLEWVANIVPVPKKDG------KVR
        + +L N+  EY DIFA   +  P    ++   ++ LK +  PV  K  +     + +I+ +VQK I+   +  S + ++ + ++ VPKK        K R
Subjt:  RKKLINLLHEYSDIFAWSYQDMPGLSTDIVVHRVPLKPECNPVRQKLRKMKPDVLIKIKEEVQKQIEAGFLTVSENLEWVANIVPVPKKDG------KVR

Query:  MCIDYRDLNRASPKDNFPLPHIDMLVDNTAGYSTFSFMDGFSGYNQIKMAEENREKTTFVTLWGTFCYKVMPFGLKNAGATYQRAMVTLFHDMMHKEIEV
        + IDYR +N+    D FPLP ID ++D       FS +D  SG++QI++ E +R+ T+F T  G++ +  +PFGLK A  ++QR M   F  +   +  +
Subjt:  MCIDYRDLNRASPKDNFPLPHIDMLVDNTAGYSTFSFMDGFSGYNQIKMAEENREKTTFVTLWGTFCYKVMPFGLKNAGATYQRAMVTLFHDMMHKEIEV

Query:  YVDDMIAKSKVDEDHTTTLQKLFDRLRKYQLKLNPSKCTFGATSGKLLGFIVSEEGIKVDPYKVSAIMEMPSPKTEKEIRGFLGRLNYISRFISHLTPTC
        Y+DD+I     ++     L ++F + R+Y LKL+P KC+F       LG   +++GI  D  K   I   P P      R F+   NY  RFI +     
Subjt:  YVDDMIAKSKVDEDHTTTLQKLFDRLRKYQLKLNPSKCTFGATSGKLLGFIVSEEGIKVDPYKVSAIMEMPSPKTEKEIRGFLGRLNYISRFISHLTPTC

Query:  EPIFKLLRKNNPGKWNEDCGEAFNKIKQYLQSPPVLIPPAPWRPLILYLTVLESSTGGVLGQHDLSGKKEHAIYYLSKKFTDYESRYSMLERTCCAL
          I +L +KN P +W ++C +AF  +K  L +P +L  P   +   +     + + G VL Q+      +  + Y S+ FT  ES  S  E+   A+
Subjt:  EPIFKLLRKNNPGKWNEDCGEAFNKIKQYLQSPPVLIPPAPWRPLILYLTVLESSTGGVLGQHDLSGKKEHAIYYLSKKFTDYESRYSMLERTCCAL

P20825 Retrovirus-related Pol polyprotein from transposon 2978.9e-4930.35Show/hide
Query:  EAINLGSQEESKEVKIG----TSLTSESRKKLINLLHEYSDIFAWSYQDMPGLSTDIVVHRVPLKPECNPVRQKLRKMKPDVLIKIKEEVQKQIEAGFLT
        E+I    QE  K++         L  E   KL  LL+++ ++    Y++   L+    +  V      +P+  K   +     I+++ +VQ+ +  G + 
Subjt:  EAINLGSQEESKEVKIG----TSLTSESRKKLINLLHEYSDIFAWSYQDMPGLSTDIVVHRVPLKPECNPVRQKLRKMKPDVLIKIKEEVQKQIEAGFLT

Query:  VS----ENLEWVANIVPVPKKDGKVRMCIDYRDLNRASPKDNFPLPHIDMLVDNTAGYSTFSFMDGFSGYNQIKMAEENREKTTFVTLWGTFCYKVMPFG
         S     +  WV    P      K R+ IDYR LN  +  D +P+P++D ++        F+ +D   G++QI+M EE+  KT F T  G + Y  MPFG
Subjt:  VS----ENLEWVANIVPVPKKDGKVRMCIDYRDLNRASPKDNFPLPHIDMLVDNTAGYSTFSFMDGFSGYNQIKMAEENREKTTFVTLWGTFCYKVMPFG

Query:  LKNAGATYQRAMVTLFHDMMHKEIEVYVDDMIAKSKVDEDHTTTLQKLFDRLRKYQLKLNPSKCTFGATSGKLLGFIVSEEGIKVDPYKVSAIMEMPSPK
        L+NA AT+QR M  +   +++K   VY+DD+I  S    +H  ++Q +F +L    LKL   KC F       LG IV+ +GIK +P KV AI+  P P 
Subjt:  LKNAGATYQRAMVTLFHDMMHKEIEVYVDDMIAKSKVDEDHTTTLQKLFDRLRKYQLKLNPSKCTFGATSGKLLGFIVSEEGIKVDPYKVSAIMEMPSPK

Query:  TEKEIRGFLGRLNYISRFISHLTPTCEPIFKLLRKNNPGKWNE-DCGEAFNKIKQYLQSPPVLIPPAPWRPLILYLTVLESSTGGVLGQHDLSGKKEHAI
         +KEIR FLG   Y  +FI +     +P+   L+K       + +  EAF K+K  +   P+L  P   +  +L       + G VL Q+       H I
Subjt:  TEKEIRGFLGRLNYISRFISHLTPTCEPIFKLLRKNNPGKWNE-DCGEAFNKIKQYLQSPPVLIPPAPWRPLILYLTVLESSTGGVLGQHDLSGKKEHAI

Query:  YYLSKKFTDYESRYSMLERTCCALV
         ++S+   D+E  YS +E+   A+V
Subjt:  YYLSKKFTDYESRYSMLERTCCALV

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.4e-4933.42Show/hide
Query:  DMPGLSTDI----VVHRVPLKPECNPVRQKLRKMKPDVLIKIKEEVQKQIEAGFLTVSENLEWVANIVPVPKKDGKVRMCIDYRDLNRASPKDNFPLPHI
        D+P    DI    V H + +KP     R +   +      +I + VQK ++  F+  S++    + +V VPKKDG  R+C+DYR LN+A+  D FPLP I
Subjt:  DMPGLSTDI----VVHRVPLKPECNPVRQKLRKMKPDVLIKIKEEVQKQIEAGFLTVSENLEWVANIVPVPKKDGKVRMCIDYRDLNRASPKDNFPLPHI

Query:  DMLVDNTAGYSTFSFMDGFSGYNQIKMAEENREKTTFVTLWGTFCYKVMPFGLKNAGATYQRAMVTLFHDMMHKEIEVYVDDMIAKSKVDEDHTTTLQKL
        D L+        F+ +D  SGY+QI M  ++R KT FVT  G + Y VMPFGL NA +T+ R M   F D+  + + VY+DD++  S+  E+H   L  +
Subjt:  DMLVDNTAGYSTFSFMDGFSGYNQIKMAEENREKTTFVTLWGTFCYKVMPFGLKNAGATYQRAMVTLFHDMMHKEIEVYVDDMIAKSKVDEDHTTTLQKL

Query:  FDRLRKYQLKLNPSKCTFGATSGKLLGFIVSEEGIKVDPYKVSAIMEMPSPKTEKEIRGFLGRLNYISRFISHLTPTCEPIFKLLRKNNPGKWNEDCGEA
         +RL+   L +   KC F +   + LG+ +  + I    +K +AI + P+PKT K+ + FLG +NY  RFI + +   +PI   +   +  +W E   +A
Subjt:  FDRLRKYQLKLNPSKCTFGATSGKLLGFIVSEEGIKVDPYKVSAIMEMPSPKTEKEIRGFLGRLNYISRFISHLTPTCEPIFKLLRKNNPGKWNEDCGEA

Query:  FNKIKQYLQSPPVLIPPAPWRPLILYLTVLESSTGGVLGQHDLSGKKEHAIYYLSKKFTDYESRY
          K+K  L + PVL+P        L     +   G VL + D   K    + Y SK     +  Y
Subjt:  FNKIKQYLQSPPVLIPPAPWRPLILYLTVLESSTGGVLGQHDLSGKKEHAIYYLSKKFTDYESRY

Q99315 Transposon Ty3-G Gag-Pol polyprotein8.0e-5033.42Show/hide
Query:  DMPGLSTDI----VVHRVPLKPECNPVRQKLRKMKPDVLIKIKEEVQKQIEAGFLTVSENLEWVANIVPVPKKDGKVRMCIDYRDLNRASPKDNFPLPHI
        D+P    DI    V H + +KP     R +   +      +I + VQK ++  F+  S++    + +V VPKKDG  R+C+DYR LN+A+  D FPLP I
Subjt:  DMPGLSTDI----VVHRVPLKPECNPVRQKLRKMKPDVLIKIKEEVQKQIEAGFLTVSENLEWVANIVPVPKKDGKVRMCIDYRDLNRASPKDNFPLPHI

Query:  DMLVDNTAGYSTFSFMDGFSGYNQIKMAEENREKTTFVTLWGTFCYKVMPFGLKNAGATYQRAMVTLFHDMMHKEIEVYVDDMIAKSKVDEDHTTTLQKL
        D L+        F+ +D  SGY+QI M  ++R KT FVT  G + Y VMPFGL NA +T+ R M   F D+  + + VY+DD++  S+  E+H   L  +
Subjt:  DMLVDNTAGYSTFSFMDGFSGYNQIKMAEENREKTTFVTLWGTFCYKVMPFGLKNAGATYQRAMVTLFHDMMHKEIEVYVDDMIAKSKVDEDHTTTLQKL

Query:  FDRLRKYQLKLNPSKCTFGATSGKLLGFIVSEEGIKVDPYKVSAIMEMPSPKTEKEIRGFLGRLNYISRFISHLTPTCEPIFKLLRKNNPGKWNEDCGEA
         +RL+   L +   KC F +   + LG+ +  + I    +K +AI + P+PKT K+ + FLG +NY  RFI + +   +PI   +   +  +W E   +A
Subjt:  FDRLRKYQLKLNPSKCTFGATSGKLLGFIVSEEGIKVDPYKVSAIMEMPSPKTEKEIRGFLGRLNYISRFISHLTPTCEPIFKLLRKNNPGKWNEDCGEA

Query:  FNKIKQYLQSPPVLIPPAPWRPLILYLTVLESSTGGVLGQHDLSGKKEHAIYYLSKKFTDYESRY
         +K+K  L + PVL+P        L     +   G VL + D   K    + Y SK     +  Y
Subjt:  FNKIKQYLQSPPVLIPPAPWRPLILYLTVLESSTGGVLGQHDLSGKKEHAIYYLSKKFTDYESRY

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein3.9e-1537.72Show/hide
Query:  KYQLKLNPSKCTFGATSGKLLG--FIVSEEGIKVDPYKVSAIMEMPSPKTEKEIRGFLGRLNYISRFISHLTPTCEPIFKLLRKNNPGKWNEDCGEAFNK
        ++Q   N  KC FG      LG   I+S EG+  DP K+ A++  P PK   E+RGFLG   Y  RF+ +      P+ +LL+KN+  KW E    AF  
Subjt:  KYQLKLNPSKCTFGATSGKLLG--FIVSEEGIKVDPYKVSAIMEMPSPKTEKEIRGFLGRLNYISRFISHLTPTCEPIFKLLRKNNPGKWNEDCGEAFNK

Query:  IKQYLQSPPVLIPP
        +K  + + PVL  P
Subjt:  IKQYLQSPPVLIPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATCTGACAAGGAAAGTGACGATGAAGATGATGTGGGAATATCCTCAGAATTGTTAAGAATGGTAGAAGAAGAAGATGAGGTTCTTGGTCCTCACCAAGGACTAGT
TGAAGCAATCAATTTGGGTTCTCAAGAGGAGTCAAAAGAGGTAAAAATTGGCACATCATTGACCAGTGAATCACGGAAAAAACTGATCAATTTGTTACATGAGTATTCAG
ATATTTTTGCTTGGAGTTATCAGGATATGCCAGGATTAAGTACGGATATTGTAGTACACCGAGTTCCTTTAAAACCAGAATGCAACCCAGTGAGACAAAAGCTACGTAAA
ATGAAACCTGATGTACTGATTAAAATAAAGGAGGAAGTACAGAAGCAAATTGAAGCAGGATTCCTCACAGTCTCAGAAAATCTTGAATGGGTGGCAAACATTGTTCCAGT
GCCCAAAAAAGATGGAAAAGTGAGAATGTGCATAGATTATAGAGATTTAAATCGAGCCAGTCCAAAAGATAACTTCCCTTTACCTCATATCGACATGTTGGTGGATAACA
CTGCAGGATACTCAACTTTCTCCTTCATGGATGGTTTTTCAGGATATAATCAGATCAAAATGGCTGAAGAAAATAGAGAAAAAACGACATTCGTTACCCTTTGGGGAACA
TTCTGCTACAAAGTAATGCCTTTTGGTTTGAAAAATGCTGGGGCAACGTATCAGCGAGCAATGGTTACACTTTTTCATGATATGATGCATAAAGAAATAGAGGTGTATGT
TGATGATATGATCGCAAAATCTAAAGTAGATGAAGATCATACAACCACACTCCAAAAATTATTTGATCGGTTGAGAAAATATCAATTGAAATTAAATCCATCCAAATGCA
CATTTGGGGCAACTTCGGGAAAGCTACTGGGATTCATCGTCAGCGAAGAAGGGATCAAAGTGGACCCATATAAAGTAAGCGCTATCATGGAAATGCCATCCCCTAAAACA
GAAAAAGAAATTCGAGGTTTTCTGGGAAGATTGAACTACATATCCAGATTTATCTCACATTTAACACCAACATGCGAACCAATCTTCAAGCTCTTACGTAAAAACAACCC
AGGAAAATGGAATGAGGATTGTGGAGAAGCTTTCAATAAAATTAAGCAGTATTTGCAGAGCCCACCAGTACTGATACCCCCAGCTCCATGGCGACCGTTAATCTTGTATT
TAACAGTATTAGAAAGTTCCACAGGTGGTGTTCTAGGACAACATGATTTATCTGGAAAGAAGGAACATGCCATTTATTATTTGAGCAAAAAGTTTACCGATTATGAATCT
AGATACTCAATGTTAGAGCGAACTTGTTGTGCTTTGGTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAATCTGACAAGGAAAGTGACGATGAAGATGATGTGGGAATATCCTCAGAATTGTTAAGAATGGTAGAAGAAGAAGATGAGGTTCTTGGTCCTCACCAAGGACTAGT
TGAAGCAATCAATTTGGGTTCTCAAGAGGAGTCAAAAGAGGTAAAAATTGGCACATCATTGACCAGTGAATCACGGAAAAAACTGATCAATTTGTTACATGAGTATTCAG
ATATTTTTGCTTGGAGTTATCAGGATATGCCAGGATTAAGTACGGATATTGTAGTACACCGAGTTCCTTTAAAACCAGAATGCAACCCAGTGAGACAAAAGCTACGTAAA
ATGAAACCTGATGTACTGATTAAAATAAAGGAGGAAGTACAGAAGCAAATTGAAGCAGGATTCCTCACAGTCTCAGAAAATCTTGAATGGGTGGCAAACATTGTTCCAGT
GCCCAAAAAAGATGGAAAAGTGAGAATGTGCATAGATTATAGAGATTTAAATCGAGCCAGTCCAAAAGATAACTTCCCTTTACCTCATATCGACATGTTGGTGGATAACA
CTGCAGGATACTCAACTTTCTCCTTCATGGATGGTTTTTCAGGATATAATCAGATCAAAATGGCTGAAGAAAATAGAGAAAAAACGACATTCGTTACCCTTTGGGGAACA
TTCTGCTACAAAGTAATGCCTTTTGGTTTGAAAAATGCTGGGGCAACGTATCAGCGAGCAATGGTTACACTTTTTCATGATATGATGCATAAAGAAATAGAGGTGTATGT
TGATGATATGATCGCAAAATCTAAAGTAGATGAAGATCATACAACCACACTCCAAAAATTATTTGATCGGTTGAGAAAATATCAATTGAAATTAAATCCATCCAAATGCA
CATTTGGGGCAACTTCGGGAAAGCTACTGGGATTCATCGTCAGCGAAGAAGGGATCAAAGTGGACCCATATAAAGTAAGCGCTATCATGGAAATGCCATCCCCTAAAACA
GAAAAAGAAATTCGAGGTTTTCTGGGAAGATTGAACTACATATCCAGATTTATCTCACATTTAACACCAACATGCGAACCAATCTTCAAGCTCTTACGTAAAAACAACCC
AGGAAAATGGAATGAGGATTGTGGAGAAGCTTTCAATAAAATTAAGCAGTATTTGCAGAGCCCACCAGTACTGATACCCCCAGCTCCATGGCGACCGTTAATCTTGTATT
TAACAGTATTAGAAAGTTCCACAGGTGGTGTTCTAGGACAACATGATTTATCTGGAAAGAAGGAACATGCCATTTATTATTTGAGCAAAAAGTTTACCGATTATGAATCT
AGATACTCAATGTTAGAGCGAACTTGTTGTGCTTTGGTATGA
Protein sequenceShow/hide protein sequence
MESDKESDDEDDVGISSELLRMVEEEDEVLGPHQGLVEAINLGSQEESKEVKIGTSLTSESRKKLINLLHEYSDIFAWSYQDMPGLSTDIVVHRVPLKPECNPVRQKLRK
MKPDVLIKIKEEVQKQIEAGFLTVSENLEWVANIVPVPKKDGKVRMCIDYRDLNRASPKDNFPLPHIDMLVDNTAGYSTFSFMDGFSGYNQIKMAEENREKTTFVTLWGT
FCYKVMPFGLKNAGATYQRAMVTLFHDMMHKEIEVYVDDMIAKSKVDEDHTTTLQKLFDRLRKYQLKLNPSKCTFGATSGKLLGFIVSEEGIKVDPYKVSAIMEMPSPKT
EKEIRGFLGRLNYISRFISHLTPTCEPIFKLLRKNNPGKWNEDCGEAFNKIKQYLQSPPVLIPPAPWRPLILYLTVLESSTGGVLGQHDLSGKKEHAIYYLSKKFTDYES
RYSMLERTCCALV