; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc09g0241801 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc09g0241801
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr09:3969985..3971250
RNA-Seq ExpressionCmc09g0241801
SyntenyCmc09g0241801
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]3.8e-20082.62Show/hide
Query:  FLEEGHIRNHQTRNKLVLKEISKNATDRPSSSTKVVDKTWNIGQTHPSQELGEPRRSGRVVRQPDRYLGLSEAQIIIPDDGIEDPLTFKQEMNDVDYDRW
        FLEE H RNHQ R+K+VLKE+ KNATD+PSSSTKVVDK     Q+H SQEL  PRRSGRVV QP+RYLGL E QIIIPDDG+EDPLT+KQ MNDVD D+W
Subjt:  FLEEGHIRNHQTRNKLVLKEISKNATDRPSSSTKVVDKTWNIGQTHPSQELGEPRRSGRVVRQPDRYLGLSEAQIIIPDDGIEDPLTFKQEMNDVDYDRW

Query:  IKAMDLEMESMYSNYVWTLVDQPNDVKPIGCKWIYKRKRDQVGKVQTFKAQLVAKGYTQKEGIDNEETFSPVAMIKSIRILLSIATFYDYEIWQMDVKTT
        IKAM+LEMESMY N VWTLVD P+DVKPIGCKWIYKRKRDQ GKVQTFKA+LVAKGYTQKEG+D EETFSPVAM+KSIRILLSIATFY+YEIWQMDVKT 
Subjt:  IKAMDLEMESMYSNYVWTLVDQPNDVKPIGCKWIYKRKRDQVGKVQTFKAQLVAKGYTQKEGIDNEETFSPVAMIKSIRILLSIATFYDYEIWQMDVKTT

Query:  FLNGNLEESIYMVQPERFIQKGQEQKVCKLQKSIYGLKQASKSWNIRFDTTIKSYGFEQNVDEPCVYKRIIKSTVAFLVLYVDDILLIGNDIGHLTDIKE
        FLNGNLEESIYMVQPE FI + QEQKVCKLQKSIYGLKQAS+SWNIRFDT IKSYGFEQNVDEPCVYK+I+ S VAFL+LYVDDILLIGND+ +LTD+K+
Subjt:  FLNGNLEESIYMVQPERFIQKGQEQKVCKLQKSIYGLKQASKSWNIRFDTTIKSYGFEQNVDEPCVYKRIIKSTVAFLVLYVDDILLIGNDIGHLTDIKE

Query:  WLATQFQMKDLENAQYVLGIQIVRNRKNRTLAMSQTSYIDRMLSRYKMHNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMSNIPYASTVGSLMYTMLCTRP
        WL TQFQMKDL  AQY+LGIQIVRNRKN+TLAMSQ SYID++LSRYKM NSKKG LP+R+GIHLSKEQCPKTPQEVEDM NIPY+S VGSLMY MLCTRP
Subjt:  WLATQFQMKDLENAQYVLGIQIVRNRKNRTLAMSQTSYIDRMLSRYKMHNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMSNIPYASTVGSLMYTMLCTRP

Query:  DICYLVGIVSRYQSNLGCDH
        DICY VGIVSRYQSN G DH
Subjt:  DICYLVGIVSRYQSNLGCDH

KAA0025729.1 gag/pol protein [Cucumis melo var. makuwa]9.6e-18880.95Show/hide
Query:  FLEEGHIRNHQTRNKLVLKEISKNATDRPSSSTKVVDKTWNIGQTHPSQELGEPRRSGRVVRQPDRYLGLSEAQIIIPDDGIEDPLTFKQEMNDVDYDRW
        FLEE HIRNHQTR+KLVL+EISKNATDRPSSSTK+VDKT NIGQTHPSQEL EPRRS RVVRQPD YLGLS+AQIIIPDDGIEDPLT+KQ MNDVD D+W
Subjt:  FLEEGHIRNHQTRNKLVLKEISKNATDRPSSSTKVVDKTWNIGQTHPSQELGEPRRSGRVVRQPDRYLGLSEAQIIIPDDGIEDPLTFKQEMNDVDYDRW

Query:  IKAMDLEMESMYSNYVWTLVDQPNDVKPIGCKWIYKRKRDQVGKVQTFKAQLVAKGYTQKEGIDNEETFSPVAMIKSIRILLSIATFYDYEIWQMDVKTT
        IKA+DL+MESMYSN VWTLVDQ ++V+PIGC+WIYKRKRDQ GKVQTFKA+LVAKGYTQ EGID EETFSPV MIKSIRILLSIATFYDYEIW MDVKT 
Subjt:  IKAMDLEMESMYSNYVWTLVDQPNDVKPIGCKWIYKRKRDQVGKVQTFKAQLVAKGYTQKEGIDNEETFSPVAMIKSIRILLSIATFYDYEIWQMDVKTT

Query:  FLNGNLEESIYMVQPERFIQKGQEQKVCKLQKSIYGLKQASKSWNIRFDTTIKSYGFEQNVDEPCVYKRIIKSTVAFLVLYVDDILLIGNDIGHLTDIKE
        FLNGNLEESIYMVQ E FIQKGQEQK                                 NVDEPCVYKRII S VAFLVLYVDDILLIGND+GHLTDIKE
Subjt:  FLNGNLEESIYMVQPERFIQKGQEQKVCKLQKSIYGLKQASKSWNIRFDTTIKSYGFEQNVDEPCVYKRIIKSTVAFLVLYVDDILLIGNDIGHLTDIKE

Query:  WLATQFQMKDLENAQYVLGIQIVRNRKNRTLAMSQTSYIDRMLSRYKMHNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMSNIPYASTVGSLMYTMLCTRP
        WLATQFQM DLENA YV GIQIVRNRKN+TLA+SQTSYID+MLSRYKM NSKKGLLPYRYGIHLSKEQCPKTPQEVEDMSNIPYAS VGSLM+ MLCTRP
Subjt:  WLATQFQMKDLENAQYVLGIQIVRNRKNRTLAMSQTSYIDRMLSRYKMHNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMSNIPYASTVGSLMYTMLCTRP

Query:  DICYLVGIVSRYQSNLGCDH
        DICY V IVSRYQSN G DH
Subjt:  DICYLVGIVSRYQSNLGCDH

KAA0045356.1 gag/pol protein [Cucumis melo var. makuwa]1.7e-19788.49Show/hide
Query:  KNATDRPSSSTKVVDKTWNIGQTHPSQELGEPRRSGRVVRQPDRYLGLSEAQIIIPDDGIEDPLTFKQEMNDVDYDRWIKAMDLEMESMYSNYVWTLVDQ
        +N +DRPSSSTKVVDKT NIGQTH SQELGE RRSGRVVRQ +RYLGLSEAQIIIP+D IEDPLTFKQ  NDVD D+WIKAMDL+MESMYSN VWTLVDQ
Subjt:  KNATDRPSSSTKVVDKTWNIGQTHPSQELGEPRRSGRVVRQPDRYLGLSEAQIIIPDDGIEDPLTFKQEMNDVDYDRWIKAMDLEMESMYSNYVWTLVDQ

Query:  PNDVKPIGCKWIYKRKRDQVGKVQTFKAQLVAKGYTQKEGIDNEETFSPVAMIKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIYMVQPERFIQKG
        PN+++PIGCKWIYKRKRDQ  KVQTF+A+LVAKGYTQKEGID EETFSP+AMIKSIRILLSIATFYDYEIWQMDVKT FLNGNLEESIYMVQPE FIQKG
Subjt:  PNDVKPIGCKWIYKRKRDQVGKVQTFKAQLVAKGYTQKEGIDNEETFSPVAMIKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIYMVQPERFIQKG

Query:  QEQKVCKLQKSIYGLKQASKSWNIRFDTTIKSYGFEQNVDEPCVYKRIIKSTVAFLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKDLENAQYVLGIQI
        QEQKVCKLQKSIYGLKQAS+SWNIRFDT IKSYGFEQNVDEPCVYKRII STVAFLVLYVDDILLIGN++ HLTDIKEWL TQFQMKDL +AQYVLGIQI
Subjt:  QEQKVCKLQKSIYGLKQASKSWNIRFDTTIKSYGFEQNVDEPCVYKRIIKSTVAFLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKDLENAQYVLGIQI

Query:  VRNRKNRTLAMSQTSYIDRMLSRYKMHNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMSNIPYASTVGSLMYTMLCTRPDICYLVGIVSRYQ
        V+NRKN+TLAMSQTSYID+MLSRYKM NSKKGLLPYRYGIHLSKEQCPKTPQEV+DMSNIPYAS VGSLMY MLC RPDICY VGIVSRYQ
Subjt:  VRNRKNRTLAMSQTSYIDRMLSRYKMHNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMSNIPYASTVGSLMYTMLCTRPDICYLVGIVSRYQ

TYK03644.1 gag/pol protein [Cucumis melo var. makuwa]5.6e-21289.05Show/hide
Query:  FLEEGHIRNHQTRNKLVLKEISKNATDRPSSSTKVVDKTWNIGQTHPSQELGEPRRSGRVVRQPDRYLGLSEAQIIIPDDGIEDPLTFKQEMNDVDYDRW
        FLEE HIRNHQTR+KLVL+EISKN TDRPSS TKVVDKT NIGQTH  QELG+PRRSGRVVRQ DRYLGLSEAQIIIPDDGIEDPLT+K  MNDVD D+W
Subjt:  FLEEGHIRNHQTRNKLVLKEISKNATDRPSSSTKVVDKTWNIGQTHPSQELGEPRRSGRVVRQPDRYLGLSEAQIIIPDDGIEDPLTFKQEMNDVDYDRW

Query:  IKAMDLEMESMYSNYVWTLVDQPNDVKPIGCKWIYKRKRDQVGKVQTFKAQLVAKGYTQKEGIDNEETFSPVAMIKSIRILLSIATFYDYEIWQMDVKTT
        IKAMDLEMESMYSN VWTLVDQPNDVKPIGCKWIYKRKRDQ GKVQTFKA+LVAKGYTQKEGID EE FS  AMIKSIRILLSIATFYDYEIWQMDVKTT
Subjt:  IKAMDLEMESMYSNYVWTLVDQPNDVKPIGCKWIYKRKRDQVGKVQTFKAQLVAKGYTQKEGIDNEETFSPVAMIKSIRILLSIATFYDYEIWQMDVKTT

Query:  FLNGNLEESIYMVQPERFIQKGQEQKVCKLQKSIYGLKQASKSWNIRFDTTIKSYGFEQNVDEPCVYKRIIKSTVAFLVLYVDDILLIGNDIGHLTDIKE
        FLN NLEESIYMVQPERFIQKGQEQK+CKLQKSIYGLKQAS+S NIRFDT IKSYG EQNVDEPCVYKRI+ STVAFLVLYVDDILLIGND+GHL DIK+
Subjt:  FLNGNLEESIYMVQPERFIQKGQEQKVCKLQKSIYGLKQASKSWNIRFDTTIKSYGFEQNVDEPCVYKRIIKSTVAFLVLYVDDILLIGNDIGHLTDIKE

Query:  WLATQFQMKDLENAQYVLGIQIVRNRKNRTLAMSQTSYIDRMLSRYKMHNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMSNIPYASTVGSLMYTMLCTRP
        WLA QFQMKDL NAQYVLG+QIVRNRKN+TLAMSQTSYID+MLSRYKMHNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMSNIPYAS VGSLMY MLCTRP
Subjt:  WLATQFQMKDLENAQYVLGIQIVRNRKNRTLAMSQTSYIDRMLSRYKMHNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMSNIPYASTVGSLMYTMLCTRP

Query:  DICYLVGIVSRYQSNLGCDH
        +ICY VGIVSR QS  G DH
Subjt:  DICYLVGIVSRYQSNLGCDH

TYK06159.1 gag/pol protein [Cucumis melo var. makuwa]2.5e-18881.19Show/hide
Query:  FLEEGHIRNHQTRNKLVLKEISKNATDRPSSSTKVVDKTWNIGQTHPSQELGEPRRSGRVVRQPDRYLGLSEAQIIIPDDGIEDPLTFKQEMNDVDYDRW
        FLEE HIRNHQTR+KLVL+EISKNATDRPSSSTK+VDKT NIGQTHPSQEL EPRRS RVVRQPD YLGLS+AQIIIPDDGIEDPLT+KQ MNDVD D+W
Subjt:  FLEEGHIRNHQTRNKLVLKEISKNATDRPSSSTKVVDKTWNIGQTHPSQELGEPRRSGRVVRQPDRYLGLSEAQIIIPDDGIEDPLTFKQEMNDVDYDRW

Query:  IKAMDLEMESMYSNYVWTLVDQPNDVKPIGCKWIYKRKRDQVGKVQTFKAQLVAKGYTQKEGIDNEETFSPVAMIKSIRILLSIATFYDYEIWQMDVKTT
        IKA+DL+MESMYSN VWTLVDQ ++V+PIGC+WIYKRKRDQ GKVQTFKA+LVAKGYTQKEGID EETFSPV MIKSIRILLSIATFYDYEIW MDVKT 
Subjt:  IKAMDLEMESMYSNYVWTLVDQPNDVKPIGCKWIYKRKRDQVGKVQTFKAQLVAKGYTQKEGIDNEETFSPVAMIKSIRILLSIATFYDYEIWQMDVKTT

Query:  FLNGNLEESIYMVQPERFIQKGQEQKVCKLQKSIYGLKQASKSWNIRFDTTIKSYGFEQNVDEPCVYKRIIKSTVAFLVLYVDDILLIGNDIGHLTDIKE
        FLNGNLEESIYMVQ E FIQKGQEQK                                 NVDEPCVYKRII S VAFLVLYVDDILLIGND+GHLTDIKE
Subjt:  FLNGNLEESIYMVQPERFIQKGQEQKVCKLQKSIYGLKQASKSWNIRFDTTIKSYGFEQNVDEPCVYKRIIKSTVAFLVLYVDDILLIGNDIGHLTDIKE

Query:  WLATQFQMKDLENAQYVLGIQIVRNRKNRTLAMSQTSYIDRMLSRYKMHNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMSNIPYASTVGSLMYTMLCTRP
        WLATQFQM DLENA YV GIQIVRNRKN+TLA+SQTSYID+MLSRYKM NSKKGLLPYRYGIHLSKEQCPKTPQEVEDMSNIPYAS VGSLM+ MLCTRP
Subjt:  WLATQFQMKDLENAQYVLGIQIVRNRKNRTLAMSQTSYIDRMLSRYKMHNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMSNIPYASTVGSLMYTMLCTRP

Query:  DICYLVGIVSRYQSNLGCDH
        DICY V IVSRYQSN G DH
Subjt:  DICYLVGIVSRYQSNLGCDH

TrEMBL top hitse value%identityAlignment
A0A5A7SKC9 Gag/pol protein4.7e-18880.95Show/hide
Query:  FLEEGHIRNHQTRNKLVLKEISKNATDRPSSSTKVVDKTWNIGQTHPSQELGEPRRSGRVVRQPDRYLGLSEAQIIIPDDGIEDPLTFKQEMNDVDYDRW
        FLEE HIRNHQTR+KLVL+EISKNATDRPSSSTK+VDKT NIGQTHPSQEL EPRRS RVVRQPD YLGLS+AQIIIPDDGIEDPLT+KQ MNDVD D+W
Subjt:  FLEEGHIRNHQTRNKLVLKEISKNATDRPSSSTKVVDKTWNIGQTHPSQELGEPRRSGRVVRQPDRYLGLSEAQIIIPDDGIEDPLTFKQEMNDVDYDRW

Query:  IKAMDLEMESMYSNYVWTLVDQPNDVKPIGCKWIYKRKRDQVGKVQTFKAQLVAKGYTQKEGIDNEETFSPVAMIKSIRILLSIATFYDYEIWQMDVKTT
        IKA+DL+MESMYSN VWTLVDQ ++V+PIGC+WIYKRKRDQ GKVQTFKA+LVAKGYTQ EGID EETFSPV MIKSIRILLSIATFYDYEIW MDVKT 
Subjt:  IKAMDLEMESMYSNYVWTLVDQPNDVKPIGCKWIYKRKRDQVGKVQTFKAQLVAKGYTQKEGIDNEETFSPVAMIKSIRILLSIATFYDYEIWQMDVKTT

Query:  FLNGNLEESIYMVQPERFIQKGQEQKVCKLQKSIYGLKQASKSWNIRFDTTIKSYGFEQNVDEPCVYKRIIKSTVAFLVLYVDDILLIGNDIGHLTDIKE
        FLNGNLEESIYMVQ E FIQKGQEQK                                 NVDEPCVYKRII S VAFLVLYVDDILLIGND+GHLTDIKE
Subjt:  FLNGNLEESIYMVQPERFIQKGQEQKVCKLQKSIYGLKQASKSWNIRFDTTIKSYGFEQNVDEPCVYKRIIKSTVAFLVLYVDDILLIGNDIGHLTDIKE

Query:  WLATQFQMKDLENAQYVLGIQIVRNRKNRTLAMSQTSYIDRMLSRYKMHNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMSNIPYASTVGSLMYTMLCTRP
        WLATQFQM DLENA YV GIQIVRNRKN+TLA+SQTSYID+MLSRYKM NSKKGLLPYRYGIHLSKEQCPKTPQEVEDMSNIPYAS VGSLM+ MLCTRP
Subjt:  WLATQFQMKDLENAQYVLGIQIVRNRKNRTLAMSQTSYIDRMLSRYKMHNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMSNIPYASTVGSLMYTMLCTRP

Query:  DICYLVGIVSRYQSNLGCDH
        DICY V IVSRYQSN G DH
Subjt:  DICYLVGIVSRYQSNLGCDH

A0A5A7TTA2 Gag/pol protein8.5e-19888.49Show/hide
Query:  KNATDRPSSSTKVVDKTWNIGQTHPSQELGEPRRSGRVVRQPDRYLGLSEAQIIIPDDGIEDPLTFKQEMNDVDYDRWIKAMDLEMESMYSNYVWTLVDQ
        +N +DRPSSSTKVVDKT NIGQTH SQELGE RRSGRVVRQ +RYLGLSEAQIIIP+D IEDPLTFKQ  NDVD D+WIKAMDL+MESMYSN VWTLVDQ
Subjt:  KNATDRPSSSTKVVDKTWNIGQTHPSQELGEPRRSGRVVRQPDRYLGLSEAQIIIPDDGIEDPLTFKQEMNDVDYDRWIKAMDLEMESMYSNYVWTLVDQ

Query:  PNDVKPIGCKWIYKRKRDQVGKVQTFKAQLVAKGYTQKEGIDNEETFSPVAMIKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIYMVQPERFIQKG
        PN+++PIGCKWIYKRKRDQ  KVQTF+A+LVAKGYTQKEGID EETFSP+AMIKSIRILLSIATFYDYEIWQMDVKT FLNGNLEESIYMVQPE FIQKG
Subjt:  PNDVKPIGCKWIYKRKRDQVGKVQTFKAQLVAKGYTQKEGIDNEETFSPVAMIKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIYMVQPERFIQKG

Query:  QEQKVCKLQKSIYGLKQASKSWNIRFDTTIKSYGFEQNVDEPCVYKRIIKSTVAFLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKDLENAQYVLGIQI
        QEQKVCKLQKSIYGLKQAS+SWNIRFDT IKSYGFEQNVDEPCVYKRII STVAFLVLYVDDILLIGN++ HLTDIKEWL TQFQMKDL +AQYVLGIQI
Subjt:  QEQKVCKLQKSIYGLKQASKSWNIRFDTTIKSYGFEQNVDEPCVYKRIIKSTVAFLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKDLENAQYVLGIQI

Query:  VRNRKNRTLAMSQTSYIDRMLSRYKMHNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMSNIPYASTVGSLMYTMLCTRPDICYLVGIVSRYQ
        V+NRKN+TLAMSQTSYID+MLSRYKM NSKKGLLPYRYGIHLSKEQCPKTPQEV+DMSNIPYAS VGSLMY MLC RPDICY VGIVSRYQ
Subjt:  VRNRKNRTLAMSQTSYIDRMLSRYKMHNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMSNIPYASTVGSLMYTMLCTRPDICYLVGIVSRYQ

A0A5D3BX45 Gag/pol protein2.7e-21289.05Show/hide
Query:  FLEEGHIRNHQTRNKLVLKEISKNATDRPSSSTKVVDKTWNIGQTHPSQELGEPRRSGRVVRQPDRYLGLSEAQIIIPDDGIEDPLTFKQEMNDVDYDRW
        FLEE HIRNHQTR+KLVL+EISKN TDRPSS TKVVDKT NIGQTH  QELG+PRRSGRVVRQ DRYLGLSEAQIIIPDDGIEDPLT+K  MNDVD D+W
Subjt:  FLEEGHIRNHQTRNKLVLKEISKNATDRPSSSTKVVDKTWNIGQTHPSQELGEPRRSGRVVRQPDRYLGLSEAQIIIPDDGIEDPLTFKQEMNDVDYDRW

Query:  IKAMDLEMESMYSNYVWTLVDQPNDVKPIGCKWIYKRKRDQVGKVQTFKAQLVAKGYTQKEGIDNEETFSPVAMIKSIRILLSIATFYDYEIWQMDVKTT
        IKAMDLEMESMYSN VWTLVDQPNDVKPIGCKWIYKRKRDQ GKVQTFKA+LVAKGYTQKEGID EE FS  AMIKSIRILLSIATFYDYEIWQMDVKTT
Subjt:  IKAMDLEMESMYSNYVWTLVDQPNDVKPIGCKWIYKRKRDQVGKVQTFKAQLVAKGYTQKEGIDNEETFSPVAMIKSIRILLSIATFYDYEIWQMDVKTT

Query:  FLNGNLEESIYMVQPERFIQKGQEQKVCKLQKSIYGLKQASKSWNIRFDTTIKSYGFEQNVDEPCVYKRIIKSTVAFLVLYVDDILLIGNDIGHLTDIKE
        FLN NLEESIYMVQPERFIQKGQEQK+CKLQKSIYGLKQAS+S NIRFDT IKSYG EQNVDEPCVYKRI+ STVAFLVLYVDDILLIGND+GHL DIK+
Subjt:  FLNGNLEESIYMVQPERFIQKGQEQKVCKLQKSIYGLKQASKSWNIRFDTTIKSYGFEQNVDEPCVYKRIIKSTVAFLVLYVDDILLIGNDIGHLTDIKE

Query:  WLATQFQMKDLENAQYVLGIQIVRNRKNRTLAMSQTSYIDRMLSRYKMHNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMSNIPYASTVGSLMYTMLCTRP
        WLA QFQMKDL NAQYVLG+QIVRNRKN+TLAMSQTSYID+MLSRYKMHNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMSNIPYAS VGSLMY MLCTRP
Subjt:  WLATQFQMKDLENAQYVLGIQIVRNRKNRTLAMSQTSYIDRMLSRYKMHNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMSNIPYASTVGSLMYTMLCTRP

Query:  DICYLVGIVSRYQSNLGCDH
        +ICY VGIVSR QS  G DH
Subjt:  DICYLVGIVSRYQSNLGCDH

A0A5D3C701 Gag/pol protein1.2e-18881.19Show/hide
Query:  FLEEGHIRNHQTRNKLVLKEISKNATDRPSSSTKVVDKTWNIGQTHPSQELGEPRRSGRVVRQPDRYLGLSEAQIIIPDDGIEDPLTFKQEMNDVDYDRW
        FLEE HIRNHQTR+KLVL+EISKNATDRPSSSTK+VDKT NIGQTHPSQEL EPRRS RVVRQPD YLGLS+AQIIIPDDGIEDPLT+KQ MNDVD D+W
Subjt:  FLEEGHIRNHQTRNKLVLKEISKNATDRPSSSTKVVDKTWNIGQTHPSQELGEPRRSGRVVRQPDRYLGLSEAQIIIPDDGIEDPLTFKQEMNDVDYDRW

Query:  IKAMDLEMESMYSNYVWTLVDQPNDVKPIGCKWIYKRKRDQVGKVQTFKAQLVAKGYTQKEGIDNEETFSPVAMIKSIRILLSIATFYDYEIWQMDVKTT
        IKA+DL+MESMYSN VWTLVDQ ++V+PIGC+WIYKRKRDQ GKVQTFKA+LVAKGYTQKEGID EETFSPV MIKSIRILLSIATFYDYEIW MDVKT 
Subjt:  IKAMDLEMESMYSNYVWTLVDQPNDVKPIGCKWIYKRKRDQVGKVQTFKAQLVAKGYTQKEGIDNEETFSPVAMIKSIRILLSIATFYDYEIWQMDVKTT

Query:  FLNGNLEESIYMVQPERFIQKGQEQKVCKLQKSIYGLKQASKSWNIRFDTTIKSYGFEQNVDEPCVYKRIIKSTVAFLVLYVDDILLIGNDIGHLTDIKE
        FLNGNLEESIYMVQ E FIQKGQEQK                                 NVDEPCVYKRII S VAFLVLYVDDILLIGND+GHLTDIKE
Subjt:  FLNGNLEESIYMVQPERFIQKGQEQKVCKLQKSIYGLKQASKSWNIRFDTTIKSYGFEQNVDEPCVYKRIIKSTVAFLVLYVDDILLIGNDIGHLTDIKE

Query:  WLATQFQMKDLENAQYVLGIQIVRNRKNRTLAMSQTSYIDRMLSRYKMHNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMSNIPYASTVGSLMYTMLCTRP
        WLATQFQM DLENA YV GIQIVRNRKN+TLA+SQTSYID+MLSRYKM NSKKGLLPYRYGIHLSKEQCPKTPQEVEDMSNIPYAS VGSLM+ MLCTRP
Subjt:  WLATQFQMKDLENAQYVLGIQIVRNRKNRTLAMSQTSYIDRMLSRYKMHNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMSNIPYASTVGSLMYTMLCTRP

Query:  DICYLVGIVSRYQSNLGCDH
        DICY V IVSRYQSN G DH
Subjt:  DICYLVGIVSRYQSNLGCDH

E2GK51 Gag/pol protein (Fragment)1.8e-20082.62Show/hide
Query:  FLEEGHIRNHQTRNKLVLKEISKNATDRPSSSTKVVDKTWNIGQTHPSQELGEPRRSGRVVRQPDRYLGLSEAQIIIPDDGIEDPLTFKQEMNDVDYDRW
        FLEE H RNHQ R+K+VLKE+ KNATD+PSSSTKVVDK     Q+H SQEL  PRRSGRVV QP+RYLGL E QIIIPDDG+EDPLT+KQ MNDVD D+W
Subjt:  FLEEGHIRNHQTRNKLVLKEISKNATDRPSSSTKVVDKTWNIGQTHPSQELGEPRRSGRVVRQPDRYLGLSEAQIIIPDDGIEDPLTFKQEMNDVDYDRW

Query:  IKAMDLEMESMYSNYVWTLVDQPNDVKPIGCKWIYKRKRDQVGKVQTFKAQLVAKGYTQKEGIDNEETFSPVAMIKSIRILLSIATFYDYEIWQMDVKTT
        IKAM+LEMESMY N VWTLVD P+DVKPIGCKWIYKRKRDQ GKVQTFKA+LVAKGYTQKEG+D EETFSPVAM+KSIRILLSIATFY+YEIWQMDVKT 
Subjt:  IKAMDLEMESMYSNYVWTLVDQPNDVKPIGCKWIYKRKRDQVGKVQTFKAQLVAKGYTQKEGIDNEETFSPVAMIKSIRILLSIATFYDYEIWQMDVKTT

Query:  FLNGNLEESIYMVQPERFIQKGQEQKVCKLQKSIYGLKQASKSWNIRFDTTIKSYGFEQNVDEPCVYKRIIKSTVAFLVLYVDDILLIGNDIGHLTDIKE
        FLNGNLEESIYMVQPE FI + QEQKVCKLQKSIYGLKQAS+SWNIRFDT IKSYGFEQNVDEPCVYK+I+ S VAFL+LYVDDILLIGND+ +LTD+K+
Subjt:  FLNGNLEESIYMVQPERFIQKGQEQKVCKLQKSIYGLKQASKSWNIRFDTTIKSYGFEQNVDEPCVYKRIIKSTVAFLVLYVDDILLIGNDIGHLTDIKE

Query:  WLATQFQMKDLENAQYVLGIQIVRNRKNRTLAMSQTSYIDRMLSRYKMHNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMSNIPYASTVGSLMYTMLCTRP
        WL TQFQMKDL  AQY+LGIQIVRNRKN+TLAMSQ SYID++LSRYKM NSKKG LP+R+GIHLSKEQCPKTPQEVEDM NIPY+S VGSLMY MLCTRP
Subjt:  WLATQFQMKDLENAQYVLGIQIVRNRKNRTLAMSQTSYIDRMLSRYKMHNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMSNIPYASTVGSLMYTMLCTRP

Query:  DICYLVGIVSRYQSNLGCDH
        DICY VGIVSRYQSN G DH
Subjt:  DICYLVGIVSRYQSNLGCDH

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.5e-5335.61Show/hide
Query:  PLTFKQEMNDVDYDRWIKAMDLEMESMYSNYVWTLVDQPNDVKPIGCKWIYKRKRDQVGKVQTFKAQLVAKGYTQKEGIDNEETFSPVAMIKSIRILLSI
        P +F +     D   W +A++ E+ +   N  WT+  +P +   +  +W++  K +++G    +KA+LVA+G+TQK  ID EETF+PVA I S R +LS+
Subjt:  PLTFKQEMNDVDYDRWIKAMDLEMESMYSNYVWTLVDQPNDVKPIGCKWIYKRKRDQVGKVQTFKAQLVAKGYTQKEGIDNEETFSPVAMIKSIRILLSI

Query:  ATFYDYEIWQMDVKTTFLNGNLEESIYMVQPERFIQKGQEQKVCKLQKSIYGLKQASKSWNIRFDTTIKSYGFEQNVDEPCVY---KRIIKSTVAFLVLY
           Y+ ++ QMDVKT FLNG L+E IYM  P+          VCKL K+IYGLKQA++ W   F+  +K   F  +  + C+Y   K  I   + +++LY
Subjt:  ATFYDYEIWQMDVKTTFLNGNLEESIYMVQPERFIQKGQEQKVCKLQKSIYGLKQASKSWNIRFDTTIKSYGFEQNVDEPCVY---KRIIKSTVAFLVLY

Query:  VDDILLIGNDIGHLTDIKEWLATQFQMKDLENAQYVLGIQIVRNRKNRTLAMSQTSYIDRMLSRYKMHNSKKGLLP----YRYGIHLSKEQCPKTPQEVE
        VDD+++   D+  + + K +L  +F+M DL   ++ +GI+I    +   + +SQ++Y+ ++LS++ M N      P      Y +  S E C        
Subjt:  VDDILLIGNDIGHLTDIKEWLATQFQMKDLENAQYVLGIQIVRNRKNRTLAMSQTSYIDRMLSRYKMHNSKKGLLP----YRYGIHLSKEQCPKTPQEVE

Query:  DMSNIPYASTVGSLMYTMLCTRPDICYLVGIVSRYQS
           N P  S +G LMY MLCTRPD+   V I+SRY S
Subjt:  DMSNIPYASTVGSLMYTMLCTRPDICYLVGIVSRYQS

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.8e-8641.57Show/hide
Query:  IRNHQTRNKLVLKEISKNATDRPSSSTKVVDKTWNIGQT---------------HPSQ--ELGEP-RRSGRVVRQPDRYLGLSEAQIIIPDDGIEDPLTF
        ++N    N + +   S N T   S++ +V ++    G+                HP+Q  E  +P RRS R   +  RY   S   ++I DD   +P + 
Subjt:  IRNHQTRNKLVLKEISKNATDRPSSSTKVVDKTWNIGQT---------------HPSQ--ELGEP-RRSGRVVRQPDRYLGLSEAQIIIPDDGIEDPLTF

Query:  KQEMNDVDYDRWIKAMDLEMESMYSNYVWTLVDQPNDVKPIGCKWIYKRKRDQVGKVQTFKAQLVAKGYTQKEGIDNEETFSPVAMIKSIRILLSIATFY
        K+ ++  + ++ +KAM  EMES+  N  + LV+ P   +P+ CKW++K K+D   K+  +KA+LV KG+ QK+GID +E FSPV  + SIR +LS+A   
Subjt:  KQEMNDVDYDRWIKAMDLEMESMYSNYVWTLVDQPNDVKPIGCKWIYKRKRDQVGKVQTFKAQLVAKGYTQKEGIDNEETFSPVAMIKSIRILLSIATFY

Query:  DYEIWQMDVKTTFLNGNLEESIYMVQPERFIQKGQEQKVCKLQKSIYGLKQASKSWNIRFDTTIKSYGFEQNVDEPCVY-KRIIKSTVAFLVLYVDDILL
        D E+ Q+DVKT FL+G+LEE IYM QPE F   G++  VCKL KS+YGLKQA + W ++FD+ +KS  + +   +PCVY KR  ++    L+LYVDD+L+
Subjt:  DYEIWQMDVKTTFLNGNLEESIYMVQPERFIQKGQEQKVCKLQKSIYGLKQASKSWNIRFDTTIKSYGFEQNVDEPCVY-KRIIKSTVAFLVLYVDDILL

Query:  IGNDIGHLTDIKEWLATQFQMKDLENAQYVLGIQIVRNRKNRTLAMSQTSYIDRMLSRYKMHNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMSNIPYAST
        +G D G +  +K  L+  F MKDL  AQ +LG++IVR R +R L +SQ  YI+R+L R+ M N+K    P    + LSK+ CP T +E  +M+ +PY+S 
Subjt:  IGNDIGHLTDIKEWLATQFQMKDLENAQYVLGIQIVRNRKNRTLAMSQTSYIDRMLSRYKMHNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMSNIPYAST

Query:  VGSLMYTMLCTRPDICYLVGIVSRYQSNLGCDH
        VGSLMY M+CTRPDI + VG+VSR+  N G +H
Subjt:  VGSLMYTMLCTRPDICYLVGIVSRYQSNLGCDH

P25600 Putative transposon Ty5-1 protein YCL074W3.3e-2131.65Show/hide
Query:  MDVKTTFLNGNLEESIYMVQPERFIQKGQEQKVCKLQKSIYGLKQASKSWNIRFDTTIKSYGFEQNVDEPCVYKRIIKSTVAFLVLYVDDILLIGNDIGH
        MDV T FLN  ++E IY+ QP  F+ +     V +L   +YGLKQA   WN   + T+K  GF ++  E  +Y R       ++ +YVDD+L+       
Subjt:  MDVKTTFLNGNLEESIYMVQPERFIQKGQEQKVCKLQKSIYGLKQASKSWNIRFDTTIKSYGFEQNVDEPCVYKRIIKSTVAFLVLYVDDILLIGNDIGH

Query:  LTDIKEWLATQFQMKDLENAQYVLGIQIVRNRKNRTLAMSQTSYIDRMLSRYKMHNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMSNIPYASTVGSLMYT
           +K+ L   + MKDL      LG+ I     N  + +S   YI +  S  +++  K    P    +  SK     T   ++D++  PY S VG L++ 
Subjt:  LTDIKEWLATQFQMKDLENAQYVLGIQIVRNRKNRTLAMSQTSYIDRMLSRYKMHNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMSNIPYASTVGSLMYT

Query:  MLCTRPDICYLVGIVSRY
            RPDI Y V ++SR+
Subjt:  MLCTRPDICYLVGIVSRY

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.0e-5134.97Show/hide
Query:  SQELGEPRRSGRVVRQPDRYLGLSEAQIIIPDDGIEDPLTFKQEMNDVDYDRWIKAMDLEMESMYSNYVWTLV-DQPNDVKPIGCKWIYKRKRDQVGKVQ
        +  +G   ++G +   P   L +S A          +P T  Q + D   +RW  AM  E+ +   N+ W LV   P+ V  +GC+WI+ +K +  G + 
Subjt:  SQELGEPRRSGRVVRQPDRYLGLSEAQIIIPDDGIEDPLTFKQEMNDVDYDRWIKAMDLEMESMYSNYVWTLV-DQPNDVKPIGCKWIYKRKRDQVGKVQ

Query:  TFKAQLVAKGYTQKEGIDNEETFSPVAMIKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIYMVQPERFIQKGQEQKVCKLQKSIYGLKQASKSWNI
         +KA+LVAKGY Q+ G+D  ETFSPV    SIRI+L +A    + I Q+DV   FL G L + +YM QP  FI K +   VCKL+K++YGLKQA ++W +
Subjt:  TFKAQLVAKGYTQKEGIDNEETFSPVAMIKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIYMVQPERFIQKGQEQKVCKLQKSIYGLKQASKSWNI

Query:  RFDTTIKSYGFEQNVDEPCVYKRIIKSTVAFLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKDLENAQYVLGIQIVRNRKNRTLAMSQTSYIDRMLSRY
             + + GF  +V +  ++      ++ ++++YVDDIL+ GND   L +  + L+ +F +KD E   Y LGI+    R    L +SQ  YI  +L+R 
Subjt:  RFDTTIKSYGFEQNVDEPCVYKRIIKSTVAFLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKDLENAQYVLGIQIVRNRKNRTLAMSQTSYIDRMLSRY

Query:  KMHNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMSNIPYASTVGSLMYTMLCTRPDICYLVGIVSRY
         M  +K    P      LS     K     E      Y   VGSL Y +  TRPDI Y V  +S++
Subjt:  KMHNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMSNIPYASTVGSLMYTMLCTRPDICYLVGIVSRY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.6e-5236.39Show/hide
Query:  DPLTFKQEMNDVDYDRWIKAMDLEMESMYSNYVWTLV-DQPNDVKPIGCKWIYKRKRDQVGKVQTFKAQLVAKGYTQKEGIDNEETFSPVAMIKSIRILL
        +P T  Q M D   DRW +AM  E+ +   N+ W LV   P  V  +GC+WI+ +K +  G +  +KA+LVAKGY Q+ G+D  ETFSPV    SIRI+L
Subjt:  DPLTFKQEMNDVDYDRWIKAMDLEMESMYSNYVWTLV-DQPNDVKPIGCKWIYKRKRDQVGKVQTFKAQLVAKGYTQKEGIDNEETFSPVAMIKSIRILL

Query:  SIATFYDYEIWQMDVKTTFLNGNLEESIYMVQPERFIQKGQEQKVCKLQKSIYGLKQASKSWNIRFDTTIKSYGFEQNVDEPCVYKRIIKSTVAFLVLYV
         +A    + I Q+DV   FL G L + +YM QP  F+ K +   VC+L+K+IYGLKQA ++W +   T + + GF  ++ +  ++      ++ ++++YV
Subjt:  SIATFYDYEIWQMDVKTTFLNGNLEESIYMVQPERFIQKGQEQKVCKLQKSIYGLKQASKSWNIRFDTTIKSYGFEQNVDEPCVYKRIIKSTVAFLVLYV

Query:  DDILLIGNDIGHLTDIKEWLATQFQMKDLENAQYVLGIQIVRNRKNRTLAMSQTSYIDRMLSRYKMHNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMSNI
        DDIL+ GND   L    + L+ +F +K+ E+  Y LGI+    R  + L +SQ  Y   +L+R  M  +K    P      L+     K P   E     
Subjt:  DDILLIGNDIGHLTDIKEWLATQFQMKDLENAQYVLGIQIVRNRKNRTLAMSQTSYIDRMLSRYKMHNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMSNI

Query:  PYASTVGSLMYTMLCTRPDICYLVGIVSRYQSNLGCDH
         Y   VGSL Y +  TRPD+ Y V  +S+Y      DH
Subjt:  PYASTVGSLMYTMLCTRPDICYLVGIVSRYQSNLGCDH

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.5e-5034.07Show/hide
Query:  WIKAMDLEMESMYSNYVWTLVDQPNDVKPIGCKWIYKRKRDQVGKVQTFKAQLVAKGYTQKEGIDNEETFSPVAMIKSIRILLSIATFYDYEIWQMDVKT
        W  AMD E+ +M + + W +   P + KPIGCKW+YK K +  G ++ +KA+LVAKGYTQ+EGID  ETFSPV  + S++++L+I+  Y++ + Q+D+  
Subjt:  WIKAMDLEMESMYSNYVWTLVDQPNDVKPIGCKWIYKRKRDQVGKVQTFKAQLVAKGYTQKEGIDNEETFSPVAMIKSIRILLSIATFYDYEIWQMDVKT

Query:  TFLNGNLEESIYMVQPERFIQKGQE----QKVCKLQKSIYGLKQASKSWNIRFDTTIKSYGFEQNVDEPCVYKRIIKSTVAFLVLYVDDILLIGNDIGHL
         FLNG+L+E IYM  P  +  +  +      VC L+KSIYGLKQAS+ W ++F  T+  +GF Q+  +   + +I  +    +++YVDDI++  N+   +
Subjt:  TFLNGNLEESIYMVQPERFIQKGQE----QKVCKLQKSIYGLKQASKSWNIRFDTTIKSYGFEQNVDEPCVYKRIIKSTVAFLVLYVDDILLIGNDIGHL

Query:  TDIKEWLATQFQMKDLENAQYVLGIQIVRNRKNRTLAMSQTSYIDRMLSRYKMHNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMSNIPYASTVGSLMYTM
         ++K  L + F+++DL   +Y LG++I R+     + + Q  Y   +L    +   K   +P    +  S           + +    Y   +G LMY  
Subjt:  TDIKEWLATQFQMKDLENAQYVLGIQIVRNRKNRTLAMSQTSYIDRMLSRYKMHNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMSNIPYASTVGSLMYTM

Query:  LCTRPDICYLVGIVSRY
        + TR DI + V  +S++
Subjt:  LCTRPDICYLVGIVSRY

ATMG00810.1 DNA/RNA polymerases superfamily protein1.3e-0938.52Show/hide
Query:  FLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKDLENAQYVLGIQIVRNRKNRTLAMSQTSYIDRMLSRYKMHNSK--KGLLPYRYGIHLSKEQCPKTPQ
        +L+LYVDDILL G+    L  +   L++ F MKDL    Y LGIQI  +     L +SQT Y +++L+   M + K     LP +    +S  + P    
Subjt:  FLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKDLENAQYVLGIQIVRNRKNRTLAMSQTSYIDRMLSRYKMHNSK--KGLLPYRYGIHLSKEQCPKTPQ

Query:  EVEDMSNIPYASTVGSLMYTMLCTRPDICYLVGIV
           D S+  + S VG+L Y  L TRPDI Y V IV
Subjt:  EVEDMSNIPYASTVGSLMYTMLCTRPDICYLVGIV

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.3e-1441.86Show/hide
Query:  WIKAMDLEMESMYSNYVWTLVDQPNDVKPIGCKWIYKRKRDQVGKVQTFKAQLVAKGYTQKEGIDNEETFSPVAMIKSIRILLSIA
        W +AM  E++++  N  W LV  P +   +GCKW++K K    G +   KA+LVAKG+ Q+EGI   ET+SPV    +IR +L++A
Subjt:  WIKAMDLEMESMYSNYVWTLVDQPNDVKPIGCKWIYKRKRDQVGKVQTFKAQLVAKGYTQKEGIDNEETFSPVAMIKSIRILLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCTTAGAGGAAGGCCACATAAGAAATCATCAAACTCGCAATAAACTAGTATTAAAAGAAATTTCCAAGAATGCTACAGATAGACCTAGTTCATCTACTAAAGTAGT
AGATAAAACTTGGAATATTGGTCAAACACATCCTTCTCAAGAGTTGGGAGAGCCTCGTCGTAGTGGAAGGGTTGTACGACAGCCTGATCGCTATTTGGGTTTAAGTGAAG
CTCAAATCATCATACCTGATGATGGGATAGAGGATCCATTGACCTTTAAACAGGAAATGAATGATGTGGATTATGATCGATGGATCAAAGCCATGGACCTCGAAATGGAA
TCTATGTATTCCAATTATGTCTGGACTCTAGTAGATCAACCAAATGATGTAAAACCTATTGGTTGTAAATGGATCTACAAGAGAAAACGAGACCAAGTTGGTAAAGTACA
GACTTTCAAAGCTCAACTCGTGGCAAAAGGTTATACACAAAAGGAGGGAATAGATAATGAAGAAACTTTCTCTCCTGTTGCCATGATAAAGTCGATTAGAATACTCTTAT
CCATCGCCACTTTTTATGATTATGAAATTTGGCAGATGGATGTCAAGACAACCTTTCTGAACGGTAATCTTGAAGAAAGTATTTATATGGTCCAACCAGAAAGGTTTATA
CAAAAGGGTCAAGAACAAAAGGTTTGTAAGCTTCAAAAATCTATTTATGGATTAAAACAAGCTTCTAAATCCTGGAATATAAGGTTTGATACTACGATCAAATCTTATGG
TTTTGAACAAAATGTTGACGAACCTTGTGTTTACAAAAGGATCATTAAATCTACTGTAGCATTCTTAGTTCTGTATGTAGATGACATTCTACTCATTGGGAATGATATAG
GTCATCTAACTGATATTAAAGAATGGCTAGCTACGCAATTCCAAATGAAAGATTTGGAAAATGCACAATATGTTCTTGGTATCCAAATAGTTCGAAACCGAAAGAATAGA
ACACTAGCCATGTCTCAAACATCTTATATAGACAGAATGTTGTCAAGATATAAGATGCATAATTCCAAAAAGGGTTTGTTGCCGTATAGATATGGAATTCATTTATCAAA
AGAACAATGTCCAAAGACACCTCAAGAAGTTGAGGATATGAGTAACATTCCCTATGCTTCTACTGTTGGGAGCCTAATGTATACAATGTTATGTACTAGACCTGACATTT
GCTATTTAGTGGGGATAGTTAGTAGATATCAGTCCAATCTTGGATGTGATCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTCTTAGAGGAAGGCCACATAAGAAATCATCAAACTCGCAATAAACTAGTATTAAAAGAAATTTCCAAGAATGCTACAGATAGACCTAGTTCATCTACTAAAGTAGT
AGATAAAACTTGGAATATTGGTCAAACACATCCTTCTCAAGAGTTGGGAGAGCCTCGTCGTAGTGGAAGGGTTGTACGACAGCCTGATCGCTATTTGGGTTTAAGTGAAG
CTCAAATCATCATACCTGATGATGGGATAGAGGATCCATTGACCTTTAAACAGGAAATGAATGATGTGGATTATGATCGATGGATCAAAGCCATGGACCTCGAAATGGAA
TCTATGTATTCCAATTATGTCTGGACTCTAGTAGATCAACCAAATGATGTAAAACCTATTGGTTGTAAATGGATCTACAAGAGAAAACGAGACCAAGTTGGTAAAGTACA
GACTTTCAAAGCTCAACTCGTGGCAAAAGGTTATACACAAAAGGAGGGAATAGATAATGAAGAAACTTTCTCTCCTGTTGCCATGATAAAGTCGATTAGAATACTCTTAT
CCATCGCCACTTTTTATGATTATGAAATTTGGCAGATGGATGTCAAGACAACCTTTCTGAACGGTAATCTTGAAGAAAGTATTTATATGGTCCAACCAGAAAGGTTTATA
CAAAAGGGTCAAGAACAAAAGGTTTGTAAGCTTCAAAAATCTATTTATGGATTAAAACAAGCTTCTAAATCCTGGAATATAAGGTTTGATACTACGATCAAATCTTATGG
TTTTGAACAAAATGTTGACGAACCTTGTGTTTACAAAAGGATCATTAAATCTACTGTAGCATTCTTAGTTCTGTATGTAGATGACATTCTACTCATTGGGAATGATATAG
GTCATCTAACTGATATTAAAGAATGGCTAGCTACGCAATTCCAAATGAAAGATTTGGAAAATGCACAATATGTTCTTGGTATCCAAATAGTTCGAAACCGAAAGAATAGA
ACACTAGCCATGTCTCAAACATCTTATATAGACAGAATGTTGTCAAGATATAAGATGCATAATTCCAAAAAGGGTTTGTTGCCGTATAGATATGGAATTCATTTATCAAA
AGAACAATGTCCAAAGACACCTCAAGAAGTTGAGGATATGAGTAACATTCCCTATGCTTCTACTGTTGGGAGCCTAATGTATACAATGTTATGTACTAGACCTGACATTT
GCTATTTAGTGGGGATAGTTAGTAGATATCAGTCCAATCTTGGATGTGATCATTGA
Protein sequenceShow/hide protein sequence
MFLEEGHIRNHQTRNKLVLKEISKNATDRPSSSTKVVDKTWNIGQTHPSQELGEPRRSGRVVRQPDRYLGLSEAQIIIPDDGIEDPLTFKQEMNDVDYDRWIKAMDLEME
SMYSNYVWTLVDQPNDVKPIGCKWIYKRKRDQVGKVQTFKAQLVAKGYTQKEGIDNEETFSPVAMIKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIYMVQPERFI
QKGQEQKVCKLQKSIYGLKQASKSWNIRFDTTIKSYGFEQNVDEPCVYKRIIKSTVAFLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKDLENAQYVLGIQIVRNRKNR
TLAMSQTSYIDRMLSRYKMHNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMSNIPYASTVGSLMYTMLCTRPDICYLVGIVSRYQSNLGCDH