; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc06g0169821 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc06g0169821
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr06:24543604..24544770
RNA-Seq ExpressionCmc06g0169821
SyntenyCmc06g0169821
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-20793.08Show/hide
Query:  MDLRFQDYMIKHGIQSQLSVPGTPQQNGVLERRNRTSLDMVRSKMSYAQLLCSFWGYAVETAIHILNNVPSKSVSETPFELWRGRKPSLSHFRILGCAAH
        MDLRFQDYMI+HGIQSQLS PGTPQQNGV ERRNRT LDMVRS MSYAQL  SFWGYAVETA+HILNNVPSKSVSETPFELWRGRKPSLSHFRI GC AH
Subjt:  MDLRFQDYMIKHGIQSQLSVPGTPQQNGVLERRNRTSLDMVRSKMSYAQLLCSFWGYAVETAIHILNNVPSKSVSETPFELWRGRKPSLSHFRILGCAAH

Query:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRDHKPRSKLVLNEATDESTRVVYEVGPLSRVDETTTSGQSHPSQSLRM
        VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMR+HKPRSKLVL+EATDESTRVV EVGP SRVDETTTSGQSHPSQSLRM
Subjt:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRDHKPRSKLVLNEATDESTRVVYEVGPLSRVDETTTSGQSHPSQSLRM

Query:  PRRSGRIVSQPNHYLGLIETQVVIPDDGVEDPLSYKQVMNDVDKDQCVKAMDLEMESMYFNSVWELVDLPEG--PIGCKWIYKRKKDSPGKVQTFKARLV
        PRRSGR+VSQPN YLGL ETQVVIPDDGVEDPLSYKQ MNDVDKDQ VKAMDLEMESMYFNSVWELVDLPEG  PIGCKWIYKRK+DS GKVQTFKARLV
Subjt:  PRRSGRIVSQPNHYLGLIETQVVIPDDGVEDPLSYKQVMNDVDKDQCVKAMDLEMESMYFNSVWELVDLPEG--PIGCKWIYKRKKDSPGKVQTFKARLV

Query:  AKGYTQREGVDYEETFSPVAMLKSIRILLSIAAFYDYEVWQMDFKTAFLNDNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRS
        AKGYTQREGVDYEETFSPVAMLKSIRILLSIA FYDYE+WQMD KTAFLN NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRS
Subjt:  AKGYTQREGVDYEETFSPVAMLKSIRILLSIAAFYDYEVWQMDFKTAFLNDNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRS

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]2.3e-20491.79Show/hide
Query:  MDLRFQDYMIKHGIQSQLSVPGTPQQNGVLERRNRTSLDMVRSKMSYAQLLCSFWGYAVETAIHILNNVPSKSVSETPFELWRGRKPSLSHFRILGCAAH
        MDL FQDYMI+HGIQSQLS PGTPQQNGV ERRNRT LDMVRS MSYAQL  SFWGYAVETA+HILNNVPSKSVSETPFELWRGRKPSLSHFRI GC AH
Subjt:  MDLRFQDYMIKHGIQSQLSVPGTPQQNGVLERRNRTSLDMVRSKMSYAQLLCSFWGYAVETAIHILNNVPSKSVSETPFELWRGRKPSLSHFRILGCAAH

Query:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRDHKPRSKLVLNEATDESTRVVYEVGPLSRVDETTTSGQSHPSQSLRM
        VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDP+ENRVFVSTNATFLEEDHMR+HKPRSKLVL+EATDESTRVV EVGP SRVDETTTSGQSHPSQSLRM
Subjt:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRDHKPRSKLVLNEATDESTRVVYEVGPLSRVDETTTSGQSHPSQSLRM

Query:  PRRSGRIVSQPNHYLGLIETQVVIPDDGVEDPLSYKQVMNDVDKDQCVKAMDLEMESMYFNSVWELVDLPEG--PIGCKWIYKRKKDSPGKVQTFKARLV
        PRRSGR+VSQPN YLGL ETQVVIPDDGVEDPLSYKQ MNDVDKDQ VKAMDLEMESMYFNSVWELVDLPEG  PIGCKWIYKRK+DS GKVQTFKARLV
Subjt:  PRRSGRIVSQPNHYLGLIETQVVIPDDGVEDPLSYKQVMNDVDKDQCVKAMDLEMESMYFNSVWELVDLPEG--PIGCKWIYKRKKDSPGKVQTFKARLV

Query:  AKGYTQREGVDYEETFSPVAMLKSIRILLSIAAFYDYEVWQMDFKTAFLNDNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRS
        AKGYT++EGVDYEETFS VAMLKSIRILLSIA FYDYE+WQMD KTAFLN NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRS
Subjt:  AKGYTQREGVDYEETFSPVAMLKSIRILLSIAAFYDYEVWQMDFKTAFLNDNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRS

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-20793.08Show/hide
Query:  MDLRFQDYMIKHGIQSQLSVPGTPQQNGVLERRNRTSLDMVRSKMSYAQLLCSFWGYAVETAIHILNNVPSKSVSETPFELWRGRKPSLSHFRILGCAAH
        MDLRFQDYMI+HGIQSQLS PGTPQQNGV ERRNRT LDMVRS MSYAQL  SFWGYAVETA+HILNNVPSKSVSETPFELWRGRKPSLSHFRI GC AH
Subjt:  MDLRFQDYMIKHGIQSQLSVPGTPQQNGVLERRNRTSLDMVRSKMSYAQLLCSFWGYAVETAIHILNNVPSKSVSETPFELWRGRKPSLSHFRILGCAAH

Query:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRDHKPRSKLVLNEATDESTRVVYEVGPLSRVDETTTSGQSHPSQSLRM
        VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMR+HKPRSKLVL+EATDESTRVV EVGP SRVDETTTSGQSHPSQSLRM
Subjt:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRDHKPRSKLVLNEATDESTRVVYEVGPLSRVDETTTSGQSHPSQSLRM

Query:  PRRSGRIVSQPNHYLGLIETQVVIPDDGVEDPLSYKQVMNDVDKDQCVKAMDLEMESMYFNSVWELVDLPEG--PIGCKWIYKRKKDSPGKVQTFKARLV
        PRRSGR+VSQPN YLGL ETQVVIPDDGVEDPLSYKQ MNDVDKDQ VKAMDLEMESMYFNSVWELVDLPEG  PIGCKWIYKRK+DS GKVQTFKARLV
Subjt:  PRRSGRIVSQPNHYLGLIETQVVIPDDGVEDPLSYKQVMNDVDKDQCVKAMDLEMESMYFNSVWELVDLPEG--PIGCKWIYKRKKDSPGKVQTFKARLV

Query:  AKGYTQREGVDYEETFSPVAMLKSIRILLSIAAFYDYEVWQMDFKTAFLNDNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRS
        AKGYTQREGVDYEETFSPVAMLKSIRILLSIA FYDYE+WQMD KTAFLN NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRS
Subjt:  AKGYTQREGVDYEETFSPVAMLKSIRILLSIAAFYDYEVWQMDFKTAFLNDNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRS

TYK02840.1 gag/pol protein [Cucumis melo var. makuwa]7.2e-20692.56Show/hide
Query:  MDLRFQDYMIKHGIQSQLSVPGTPQQNGVLERRNRTSLDMVRSKMSYAQLLCSFWGYAVETAIHILNNVPSKSVSETPFELWRGRKPSLSHFRILGCAAH
        MDLRFQDYMI+HGIQSQLS PGTPQQNGV ERRNRT LDMVRS MSYAQL  SFWGYAVETA+HILNNVPSKSVSETPFELWRGRKPSLSHFRI GC AH
Subjt:  MDLRFQDYMIKHGIQSQLSVPGTPQQNGVLERRNRTSLDMVRSKMSYAQLLCSFWGYAVETAIHILNNVPSKSVSETPFELWRGRKPSLSHFRILGCAAH

Query:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRDHKPRSKLVLNEATDESTRVVYEVGPLSRVDETTTSGQSHPSQSLRM
        VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMR+HKPRSKLVL+EATDESTRVV EVGP SRVDETTTSGQSHPSQSLRM
Subjt:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRDHKPRSKLVLNEATDESTRVVYEVGPLSRVDETTTSGQSHPSQSLRM

Query:  PRRSGRIVSQPNHYLGLIETQVVIPDDGVEDPLSYKQVMNDVDKDQCVKAMDLEMESMYFNSVWELVDLPEG--PIGCKWIYKRKKDSPGKVQTFKARLV
        PRRSGR+VSQPN YLGL ETQVVIPDDGVEDPLSYKQ MNDVDKDQ VKAMDLEMESMYFNSVWELVDLPEG  PIGCKWIYKRK+DS GKVQTFKARLV
Subjt:  PRRSGRIVSQPNHYLGLIETQVVIPDDGVEDPLSYKQVMNDVDKDQCVKAMDLEMESMYFNSVWELVDLPEG--PIGCKWIYKRKKDSPGKVQTFKARLV

Query:  AKGYTQREGVDYEETFSPVAMLKSIRILLSIAAFYDYEVWQMDFKTAFLNDNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRS
        AKGYTQREGVDYEETFSPVAMLKSIRILLSIA FYDYE+WQMD KTAFLN NLEESIFMSQPE FITQGQEQKVCKLN SIYGLKQASRS
Subjt:  AKGYTQREGVDYEETFSPVAMLKSIRILLSIAAFYDYEVWQMDFKTAFLNDNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRS

TYK15984.1 gag/pol protein [Cucumis melo var. makuwa]7.2e-20692.31Show/hide
Query:  MDLRFQDYMIKHGIQSQLSVPGTPQQNGVLERRNRTSLDMVRSKMSYAQLLCSFWGYAVETAIHILNNVPSKSVSETPFELWRGRKPSLSHFRILGCAAH
        MDLRFQDYMI+HGIQSQLS PGTPQQNGV ERRNRT LDMV S MSYAQL  SFWGYAVETA+HILNNVPSKSVS+ PFELWRGRKPSLSHFRI GC AH
Subjt:  MDLRFQDYMIKHGIQSQLSVPGTPQQNGVLERRNRTSLDMVRSKMSYAQLLCSFWGYAVETAIHILNNVPSKSVSETPFELWRGRKPSLSHFRILGCAAH

Query:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRDHKPRSKLVLNEATDESTRVVYEVGPLSRVDETTTSGQSHPSQSLRM
        VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMR+HKPRSKLVL+EATDESTRVV EVGP SRVDETTTSGQSHPSQSLRM
Subjt:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRDHKPRSKLVLNEATDESTRVVYEVGPLSRVDETTTSGQSHPSQSLRM

Query:  PRRSGRIVSQPNHYLGLIETQVVIPDDGVEDPLSYKQVMNDVDKDQCVKAMDLEMESMYFNSVWELVDLPEG--PIGCKWIYKRKKDSPGKVQTFKARLV
        PRRSGR+VSQPN YLGL ETQVVIPDDGVEDPLSYKQ MNDVDKDQ VKAMDLEMESMYFNSVWELVDLPEG  PIGCKWIYKRK+DS GKVQTFKARLV
Subjt:  PRRSGRIVSQPNHYLGLIETQVVIPDDGVEDPLSYKQVMNDVDKDQCVKAMDLEMESMYFNSVWELVDLPEG--PIGCKWIYKRKKDSPGKVQTFKARLV

Query:  AKGYTQREGVDYEETFSPVAMLKSIRILLSIAAFYDYEVWQMDFKTAFLNDNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRS
        AKGYTQREGVDYEETFSPVAMLKSIRILLSIA FYDYE+WQMD KTAFLN NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRS
Subjt:  AKGYTQREGVDYEETFSPVAMLKSIRILLSIAAFYDYEVWQMDFKTAFLNDNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRS

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein1.1e-20491.79Show/hide
Query:  MDLRFQDYMIKHGIQSQLSVPGTPQQNGVLERRNRTSLDMVRSKMSYAQLLCSFWGYAVETAIHILNNVPSKSVSETPFELWRGRKPSLSHFRILGCAAH
        MDL FQDYMI+HGIQSQLS PGTPQQNGV ERRNRT LDMVRS MSYAQL  SFWGYAVETA+HILNNVPSKSVSETPFELWRGRKPSLSHFRI GC AH
Subjt:  MDLRFQDYMIKHGIQSQLSVPGTPQQNGVLERRNRTSLDMVRSKMSYAQLLCSFWGYAVETAIHILNNVPSKSVSETPFELWRGRKPSLSHFRILGCAAH

Query:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRDHKPRSKLVLNEATDESTRVVYEVGPLSRVDETTTSGQSHPSQSLRM
        VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDP+ENRVFVSTNATFLEEDHMR+HKPRSKLVL+EATDESTRVV EVGP SRVDETTTSGQSHPSQSLRM
Subjt:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRDHKPRSKLVLNEATDESTRVVYEVGPLSRVDETTTSGQSHPSQSLRM

Query:  PRRSGRIVSQPNHYLGLIETQVVIPDDGVEDPLSYKQVMNDVDKDQCVKAMDLEMESMYFNSVWELVDLPEG--PIGCKWIYKRKKDSPGKVQTFKARLV
        PRRSGR+VSQPN YLGL ETQVVIPDDGVEDPLSYKQ MNDVDKDQ VKAMDLEMESMYFNSVWELVDLPEG  PIGCKWIYKRK+DS GKVQTFKARLV
Subjt:  PRRSGRIVSQPNHYLGLIETQVVIPDDGVEDPLSYKQVMNDVDKDQCVKAMDLEMESMYFNSVWELVDLPEG--PIGCKWIYKRKKDSPGKVQTFKARLV

Query:  AKGYTQREGVDYEETFSPVAMLKSIRILLSIAAFYDYEVWQMDFKTAFLNDNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRS
        AKGYT++EGVDYEETFS VAMLKSIRILLSIA FYDYE+WQMD KTAFLN NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRS
Subjt:  AKGYTQREGVDYEETFSPVAMLKSIRILLSIAAFYDYEVWQMDFKTAFLNDNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRS

A0A5A7TZD0 Gag/pol protein6.4e-20893.08Show/hide
Query:  MDLRFQDYMIKHGIQSQLSVPGTPQQNGVLERRNRTSLDMVRSKMSYAQLLCSFWGYAVETAIHILNNVPSKSVSETPFELWRGRKPSLSHFRILGCAAH
        MDLRFQDYMI+HGIQSQLS PGTPQQNGV ERRNRT LDMVRS MSYAQL  SFWGYAVETA+HILNNVPSKSVSETPFELWRGRKPSLSHFRI GC AH
Subjt:  MDLRFQDYMIKHGIQSQLSVPGTPQQNGVLERRNRTSLDMVRSKMSYAQLLCSFWGYAVETAIHILNNVPSKSVSETPFELWRGRKPSLSHFRILGCAAH

Query:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRDHKPRSKLVLNEATDESTRVVYEVGPLSRVDETTTSGQSHPSQSLRM
        VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMR+HKPRSKLVL+EATDESTRVV EVGP SRVDETTTSGQSHPSQSLRM
Subjt:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRDHKPRSKLVLNEATDESTRVVYEVGPLSRVDETTTSGQSHPSQSLRM

Query:  PRRSGRIVSQPNHYLGLIETQVVIPDDGVEDPLSYKQVMNDVDKDQCVKAMDLEMESMYFNSVWELVDLPEG--PIGCKWIYKRKKDSPGKVQTFKARLV
        PRRSGR+VSQPN YLGL ETQVVIPDDGVEDPLSYKQ MNDVDKDQ VKAMDLEMESMYFNSVWELVDLPEG  PIGCKWIYKRK+DS GKVQTFKARLV
Subjt:  PRRSGRIVSQPNHYLGLIETQVVIPDDGVEDPLSYKQVMNDVDKDQCVKAMDLEMESMYFNSVWELVDLPEG--PIGCKWIYKRKKDSPGKVQTFKARLV

Query:  AKGYTQREGVDYEETFSPVAMLKSIRILLSIAAFYDYEVWQMDFKTAFLNDNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRS
        AKGYTQREGVDYEETFSPVAMLKSIRILLSIA FYDYE+WQMD KTAFLN NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRS
Subjt:  AKGYTQREGVDYEETFSPVAMLKSIRILLSIAAFYDYEVWQMDFKTAFLNDNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRS

A0A5A7UYE8 Gag/pol protein6.4e-20893.08Show/hide
Query:  MDLRFQDYMIKHGIQSQLSVPGTPQQNGVLERRNRTSLDMVRSKMSYAQLLCSFWGYAVETAIHILNNVPSKSVSETPFELWRGRKPSLSHFRILGCAAH
        MDLRFQDYMI+HGIQSQLS PGTPQQNGV ERRNRT LDMVRS MSYAQL  SFWGYAVETA+HILNNVPSKSVSETPFELWRGRKPSLSHFRI GC AH
Subjt:  MDLRFQDYMIKHGIQSQLSVPGTPQQNGVLERRNRTSLDMVRSKMSYAQLLCSFWGYAVETAIHILNNVPSKSVSETPFELWRGRKPSLSHFRILGCAAH

Query:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRDHKPRSKLVLNEATDESTRVVYEVGPLSRVDETTTSGQSHPSQSLRM
        VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMR+HKPRSKLVL+EATDESTRVV EVGP SRVDETTTSGQSHPSQSLRM
Subjt:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRDHKPRSKLVLNEATDESTRVVYEVGPLSRVDETTTSGQSHPSQSLRM

Query:  PRRSGRIVSQPNHYLGLIETQVVIPDDGVEDPLSYKQVMNDVDKDQCVKAMDLEMESMYFNSVWELVDLPEG--PIGCKWIYKRKKDSPGKVQTFKARLV
        PRRSGR+VSQPN YLGL ETQVVIPDDGVEDPLSYKQ MNDVDKDQ VKAMDLEMESMYFNSVWELVDLPEG  PIGCKWIYKRK+DS GKVQTFKARLV
Subjt:  PRRSGRIVSQPNHYLGLIETQVVIPDDGVEDPLSYKQVMNDVDKDQCVKAMDLEMESMYFNSVWELVDLPEG--PIGCKWIYKRKKDSPGKVQTFKARLV

Query:  AKGYTQREGVDYEETFSPVAMLKSIRILLSIAAFYDYEVWQMDFKTAFLNDNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRS
        AKGYTQREGVDYEETFSPVAMLKSIRILLSIA FYDYE+WQMD KTAFLN NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRS
Subjt:  AKGYTQREGVDYEETFSPVAMLKSIRILLSIAAFYDYEVWQMDFKTAFLNDNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRS

A0A5D3BUN8 Gag/pol protein3.5e-20692.56Show/hide
Query:  MDLRFQDYMIKHGIQSQLSVPGTPQQNGVLERRNRTSLDMVRSKMSYAQLLCSFWGYAVETAIHILNNVPSKSVSETPFELWRGRKPSLSHFRILGCAAH
        MDLRFQDYMI+HGIQSQLS PGTPQQNGV ERRNRT LDMVRS MSYAQL  SFWGYAVETA+HILNNVPSKSVSETPFELWRGRKPSLSHFRI GC AH
Subjt:  MDLRFQDYMIKHGIQSQLSVPGTPQQNGVLERRNRTSLDMVRSKMSYAQLLCSFWGYAVETAIHILNNVPSKSVSETPFELWRGRKPSLSHFRILGCAAH

Query:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRDHKPRSKLVLNEATDESTRVVYEVGPLSRVDETTTSGQSHPSQSLRM
        VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMR+HKPRSKLVL+EATDESTRVV EVGP SRVDETTTSGQSHPSQSLRM
Subjt:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRDHKPRSKLVLNEATDESTRVVYEVGPLSRVDETTTSGQSHPSQSLRM

Query:  PRRSGRIVSQPNHYLGLIETQVVIPDDGVEDPLSYKQVMNDVDKDQCVKAMDLEMESMYFNSVWELVDLPEG--PIGCKWIYKRKKDSPGKVQTFKARLV
        PRRSGR+VSQPN YLGL ETQVVIPDDGVEDPLSYKQ MNDVDKDQ VKAMDLEMESMYFNSVWELVDLPEG  PIGCKWIYKRK+DS GKVQTFKARLV
Subjt:  PRRSGRIVSQPNHYLGLIETQVVIPDDGVEDPLSYKQVMNDVDKDQCVKAMDLEMESMYFNSVWELVDLPEG--PIGCKWIYKRKKDSPGKVQTFKARLV

Query:  AKGYTQREGVDYEETFSPVAMLKSIRILLSIAAFYDYEVWQMDFKTAFLNDNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRS
        AKGYTQREGVDYEETFSPVAMLKSIRILLSIA FYDYE+WQMD KTAFLN NLEESIFMSQPE FITQGQEQKVCKLN SIYGLKQASRS
Subjt:  AKGYTQREGVDYEETFSPVAMLKSIRILLSIAAFYDYEVWQMDFKTAFLNDNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRS

A0A5D3CYF4 Gag/pol protein3.5e-20692.31Show/hide
Query:  MDLRFQDYMIKHGIQSQLSVPGTPQQNGVLERRNRTSLDMVRSKMSYAQLLCSFWGYAVETAIHILNNVPSKSVSETPFELWRGRKPSLSHFRILGCAAH
        MDLRFQDYMI+HGIQSQLS PGTPQQNGV ERRNRT LDMV S MSYAQL  SFWGYAVETA+HILNNVPSKSVS+ PFELWRGRKPSLSHFRI GC AH
Subjt:  MDLRFQDYMIKHGIQSQLSVPGTPQQNGVLERRNRTSLDMVRSKMSYAQLLCSFWGYAVETAIHILNNVPSKSVSETPFELWRGRKPSLSHFRILGCAAH

Query:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRDHKPRSKLVLNEATDESTRVVYEVGPLSRVDETTTSGQSHPSQSLRM
        VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMR+HKPRSKLVL+EATDESTRVV EVGP SRVDETTTSGQSHPSQSLRM
Subjt:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRDHKPRSKLVLNEATDESTRVVYEVGPLSRVDETTTSGQSHPSQSLRM

Query:  PRRSGRIVSQPNHYLGLIETQVVIPDDGVEDPLSYKQVMNDVDKDQCVKAMDLEMESMYFNSVWELVDLPEG--PIGCKWIYKRKKDSPGKVQTFKARLV
        PRRSGR+VSQPN YLGL ETQVVIPDDGVEDPLSYKQ MNDVDKDQ VKAMDLEMESMYFNSVWELVDLPEG  PIGCKWIYKRK+DS GKVQTFKARLV
Subjt:  PRRSGRIVSQPNHYLGLIETQVVIPDDGVEDPLSYKQVMNDVDKDQCVKAMDLEMESMYFNSVWELVDLPEG--PIGCKWIYKRKKDSPGKVQTFKARLV

Query:  AKGYTQREGVDYEETFSPVAMLKSIRILLSIAAFYDYEVWQMDFKTAFLNDNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRS
        AKGYTQREGVDYEETFSPVAMLKSIRILLSIA FYDYE+WQMD KTAFLN NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRS
Subjt:  AKGYTQREGVDYEETFSPVAMLKSIRILLSIAAFYDYEVWQMDFKTAFLNDNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRS

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.2e-4729.39Show/hide
Query:  QDYMIKHGIQSQLSVPGTPQQNGVLERRNRTSLDMVRSKMSYAQLLCSFWGYAVETAIHILNNVPSKSV---SETPFELWRGRKPSLSHFRILGCAAHVL
        + + +K GI   L+VP TPQ NGV ER  RT  +  R+ +S A+L  SFWG AV TA +++N +PS+++   S+TP+E+W  +KP L H R+ G   +V 
Subjt:  QDYMIKHGIQSQLSVPGTPQQNGVLERRNRTSLDMVRSKMSYAQLLCSFWGYAVETAIHILNNVPSKSV---SETPFELWRGRKPSLSHFRILGCAAHVL

Query:  VTNPK-KLEPRSRLCQFVGYPKETRGGLFFD-------------------------------------------PQENRVFVST----------NATFLE
        + N + K + +S    FVGY  E  G   +D                                           P ++R  + T          N  FL+
Subjt:  VTNPK-KLEPRSRLCQFVGYPKETRGGLFFD-------------------------------------------PQENRVFVST----------NATFLE

Query:  EDHMRDHK----PRSKLVLNEATDESTR----------------VVYEVGPLSRVDETTTS-GQSHPSQSLR-----------------------MPRRS
        +    ++K       K++  E  +ES                   + E     R D    S G  +P++S                         + RRS
Subjt:  EDHMRDHK----PRSKLVLNEATDESTR----------------VVYEVGPLSRVDETTTS-GQSHPSQSLR-----------------------MPRRS

Query:  GRIVSQP----NHYLGLIETQVVIPDDGVED-PLSYKQVMNDVDKDQCVKAMDLEMESMYFNSVWELVDLPEGP--IGCKWIYKRKKDSPGKVQTFKARL
         R+ ++P    N     +   V+       D P S+ ++    DK    +A++ E+ +   N+ W +   PE    +  +W++  K +  G    +KARL
Subjt:  GRIVSQP----NHYLGLIETQVVIPDDGVED-PLSYKQVMNDVDKDQCVKAMDLEMESMYFNSVWELVDLPEGP--IGCKWIYKRKKDSPGKVQTFKARL

Query:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIAAFYDYEVWQMDFKTAFLNDNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASR
        VA+G+TQ+  +DYEETF+PVA + S R +LS+   Y+ +V QMD KTAFLN  L+E I+M  P+G         VCKLN++IYGLKQA+R
Subjt:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIAAFYDYEVWQMDFKTAFLNDNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASR

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.8e-7440.34Show/hide
Query:  FQDYMIKHGIQSQLSVPGTPQQNGVLERRNRTSLDMVRSKMSYAQLLCSFWGYAVETAIHILNNVPSKSVS-ETPFELWRGRKPSLSHFRILGCA--AHV
        F++Y   HGI+ + +VPGTPQ NGV ER NRT ++ VRS +  A+L  SFWG AV+TA +++N  PS  ++ E P  +W  ++ S SH ++ GC   AHV
Subjt:  FQDYMIKHGIQSQLSVPGTPQQNGVLERRNRTSLDMVRSKMSYAQLLCSFWGYAVETAIHILNNVPSKSVS-ETPFELWRGRKPSLSHFRILGCA--AHV

Query:  LVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRDHKPRSKLVLN--------------------EATDESTRVVYEVGPLS
              KL+ +S  C F+GY  E  G   +DP + +V  S +  F  E  +R     S+ V N                      TDE +    + G + 
Subjt:  LVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRDHKPRSKLVLN--------------------EATDESTRVVYEVGPLS

Query:  RVDETTTSG---QSHPSQSLRMP---RRSGRIVSQPNHYLGLIETQVVIPDDGVEDPLSYKQVMNDVDKDQCVKAMDLEMESMYFNSVWELVDLPEG--P
           E    G     HP+Q        RRS R   +   Y       V+I DD   +P S K+V++  +K+Q +KAM  EMES+  N  ++LV+LP+G  P
Subjt:  RVDETTTSG---QSHPSQSLRMP---RRSGRIVSQPNHYLGLIETQVVIPDDGVEDPLSYKQVMNDVDKDQCVKAMDLEMESMYFNSVWELVDLPEG--P

Query:  IGCKWIYKRKKDSPGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIAAFYDYEVWQMDFKTAFLNDNLEESIFMSQPEGFITQGQEQKVC
        + CKW++K KKD   K+  +KARLV KG+ Q++G+D++E FSPV  + SIR +LS+AA  D EV Q+D KTAFL+ +LEE I+M QPEGF   G++  VC
Subjt:  IGCKWIYKRKKDSPGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIAAFYDYEVWQMDFKTAFLNDNLEESIFMSQPEGFITQGQEQKVC

Query:  KLNRSIYGLKQASR
        KLN+S+YGLKQA R
Subjt:  KLNRSIYGLKQASR

P92520 Uncharacterized mitochondrial protein AtMg008208.8e-1342.86Show/hide
Query:  KAMDLEMESMYFNSVWELVDLP--EGPIGCKWIYKRKKDSPGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIA
        +AM  E++++  N  W LV  P  +  +GCKW++K K  S G +   KARLVAKG+ Q EG+ + ET+SPV    +IR +L++A
Subjt:  KAMDLEMESMYFNSVWELVDLP--EGPIGCKWIYKRKKDSPGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.9e-4426.03Show/hide
Query:  DYMIKHGIQSQLSVPGTPQQNGVLERRNRTSLDMVRSKMSYAQLLCSFWGYAVETAIHILNNVPSKSVS-ETPFELWRGRKPSLSHFRILGCAAHVLVT-
        +Y  +HGI    S P TP+ NG+ ER++R  ++   + +S+A +  ++W YA   A++++N +P+  +  E+PF+   G  P+    R+ GCA +  +  
Subjt:  DYMIKHGIQSQLSVPGTPQQNGVLERRNRTSLDMVRSKMSYAQLLCSFWGYAVETAIHILNNVPSKSVS-ETPFELWRGRKPSLSHFRILGCAAHVLVT-

Query:  -NPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEE-----------DHMRDHKPRSKLVLNEATDESTRVVY-----------------
         N  KL+ +SR C F+GY       L    Q +R+++S +  F E              +++ +  S  V +  T   TR                    
Subjt:  -NPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEE-----------DHMRDHKPRSKLVLNEATDESTRVVY-----------------

Query:  ----------------------------------EVGPLSRVDETTTSGQSHPS-----------------QSLRMPRRSGRIVSQPNHYLGLIETQVVI
                                          + GP      T T  Q+H S                 QSL  P +S      P        T    
Subjt:  ----------------------------------EVGPLSRVDETTTSGQSHPS-----------------QSLRMPRRSGRIVSQPNHYLGLIETQVVI

Query:  PDDGVEDPLSYKQVMND--------------------------------------------VDKDQCVKAMDLEMESMYFNSVWELVDLPEGP---IGCK
        P   +  P    Q++N+                                            +  ++   AM  E+ +   N  W+LV  P      +GC+
Subjt:  PDDGVEDPLSYKQVMND--------------------------------------------VDKDQCVKAMDLEMESMYFNSVWELVDLPEGP---IGCK

Query:  WIYKRKKDSPGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIAAFYDYEVWQMDFKTAFLNDNLEESIFMSQPEGFITQGQEQKVCKLNR
        WI+ +K +S G +  +KARLVAKGY QR G+DY ETFSPV    SIRI+L +A    + + Q+D   AFL   L + ++MSQP GFI + +   VCKL +
Subjt:  WIYKRKKDSPGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIAAFYDYEVWQMDFKTAFLNDNLEESIFMSQPEGFITQGQEQKVCKLNR

Query:  SIYGLKQASRS
        ++YGLKQA R+
Subjt:  SIYGLKQASRS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.3e-4225.97Show/hide
Query:  QDYMIKHGIQSQLSVPGTPQQNGVLERRNRTSLDMVRSKMSYAQLLCSFWGYAVETAIHILNNVPSKSVS-ETPFELWRGRKPSLSHFRILGCAAHVLVT
        +DY+ +HGI    S P TP+ NG+ ER++R  ++M  + +S+A +  ++W YA   A++++N +P+  +  ++PF+   G+ P+    ++ GCA +  + 
Subjt:  QDYMIKHGIQSQLSVPGTPQQNGVLERRNRTSLDMVRSKMSYAQLLCSFWGYAVETAIHILNNVPSKSVS-ETPFELWRGRKPSLSHFRILGCAAHVLVT

Query:  --NPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLE---------------EDHMRDHKPR---------SKLV------LNEATDESTR
          N  KLE +S+ C F+GY       L       R++ S +  F E               ++   D  P          + LV      L    D S R
Subjt:  --NPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLE---------------EDHMRDHKPR---------SKLV------LNEATDESTR

Query:  VVYEVGPL------------SRVDETTTSGQSHPSQS--------------------LRMPRRSGRIVSQPNHYLGLIETQVVIP---------------
              PL            S +   ++S  + PS +                    L  P  +    + PN    L ++ +  P               
Subjt:  VVYEVGPL------------SRVDETTTSGQSHPSQS--------------------LRMPRRSGRIVSQPNHYLGLIETQVVIP---------------

Query:  -------------------------------------DDGVEDP---LSY----------KQVMNDVDKDQCVKAMDLEMESMYFNSVWELVDLPEGP--
                                              DG+  P    SY          +  +  +  D+  +AM  E+ +   N  W+LV  P     
Subjt:  -------------------------------------DDGVEDP---LSY----------KQVMNDVDKDQCVKAMDLEMESMYFNSVWELVDLPEGP--

Query:  -IGCKWIYKRKKDSPGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIAAFYDYEVWQMDFKTAFLNDNLEESIFMSQPEGFITQGQEQKV
         +GC+WI+ +K +S G +  +KARLVAKGY QR G+DY ETFSPV    SIRI+L +A    + + Q+D   AFL   L + ++MSQP GF+ + +   V
Subjt:  -IGCKWIYKRKKDSPGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIAAFYDYEVWQMDFKTAFLNDNLEESIFMSQPEGFITQGQEQKV

Query:  CKLNRSIYGLKQASRS
        C+L ++IYGLKQA R+
Subjt:  CKLNRSIYGLKQASRS

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 86.4e-3550.69Show/hide
Query:  AMDLEMESMYFNSVWELVDLP--EGPIGCKWIYKRKKDSPGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIAAFYDYEVWQMDFKTAFL
        AMD E+ +M     WE+  LP  + PIGCKW+YK K +S G ++ +KARLVAKGYTQ+EG+D+ ETFSPV  L S++++L+I+A Y++ + Q+D   AFL
Subjt:  AMDLEMESMYFNSVWELVDLP--EGPIGCKWIYKRKKDSPGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIAAFYDYEVWQMDFKTAFL

Query:  NDNLEESIFMSQPEGFIT-QGQE---QKVCKLNRSIYGLKQASR
        N +L+E I+M  P G+   QG       VC L +SIYGLKQASR
Subjt:  NDNLEESIFMSQPEGFIT-QGQE---QKVCKLNRSIYGLKQASR

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.5e-0735.37Show/hide
Query:  NRTSLDMVRSKMSYAQLLCSFWGYAVETAIHILNNVPSKSVS-ETPFELWRGRKPSLSHFRILGCAAHVLVTNPKKLEPRSR
        NRT ++ VRS +    L  +F   A  TA+HI+N  PS +++   P E+W    P+ S+ R  GC A++   +  KL+PR++
Subjt:  NRTSLDMVRSKMSYAQLLCSFWGYAVETAIHILNNVPSKSVS-ETPFELWRGRKPSLSHFRILGCAAHVLVTNPKKLEPRSR

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)6.2e-1442.86Show/hide
Query:  KAMDLEMESMYFNSVWELVDLP--EGPIGCKWIYKRKKDSPGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIA
        +AM  E++++  N  W LV  P  +  +GCKW++K K  S G +   KARLVAKG+ Q EG+ + ET+SPV    +IR +L++A
Subjt:  KAMDLEMESMYFNSVWELVDLP--EGPIGCKWIYKRKKDSPGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTGAGATTCCAGGACTATATGATAAAACATGGAATCCAATCCCAACTCTCAGTACCTGGTACACCTCAACAAAATGGTGTATTAGAAAGGAGAAATAGAACCTC
GTTAGACATGGTTCGTTCAAAGATGAGTTACGCTCAATTGCTTTGCTCGTTTTGGGGGTATGCAGTAGAGACTGCAATTCATATCTTGAACAATGTTCCCTCGAAGAGTG
TTTCTGAAACACCTTTCGAGTTATGGAGAGGACGTAAACCTAGTTTAAGTCATTTCAGAATTTTGGGTTGTGCAGCACACGTGTTAGTGACAAATCCCAAGAAGTTGGAA
CCTCGTTCAAGGTTATGCCAATTTGTTGGTTACCCTAAAGAGACGAGAGGTGGTCTATTTTTCGATCCACAAGAAAATAGAGTGTTTGTATCGACAAATGCTACTTTCTT
GGAAGAAGACCACATGAGAGATCATAAACCACGAAGCAAATTAGTATTAAATGAAGCTACTGATGAATCAACAAGGGTTGTTTATGAAGTTGGTCCCTTATCAAGAGTTG
ATGAAACGACCACATCAGGTCAATCTCATCCTTCTCAATCATTGAGAATGCCTCGACGCAGTGGGAGGATTGTATCACAACCTAACCATTATTTGGGTTTAATTGAAACT
CAGGTTGTCATACCAGATGATGGTGTTGAGGATCCATTGTCCTATAAACAGGTAATGAATGATGTAGATAAGGACCAATGCGTGAAAGCCATGGACCTTGAAATGGAGTC
TATGTACTTCAATTCAGTGTGGGAGCTTGTAGATCTACCTGAAGGGCCTATAGGGTGTAAATGGATCTATAAGAGAAAGAAAGATTCACCTGGGAAAGTACAGACCTTCA
AAGCTAGACTTGTAGCAAAAGGGTATACCCAAAGGGAAGGGGTTGACTATGAGGAAACTTTTTCTCCTGTGGCTATGTTAAAGTCTATAAGGATTCTCTTGTCCATCGCC
GCATTTTATGATTATGAAGTATGGCAAATGGATTTCAAGACTGCTTTTTTGAATGACAATCTTGAAGAGAGTATCTTTATGTCTCAGCCCGAGGGGTTCATAACCCAAGG
TCAAGAGCAAAAAGTTTGTAAGCTGAATCGATCCATTTATGGGTTGAAACAAGCATCTAGATCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATTTGAGATTCCAGGACTATATGATAAAACATGGAATCCAATCCCAACTCTCAGTACCTGGTACACCTCAACAAAATGGTGTATTAGAAAGGAGAAATAGAACCTC
GTTAGACATGGTTCGTTCAAAGATGAGTTACGCTCAATTGCTTTGCTCGTTTTGGGGGTATGCAGTAGAGACTGCAATTCATATCTTGAACAATGTTCCCTCGAAGAGTG
TTTCTGAAACACCTTTCGAGTTATGGAGAGGACGTAAACCTAGTTTAAGTCATTTCAGAATTTTGGGTTGTGCAGCACACGTGTTAGTGACAAATCCCAAGAAGTTGGAA
CCTCGTTCAAGGTTATGCCAATTTGTTGGTTACCCTAAAGAGACGAGAGGTGGTCTATTTTTCGATCCACAAGAAAATAGAGTGTTTGTATCGACAAATGCTACTTTCTT
GGAAGAAGACCACATGAGAGATCATAAACCACGAAGCAAATTAGTATTAAATGAAGCTACTGATGAATCAACAAGGGTTGTTTATGAAGTTGGTCCCTTATCAAGAGTTG
ATGAAACGACCACATCAGGTCAATCTCATCCTTCTCAATCATTGAGAATGCCTCGACGCAGTGGGAGGATTGTATCACAACCTAACCATTATTTGGGTTTAATTGAAACT
CAGGTTGTCATACCAGATGATGGTGTTGAGGATCCATTGTCCTATAAACAGGTAATGAATGATGTAGATAAGGACCAATGCGTGAAAGCCATGGACCTTGAAATGGAGTC
TATGTACTTCAATTCAGTGTGGGAGCTTGTAGATCTACCTGAAGGGCCTATAGGGTGTAAATGGATCTATAAGAGAAAGAAAGATTCACCTGGGAAAGTACAGACCTTCA
AAGCTAGACTTGTAGCAAAAGGGTATACCCAAAGGGAAGGGGTTGACTATGAGGAAACTTTTTCTCCTGTGGCTATGTTAAAGTCTATAAGGATTCTCTTGTCCATCGCC
GCATTTTATGATTATGAAGTATGGCAAATGGATTTCAAGACTGCTTTTTTGAATGACAATCTTGAAGAGAGTATCTTTATGTCTCAGCCCGAGGGGTTCATAACCCAAGG
TCAAGAGCAAAAAGTTTGTAAGCTGAATCGATCCATTTATGGGTTGAAACAAGCATCTAGATCTTGA
Protein sequenceShow/hide protein sequence
MDLRFQDYMIKHGIQSQLSVPGTPQQNGVLERRNRTSLDMVRSKMSYAQLLCSFWGYAVETAIHILNNVPSKSVSETPFELWRGRKPSLSHFRILGCAAHVLVTNPKKLE
PRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRDHKPRSKLVLNEATDESTRVVYEVGPLSRVDETTTSGQSHPSQSLRMPRRSGRIVSQPNHYLGLIET
QVVIPDDGVEDPLSYKQVMNDVDKDQCVKAMDLEMESMYFNSVWELVDLPEGPIGCKWIYKRKKDSPGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIA
AFYDYEVWQMDFKTAFLNDNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRS