; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc01g0010511 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc01g0010511
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr01:5518629..5520737
RNA-Seq ExpressionCmc01g0010511
SyntenyCmc01g0010511
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026233.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0089.64Show/hide
Query:  FILNYVPSKSVSETPLKLWNGRKGSLRHFRIWCCPAHVLENNPKKLEPRSKSCLFVGYRKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVL
        +ILN VPSKSVSETPLKLWNG KGSLRHFRIW CPAHVLENNPKKLEPRSK CLFVGY KGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVL
Subjt:  FILNYVPSKSVSETPLKLWNGRKGSLRHFRIWCCPAHVLENNPKKLEPRSKSCLFVGYRKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVL

Query:  NELSKEITEPSTRIVEEPSALTRVVHAGSSIRTHQPQSLREPRRSGRVTNLPIRYMSLTETLIVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMY
        NELSKE TEPSTR+VEEPSAL RVVH  SS RTHQPQSLREPRRSGRVTNLPIRYMSLTETL VISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMY
Subjt:  NELSKEITEPSTRIVEEPSALTRVVHAGSSIRTHQPQSLREPRRSGRVTNLPIRYMSLTETLIVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMY

Query:  FNSVWDLVDQPDEVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS---------------------------HTAFLNGNLEETIYM
        FNSVWDLVDQPD VKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS                            TAFLNGNLEETIYM
Subjt:  FNSVWDLVDQPDEVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS---------------------------HTAFLNGNLEETIYM

Query:  QQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEHCVYKRIINKSVAFLVLYVDDILLIKNDIGLLTDIKQWLATQFQMKDLG
        QQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDE CVYKRIINKSVAFLVLYVDDILLI NDIGLLTDIKQWLATQFQMKDLG
Subjt:  QQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEHCVYKRIINKSVAFLVLYVDDILLIKNDIGLLTDIKQWLATQFQMKDLG

Query:  EAQFVLGIQIFRDRKNKTLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRY
        EAQFVLGIQIFRDRKNK LALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRY
Subjt:  EAQFVLGIQIFRDRKNKTLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRY

Query:  QSNPGLAHWIAVKTILKYLRRTRDYMPVYGSKDLILTGYTDSDFQTDRDFRKSTS------------------GCIADSTMEAEYVAACEAAKEAVWLRK
        QSNPGLAHW AVKTILKYLRRTRDY  VYGSKDLILTGYTDSDFQTDRD RKSTS                  GCIADSTMEAEYVAACEAAKEAVWLR 
Subjt:  QSNPGLAHWIAVKTILKYLRRTRDYMPVYGSKDLILTGYTDSDFQTDRDFRKSTS------------------GCIADSTMEAEYVAACEAAKEAVWLRK

Query:  ILIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKLGKHIKRKYHLIREIVHRGDVIVTQIASTHNGADPFTKPLTAKVFEGHLESLGLRDMPHL
         LIDLEVVPNMSKPITLYCDNSGAVANSREPRSHK GKHI+RKYHLIREIVHRGDVIVTQIASTHN ADPFTKPLTAKVFEGHLESLGLRDMPHL
Subjt:  ILIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKLGKHIKRKYHLIREIVHRGDVIVTQIASTHNGADPFTKPLTAKVFEGHLESLGLRDMPHL

KAA0031826.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0090.07Show/hide
Query:  FILNYVPSKSVSETPLKLWNGRKGSLRHFRIWCCPAHVLENNPKKLEPRSKSCLFVGYRKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVL
        +ILN VPSKSVSETPLKLWNGRKGSLRHFRIW CPAHVLENNPKKLEPRSK CLFVGY KGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVL
Subjt:  FILNYVPSKSVSETPLKLWNGRKGSLRHFRIWCCPAHVLENNPKKLEPRSKSCLFVGYRKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVL

Query:  NELSKEITEPSTRIVEEPSALTRVVHAGSSIRTHQPQSLREPRRSGRVTNLPIRYMSLTETLIVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMY
        NELSKE TEPSTR+VEEPSALTRVVH GSS RTHQPQSLREPRRSGRVTNLPIRYMSLTETL VISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMY
Subjt:  NELSKEITEPSTRIVEEPSALTRVVHAGSSIRTHQPQSLREPRRSGRVTNLPIRYMSLTETLIVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMY

Query:  FNSVWDLVDQPDEVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS---------------------------HTAFLNGNLEETIYM
        FNSVWDLVDQPD VKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS                            TAFLNGNLEETIYM
Subjt:  FNSVWDLVDQPDEVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS---------------------------HTAFLNGNLEETIYM

Query:  QQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEHCVYKRIINKSVAFLVLYVDDILLIKNDIGLLTDIKQWLATQFQMKDLG
        QQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDE CVYKRIINKSVAFLVLYVDDILLI NDIGLLTDIKQWLATQFQMKDLG
Subjt:  QQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEHCVYKRIINKSVAFLVLYVDDILLIKNDIGLLTDIKQWLATQFQMKDLG

Query:  EAQFVLGIQIFRDRKNKTLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRY
        EAQFVLGIQIFRDRKNK LALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRY
Subjt:  EAQFVLGIQIFRDRKNKTLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRY

Query:  QSNPGLAHWIAVKTILKYLRRTRDYMPVYGSKDLILTGYTDSDFQTDRDFRKSTS------------------GCIADSTMEAEYVAACEAAKEAVWLRK
        QSNPGLAHW AVKTILKYLRRTRDY  VYGSKDLILTGYTDSDFQTDRD RKSTS                  GCIADSTMEAEYVAACEAAKEAVWLR 
Subjt:  QSNPGLAHWIAVKTILKYLRRTRDYMPVYGSKDLILTGYTDSDFQTDRDFRKSTS------------------GCIADSTMEAEYVAACEAAKEAVWLRK

Query:  ILIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKLGKHIKRKYHLIREIVHRGDVIVTQIASTHNGADPFTKPLTAKVFEGHLESLGLRDMPHL
         LIDLEVVPNMSKPITLYCDNSGAVANSREPRSHK GKHI+RKYHLIREIVHRGDVIVTQIASTHN ADPFTKPLTAKVFEGHLESLGLRDMPHL
Subjt:  ILIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKLGKHIKRKYHLIREIVHRGDVIVTQIASTHNGADPFTKPLTAKVFEGHLESLGLRDMPHL

KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0090.07Show/hide
Query:  FILNYVPSKSVSETPLKLWNGRKGSLRHFRIWCCPAHVLENNPKKLEPRSKSCLFVGYRKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVL
        +ILN VPSKSVSETPLKLWNGRKGSLRHFRIW CPAHVLENNPKKLEPRSK CLFVGY KGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVL
Subjt:  FILNYVPSKSVSETPLKLWNGRKGSLRHFRIWCCPAHVLENNPKKLEPRSKSCLFVGYRKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVL

Query:  NELSKEITEPSTRIVEEPSALTRVVHAGSSIRTHQPQSLREPRRSGRVTNLPIRYMSLTETLIVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMY
        NELSKE TEPSTR+VEEPSALTRVVH GSS RTHQPQSLREPRRSGRVTNLPIRYMSLTETL VISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMY
Subjt:  NELSKEITEPSTRIVEEPSALTRVVHAGSSIRTHQPQSLREPRRSGRVTNLPIRYMSLTETLIVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMY

Query:  FNSVWDLVDQPDEVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS---------------------------HTAFLNGNLEETIYM
        FNSVWDLVDQPD VKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS                            TAFLNGNLEETIYM
Subjt:  FNSVWDLVDQPDEVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS---------------------------HTAFLNGNLEETIYM

Query:  QQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEHCVYKRIINKSVAFLVLYVDDILLIKNDIGLLTDIKQWLATQFQMKDLG
        QQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDE CVYKRIINKSVAFLVLYVDDILLI NDIGLLTDIKQWLATQFQMKDLG
Subjt:  QQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEHCVYKRIINKSVAFLVLYVDDILLIKNDIGLLTDIKQWLATQFQMKDLG

Query:  EAQFVLGIQIFRDRKNKTLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRY
        EAQFVLGIQIFRDRKNK LALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRY
Subjt:  EAQFVLGIQIFRDRKNKTLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRY

Query:  QSNPGLAHWIAVKTILKYLRRTRDYMPVYGSKDLILTGYTDSDFQTDRDFRKSTS------------------GCIADSTMEAEYVAACEAAKEAVWLRK
        QSNPGLAHW AVKTILKYLRRTRDY  VYGSKDLILTGYTDSDFQTDRD RKSTS                  GCIADSTMEAEYVAACEAAKEAVWLR 
Subjt:  QSNPGLAHWIAVKTILKYLRRTRDYMPVYGSKDLILTGYTDSDFQTDRDFRKSTS------------------GCIADSTMEAEYVAACEAAKEAVWLRK

Query:  ILIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKLGKHIKRKYHLIREIVHRGDVIVTQIASTHNGADPFTKPLTAKVFEGHLESLGLRDMPHL
         LIDLEVVPNMSKPITLYCDNSGAVANSREPRSHK GKHI+RKYHLIREIVHRGDVIVTQIASTHN ADPFTKPLTAKVFEGHLESLGLRDMPHL
Subjt:  ILIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKLGKHIKRKYHLIREIVHRGDVIVTQIASTHNGADPFTKPLTAKVFEGHLESLGLRDMPHL

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0090.07Show/hide
Query:  FILNYVPSKSVSETPLKLWNGRKGSLRHFRIWCCPAHVLENNPKKLEPRSKSCLFVGYRKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVL
        +ILN VPSKSVSETPLKLWNGRKGSLRHFRIW CPAHVLENNPKKLEPRSK CLFVGY KGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVL
Subjt:  FILNYVPSKSVSETPLKLWNGRKGSLRHFRIWCCPAHVLENNPKKLEPRSKSCLFVGYRKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVL

Query:  NELSKEITEPSTRIVEEPSALTRVVHAGSSIRTHQPQSLREPRRSGRVTNLPIRYMSLTETLIVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMY
        NELSKE TEPSTR+VEEPSALTRVVH GSS RTHQPQSLREPRRSGRVTNLPIRYMSLTETL VISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMY
Subjt:  NELSKEITEPSTRIVEEPSALTRVVHAGSSIRTHQPQSLREPRRSGRVTNLPIRYMSLTETLIVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMY

Query:  FNSVWDLVDQPDEVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS---------------------------HTAFLNGNLEETIYM
        FNSVWDLVDQPD VKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS                            TAFLNGNLEETIYM
Subjt:  FNSVWDLVDQPDEVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS---------------------------HTAFLNGNLEETIYM

Query:  QQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEHCVYKRIINKSVAFLVLYVDDILLIKNDIGLLTDIKQWLATQFQMKDLG
        QQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDE CVYKRIINKSVAFLVLYVDDILLI NDIGLLTDIKQWLATQFQMKDLG
Subjt:  QQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEHCVYKRIINKSVAFLVLYVDDILLIKNDIGLLTDIKQWLATQFQMKDLG

Query:  EAQFVLGIQIFRDRKNKTLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRY
        EAQFVLGIQIFRDRKNK LALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRY
Subjt:  EAQFVLGIQIFRDRKNKTLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRY

Query:  QSNPGLAHWIAVKTILKYLRRTRDYMPVYGSKDLILTGYTDSDFQTDRDFRKSTS------------------GCIADSTMEAEYVAACEAAKEAVWLRK
        QSNPGLAHW AVKTILKYLRRTRDY  VYGSKDLILTGYTDSDFQTDRD RKSTS                  GCIADSTMEAEYVAACEAAKEAVWLR 
Subjt:  QSNPGLAHWIAVKTILKYLRRTRDYMPVYGSKDLILTGYTDSDFQTDRDFRKSTS------------------GCIADSTMEAEYVAACEAAKEAVWLRK

Query:  ILIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKLGKHIKRKYHLIREIVHRGDVIVTQIASTHNGADPFTKPLTAKVFEGHLESLGLRDMPHL
         LIDLEVVPNMSKPITLYCDNSGAVANSREPRSHK GKHI+RKYHLIREIVHRGDVIVTQIASTHN ADPFTKPLTAKVFEGHLESLGLRDMPHL
Subjt:  ILIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKLGKHIKRKYHLIREIVHRGDVIVTQIASTHNGADPFTKPLTAKVFEGHLESLGLRDMPHL

TYJ97618.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0090.07Show/hide
Query:  FILNYVPSKSVSETPLKLWNGRKGSLRHFRIWCCPAHVLENNPKKLEPRSKSCLFVGYRKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVL
        +ILN VPSKSVSETPLKLWNGRKGSLRHFRIW CPAHVLENNPKKLEPRSK CLFVGY KGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVL
Subjt:  FILNYVPSKSVSETPLKLWNGRKGSLRHFRIWCCPAHVLENNPKKLEPRSKSCLFVGYRKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVL

Query:  NELSKEITEPSTRIVEEPSALTRVVHAGSSIRTHQPQSLREPRRSGRVTNLPIRYMSLTETLIVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMY
        NELSKE TEPSTR+VEEPSALTRVVH GSS RTHQPQSLREPRRSGRVTNLPIRYMSLTETL VISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMY
Subjt:  NELSKEITEPSTRIVEEPSALTRVVHAGSSIRTHQPQSLREPRRSGRVTNLPIRYMSLTETLIVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMY

Query:  FNSVWDLVDQPDEVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS---------------------------HTAFLNGNLEETIYM
        FNSVWDLVDQPD VKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS                            TAFLNGNLEETIYM
Subjt:  FNSVWDLVDQPDEVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS---------------------------HTAFLNGNLEETIYM

Query:  QQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEHCVYKRIINKSVAFLVLYVDDILLIKNDIGLLTDIKQWLATQFQMKDLG
        QQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDE CVYKRIINKSVAFLVLYVDDILLI NDIGLLTDIKQWLATQFQMKDLG
Subjt:  QQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEHCVYKRIINKSVAFLVLYVDDILLIKNDIGLLTDIKQWLATQFQMKDLG

Query:  EAQFVLGIQIFRDRKNKTLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRY
        EAQFVLGIQIFRDRKNK LALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRY
Subjt:  EAQFVLGIQIFRDRKNKTLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRY

Query:  QSNPGLAHWIAVKTILKYLRRTRDYMPVYGSKDLILTGYTDSDFQTDRDFRKSTS------------------GCIADSTMEAEYVAACEAAKEAVWLRK
        QSNPGLAHW AVKTILKYLRRTRDY  VYGSKDLILTGYTDSDFQTDRD RKSTS                  GCIADSTMEAEYVAACEAAKEAVWLR 
Subjt:  QSNPGLAHWIAVKTILKYLRRTRDYMPVYGSKDLILTGYTDSDFQTDRDFRKSTS------------------GCIADSTMEAEYVAACEAAKEAVWLRK

Query:  ILIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKLGKHIKRKYHLIREIVHRGDVIVTQIASTHNGADPFTKPLTAKVFEGHLESLGLRDMPHL
         LIDLEVVPNMSKPITLYCDNSGAVANSREPRSHK GKHI+RKYHLIREIVHRGDVIVTQIASTHN ADPFTKPLTAKVFEGHLESLGLRDMPHL
Subjt:  ILIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKLGKHIKRKYHLIREIVHRGDVIVTQIASTHNGADPFTKPLTAKVFEGHLESLGLRDMPHL

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein0.0e+0090.07Show/hide
Query:  FILNYVPSKSVSETPLKLWNGRKGSLRHFRIWCCPAHVLENNPKKLEPRSKSCLFVGYRKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVL
        +ILN VPSKSVSETPLKLWNGRKGSLRHFRIW CPAHVLENNPKKLEPRSK CLFVGY KGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVL
Subjt:  FILNYVPSKSVSETPLKLWNGRKGSLRHFRIWCCPAHVLENNPKKLEPRSKSCLFVGYRKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVL

Query:  NELSKEITEPSTRIVEEPSALTRVVHAGSSIRTHQPQSLREPRRSGRVTNLPIRYMSLTETLIVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMY
        NELSKE TEPSTR+VEEPSALTRVVH GSS RTHQPQSLREPRRSGRVTNLPIRYMSLTETL VISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMY
Subjt:  NELSKEITEPSTRIVEEPSALTRVVHAGSSIRTHQPQSLREPRRSGRVTNLPIRYMSLTETLIVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMY

Query:  FNSVWDLVDQPDEVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS---------------------------HTAFLNGNLEETIYM
        FNSVWDLVDQPD VKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS                            TAFLNGNLEETIYM
Subjt:  FNSVWDLVDQPDEVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS---------------------------HTAFLNGNLEETIYM

Query:  QQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEHCVYKRIINKSVAFLVLYVDDILLIKNDIGLLTDIKQWLATQFQMKDLG
        QQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDE CVYKRIINKSVAFLVLYVDDILLI NDIGLLTDIKQWLATQFQMKDLG
Subjt:  QQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEHCVYKRIINKSVAFLVLYVDDILLIKNDIGLLTDIKQWLATQFQMKDLG

Query:  EAQFVLGIQIFRDRKNKTLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRY
        EAQFVLGIQIFRDRKNK LALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRY
Subjt:  EAQFVLGIQIFRDRKNKTLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRY

Query:  QSNPGLAHWIAVKTILKYLRRTRDYMPVYGSKDLILTGYTDSDFQTDRDFRKSTS------------------GCIADSTMEAEYVAACEAAKEAVWLRK
        QSNPGLAHW AVKTILKYLRRTRDY  VYGSKDLILTGYTDSDFQTDRD RKSTS                  GCIADSTMEAEYVAACEAAKEAVWLR 
Subjt:  QSNPGLAHWIAVKTILKYLRRTRDYMPVYGSKDLILTGYTDSDFQTDRDFRKSTS------------------GCIADSTMEAEYVAACEAAKEAVWLRK

Query:  ILIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKLGKHIKRKYHLIREIVHRGDVIVTQIASTHNGADPFTKPLTAKVFEGHLESLGLRDMPHL
         LIDLEVVPNMSKPITLYCDNSGAVANSREPRSHK GKHI+RKYHLIREIVHRGDVIVTQIASTHN ADPFTKPLTAKVFEGHLESLGLRDMPHL
Subjt:  ILIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKLGKHIKRKYHLIREIVHRGDVIVTQIASTHNGADPFTKPLTAKVFEGHLESLGLRDMPHL

A0A5A7SNP8 Gag/pol protein0.0e+0089.64Show/hide
Query:  FILNYVPSKSVSETPLKLWNGRKGSLRHFRIWCCPAHVLENNPKKLEPRSKSCLFVGYRKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVL
        +ILN VPSKSVSETPLKLWNG KGSLRHFRIW CPAHVLENNPKKLEPRSK CLFVGY KGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVL
Subjt:  FILNYVPSKSVSETPLKLWNGRKGSLRHFRIWCCPAHVLENNPKKLEPRSKSCLFVGYRKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVL

Query:  NELSKEITEPSTRIVEEPSALTRVVHAGSSIRTHQPQSLREPRRSGRVTNLPIRYMSLTETLIVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMY
        NELSKE TEPSTR+VEEPSAL RVVH  SS RTHQPQSLREPRRSGRVTNLPIRYMSLTETL VISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMY
Subjt:  NELSKEITEPSTRIVEEPSALTRVVHAGSSIRTHQPQSLREPRRSGRVTNLPIRYMSLTETLIVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMY

Query:  FNSVWDLVDQPDEVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS---------------------------HTAFLNGNLEETIYM
        FNSVWDLVDQPD VKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS                            TAFLNGNLEETIYM
Subjt:  FNSVWDLVDQPDEVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS---------------------------HTAFLNGNLEETIYM

Query:  QQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEHCVYKRIINKSVAFLVLYVDDILLIKNDIGLLTDIKQWLATQFQMKDLG
        QQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDE CVYKRIINKSVAFLVLYVDDILLI NDIGLLTDIKQWLATQFQMKDLG
Subjt:  QQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEHCVYKRIINKSVAFLVLYVDDILLIKNDIGLLTDIKQWLATQFQMKDLG

Query:  EAQFVLGIQIFRDRKNKTLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRY
        EAQFVLGIQIFRDRKNK LALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRY
Subjt:  EAQFVLGIQIFRDRKNKTLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRY

Query:  QSNPGLAHWIAVKTILKYLRRTRDYMPVYGSKDLILTGYTDSDFQTDRDFRKSTS------------------GCIADSTMEAEYVAACEAAKEAVWLRK
        QSNPGLAHW AVKTILKYLRRTRDY  VYGSKDLILTGYTDSDFQTDRD RKSTS                  GCIADSTMEAEYVAACEAAKEAVWLR 
Subjt:  QSNPGLAHWIAVKTILKYLRRTRDYMPVYGSKDLILTGYTDSDFQTDRDFRKSTS------------------GCIADSTMEAEYVAACEAAKEAVWLRK

Query:  ILIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKLGKHIKRKYHLIREIVHRGDVIVTQIASTHNGADPFTKPLTAKVFEGHLESLGLRDMPHL
         LIDLEVVPNMSKPITLYCDNSGAVANSREPRSHK GKHI+RKYHLIREIVHRGDVIVTQIASTHN ADPFTKPLTAKVFEGHLESLGLRDMPHL
Subjt:  ILIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKLGKHIKRKYHLIREIVHRGDVIVTQIASTHNGADPFTKPLTAKVFEGHLESLGLRDMPHL

A0A5A7TZD7 Gag/pol protein0.0e+0090.07Show/hide
Query:  FILNYVPSKSVSETPLKLWNGRKGSLRHFRIWCCPAHVLENNPKKLEPRSKSCLFVGYRKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVL
        +ILN VPSKSVSETPLKLWNGRKGSLRHFRIW CPAHVLENNPKKLEPRSK CLFVGY KGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVL
Subjt:  FILNYVPSKSVSETPLKLWNGRKGSLRHFRIWCCPAHVLENNPKKLEPRSKSCLFVGYRKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVL

Query:  NELSKEITEPSTRIVEEPSALTRVVHAGSSIRTHQPQSLREPRRSGRVTNLPIRYMSLTETLIVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMY
        NELSKE TEPSTR+VEEPSALTRVVH GSS RTHQPQSLREPRRSGRVTNLPIRYMSLTETL VISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMY
Subjt:  NELSKEITEPSTRIVEEPSALTRVVHAGSSIRTHQPQSLREPRRSGRVTNLPIRYMSLTETLIVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMY

Query:  FNSVWDLVDQPDEVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS---------------------------HTAFLNGNLEETIYM
        FNSVWDLVDQPD VKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS                            TAFLNGNLEETIYM
Subjt:  FNSVWDLVDQPDEVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS---------------------------HTAFLNGNLEETIYM

Query:  QQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEHCVYKRIINKSVAFLVLYVDDILLIKNDIGLLTDIKQWLATQFQMKDLG
        QQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDE CVYKRIINKSVAFLVLYVDDILLI NDIGLLTDIKQWLATQFQMKDLG
Subjt:  QQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEHCVYKRIINKSVAFLVLYVDDILLIKNDIGLLTDIKQWLATQFQMKDLG

Query:  EAQFVLGIQIFRDRKNKTLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRY
        EAQFVLGIQIFRDRKNK LALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRY
Subjt:  EAQFVLGIQIFRDRKNKTLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRY

Query:  QSNPGLAHWIAVKTILKYLRRTRDYMPVYGSKDLILTGYTDSDFQTDRDFRKSTS------------------GCIADSTMEAEYVAACEAAKEAVWLRK
        QSNPGLAHW AVKTILKYLRRTRDY  VYGSKDLILTGYTDSDFQTDRD RKSTS                  GCIADSTMEAEYVAACEAAKEAVWLR 
Subjt:  QSNPGLAHWIAVKTILKYLRRTRDYMPVYGSKDLILTGYTDSDFQTDRDFRKSTS------------------GCIADSTMEAEYVAACEAAKEAVWLRK

Query:  ILIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKLGKHIKRKYHLIREIVHRGDVIVTQIASTHNGADPFTKPLTAKVFEGHLESLGLRDMPHL
         LIDLEVVPNMSKPITLYCDNSGAVANSREPRSHK GKHI+RKYHLIREIVHRGDVIVTQIASTHN ADPFTKPLTAKVFEGHLESLGLRDMPHL
Subjt:  ILIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKLGKHIKRKYHLIREIVHRGDVIVTQIASTHNGADPFTKPLTAKVFEGHLESLGLRDMPHL

A0A5D3BHG7 Gag/pol protein0.0e+0090.07Show/hide
Query:  FILNYVPSKSVSETPLKLWNGRKGSLRHFRIWCCPAHVLENNPKKLEPRSKSCLFVGYRKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVL
        +ILN VPSKSVSETPLKLWNGRKGSLRHFRIW CPAHVLENNPKKLEPRSK CLFVGY KGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVL
Subjt:  FILNYVPSKSVSETPLKLWNGRKGSLRHFRIWCCPAHVLENNPKKLEPRSKSCLFVGYRKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVL

Query:  NELSKEITEPSTRIVEEPSALTRVVHAGSSIRTHQPQSLREPRRSGRVTNLPIRYMSLTETLIVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMY
        NELSKE TEPSTR+VEEPSALTRVVH GSS RTHQPQSLREPRRSGRVTNLPIRYMSLTETL VISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMY
Subjt:  NELSKEITEPSTRIVEEPSALTRVVHAGSSIRTHQPQSLREPRRSGRVTNLPIRYMSLTETLIVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMY

Query:  FNSVWDLVDQPDEVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS---------------------------HTAFLNGNLEETIYM
        FNSVWDLVDQPD VKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS                            TAFLNGNLEETIYM
Subjt:  FNSVWDLVDQPDEVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS---------------------------HTAFLNGNLEETIYM

Query:  QQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEHCVYKRIINKSVAFLVLYVDDILLIKNDIGLLTDIKQWLATQFQMKDLG
        QQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDE CVYKRIINKSVAFLVLYVDDILLI NDIGLLTDIKQWLATQFQMKDLG
Subjt:  QQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEHCVYKRIINKSVAFLVLYVDDILLIKNDIGLLTDIKQWLATQFQMKDLG

Query:  EAQFVLGIQIFRDRKNKTLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRY
        EAQFVLGIQIFRDRKNK LALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRY
Subjt:  EAQFVLGIQIFRDRKNKTLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRY

Query:  QSNPGLAHWIAVKTILKYLRRTRDYMPVYGSKDLILTGYTDSDFQTDRDFRKSTS------------------GCIADSTMEAEYVAACEAAKEAVWLRK
        QSNPGLAHW AVKTILKYLRRTRDY  VYGSKDLILTGYTDSDFQTDRD RKSTS                  GCIADSTMEAEYVAACEAAKEAVWLR 
Subjt:  QSNPGLAHWIAVKTILKYLRRTRDYMPVYGSKDLILTGYTDSDFQTDRDFRKSTS------------------GCIADSTMEAEYVAACEAAKEAVWLRK

Query:  ILIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKLGKHIKRKYHLIREIVHRGDVIVTQIASTHNGADPFTKPLTAKVFEGHLESLGLRDMPHL
         LIDLEVVPNMSKPITLYCDNSGAVANSREPRSHK GKHI+RKYHLIREIVHRGDVIVTQIASTHN ADPFTKPLTAKVFEGHLESLGLRDMPHL
Subjt:  ILIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKLGKHIKRKYHLIREIVHRGDVIVTQIASTHNGADPFTKPLTAKVFEGHLESLGLRDMPHL

A0A5D3CPJ6 Gag/pol protein0.0e+0090.07Show/hide
Query:  FILNYVPSKSVSETPLKLWNGRKGSLRHFRIWCCPAHVLENNPKKLEPRSKSCLFVGYRKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVL
        +ILN VPSKSVSETPLKLWNGRKGSLRHFRIW CPAHVLENNPKKLEPRSK CLFVGY KGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVL
Subjt:  FILNYVPSKSVSETPLKLWNGRKGSLRHFRIWCCPAHVLENNPKKLEPRSKSCLFVGYRKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVL

Query:  NELSKEITEPSTRIVEEPSALTRVVHAGSSIRTHQPQSLREPRRSGRVTNLPIRYMSLTETLIVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMY
        NELSKE TEPSTR+VEEPSALTRVVH GSS RTHQPQSLREPRRSGRVTNLPIRYMSLTETL VISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMY
Subjt:  NELSKEITEPSTRIVEEPSALTRVVHAGSSIRTHQPQSLREPRRSGRVTNLPIRYMSLTETLIVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMY

Query:  FNSVWDLVDQPDEVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS---------------------------HTAFLNGNLEETIYM
        FNSVWDLVDQPD VKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS                            TAFLNGNLEETIYM
Subjt:  FNSVWDLVDQPDEVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS---------------------------HTAFLNGNLEETIYM

Query:  QQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEHCVYKRIINKSVAFLVLYVDDILLIKNDIGLLTDIKQWLATQFQMKDLG
        QQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDE CVYKRIINKSVAFLVLYVDDILLI NDIGLLTDIKQWLATQFQMKDLG
Subjt:  QQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEHCVYKRIINKSVAFLVLYVDDILLIKNDIGLLTDIKQWLATQFQMKDLG

Query:  EAQFVLGIQIFRDRKNKTLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRY
        EAQFVLGIQIFRDRKNK LALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRY
Subjt:  EAQFVLGIQIFRDRKNKTLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRY

Query:  QSNPGLAHWIAVKTILKYLRRTRDYMPVYGSKDLILTGYTDSDFQTDRDFRKSTS------------------GCIADSTMEAEYVAACEAAKEAVWLRK
        QSNPGLAHW AVKTILKYLRRTRDY  VYGSKDLILTGYTDSDFQTDRD RKSTS                  GCIADSTMEAEYVAACEAAKEAVWLR 
Subjt:  QSNPGLAHWIAVKTILKYLRRTRDYMPVYGSKDLILTGYTDSDFQTDRDFRKSTS------------------GCIADSTMEAEYVAACEAAKEAVWLRK

Query:  ILIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKLGKHIKRKYHLIREIVHRGDVIVTQIASTHNGADPFTKPLTAKVFEGHLESLGLRDMPHL
         LIDLEVVPNMSKPITLYCDNSGAVANSREPRSHK GKHI+RKYHLIREIVHRGDVIVTQIASTHN ADPFTKPLTAKVFEGHLESLGLRDMPHL
Subjt:  ILIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKLGKHIKRKYHLIREIVHRGDVIVTQIASTHNGADPFTKPLTAKVFEGHLESLGLRDMPHL

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.1e-7531.05Show/hide
Query:  SKSCLFVGYRKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKEITEPSTRIVEEPSALTRVVHAGSSIRTHQPQSLREPRRSGRVT
        SK C  + + K ++       + NK F++ +     +DH+ E            SK    P+     E +   + +   +  +    + +   RRS R+ 
Subjt:  SKSCLFVGYRKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKEITEPSTRIVEEPSALTRVVHAGSSIRTHQPQSLREPRRSGRVT

Query:  NLP-IRYMSLTETL--IVISDGDI--EDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDEVKPIGCKWIYKRKRGADGKVQTFKARLVAKG
          P I Y     +L  +V++   I  + P +F +     DK  W +A+N EL +   N+ W +  +P+    +  +W++  K    G    +KARLVA+G
Subjt:  NLP-IRYMSLTETL--IVISDGDI--EDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDEVKPIGCKWIYKRKRGADGKVQTFKARLVAKG

Query:  YTQVEGVDYEETFS---------------------------HTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYG
        +TQ   +DYEETF+                            TAFLNG L+E IYM+ P+G  I      +CKLN++IYGLKQA+R W   F+ A+K   
Subjt:  YTQVEGVDYEETFS---------------------------HTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYG

Query:  FDQIVDEHCVY---KRIINKSVAFLVLYVDDILLIKNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKTLALSQASYIDKIVVKYSMQNSKR
        F     + C+Y   K  IN+++ +++LYVDD+++   D+  + + K++L  +F+M DL E +  +GI+I  + +   + LSQ++Y+ KI+ K++M+N   
Subjt:  FDQIVDEHCVY---KRIINKSVAFLVLYVDDILLIKNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKTLALSQASYIDKIVVKYSMQNSKR

Query:  GLLP----FRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWIAVKTILKYLRRTRDYMPVYGSKDLI--
           P      + +  S E C           + P  S +G LMY MLCTRPD+  AV I+SRY S      W  +K +L+YL+ T D M +   K+L   
Subjt:  GLLP----FRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWIAVKTILKYLRRTRDYMPVYGSKDLI--

Query:  --LTGYTDSDFQTDRDFRKSTSG-------------------CIADSTMEAEYVAACEAAKEAVWLRKILIDLEVVPNMSKPITLYCDNSGAVANSREPR
          + GY DSD+      RKST+G                    +A S+ EAEY+A  EA +EA+WL+ +L  + +   +  PI +Y DN G ++ +  P 
Subjt:  --LTGYTDSDFQTDRDFRKSTSG-------------------CIADSTMEAEYVAACEAAKEAVWLRKILIDLEVVPNMSKPITLYCDNSGAVANSREPR

Query:  SHKLGKHIKRKYHLIREIVHRGDVIVTQIASTHNGADPFTKPLTAKVFEGHLESLGL
         HK  KHI  KYH  RE V    + +  I + +  AD FTKPL A  F    + LGL
Subjt:  SHKLGKHIKRKYHLIREIVHRGDVIVTQIASTHNGADPFTKPLTAKVFEGHLESLGL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.3e-12136.67Show/hide
Query:  FILNYVPSKSVS-ETPLKLWNGRKGSLRHFRIWCCP--AHVLENNPKKLEPRSKSCLFVGYRKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSK
        +++N  PS  ++ E P ++W  ++ S  H +++ C   AHV +    KL+ +S  C+F+GY     G   +DP   KV  S +  F  E  +R     S+
Subjt:  FILNYVPSKSVS-ETPLKLWNGRKGSLRHFRIWCCP--AHVLENNPKKLEPRSKSCLFVGYRKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSK

Query:  IVLNELSKE-ITEPST----------------------RIVEEPSAL----TRVVHAGSSIRTHQPQSLREPRRSGRVTNLPIRYMSLTETLIVISDGDI
         V N +    +T PST                       ++E+   L      V H       HQP      RRS R      RY S    LI     D 
Subjt:  IVLNELSKE-ITEPST----------------------RIVEEPSAL----TRVVHAGSSIRTHQPQSLREPRRSGRVTNLPIRYMSLTETLIVISDGDI

Query:  EDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDEVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS------------
         +P + K+ +   +K++ +KAM  E+ES+  N  + LV+ P   +P+ CKW++K K+  D K+  +KARLV KG+ Q +G+D++E FS            
Subjt:  EDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDEVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS------------

Query:  ---------------HTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEHCVY-KRIINKSVAFLVLY
                        TAFL+G+LEE IYM+QPEGF + G++  +CKLN+S+YGLKQA R W ++FD+ +KS  + +   + CVY KR    +   L+LY
Subjt:  ---------------HTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEHCVY-KRIINKSVAFLVLY

Query:  VDDILLIKNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKTLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRH
        VDD+L++  D GL+  +K  L+  F MKDLG AQ +LG++I R+R ++ L LSQ  YI++++ +++M+N+K    P    + LSK+ CP T ++   M  
Subjt:  VDDILLIKNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKTLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRH

Query:  IPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWIAVKTILKYLRRTRDYMPVYGSKDLILTGYTDSDFQTDRDFRKSTSG-------------
        +PY+SAVGSLMYAM+CTRPDI +AVG+VSR+  NPG  HW AVK IL+YLR T      +G  D IL GYTD+D   D D RKS++G             
Subjt:  IPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWIAVKTILKYLRRTRDYMPVYGSKDLILTGYTDSDFQTDRDFRKSTSG-------------

Query:  -----CIADSTMEAEYVAACEAAKEAVWLRKILIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKLGKHIKRKYHLIREIVHRGDVIVTQIASTHNGAD
             C+A ST EAEY+AA E  KE +WL++ L +L +     K   +YCD+  A+  S+    H   KHI  +YH IRE+V    + V +I++  N AD
Subjt:  -----CIADSTMEAEYVAACEAAKEAVWLRKILIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKLGKHIKRKYHLIREIVHRGDVIVTQIASTHNGAD

Query:  PFTKPLTAKVFEGHLESLGL
          TK +    FE   E +G+
Subjt:  PFTKPLTAKVFEGHLESLGL

P25600 Putative transposon Ty5-1 protein YCL074W3.5e-3030.65Show/hide
Query:  TAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEHCVYKRIINKSVAFLVLYVDDILLIKNDIGLLTDI
        TAFLN  ++E IY++QP GF+       + +L   +YGLKQA   WN   +  +K  GF +   EH +Y R  +    ++ +YVDD+L+      +   +
Subjt:  TAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEHCVYKRIINKSVAFLVLYVDDILLIKNDIGLLTDI

Query:  KQWLATQFQMKDLGEAQFVLGIQIFRDRKNKTLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCT
        KQ L   + MKDLG+    LG+ I +   N  + LS   YI K   +  +   K    P  +   L +   P   +D+      PY S VG L++     
Subjt:  KQWLATQFQMKDLGEAQFVLGIQIFRDRKNKTLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCT

Query:  RPDICYAVGIVSRYQSNPGLAHWIAVKTILKYLRRTRDYMPVYGS-KDLILTGYTDSDFQTDRDFRKST-------------------SGCIADSTMEAE
        RPDI Y V ++SR+   P   H  + + +L+YL  TR     Y S   L LT Y D+      D   ST                    G I   + EAE
Subjt:  RPDICYAVGIVSRYQSNPGLAHWIAVKTILKYLRRTRDYMPVYGS-KDLILTGYTDSDFQTDRDFRKST-------------------SGCIADSTMEAE

Query:  YVAACEAAKE
        Y+ A E   E
Subjt:  YVAACEAAKE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.6e-6731.46Show/hide
Query:  PSTRIVEEPSALTRVVHAGSS--IRTHQPQSLREPRRSGRVTNLPIRYMSLTETLIVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDL
        P + ++  P  L ++V+  +   + TH   S+    ++G +   P +Y SL  +L   S     +P T  +A++D   + W  AM  E+ +   N  WDL
Subjt:  PSTRIVEEPSALTRVVHAGSS--IRTHQPQSLREPRRSGRVTNLPIRYMSLTETLIVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDL

Query:  V-DQPDEVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS---------------------------HTAFLNGNLEETIYMQQPEGF
        V   P  V  +GC+WI+ +K  +DG +  +KARLVAKGY Q  G+DY ETFS                           + AFL G L + +YM QP GF
Subjt:  V-DQPDEVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS---------------------------HTAFLNGNLEETIYMQQPEGF

Query:  IIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEHCVYKRIINKSVAFLVLYVDDILLIKNDIGLLTDIKQWLATQFQMKDLGEAQFVL
        I   +   +CKL +++YGLKQA R+W +     + + GF   V +  ++     KS+ ++++YVDDIL+  ND  LL +    L+ +F +KD  E  + L
Subjt:  IIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEHCVYKRIINKSVAFLVLYVDDILLIKNDIGLLTDIKQWLATQFQMKDLGEAQFVL

Query:  GIQIFRDRKNKTLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGL
        GI+    R    L LSQ  YI  ++ + +M  +K    P      LS     K     E      Y   VGSL Y +  TRPDI YAV  +S++   P  
Subjt:  GIQIFRDRKNKTLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGL

Query:  AHWIAVKTILKYLRRTRDY-MPVYGSKDLILTGYTDSDFQTDRDFRKSTSGCI------------------ADSTMEAEYVAACEAAKEAVWLRKILIDL
         H  A+K IL+YL  T ++ + +     L L  Y+D+D+  D+D   ST+G I                    S+ EAEY +    + E  W+  +L +L
Subjt:  AHWIAVKTILKYLRRTRDY-MPVYGSKDLILTGYTDSDFQTDRDFRKSTSGCI------------------ADSTMEAEYVAACEAAKEAVWLRKILIDL

Query:  EVVPNMSKPITLYCDNSGAVANSREPRSHKLGKHIKRKYHLIREIVHRGDVIVTQIASTHNGADPFTKPLTAKVFEGHLESLGLRDMP
         +   +++P  +YCDN GA      P  H   KHI   YH IR  V  G + V  +++    AD  TKPL+   F+     +G+  +P
Subjt:  EVVPNMSKPITLYCDNSGAVANSREPRSHKLGKHIKRKYHLIREIVHRGDVIVTQIASTHNGADPFTKPLTAKVFEGHLESLGLRDMP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.0e-6932.44Show/hide
Query:  DPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLV-DQPDEVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS------------
        +P T  +AM+D   D W +AM  E+ +   N  WDLV   P  V  +GC+WI+ +K  +DG +  +KARLVAKGY Q  G+DY ETFS            
Subjt:  DPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLV-DQPDEVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS------------

Query:  ---------------HTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEHCVYKRIINKSVAFLVLYV
                       + AFL G L + +YM QP GF+   +   +C+L ++IYGLKQA R+W +   T + + GF   + +  ++     +S+ ++++YV
Subjt:  ---------------HTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEHCVYKRIINKSVAFLVLYV

Query:  DDILLIKNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKTLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHI
        DDIL+  ND  LL      L+ +F +K+  +  + LGI+    R  + L LSQ  Y   ++ + +M  +K    P      L+     K P   E     
Subjt:  DDILLIKNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKTLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHI

Query:  PYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWIAVKTILKYLRRTRDY-MPVYGSKDLILTGYTDSDFQTDRDFRKSTSGCI-----------
         Y   VGSL Y +  TRPD+ YAV  +S+Y   P   HW A+K +L+YL  T D+ + +     L L  Y+D+D+  D D   ST+G I           
Subjt:  PYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWIAVKTILKYLRRTRDY-MPVYGSKDLILTGYTDSDFQTDRDFRKSTSGCI-----------

Query:  -------ADSTMEAEYVAACEAAKEAVWLRKILIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKLGKHIKRKYHLIREIVHRGDVIVTQIASTHNGAD
                 S+ EAEY +    + E  W+  +L +L +   +S P  +YCDN GA      P  H   KHI   YH IR  V  G + V  +++    AD
Subjt:  -------ADSTMEAEYVAACEAAKEAVWLRKILIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKLGKHIKRKYHLIREIVHRGDVIVTQIASTHNGAD

Query:  PFTKPLTAKVFEGHLESLGLRDMP
          TKPL+   F+     +G+  +P
Subjt:  PFTKPLTAKVFEGHLESLGLRDMP

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.2e-6332.02Show/hide
Query:  EDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDEVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS------------
        ++P T+ +A E +    W  AM+ E+ +M     W++   P   KPIGCKW+YK K  +DG ++ +KARLVAKGYTQ EG+D+ ETFS            
Subjt:  EDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDEVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS------------

Query:  ---------------HTAFLNGNLEETIYMQQPEGFIIPGQE----QKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEHCVYKRIINKSVAFL
                         AFLNG+L+E IYM+ P G+     +      +C L +SIYGLKQASR W ++F   +  +GF Q   +H  + +I       +
Subjt:  ---------------HTAFLNGNLEETIYMQQPEGFIIPGQE----QKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEHCVYKRIINKSVAFL

Query:  VLYVDDILLIKNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKTLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEE
        ++YVDDI++  N+   + ++K  L + F+++DLG  ++ LG++I R      + + Q  Y   ++ +  +   K   +P    VT S      +  D  +
Subjt:  VLYVDDILLIKNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKTLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEE

Query:  MRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWIAVKTILKYLRRTRDYMPVYGSK-DLILTGYTDSDFQTDRDFRKSTSG---------
         +   Y   +G LMY  + TR DI +AV  +S++   P LAH  AV  IL Y++ T      Y S+ ++ L  ++D+ FQ+ +D R+ST+G         
Subjt:  MRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWIAVKTILKYLRRTRDYMPVYGSK-DLILTGYTDSDFQTDRDFRKSTSG---------

Query:  ---------CIADSTMEAEYVAACEAAKEAVWLRKILIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKLGKHIKRKYHLIRE
                  ++ S+ EAEY A   A  E +WL +   +L++   +SKP  L+CDN+ A+  +     H+  KHI+   H +RE
Subjt:  ---------CIADSTMEAEYVAACEAAKEAVWLRKILIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKLGKHIKRKYHLIRE

ATMG00810.1 DNA/RNA polymerases superfamily protein2.7e-1732.2Show/hide
Query:  FLVLYVDDILLIKNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKTLALSQASYIDKIVVKYSMQNSK--RGLLPFRHGVTLSKEQCPKTPQ
        +L+LYVDDILL  +   LL  +   L++ F MKDLG   + LGIQI        L LSQ  Y ++I+    M + K     LP +   ++S  + P  P 
Subjt:  FLVLYVDDILLIKNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKTLALSQASYIDKIVVKYSMQNSK--RGLLPFRHGVTLSKEQCPKTPQ

Query:  DVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWIAVKTILKYLRRTRDY-MPVYGSKDLILTGYTDSDFQTDRDFRKSTSG-----
        D        + S VG+L Y  L TRPDI YAV IV +    P LA +  +K +L+Y++ T  + + ++ +  L +  + DSD+      R+ST+G     
Subjt:  DVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWIAVKTILKYLRRTRDY-MPVYGSKDLILTGYTDSDFQTDRDFRKSTSG-----

Query:  -------------CIADSTMEAEYVAACEAAKEAVW
                      ++ S+ E EY A    A E  W
Subjt:  -------------CIADSTMEAEYVAACEAAKEAVW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.6e-1245.07Show/hide
Query:  WIKAMNLELESMYFNSVWDLVDQPDEVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS
        W +AM  EL+++  N  W LV  P     +GCKW++K K  +DG +   KARLVAKG+ Q EG+ + ET+S
Subjt:  WIKAMNLELESMYFNSVWDLVDQPDEVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGTGCAAACTGCAGTTTATTTTGAATTATGTTCCATCTAAAAGTGTTTCTGAAACACCTTTAAAATTATGGAATGGTCGTAAAGGTAGTTTACGTCATTTCAGAAT
TTGGTGTTGTCCAGCACACGTGCTTGAGAATAACCCTAAGAAATTGGAACCTCGTTCAAAATCATGTTTATTTGTAGGCTACCGCAAAGGAACTAGAGGTGGTTACTTCT
ATGATCCTAAAGATAATAAAGTGTTTGTATCGACAAATGCTACATTTTTAGAAGAGGACCACATAAGGGAGCACAAACCGCGTAGTAAGATAGTATTAAATGAACTTTCC
AAAGAAATTACTGAACCTTCAACAAGAATTGTTGAAGAGCCTAGTGCATTAACAAGAGTTGTTCATGCCGGTTCATCTATTAGGACACATCAACCTCAATCGTTGAGGGA
ACCTCGACGAAGTGGGAGGGTTACAAACTTACCTATTCGTTATATGAGTTTAACCGAAACCTTAATTGTCATATCTGATGGCGACATTGAGGATCCATTGACTTTTAAGA
AGGCAATGGAGGATGTGGATAAAGATGAATGGATCAAAGCTATGAATCTTGAATTGGAGTCTATGTACTTCAATTCAGTCTGGGATCTTGTAGATCAACCTGATGAGGTA
AAACCTATAGGTTGTAAATGGATCTACAAGAGAAAAAGAGGTGCAGATGGTAAGGTACAAACTTTTAAAGCTAGACTAGTGGCAAAGGGTTATACTCAAGTTGAGGGAGT
TGACTATGAGGAGACTTTCTCACATACTGCCTTTTTGAATGGCAATCTTGAGGAGACCATCTATATGCAACAACCAGAAGGATTCATAATTCCAGGTCAAGAGCAAAAGA
TTTGCAAGCTTAATCGTTCTATTTATGGATTAAAACAAGCTTCTCGATCTTGGAACATAAGATTTGATACCGCAATAAAATCTTATGGATTTGATCAAATCGTTGATGAA
CATTGTGTCTACAAAAGAATCATCAACAAATCAGTAGCTTTCTTAGTTCTGTACGTAGATGATATCCTACTCATTAAGAATGATATAGGTTTACTAACTGACATCAAACA
ATGGCTAGCAACCCAATTTCAAATGAAAGATTTGGGAGAGGCACAGTTTGTTCTGGGTATTCAGATCTTTAGAGATCGTAAGAACAAAACACTAGCTTTGTCTCAAGCAT
CATATATTGACAAAATAGTTGTTAAATATTCAATGCAAAACTCCAAGAGAGGCTTACTACCTTTCAGGCATGGAGTTACTTTGTCTAAGGAACAGTGTCCTAAGACACCT
CAAGACGTTGAGGAAATGAGACATATCCCCTATGCATCAGCTGTTGGCAGCTTGATGTATGCGATGTTATGTACTAGACCTGACATCTGTTATGCGGTGGGGATAGTCAG
TAGATATCAATCTAATCCAGGATTAGCTCATTGGATTGCCGTTAAAACTATCCTCAAGTATCTTAGGAGAACGAGGGACTACATGCCTGTGTATGGTTCTAAGGATTTGA
TTCTTACAGGATACACAGACTCTGACTTTCAGACTGATAGAGATTTTAGGAAATCTACTTCAGGATGTATTGCTGACTCCACTATGGAGGCAGAGTATGTTGCAGCTTGT
GAAGCTGCCAAAGAGGCTGTTTGGCTTAGAAAAATCTTGATTGATTTGGAAGTAGTTCCAAACATGTCAAAGCCAATTACTCTTTACTGTGATAATAGCGGGGCTGTGGC
TAATTCTAGGGAGCCTAGAAGCCACAAGCTTGGAAAGCATATTAAGCGCAAGTATCACTTGATTCGAGAGATAGTGCATCGAGGGGACGTGATCGTCACACAGATAGCTT
CGACACACAATGGTGCTGATCCGTTTACAAAGCCCCTCACGGCTAAGGTGTTTGAGGGTCACCTAGAGAGTCTAGGTCTACGTGACATGCCACATTTAATCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAGTGCAAACTGCAGTTTATTTTGAATTATGTTCCATCTAAAAGTGTTTCTGAAACACCTTTAAAATTATGGAATGGTCGTAAAGGTAGTTTACGTCATTTCAGAAT
TTGGTGTTGTCCAGCACACGTGCTTGAGAATAACCCTAAGAAATTGGAACCTCGTTCAAAATCATGTTTATTTGTAGGCTACCGCAAAGGAACTAGAGGTGGTTACTTCT
ATGATCCTAAAGATAATAAAGTGTTTGTATCGACAAATGCTACATTTTTAGAAGAGGACCACATAAGGGAGCACAAACCGCGTAGTAAGATAGTATTAAATGAACTTTCC
AAAGAAATTACTGAACCTTCAACAAGAATTGTTGAAGAGCCTAGTGCATTAACAAGAGTTGTTCATGCCGGTTCATCTATTAGGACACATCAACCTCAATCGTTGAGGGA
ACCTCGACGAAGTGGGAGGGTTACAAACTTACCTATTCGTTATATGAGTTTAACCGAAACCTTAATTGTCATATCTGATGGCGACATTGAGGATCCATTGACTTTTAAGA
AGGCAATGGAGGATGTGGATAAAGATGAATGGATCAAAGCTATGAATCTTGAATTGGAGTCTATGTACTTCAATTCAGTCTGGGATCTTGTAGATCAACCTGATGAGGTA
AAACCTATAGGTTGTAAATGGATCTACAAGAGAAAAAGAGGTGCAGATGGTAAGGTACAAACTTTTAAAGCTAGACTAGTGGCAAAGGGTTATACTCAAGTTGAGGGAGT
TGACTATGAGGAGACTTTCTCACATACTGCCTTTTTGAATGGCAATCTTGAGGAGACCATCTATATGCAACAACCAGAAGGATTCATAATTCCAGGTCAAGAGCAAAAGA
TTTGCAAGCTTAATCGTTCTATTTATGGATTAAAACAAGCTTCTCGATCTTGGAACATAAGATTTGATACCGCAATAAAATCTTATGGATTTGATCAAATCGTTGATGAA
CATTGTGTCTACAAAAGAATCATCAACAAATCAGTAGCTTTCTTAGTTCTGTACGTAGATGATATCCTACTCATTAAGAATGATATAGGTTTACTAACTGACATCAAACA
ATGGCTAGCAACCCAATTTCAAATGAAAGATTTGGGAGAGGCACAGTTTGTTCTGGGTATTCAGATCTTTAGAGATCGTAAGAACAAAACACTAGCTTTGTCTCAAGCAT
CATATATTGACAAAATAGTTGTTAAATATTCAATGCAAAACTCCAAGAGAGGCTTACTACCTTTCAGGCATGGAGTTACTTTGTCTAAGGAACAGTGTCCTAAGACACCT
CAAGACGTTGAGGAAATGAGACATATCCCCTATGCATCAGCTGTTGGCAGCTTGATGTATGCGATGTTATGTACTAGACCTGACATCTGTTATGCGGTGGGGATAGTCAG
TAGATATCAATCTAATCCAGGATTAGCTCATTGGATTGCCGTTAAAACTATCCTCAAGTATCTTAGGAGAACGAGGGACTACATGCCTGTGTATGGTTCTAAGGATTTGA
TTCTTACAGGATACACAGACTCTGACTTTCAGACTGATAGAGATTTTAGGAAATCTACTTCAGGATGTATTGCTGACTCCACTATGGAGGCAGAGTATGTTGCAGCTTGT
GAAGCTGCCAAAGAGGCTGTTTGGCTTAGAAAAATCTTGATTGATTTGGAAGTAGTTCCAAACATGTCAAAGCCAATTACTCTTTACTGTGATAATAGCGGGGCTGTGGC
TAATTCTAGGGAGCCTAGAAGCCACAAGCTTGGAAAGCATATTAAGCGCAAGTATCACTTGATTCGAGAGATAGTGCATCGAGGGGACGTGATCGTCACACAGATAGCTT
CGACACACAATGGTGCTGATCCGTTTACAAAGCCCCTCACGGCTAAGGTGTTTGAGGGTCACCTAGAGAGTCTAGGTCTACGTGACATGCCACATTTAATCTAG
Protein sequenceShow/hide protein sequence
MQCKLQFILNYVPSKSVSETPLKLWNGRKGSLRHFRIWCCPAHVLENNPKKLEPRSKSCLFVGYRKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELS
KEITEPSTRIVEEPSALTRVVHAGSSIRTHQPQSLREPRRSGRVTNLPIRYMSLTETLIVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDEV
KPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFSHTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDE
HCVYKRIINKSVAFLVLYVDDILLIKNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKTLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTP
QDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWIAVKTILKYLRRTRDYMPVYGSKDLILTGYTDSDFQTDRDFRKSTSGCIADSTMEAEYVAAC
EAAKEAVWLRKILIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKLGKHIKRKYHLIREIVHRGDVIVTQIASTHNGADPFTKPLTAKVFEGHLESLGLRDMPHLI