; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc07g0197181 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc07g0197181
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr07:20451436..20452641
RNA-Seq ExpressionCmc07g0197181
SyntenyCmc07g0197181
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]1.8e-21594.47Show/hide
Query:  MDVKTTFLNGNLEESIFMSQPEGFITQGEEQKVCKLNRSIYGLKQASRSWNIMFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNDMGY
        MDVKT FLNGNLEESIFMSQPEGFITQG+EQKVCKLNRSIYGLKQASRSWNI FDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLI ND+GY
Subjt:  MDVKTTFLNGNLEESIFMSQPEGFITQGEEQKVCKLNRSIYGLKQASRSWNIMFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNDMGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYV
        LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDK+LVRYSMQNSKKGLLPFRHGVHLSKEQ PKTPQEVEDMRRIPYASAVGSLMY 
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYV

Query:  MLCTRPDISYVVEIVSRYQSNPGLDHWTAVKMILKYLRRTRDYMLVYRAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLYGGAVVWCSIKQGCIADSTME
        MLCTRPDI Y V IVSRYQSNPGLDHWTAVK++LKYLRRTRDYMLVY AKDLILTGYTDSDFQTDKDSRKSTSGSVFTL GGAVVW SIKQGCIADSTME
Subjt:  MLCTRPDISYVVEIVSRYQSNPGLDHWTAVKMILKYLRRTRDYMLVYRAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLYGGAVVWCSIKQGCIADSTME

Query:  AEYIATCGAAKEVVWLRKFLHDLEVVPNMNLPITLYCDNIGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTTKVF
        AEY+A C AAKE VWLRKFLHDLEVVPNMNLPITLYCDN GAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLT KVF
Subjt:  AEYIATCGAAKEVVWLRKFLHDLEVVPNMNLPITLYCDNIGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTTKVF

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]2.8e-21393.72Show/hide
Query:  MDVKTTFLNGNLEESIFMSQPEGFITQGEEQKVCKLNRSIYGLKQASRSWNIMFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNDMGY
        MDVKT FLNGNLEESIFMSQPEGFITQG+EQKVCKLNRSIYGLKQASRSWNI FDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLI ND+GY
Subjt:  MDVKTTFLNGNLEESIFMSQPEGFITQGEEQKVCKLNRSIYGLKQASRSWNIMFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNDMGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYV
        LTDVKAWLAAQFQMKDLGE QYVLGIQIIRDRKNKTLALSQATYIDK+LVRYSMQNSKKGLLPFRHGVHLSKEQ PKTPQEVEDMRRIPYASAVGSLMY 
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYV

Query:  MLCTRPDISYVVEIVSRYQSNPGLDHWTAVKMILKYLRRTRDYMLVYRAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLYGGAVVWCSIKQGCIADSTME
        MLCTRPDI Y V IVSRYQSNPGLDHWTAVK+ILKYLRRTRDYMLVY AKDLILTGYT+SDFQTDKDSRKSTS SVFTL GGAVVW SIKQGCIADSTME
Subjt:  MLCTRPDISYVVEIVSRYQSNPGLDHWTAVKMILKYLRRTRDYMLVYRAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLYGGAVVWCSIKQGCIADSTME

Query:  AEYIATCGAAKEVVWLRKFLHDLEVVPNMNLPITLYCDNIGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTTKVF
        AEY+A C AAKE VWL+KFLHDLEVVPNMNLPITLYCDN GAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLT KVF
Subjt:  AEYIATCGAAKEVVWLRKFLHDLEVVPNMNLPITLYCDNIGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTTKVF

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]1.8e-21594.47Show/hide
Query:  MDVKTTFLNGNLEESIFMSQPEGFITQGEEQKVCKLNRSIYGLKQASRSWNIMFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNDMGY
        MDVKT FLNGNLEESIFMSQPEGFITQG+EQKVCKLNRSIYGLKQASRSWNI FDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLI ND+GY
Subjt:  MDVKTTFLNGNLEESIFMSQPEGFITQGEEQKVCKLNRSIYGLKQASRSWNIMFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNDMGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYV
        LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDK+LVRYSMQNSKKGLLPFRHGVHLSKEQ PKTPQEVEDMRRIPYASAVGSLMY 
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYV

Query:  MLCTRPDISYVVEIVSRYQSNPGLDHWTAVKMILKYLRRTRDYMLVYRAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLYGGAVVWCSIKQGCIADSTME
        MLCTRPDI Y V IVSRYQSNPGLDHWTAVK++LKYLRRTRDYMLVY AKDLILTGYTDSDFQTDKDSRKSTSGSVFTL GGAVVW SIKQGCIADSTME
Subjt:  MLCTRPDISYVVEIVSRYQSNPGLDHWTAVKMILKYLRRTRDYMLVYRAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLYGGAVVWCSIKQGCIADSTME

Query:  AEYIATCGAAKEVVWLRKFLHDLEVVPNMNLPITLYCDNIGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTTKVF
        AEY+A C AAKE VWLRKFLHDLEVVPNMNLPITLYCDN GAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLT KVF
Subjt:  AEYIATCGAAKEVVWLRKFLHDLEVVPNMNLPITLYCDNIGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTTKVF

KAA0062926.1 gag/pol protein [Cucumis melo var. makuwa]2.2e-21397.63Show/hide
Query:  QPEGFITQGEEQKVCKLNRSIYGLKQASRSWNIMFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNDMGYLTDVKAWLAAQFQMKDLGE
        +PEGFITQG++QKVCKLNRSIYGLKQASRSWNI FDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRND+GYLTDVKAWLAAQFQMKDLGE
Subjt:  QPEGFITQGEEQKVCKLNRSIYGLKQASRSWNIMFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNDMGYLTDVKAWLAAQFQMKDLGE

Query:  AQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDISYVVEIVSRYQ
        AQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDISYVVEIVSRYQ
Subjt:  AQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDISYVVEIVSRYQ

Query:  SNPGLDHWTAVKMILKYLRRTRDYMLVYRAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLYGGAVVWCSIKQGCIADSTMEAEYIATCGAAKEVVWLRKF
        SNPGLDHWT VKMILKYLRRTRDYMLVY AKDLILTGYTDSDFQTDKDSRKSTSGSVFTL GGAVVWCSIKQGCIADSTMEAEYIATC AAKEVVWLRKF
Subjt:  SNPGLDHWTAVKMILKYLRRTRDYMLVYRAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLYGGAVVWCSIKQGCIADSTMEAEYIATCGAAKEVVWLRKF

Query:  LHDLEVVPNMNLPITLYCDNIGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTTKVFG
        LHDLEVVPNMNLPITLYCDNIGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTTKVFG
Subjt:  LHDLEVVPNMNLPITLYCDNIGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTTKVFG

TYK16417.1 gag/pol protein [Cucumis melo var. makuwa]7.4e-21497.89Show/hide
Query:  QPEGFITQGEEQKVCKLNRSIYGLKQASRSWNIMFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNDMGYLTDVKAWLAAQFQMKDLGE
        +PEGFITQG+EQKVCKLNRSIYGLKQASRSWNI FDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRND+GYLTDVKAWLAAQFQMKDLGE
Subjt:  QPEGFITQGEEQKVCKLNRSIYGLKQASRSWNIMFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNDMGYLTDVKAWLAAQFQMKDLGE

Query:  AQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDISYVVEIVSRYQ
        AQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDISYVVEIVSRYQ
Subjt:  AQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDISYVVEIVSRYQ

Query:  SNPGLDHWTAVKMILKYLRRTRDYMLVYRAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLYGGAVVWCSIKQGCIADSTMEAEYIATCGAAKEVVWLRKF
        SNPGLDHWT VKMILKYLRRTRDYMLVY AKDLILTGYTDSDFQTDKDSRKSTSGSVFTL GGAVVWCSIKQGCIADSTMEAEYIATC AAKEVVWLRKF
Subjt:  SNPGLDHWTAVKMILKYLRRTRDYMLVYRAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLYGGAVVWCSIKQGCIADSTMEAEYIATCGAAKEVVWLRKF

Query:  LHDLEVVPNMNLPITLYCDNIGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTTKVFG
        LHDLEVVPNMNLPITLYCDNIGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTTKVFG
Subjt:  LHDLEVVPNMNLPITLYCDNIGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTTKVFG

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein1.4e-21393.72Show/hide
Query:  MDVKTTFLNGNLEESIFMSQPEGFITQGEEQKVCKLNRSIYGLKQASRSWNIMFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNDMGY
        MDVKT FLNGNLEESIFMSQPEGFITQG+EQKVCKLNRSIYGLKQASRSWNI FDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLI ND+GY
Subjt:  MDVKTTFLNGNLEESIFMSQPEGFITQGEEQKVCKLNRSIYGLKQASRSWNIMFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNDMGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYV
        LTDVKAWLAAQFQMKDLGE QYVLGIQIIRDRKNKTLALSQATYIDK+LVRYSMQNSKKGLLPFRHGVHLSKEQ PKTPQEVEDMRRIPYASAVGSLMY 
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYV

Query:  MLCTRPDISYVVEIVSRYQSNPGLDHWTAVKMILKYLRRTRDYMLVYRAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLYGGAVVWCSIKQGCIADSTME
        MLCTRPDI Y V IVSRYQSNPGLDHWTAVK+ILKYLRRTRDYMLVY AKDLILTGYT+SDFQTDKDSRKSTS SVFTL GGAVVW SIKQGCIADSTME
Subjt:  MLCTRPDISYVVEIVSRYQSNPGLDHWTAVKMILKYLRRTRDYMLVYRAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLYGGAVVWCSIKQGCIADSTME

Query:  AEYIATCGAAKEVVWLRKFLHDLEVVPNMNLPITLYCDNIGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTTKVF
        AEY+A C AAKE VWL+KFLHDLEVVPNMNLPITLYCDN GAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLT KVF
Subjt:  AEYIATCGAAKEVVWLRKFLHDLEVVPNMNLPITLYCDNIGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTTKVF

A0A5A7TZD0 Gag/pol protein8.6e-21694.47Show/hide
Query:  MDVKTTFLNGNLEESIFMSQPEGFITQGEEQKVCKLNRSIYGLKQASRSWNIMFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNDMGY
        MDVKT FLNGNLEESIFMSQPEGFITQG+EQKVCKLNRSIYGLKQASRSWNI FDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLI ND+GY
Subjt:  MDVKTTFLNGNLEESIFMSQPEGFITQGEEQKVCKLNRSIYGLKQASRSWNIMFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNDMGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYV
        LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDK+LVRYSMQNSKKGLLPFRHGVHLSKEQ PKTPQEVEDMRRIPYASAVGSLMY 
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYV

Query:  MLCTRPDISYVVEIVSRYQSNPGLDHWTAVKMILKYLRRTRDYMLVYRAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLYGGAVVWCSIKQGCIADSTME
        MLCTRPDI Y V IVSRYQSNPGLDHWTAVK++LKYLRRTRDYMLVY AKDLILTGYTDSDFQTDKDSRKSTSGSVFTL GGAVVW SIKQGCIADSTME
Subjt:  MLCTRPDISYVVEIVSRYQSNPGLDHWTAVKMILKYLRRTRDYMLVYRAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLYGGAVVWCSIKQGCIADSTME

Query:  AEYIATCGAAKEVVWLRKFLHDLEVVPNMNLPITLYCDNIGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTTKVF
        AEY+A C AAKE VWLRKFLHDLEVVPNMNLPITLYCDN GAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLT KVF
Subjt:  AEYIATCGAAKEVVWLRKFLHDLEVVPNMNLPITLYCDNIGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTTKVF

A0A5A7UYE8 Gag/pol protein8.6e-21694.47Show/hide
Query:  MDVKTTFLNGNLEESIFMSQPEGFITQGEEQKVCKLNRSIYGLKQASRSWNIMFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNDMGY
        MDVKT FLNGNLEESIFMSQPEGFITQG+EQKVCKLNRSIYGLKQASRSWNI FDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLI ND+GY
Subjt:  MDVKTTFLNGNLEESIFMSQPEGFITQGEEQKVCKLNRSIYGLKQASRSWNIMFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNDMGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYV
        LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDK+LVRYSMQNSKKGLLPFRHGVHLSKEQ PKTPQEVEDMRRIPYASAVGSLMY 
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYV

Query:  MLCTRPDISYVVEIVSRYQSNPGLDHWTAVKMILKYLRRTRDYMLVYRAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLYGGAVVWCSIKQGCIADSTME
        MLCTRPDI Y V IVSRYQSNPGLDHWTAVK++LKYLRRTRDYMLVY AKDLILTGYTDSDFQTDKDSRKSTSGSVFTL GGAVVW SIKQGCIADSTME
Subjt:  MLCTRPDISYVVEIVSRYQSNPGLDHWTAVKMILKYLRRTRDYMLVYRAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLYGGAVVWCSIKQGCIADSTME

Query:  AEYIATCGAAKEVVWLRKFLHDLEVVPNMNLPITLYCDNIGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTTKVF
        AEY+A C AAKE VWLRKFLHDLEVVPNMNLPITLYCDN GAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLT KVF
Subjt:  AEYIATCGAAKEVVWLRKFLHDLEVVPNMNLPITLYCDNIGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTTKVF

A0A5A7V901 Gag/pol protein1.0e-21397.63Show/hide
Query:  QPEGFITQGEEQKVCKLNRSIYGLKQASRSWNIMFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNDMGYLTDVKAWLAAQFQMKDLGE
        +PEGFITQG++QKVCKLNRSIYGLKQASRSWNI FDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRND+GYLTDVKAWLAAQFQMKDLGE
Subjt:  QPEGFITQGEEQKVCKLNRSIYGLKQASRSWNIMFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNDMGYLTDVKAWLAAQFQMKDLGE

Query:  AQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDISYVVEIVSRYQ
        AQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDISYVVEIVSRYQ
Subjt:  AQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDISYVVEIVSRYQ

Query:  SNPGLDHWTAVKMILKYLRRTRDYMLVYRAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLYGGAVVWCSIKQGCIADSTMEAEYIATCGAAKEVVWLRKF
        SNPGLDHWT VKMILKYLRRTRDYMLVY AKDLILTGYTDSDFQTDKDSRKSTSGSVFTL GGAVVWCSIKQGCIADSTMEAEYIATC AAKEVVWLRKF
Subjt:  SNPGLDHWTAVKMILKYLRRTRDYMLVYRAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLYGGAVVWCSIKQGCIADSTMEAEYIATCGAAKEVVWLRKF

Query:  LHDLEVVPNMNLPITLYCDNIGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTTKVFG
        LHDLEVVPNMNLPITLYCDNIGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTTKVFG
Subjt:  LHDLEVVPNMNLPITLYCDNIGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTTKVFG

A0A5D3CWZ1 Gag/pol protein3.6e-21497.89Show/hide
Query:  QPEGFITQGEEQKVCKLNRSIYGLKQASRSWNIMFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNDMGYLTDVKAWLAAQFQMKDLGE
        +PEGFITQG+EQKVCKLNRSIYGLKQASRSWNI FDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRND+GYLTDVKAWLAAQFQMKDLGE
Subjt:  QPEGFITQGEEQKVCKLNRSIYGLKQASRSWNIMFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNDMGYLTDVKAWLAAQFQMKDLGE

Query:  AQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDISYVVEIVSRYQ
        AQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDISYVVEIVSRYQ
Subjt:  AQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDISYVVEIVSRYQ

Query:  SNPGLDHWTAVKMILKYLRRTRDYMLVYRAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLYGGAVVWCSIKQGCIADSTMEAEYIATCGAAKEVVWLRKF
        SNPGLDHWT VKMILKYLRRTRDYMLVY AKDLILTGYTDSDFQTDKDSRKSTSGSVFTL GGAVVWCSIKQGCIADSTMEAEYIATC AAKEVVWLRKF
Subjt:  SNPGLDHWTAVKMILKYLRRTRDYMLVYRAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLYGGAVVWCSIKQGCIADSTMEAEYIATCGAAKEVVWLRKF

Query:  LHDLEVVPNMNLPITLYCDNIGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTTKVFG
        LHDLEVVPNMNLPITLYCDNIGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTTKVFG
Subjt:  LHDLEVVPNMNLPITLYCDNIGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTTKVFG

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.0e-6836.27Show/hide
Query:  MDVKTTFLNGNLEESIFMSQPEGFITQGEEQKVCKLNRSIYGLKQASRSWNIMFDTAIKSYGFDQNVDEPCVY--KKINKGKVAFLVLYVDDILLIRNDM
        MDVKT FLNG L+E I+M  P+G     +   VCKLN++IYGLKQA+R W  +F+ A+K   F  +  + C+Y   K N  +  +++LYVDD+++   DM
Subjt:  MDVKTTFLNGNLEESIFMSQPEGFITQGEEQKVCKLNRSIYGLKQASRSWNIMFDTAIKSYGFDQNVDEPCVY--KKINKGKVAFLVLYVDDILLIRNDM

Query:  GYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHL----SKEQCPKTPQEVEDMRRIPYASAV
          + + K +L  +F+M DL E ++ +GI+I  + +   + LSQ+ Y+ K+L +++M+N      P    ++     S E C             P  S +
Subjt:  GYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHL----SKEQCPKTPQEVEDMRRIPYASAV

Query:  GSLMYVMLCTRPDISYVVEIVSRYQSNPGLDHWTAVKMILKYLRRTRDYMLVYR---AKDLILTGYTDSDFQTDKDSRKSTSGSVFTLYG-GAVVWCSIK
        G LMY+MLCTRPD++  V I+SRY S    + W  +K +L+YL+ T D  L+++   A +  + GY DSD+   +  RKST+G +F ++    + W + +
Subjt:  GSLMYVMLCTRPDISYVVEIVSRYQSNPGLDHWTAVKMILKYLRRTRDYMLVYR---AKDLILTGYTDSDFQTDKDSRKSTSGSVFTLYG-GAVVWCSIK

Query:  QGCIADSTMEAEYIATCGAAKEVVWLRKFLHDLEVVPNMNLPITLYCDNIGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFT
        Q  +A S+ EAEY+A   A +E +WL+  L  + +   +  PI +Y DN G ++ +  P  HKR KHI+ KYH  RE VQ   + +  I +E+ +AD FT
Subjt:  QGCIADSTMEAEYIATCGAAKEVVWLRKFLHDLEVVPNMNLPITLYCDNIGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFT

Query:  KTLTTKVF
        K L    F
Subjt:  KTLTTKVF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-10046.62Show/hide
Query:  MDVKTTFLNGNLEESIFMSQPEGFITQGEEQKVCKLNRSIYGLKQASRSWNIMFDTAIKSYGFDQNVDEPCVY-KKINKGKVAFLVLYVDDILLIRNDMG
        +DVKT FL+G+LEE I+M QPEGF   G++  VCKLN+S+YGLKQA R W + FD+ +KS  + +   +PCVY K+ ++     L+LYVDD+L++  D G
Subjt:  MDVKTTFLNGNLEESIFMSQPEGFITQGEEQKVCKLNRSIYGLKQASRSWNIMFDTAIKSYGFDQNVDEPCVY-KKINKGKVAFLVLYVDDILLIRNDMG

Query:  YLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMY
         +  +K  L+  F MKDLG AQ +LG++I+R+R ++ L LSQ  YI+++L R++M+N+K    P    + LSK+ CP T +E  +M ++PY+SAVGSLMY
Subjt:  YLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMY

Query:  VMLCTRPDISYVVEIVSRYQSNPGLDHWTAVKMILKYLRRTRDYMLVYRAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLYGGAVVWCSIKQGCIADSTM
         M+CTRPDI++ V +VSR+  NPG +HW AVK IL+YLR T    L +   D IL GYTD+D   D D+RKS++G +FT  GGA+ W S  Q C+A ST 
Subjt:  VMLCTRPDISYVVEIVSRYQSNPGLDHWTAVKMILKYLRRTRDYMLVYRAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLYGGAVVWCSIKQGCIADSTM

Query:  EAEYIATCGAAKEVVWLRKFLHDLEVVPNMNLPITLYCDNIGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTTKVF
        EAEYIA     KE++WL++FL +L +         +YCD+  A+  SK    H R KHI+ +YH IRE+V    + V KI++  N AD  TK +    F
Subjt:  EAEYIATCGAAKEVVWLRKFLHDLEVVPNMNLPITLYCDNIGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTTKVF

P25600 Putative transposon Ty5-1 protein YCL074W7.6e-3633.33Show/hide
Query:  MDVKTTFLNGNLEESIFMSQPEGFITQGEEQKVCKLNRSIYGLKQASRSWNIMFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNDMGY
        MDV T FLN  ++E I++ QP GF+ +     V +L   +YGLKQA   WN   +  +K  GF ++  E  +Y +       ++ +YVDD+L+       
Subjt:  MDVKTTFLNGNLEESIFMSQPEGFITQGEEQKVCKLNRSIYGLKQASRSWNIMFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNDMGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYV
           VK  L   + MKDLG+    LG+  I    N  + LS   YI K      +   K    P  +    SK     T   ++D+   PY S VG L++ 
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYV

Query:  MLCTRPDISYVVEIVSRYQSNPGLDHWTAVKMILKYLRRTRDYMLVYRA-KDLILTGYTDSDFQTDKDSRKSTSGSVFTLYGGAVVWCSIK-QGCIADST
            RPDISY V ++SR+   P   H  + + +L+YL  TR   L YR+   L LT Y D+      D   ST G V  L G  V W S K +G I   +
Subjt:  MLCTRPDISYVVEIVSRYQSNPGLDHWTAVKMILKYLRRTRDYMLVYRA-KDLILTGYTDSDFQTDKDSRKSTSGSVFTLYGGAVVWCSIK-QGCIADST

Query:  MEAEYIATCGAAKEV
         EAEYI       E+
Subjt:  MEAEYIATCGAAKEV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.8e-5734.59Show/hide
Query:  MDVKTTFLNGNLEESIFMSQPEGFITQGEEQKVCKLNRSIYGLKQASRSWNIMFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNDMGY
        +DV   FL G L + ++MSQP GFI +     VCKL +++YGLKQA R+W +     + + GF  +V +  ++       + ++++YVDDIL+  ND   
Subjt:  MDVKTTFLNGNLEESIFMSQPEGFITQGEEQKVCKLNRSIYGLKQASRSWNIMFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNDMGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYV
        L +    L+ +F +KD  E  Y LGI+    R    L LSQ  YI  +L R +M  +K    P      LS     K     E      Y   VGSL Y+
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYV

Query:  MLCTRPDISYVVEIVSRYQSNPGLDHWTAVKMILKYLRRTRDY-MLVYRAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLYGGAVVWCSIKQGCIADSTM
           TRPDISY V  +S++   P  +H  A+K IL+YL  T ++ + + +   L L  Y+D+D+  DKD   ST+G +  L    + W S KQ  +  S+ 
Subjt:  MLCTRPDISYVVEIVSRYQSNPGLDHWTAVKMILKYLRRTRDY-MLVYRAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLYGGAVVWCSIKQGCIADSTM

Query:  EAEYIATCGAAKEVVWLRKFLHDLEVVPNMNLPITLYCDNIGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTTKVF
        EAEY +    + E+ W+   L +L +   +  P  +YCDN+GA      P  H R KHI   YH IR  VQ G + V  +++   +AD  TK L+   F
Subjt:  EAEYIATCGAAKEVVWLRKFLHDLEVVPNMNLPITLYCDNIGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTTKVF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-5833.83Show/hide
Query:  MDVKTTFLNGNLEESIFMSQPEGFITQGEEQKVCKLNRSIYGLKQASRSWNIMFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNDMGY
        +DV   FL G L + ++MSQP GF+ +     VC+L ++IYGLKQA R+W +   T + + GF  ++ +  ++       + ++++YVDDIL+  ND   
Subjt:  MDVKTTFLNGNLEESIFMSQPEGFITQGEEQKVCKLNRSIYGLKQASRSWNIMFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNDMGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYV
        L      L+ +F +K+  +  Y LGI+    R  + L LSQ  Y   +L R +M  +K    P      L+     K P   E      Y   VGSL Y+
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYV

Query:  MLCTRPDISYVVEIVSRYQSNPGLDHWTAVKMILKYLRRTRDY-MLVYRAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLYGGAVVWCSIKQGCIADSTM
           TRPD+SY V  +S+Y   P  DHW A+K +L+YL  T D+ + + +   L L  Y+D+D+  D D   ST+G +  L    + W S KQ  +  S+ 
Subjt:  MLCTRPDISYVVEIVSRYQSNPGLDHWTAVKMILKYLRRTRDY-MLVYRAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLYGGAVVWCSIKQGCIADSTM

Query:  EAEYIATCGAAKEVVWLRKFLHDLEVVPNMNLPITLYCDNIGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTTKVF
        EAEY +    + E+ W+   L +L +   ++ P  +YCDN+GA      P  H R KHI   YH IR  VQ G + V  +++   +AD  TK L+   F
Subjt:  EAEYIATCGAAKEVVWLRKFLHDLEVVPNMNLPITLYCDNIGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTTKVF

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.3e-5033.06Show/hide
Query:  MDVKTTFLNGNLEESIFMSQPEGFIT-QGEE---QKVCKLNRSIYGLKQASRSWNIMFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRN
        +D+   FLNG+L+E I+M  P G+   QG+      VC L +SIYGLKQASR W + F   +  +GF Q+  +   + KI       +++YVDDI++  N
Subjt:  MDVKTTFLNGNLEESIFMSQPEGFIT-QGEE---QKVCKLNRSIYGLKQASRSWNIMFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRN

Query:  DMGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGS
        +   + ++K+ L + F+++DLG  +Y LG++I R      + + Q  Y   +L    +   K   +P    V  S      +  +  D +   Y   +G 
Subjt:  DMGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGS

Query:  LMYVMLCTRPDISYVVEIVSRYQSNPGLDHWTAVKMILKYLRRTRDYMLVYRAK-DLILTGYTDSDFQTDKDSRKSTSGSVFTLYGGAVVWCSIKQGCIA
        LMY+ + TR DIS+ V  +S++   P L H  AV  IL Y++ T    L Y ++ ++ L  ++D+ FQ+ KD+R+ST+G    L    + W S KQ  ++
Subjt:  LMYVMLCTRPDISYVVEIVSRYQSNPGLDHWTAVKMILKYLRRTRDYMLVYRAK-DLILTGYTDSDFQTDKDSRKSTSGSVFTLYGGAVVWCSIKQGCIA

Query:  DSTMEAEYIATCGAAKEVVWLRKFLHDLEVVPNMNLPITLYCDNIGAVANSKEPRSHKRGKHIERKYHLIRE
         S+ EAEY A   A  E++WL +F  +L++   ++ P  L+CDN  A+  +     H+R KHIE   H +RE
Subjt:  DSTMEAEYIATCGAAKEVVWLRKFLHDLEVVPNMNLPITLYCDNIGAVANSKEPRSHKRGKHIERKYHLIRE

ATMG00240.1 Gag-Pol-related retrotransposon family protein4.6e-0434.62Show/hide
Query:  MYVMLCTRPDISYVVEIVSRYQSNPGLDHWTAVKMILKYLRRTRDYMLVYRA-KDLILTGYTDSDFQTDKDSRKSTSG
        MY+ + TRPD+++ V  +S++ S        AV  +L Y++ T    L Y A  DL L  + DSD+ +  D+R+S +G
Subjt:  MYVMLCTRPDISYVVEIVSRYQSNPGLDHWTAVKMILKYLRRTRDYMLVYRA-KDLILTGYTDSDFQTDKDSRKSTSG

ATMG00810.1 DNA/RNA polymerases superfamily protein1.4e-2133.05Show/hide
Query:  FLVLYVDDILLIRNDMGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSK--KGLLPFRHGVHLSKEQCPKTPQ
        +L+LYVDDILL  +    L  +   L++ F MKDLG   Y LGIQI        L LSQ  Y +++L    M + K     LP +    +S  + P    
Subjt:  FLVLYVDDILLIRNDMGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSK--KGLLPFRHGVHLSKEQCPKTPQ

Query:  EVEDMRRIPYASAVGSLMYVMLCTRPDISYVVEIVSRYQSNPGLDHWTAVKMILKYLRRTRDY-MLVYRAKDLILTGYTDSDFQTDKDSRKSTSGSVFTL
        +  D R     S VG+L Y+ L TRPDISY V IV +    P L  +  +K +L+Y++ T  + + +++   L +  + DSD+     +R+ST+G    L
Subjt:  EVEDMRRIPYASAVGSLMYVMLCTRPDISYVVEIVSRYQSNPGLDHWTAVKMILKYLRRTRDY-MLVYRAKDLILTGYTDSDFQTDKDSRKSTSGSVFTL

Query:  YGGAVVWCSIKQGCIADSTMEAEYIATCGAAKEVVW
            + W + +Q  ++ S+ E EY A    A E+ W
Subjt:  YGGAVVWCSIKQGCIADSTMEAEYIATCGAAKEVVW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGTCAAGACTACTTTTCTGAATGGCAATCTTGAAGAGAGTATCTTTATGTCTCAGCCCGAGGGGTTCATAACCCAAGGTGAAGAGCAAAAAGTTTGCAAGCTGAA
TCGATCCATTTATGGGTTGAAACAAGCATCTAGATCTTGGAACATTATGTTTGATACTGCAATCAAATCCTACGGTTTTGACCAAAACGTTGATGAACCTTGTGTATATA
AGAAAATCAACAAAGGAAAAGTAGCTTTCTTAGTACTTTATGTGGACGATATCCTCCTCATTAGGAATGATATGGGATACCTTACTGACGTTAAAGCTTGGCTAGCAGCC
CAATTCCAAATGAAAGATTTAGGAGAGGCACAATATGTTCTTGGGATCCAAATCATAAGGGATCGTAAGAACAAAACGCTAGCACTGTCTCAAGCAACCTATATCGACAA
AATGTTGGTTCGATATTCGATGCAGAACTCTAAGAAGGGTTTATTACCTTTCAGGCATGGAGTTCACTTGTCTAAGGAACAGTGTCCTAAGACACCTCAAGAAGTTGAGG
ATATGAGACGTATTCCCTATGCCTCAGCTGTGGGCAGCTTAATGTATGTTATGCTCTGCACTAGGCCAGACATTTCTTATGTAGTGGAAATAGTTAGTAGGTATCAGTCC
AATCCAGGGTTAGACCACTGGACGGCGGTTAAAATGATTCTCAAGTATCTTAGGAGAACGAGAGACTACATGCTTGTGTATAGAGCTAAGGATTTGATCCTTACAGGATA
CACTGATTCTGATTTTCAAACCGATAAGGATTCTAGAAAATCTACATCGGGATCAGTGTTCACCCTATATGGAGGAGCTGTAGTATGGTGTAGCATCAAGCAAGGATGCA
TTGCAGACTCTACAATGGAGGCTGAATACATCGCTACTTGTGGAGCAGCAAAAGAAGTAGTTTGGCTTAGGAAGTTCCTACATGATTTGGAAGTTGTTCCAAACATGAAC
TTGCCCATCACTCTATATTGTGATAACATTGGGGCAGTAGCCAATTCTAAAGAACCTCGCAGCCATAAACGAGGGAAACACATAGAGAGGAAGTATCACCTGATACGGGA
GATTGTGCAACGAGGGGATGTGATCGTCACCAAGATCGCTTCGGAGCACAACATTGCTGATCCATTTACGAAGACTCTCACGACTAAAGTGTTTGGGGTCATCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATGTCAAGACTACTTTTCTGAATGGCAATCTTGAAGAGAGTATCTTTATGTCTCAGCCCGAGGGGTTCATAACCCAAGGTGAAGAGCAAAAAGTTTGCAAGCTGAA
TCGATCCATTTATGGGTTGAAACAAGCATCTAGATCTTGGAACATTATGTTTGATACTGCAATCAAATCCTACGGTTTTGACCAAAACGTTGATGAACCTTGTGTATATA
AGAAAATCAACAAAGGAAAAGTAGCTTTCTTAGTACTTTATGTGGACGATATCCTCCTCATTAGGAATGATATGGGATACCTTACTGACGTTAAAGCTTGGCTAGCAGCC
CAATTCCAAATGAAAGATTTAGGAGAGGCACAATATGTTCTTGGGATCCAAATCATAAGGGATCGTAAGAACAAAACGCTAGCACTGTCTCAAGCAACCTATATCGACAA
AATGTTGGTTCGATATTCGATGCAGAACTCTAAGAAGGGTTTATTACCTTTCAGGCATGGAGTTCACTTGTCTAAGGAACAGTGTCCTAAGACACCTCAAGAAGTTGAGG
ATATGAGACGTATTCCCTATGCCTCAGCTGTGGGCAGCTTAATGTATGTTATGCTCTGCACTAGGCCAGACATTTCTTATGTAGTGGAAATAGTTAGTAGGTATCAGTCC
AATCCAGGGTTAGACCACTGGACGGCGGTTAAAATGATTCTCAAGTATCTTAGGAGAACGAGAGACTACATGCTTGTGTATAGAGCTAAGGATTTGATCCTTACAGGATA
CACTGATTCTGATTTTCAAACCGATAAGGATTCTAGAAAATCTACATCGGGATCAGTGTTCACCCTATATGGAGGAGCTGTAGTATGGTGTAGCATCAAGCAAGGATGCA
TTGCAGACTCTACAATGGAGGCTGAATACATCGCTACTTGTGGAGCAGCAAAAGAAGTAGTTTGGCTTAGGAAGTTCCTACATGATTTGGAAGTTGTTCCAAACATGAAC
TTGCCCATCACTCTATATTGTGATAACATTGGGGCAGTAGCCAATTCTAAAGAACCTCGCAGCCATAAACGAGGGAAACACATAGAGAGGAAGTATCACCTGATACGGGA
GATTGTGCAACGAGGGGATGTGATCGTCACCAAGATCGCTTCGGAGCACAACATTGCTGATCCATTTACGAAGACTCTCACGACTAAAGTGTTTGGGGTCATCTAG
Protein sequenceShow/hide protein sequence
MDVKTTFLNGNLEESIFMSQPEGFITQGEEQKVCKLNRSIYGLKQASRSWNIMFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNDMGYLTDVKAWLAA
QFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDISYVVEIVSRYQS
NPGLDHWTAVKMILKYLRRTRDYMLVYRAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLYGGAVVWCSIKQGCIADSTMEAEYIATCGAAKEVVWLRKFLHDLEVVPNMN
LPITLYCDNIGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTTKVFGVI