; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0102941 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0102941
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr04:19824525..19825765
RNA-Seq ExpressionCmc04g0102941
SyntenyCmc04g0102941
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]6.9e-22094.67Show/hide
Query:  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY
        MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY
Subjt:  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLAWSS----------------LVLGQSPKTPQEVEDMRRIPYASAVGSLMYA
        LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLL   S                L   QSPKTPQEVEDMRRIPYASAVGSLMYA
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLAWSS----------------LVLGQSPKTPQEVEDMRRIPYASAVGSLMYA

Query:  MLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME
        MLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME
Subjt:  MLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME

Query:  AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEG
        AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEG
Subjt:  AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEG

Query:  HLESLGLRDMYIR
        HLESLGLRDMYIR
Subjt:  HLESLGLRDMYIR

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]1.9e-21793.46Show/hide
Query:  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY
        MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY
Subjt:  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLAWSS----------------LVLGQSPKTPQEVEDMRRIPYASAVGSLMYA
        LTDVKAWLAAQFQMKDLGE QYVLGIQIIRDRKNKTLALSQATYIDKLL   S                L   QSPKTPQEVEDMRRIPYASAVGSLMYA
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLAWSS----------------LVLGQSPKTPQEVEDMRRIPYASAVGSLMYA

Query:  MLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME
        MLCTRPDICYAVGIVSRYQSNPGLDHWTAVKI+LKYLRRTRDYMLVYGAKDLILTGYT+SDFQTDKDSRKSTS SVFTLNGGAVVWRSIKQGCIADSTME
Subjt:  MLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME

Query:  AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEG
        AEYVAACEAAKEAVWL+KFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEG
Subjt:  AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEG

Query:  HLESLGLRDMYIR
        HLESLGLRDMYIR
Subjt:  HLESLGLRDMYIR

KAA0040367.1 gag/pol protein [Cucumis melo var. makuwa]2.1e-20588.62Show/hide
Query:  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY
        MDVKTAFLN NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQ+SRSWN+RFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY
Subjt:  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLAWSS----------------LVLGQSPKTPQEVEDMRRIPYASAVGSLMYA
        LTDVKAWLA QFQMKDLGE QYVLGIQIIRDRKNKTLALSQATYIDK+L   S                L   Q PKTPQE+EDMRRI YASAVGSLMY 
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLAWSS----------------LVLGQSPKTPQEVEDMRRIPYASAVGSLMYA

Query:  MLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME
        ML TRPDICYAVGIVSRY  NPGLDHWTAVKI+LKYLRRTRDYMLVYG KDLILTGYTDSDFQTDKDSRKSTSGSVFTLN GAVVW SIKQGCIADSTME
Subjt:  MLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME

Query:  AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEG
        AEY+AACEAAKE VWLRKFLHDLEVVPNMNL ITLYCDNSGAVANSKEPR+HKRGKHIERKYHLIREIVQR DVIVTKI SEH I DPFTKTLTAKVFEG
Subjt:  AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEG

Query:  HLESLGLRDMYIR
        HLESLGLRDMYIR
Subjt:  HLESLGLRDMYIR

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]6.9e-22094.67Show/hide
Query:  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY
        MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY
Subjt:  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLAWSS----------------LVLGQSPKTPQEVEDMRRIPYASAVGSLMYA
        LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLL   S                L   QSPKTPQEVEDMRRIPYASAVGSLMYA
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLAWSS----------------LVLGQSPKTPQEVEDMRRIPYASAVGSLMYA

Query:  MLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME
        MLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME
Subjt:  MLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME

Query:  AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEG
        AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEG
Subjt:  AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEG

Query:  HLESLGLRDMYIR
        HLESLGLRDMYIR
Subjt:  HLESLGLRDMYIR

KAA0061170.1 gag/pol protein [Cucumis melo var. makuwa]7.4e-20692.68Show/hide
Query:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDL
        MSQPEGFITQ QEQKVCKLNRSIYG KQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGND GYLTDVKAWLAAQFQMKDL
Subjt:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDL

Query:  GEAQYVLGIQIIRDRKNKTLALSQATYIDKLLAWSS----------------LVLGQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSR
        GEAQYVLGIQIIRDRKNKTLALSQATYIDKLL   S                L   QSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSR
Subjt:  GEAQYVLGIQIIRDRKNKTLALSQATYIDKLLAWSS----------------LVLGQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSR

Query:  YQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLR
        YQSNPGLDHWT VKI+LKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLN GAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLR
Subjt:  YQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLR

Query:  KFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
        KFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTK LTAKVFEGHLESLGLRDMYIR
Subjt:  KFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein9.1e-21893.46Show/hide
Query:  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY
        MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY
Subjt:  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLAWSS----------------LVLGQSPKTPQEVEDMRRIPYASAVGSLMYA
        LTDVKAWLAAQFQMKDLGE QYVLGIQIIRDRKNKTLALSQATYIDKLL   S                L   QSPKTPQEVEDMRRIPYASAVGSLMYA
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLAWSS----------------LVLGQSPKTPQEVEDMRRIPYASAVGSLMYA

Query:  MLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME
        MLCTRPDICYAVGIVSRYQSNPGLDHWTAVKI+LKYLRRTRDYMLVYGAKDLILTGYT+SDFQTDKDSRKSTS SVFTLNGGAVVWRSIKQGCIADSTME
Subjt:  MLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME

Query:  AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEG
        AEYVAACEAAKEAVWL+KFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEG
Subjt:  AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEG

Query:  HLESLGLRDMYIR
        HLESLGLRDMYIR
Subjt:  HLESLGLRDMYIR

A0A5A7TZD0 Gag/pol protein3.3e-22094.67Show/hide
Query:  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY
        MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY
Subjt:  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLAWSS----------------LVLGQSPKTPQEVEDMRRIPYASAVGSLMYA
        LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLL   S                L   QSPKTPQEVEDMRRIPYASAVGSLMYA
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLAWSS----------------LVLGQSPKTPQEVEDMRRIPYASAVGSLMYA

Query:  MLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME
        MLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME
Subjt:  MLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME

Query:  AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEG
        AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEG
Subjt:  AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEG

Query:  HLESLGLRDMYIR
        HLESLGLRDMYIR
Subjt:  HLESLGLRDMYIR

A0A5A7UYE8 Gag/pol protein3.3e-22094.67Show/hide
Query:  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY
        MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY
Subjt:  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLAWSS----------------LVLGQSPKTPQEVEDMRRIPYASAVGSLMYA
        LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLL   S                L   QSPKTPQEVEDMRRIPYASAVGSLMYA
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLAWSS----------------LVLGQSPKTPQEVEDMRRIPYASAVGSLMYA

Query:  MLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME
        MLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME
Subjt:  MLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME

Query:  AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEG
        AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEG
Subjt:  AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEG

Query:  HLESLGLRDMYIR
        HLESLGLRDMYIR
Subjt:  HLESLGLRDMYIR

A0A5A7V1F5 Gag/pol protein3.6e-20692.68Show/hide
Query:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDL
        MSQPEGFITQ QEQKVCKLNRSIYG KQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGND GYLTDVKAWLAAQFQMKDL
Subjt:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDL

Query:  GEAQYVLGIQIIRDRKNKTLALSQATYIDKLLAWSS----------------LVLGQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSR
        GEAQYVLGIQIIRDRKNKTLALSQATYIDKLL   S                L   QSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSR
Subjt:  GEAQYVLGIQIIRDRKNKTLALSQATYIDKLLAWSS----------------LVLGQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSR

Query:  YQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLR
        YQSNPGLDHWT VKI+LKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLN GAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLR
Subjt:  YQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLR

Query:  KFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
        KFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTK LTAKVFEGHLESLGLRDMYIR
Subjt:  KFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR

A0A5D3DI92 Gag/pol protein1.0e-20588.62Show/hide
Query:  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY
        MDVKTAFLN NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQ+SRSWN+RFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY
Subjt:  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLAWSS----------------LVLGQSPKTPQEVEDMRRIPYASAVGSLMYA
        LTDVKAWLA QFQMKDLGE QYVLGIQIIRDRKNKTLALSQATYIDK+L   S                L   Q PKTPQE+EDMRRI YASAVGSLMY 
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLAWSS----------------LVLGQSPKTPQEVEDMRRIPYASAVGSLMYA

Query:  MLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME
        ML TRPDICYAVGIVSRY  NPGLDHWTAVKI+LKYLRRTRDYMLVYG KDLILTGYTDSDFQTDKDSRKSTSGSVFTLN GAVVW SIKQGCIADSTME
Subjt:  MLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME

Query:  AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEG
        AEY+AACEAAKE VWLRKFLHDLEVVPNMNL ITLYCDNSGAVANSKEPR+HKRGKHIERKYHLIREIVQR DVIVTKI SEH I DPFTKTLTAKVFEG
Subjt:  AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEG

Query:  HLESLGLRDMYIR
        HLESLGLRDMYIR
Subjt:  HLESLGLRDMYIR

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.7e-6837.44Show/hide
Query:  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVY--KKINKGKVAFLVLYVDDILLIGNDV
        MDVKTAFLNG L+E I+M  P+G         VCKLN++IYGLKQA+R W   F+ A+K   F  +  + C+Y   K N  +  +++LYVDD+++   D+
Subjt:  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVY--KKINKGKVAFLVLYVDDILLIGNDV

Query:  GYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLAWSSLVLGQSPKTP---------QEVEDMRRIPYASAVGSLMYAMLCTR
          + + K +L  +F+M DL E ++ +GI+I  + +   + LSQ+ Y+ K+L+  ++    +  TP            ++    P  S +G LMY MLCTR
Subjt:  GYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLAWSSLVLGQSPKTP---------QEVEDMRRIPYASAVGSLMYAMLCTR

Query:  PDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYG---AKDLILTGYTDSDFQTDKDSRKSTSGSVFTL-NGGAVVWRSIKQGCIADSTMEA
        PD+  AV I+SRY S    + W  +K VL+YL+ T D  L++    A +  + GY DSD+   +  RKST+G +F + +   + W + +Q  +A S+ EA
Subjt:  PDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYG---AKDLILTGYTDSDFQTDKDSRKSTSGSVFTL-NGGAVVWRSIKQGCIADSTMEA

Query:  EYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGH
        EY+A  EA +EA+WL+  L  + +   +  PI +Y DN G ++ +  P  HKR KHI+ KYH  RE VQ   + +  I +E+ +AD FTK L A  F   
Subjt:  EYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGH

Query:  LESLGL
         + LGL
Subjt:  LESLGL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.7e-9945.83Show/hide
Query:  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVY-KKINKGKVAFLVLYVDDILLIGNDVG
        +DVKTAFL+G+LEE I+M QPEGF   G++  VCKLN+S+YGLKQA R W ++FD+ +KS  + +   +PCVY K+ ++     L+LYVDD+L++G D G
Subjt:  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVY-KKINKGKVAFLVLYVDDILLIGNDVG

Query:  YLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLL----------------AWSSLVLGQSPKTPQEVEDMRRIPYASAVGSLMY
         +  +K  L+  F MKDLG AQ +LG++I+R+R ++ L LSQ  YI+++L                    L     P T +E  +M ++PY+SAVGSLMY
Subjt:  YLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLL----------------AWSSLVLGQSPKTPQEVEDMRRIPYASAVGSLMY

Query:  AMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTM
        AM+CTRPDI +AVG+VSR+  NPG +HW AVK +L+YLR T    L +G  D IL GYTD+D   D D+RKS++G +FT +GGA+ W+S  Q C+A ST 
Subjt:  AMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTM

Query:  EAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFE
        EAEY+AA E  KE +WL++FL +L +         +YCD+  A+  SK    H R KHI+ +YH IRE+V    + V KI++  N AD  TK +    FE
Subjt:  EAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFE

Query:  GHLESLGL
           E +G+
Subjt:  GHLESLGL

P25600 Putative transposon Ty5-1 protein YCL074W2.6e-3634.19Show/hide
Query:  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY
        MDV TAFLN  ++E I++ QP GF+ +     V +L   +YGLKQA   WN   +  +K  GF ++  E  +Y +       ++ +YVDD+L+       
Subjt:  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLAWSSL---VLGQSP---------KTPQEVEDMRRIPYASAVGSLMYAMLCT
           VK  L   + MKDLG+    LG+  I    N  + LS   YI K  + S +    L Q+P          T   ++D+   PY S VG L++     
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLAWSSL---VLGQSP---------KTPQEVEDMRRIPYASAVGSLMYAMLCT

Query:  RPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVY-GAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIK-QGCIADSTMEAE
        RPDI Y V ++SR+   P   H  + + VL+YL  TR   L Y     L LT Y D+      D   ST G V  L G  V W S K +G I   + EAE
Subjt:  RPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVY-GAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIK-QGCIADSTMEAE

Query:  YVAACEAAKE
        Y+ A E   E
Subjt:  YVAACEAAKE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.2e-5833.83Show/hide
Query:  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY
        +DV  AFL G L + ++MSQP GFI + +   VCKL +++YGLKQA R+W +     + + GF  +V +  ++       + ++++YVDDIL+ GND   
Subjt:  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLAWSSLVLGQSPKTPQEVEDMRRI----------PYASAVGSLMYAMLCTRP
        L +    L+ +F +KD  E  Y LGI+    R    L LSQ  YI  LLA ++++  +   TP        +           Y   VGSL Y +  TRP
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLAWSSLVLGQSPKTPQEVEDMRRI----------PYASAVGSLMYAMLCTRP

Query:  DICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVA
        DI YAV  +S++   P  +H  A+K +L+YL  T ++ + +     L L  Y+D+D+  DKD   ST+G +  L    + W S KQ  +  S+ EAEY +
Subjt:  DICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVA

Query:  ACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESL
            + E  W+   L +L +   +  P  +YCDN GA      P  H R KHI   YH IR  VQ G + V  +++   +AD  TK L+   F+     +
Subjt:  ACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESL

Query:  GL
        G+
Subjt:  GL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.9e-5933.83Show/hide
Query:  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY
        +DV  AFL G L + ++MSQP GF+ + +   VC+L ++IYGLKQA R+W +   T + + GF  ++ +  ++       + ++++YVDDIL+ GND   
Subjt:  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLAWSSLVLGQSPKTPQ------EVEDMRRIP----YASAVGSLMYAMLCTRP
        L      L+ +F +K+  +  Y LGI+    R  + L LSQ  Y   LLA ++++  +   TP        +    ++P    Y   VGSL Y +  TRP
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLAWSSLVLGQSPKTPQ------EVEDMRRIP----YASAVGSLMYAMLCTRP

Query:  DICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVA
        D+ YAV  +S+Y   P  DHW A+K VL+YL  T D+ + +     L L  Y+D+D+  D D   ST+G +  L    + W S KQ  +  S+ EAEY +
Subjt:  DICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVA

Query:  ACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESL
            + E  W+   L +L +   ++ P  +YCDN GA      P  H R KHI   YH IR  VQ G + V  +++   +AD  TK L+   F+     +
Subjt:  ACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESL

Query:  GL
        G+
Subjt:  GL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.1e-5033.06Show/hide
Query:  MDVKTAFLNGNLEESIFMSQPEGFIT-QGQE---QKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGN
        +D+  AFLNG+L+E I+M  P G+   QG       VC L +SIYGLKQASR W ++F   +  +GF Q+  +   + KI       +++YVDDI++  N
Subjt:  MDVKTAFLNGNLEESIFMSQPEGFIT-QGQE---QKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGN

Query:  DVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLAWSSLVLGQSPKTPQEV----------EDMRRIPYASAVGSLMYAML
        +   + ++K+ L + F+++DLG  +Y LG++I R      + + Q  Y   LL  + L+  +    P +           + +    Y   +G LMY  +
Subjt:  DVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLAWSSLVLGQSPKTPQEV----------EDMRRIPYASAVGSLMYAML

Query:  CTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAK-DLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEA
         TR DI +AV  +S++   P L H  AV  +L Y++ T    L Y ++ ++ L  ++D+ FQ+ KD+R+ST+G    L    + W+S KQ  ++ S+ EA
Subjt:  CTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAK-DLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEA

Query:  EYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIRE
        EY A   A  E +WL +F  +L++   ++ P  L+CDN+ A+  +     H+R KHIE   H +RE
Subjt:  EYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIRE

ATMG00240.1 Gag-Pol-related retrotransposon family protein2.7e-0437.5Show/hide
Query:  TRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGA-KDLILTGYTDSDFQTDKDSRKSTSG
        TRPD+ +AV  +S++ S        AV  VL Y++ T    L Y A  DL L  + DSD+ +  D+R+S +G
Subjt:  TRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGA-KDLILTGYTDSDFQTDKDSRKSTSG

ATMG00810.1 DNA/RNA polymerases superfamily protein1.9e-2133.04Show/hide
Query:  FLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLAWSSLVLGQSPKTPQEVE-----DMRRIP----
        +L+LYVDDILL G+    L  +   L++ F MKDLG   Y LGIQI        L LSQ  Y +++L  + ++  +   TP  ++        + P    
Subjt:  FLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLAWSSLVLGQSPKTPQEVE-----DMRRIP----

Query:  YASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRS
        + S VG+L Y  L TRPDI YAV IV +    P L  +  +K VL+Y++ T  + + ++    L +  + DSD+     +R+ST+G    L    + W +
Subjt:  YASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRS

Query:  IKQGCIADSTMEAEYVAACEAAKEAVW
         +Q  ++ S+ E EY A    A E  W
Subjt:  IKQGCIADSTMEAEYVAACEAAKEAVW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGTCAAGACTGCTTTTCTGAATGGCAATCTTGAAGAGAGTATCTTTATGTCTCAGCCCGAGGGGTTCATAACCCAAGGTCAAGAGCAAAAAGTTTGCAAGCTGAA
TCGATCCATTTATGGGTTGAAACAAGCATCAAGATCTTGGAACATTAGGTTTGATACTGCAATCAAATCCTATGGTTTTGACCAGAATGTTGATGAACCTTGTGTATATA
AGAAAATCAACAAAGGAAAAGTAGCTTTCTTAGTACTTTATGTGGACGATATCCTCCTCATTGGGAATGATGTGGGTTACCTTACTGACGTTAAAGCTTGGCTAGCAGCA
CAATTCCAAATGAAAGATTTAGGAGAGGCACAATATGTTCTTGGGATCCAAATCATAAGGGATCGTAAGAACAAAACGCTAGCACTGTCTCAAGCAACCTATATCGACAA
ATTGTTGGCATGGAGTTCACTTGTCTTAGGACAGAGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGACGTATTCCCTATGCCTCAGCTGTGGGCAGCTTAATGTATG
CTATGCTCTGCACTAGGCCAGACATTTGTTATGCAGTGGGAATAGTCAGTAGGTATCAGTCCAACCCAGGGTTAGACCATTGGACGGCGGTTAAAATTGTTCTCAAGTAT
CTTAGGAGAACGAGAGACTACATGCTTGTGTATGGAGCTAAGGATTTGATCCTTACAGGATACACTGATTCTGATTTCCAAACCGATAAGGATTCTAGGAAATCCACATC
GGGATCAGTGTTCACCCTAAATGGGGGAGCTGTAGTATGGCGTAGCATCAAGCAAGGATGTATTGCAGACTCTACAATGGAGGCTGAATACGTCGCTGCTTGTGAAGCAG
CAAAAGAAGCAGTTTGGCTTAGGAAGTTCCTACATGATTTGGAAGTTGTTCCAAATATGAACTTGCCCATCACTCTATATTGTGATAACAGTGGGGCAGTAGCCAATTCT
AAAGAACCTCGCAGCCATAAACGAGGGAAACACATAGAGAGGAAGTATCATCTGATACGGGAGATTGTGCAACGAGGGGATGTGATCGTCACCAAGATCGCTTCGGAGCA
CAACATTGCTGATCCATTTACGAAGACTCTCACGGCTAAAGTGTTCGAGGGTCATCTAGAAAGTCTAGGTCTACGAGATATGTACATTAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATGTCAAGACTGCTTTTCTGAATGGCAATCTTGAAGAGAGTATCTTTATGTCTCAGCCCGAGGGGTTCATAACCCAAGGTCAAGAGCAAAAAGTTTGCAAGCTGAA
TCGATCCATTTATGGGTTGAAACAAGCATCAAGATCTTGGAACATTAGGTTTGATACTGCAATCAAATCCTATGGTTTTGACCAGAATGTTGATGAACCTTGTGTATATA
AGAAAATCAACAAAGGAAAAGTAGCTTTCTTAGTACTTTATGTGGACGATATCCTCCTCATTGGGAATGATGTGGGTTACCTTACTGACGTTAAAGCTTGGCTAGCAGCA
CAATTCCAAATGAAAGATTTAGGAGAGGCACAATATGTTCTTGGGATCCAAATCATAAGGGATCGTAAGAACAAAACGCTAGCACTGTCTCAAGCAACCTATATCGACAA
ATTGTTGGCATGGAGTTCACTTGTCTTAGGACAGAGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGACGTATTCCCTATGCCTCAGCTGTGGGCAGCTTAATGTATG
CTATGCTCTGCACTAGGCCAGACATTTGTTATGCAGTGGGAATAGTCAGTAGGTATCAGTCCAACCCAGGGTTAGACCATTGGACGGCGGTTAAAATTGTTCTCAAGTAT
CTTAGGAGAACGAGAGACTACATGCTTGTGTATGGAGCTAAGGATTTGATCCTTACAGGATACACTGATTCTGATTTCCAAACCGATAAGGATTCTAGGAAATCCACATC
GGGATCAGTGTTCACCCTAAATGGGGGAGCTGTAGTATGGCGTAGCATCAAGCAAGGATGTATTGCAGACTCTACAATGGAGGCTGAATACGTCGCTGCTTGTGAAGCAG
CAAAAGAAGCAGTTTGGCTTAGGAAGTTCCTACATGATTTGGAAGTTGTTCCAAATATGAACTTGCCCATCACTCTATATTGTGATAACAGTGGGGCAGTAGCCAATTCT
AAAGAACCTCGCAGCCATAAACGAGGGAAACACATAGAGAGGAAGTATCATCTGATACGGGAGATTGTGCAACGAGGGGATGTGATCGTCACCAAGATCGCTTCGGAGCA
CAACATTGCTGATCCATTTACGAAGACTCTCACGGCTAAAGTGTTCGAGGGTCATCTAGAAAGTCTAGGTCTACGAGATATGTACATTAGGTAA
Protein sequenceShow/hide protein sequence
MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAA
QFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLAWSSLVLGQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKY
LRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANS
KEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR