; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc06g0168171 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc06g0168171
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr06:21815288..21817744
RNA-Seq ExpressionCmc06g0168171
SyntenyCmc06g0168171
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042206.1 gag-pol polyprotein [Cucumis melo var. makuwa]4.4e-23868.64Show/hide
Query:  CVVTDKNNQV----FMSGKREADNCYHWNSNGSNICHLTKGYQTWLWHRKLGYISLRSLDKVIINEAVVGIPSLDINGKFFCGDCKVGKQTKTSHRRLKE
        C +T +NN V     +   + ++ C        N+   T       W  KL +ISLRSLDKVI NEAVVGIPSLDINGKFFCGDC+VGKQTKTSHRRLKE
Subjt:  CVVTDKNNQV----FMSGKREADNCYHWNSNGSNICHLTKGYQTWLWHRKLGYISLRSLDKVIINEAVVGIPSLDINGKFFCGDCKVGKQTKTSHRRLKE

Query:  CYTIRVLKLLHLDLVGLMQTESLRGKKYVLVI-------------------------------------IIKIRSDHGKEFDNEDLNNFCQTEGIHHEFA
        CY IRVL+LLHLDL+G MQTESLRGKKYVLV+                                     IIK+RSDHGKEFDNEDLNNFCQT+GIHHEF 
Subjt:  CYTIRVLKLLHLDLVGLMQTESLRGKKYVLVI-------------------------------------IIKIRSDHGKEFDNEDLNNFCQTEGIHHEFA

Query:  APITPQQNGVVERKNRTLQEMARVMIHAKNLPLNFWAEAVNTACHIHNRVTTRSDTIVTLYELWKGRKPNYHRKWDVKSDQGIFLGYSQNSRAYRVFNIK
        APIT QQNGVVERKNRTLQEMARVMIHA NLPLNF AEAVNT CHI  +      TI T Y L       YHRKWDVKSDQGIFLGYS NSRAYRVFNIK
Subjt:  APITPQQNGVVERKNRTLQEMARVMIHAKNLPLNFWAEAVNTACHIHNRVTTRSDTIVTLYELWKGRKPNYHRKWDVKSDQGIFLGYSQNSRAYRVFNIK

Query:  SRTVMETINVMVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQLDHAKTDSNITD---------------------------------------
        S TVME INV+VNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQ D AKT+SNITD                                       
Subjt:  SRTVMETINVMVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQLDHAKTDSNITD---------------------------------------

Query:  --------------ELLQFKCNNVWTLVPKPDEANIIGTKWIFKNKTDESGSVIRNKARLVAQGYAQVEGVDLDETFASVARFEAICLLFSIACFRKFKL
                      ELLQFK NN+WTLVPKPD ANIIGTKWIFKNKTDES SVIRN+ARLVAQGYAQV+GVD ++TFA VAR EAI LL SI+CFRKFKL
Subjt:  --------------ELLQFKCNNVWTLVPKPDEANIIGTKWIFKNKTDESGSVIRNKARLVAQGYAQVEGVDLDETFASVARFEAICLLFSIACFRKFKL

Query:  FQMDVKSAFLNGYLNEEVYVAQPREFVDYEFPQYVYKLNKALYGLKQAPRAWYKGLTMYLGERGYSRGETDKTLFINRTSTDLIVAQIYVDDIIFGGFPK
        FQMDVKSAFLNGYLNEEVYVAQ + FVD EFPQYVYK NKALYGLKQAPRAWY+ LTMYL ERGYSRGE DKTLFINRTST LIVAQIYVDDIIFGGFPK
Subjt:  FQMDVKSAFLNGYLNEEVYVAQPREFVDYEFPQYVYKLNKALYGLKQAPRAWYKGLTMYLGERGYSRGETDKTLFINRTSTDLIVAQIYVDDIIFGGFPK

Query:  TLV--IISLTTKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNLVKKFGLDQSQHKRTSTTTHAKITKDTV
        TLV   I++  KSEFEMSLVGELSCFL LQIKQR+EGIFISQEKY KNLVKKFGLD SQHKR    THAKI KDTV
Subjt:  TLV--IISLTTKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNLVKKFGLDQSQHKRTSTTTHAKITKDTV

KAA0042877.1 gag-pol polyprotein [Cucumis melo var. makuwa]0.0e+0095.89Show/hide
Query:  MTNNRSFFTELEECASGRVTFGDRAKGKLIAKGNIDKSNLPCLDEVRYVDGLKANLISVSQLCDQGYSVNFNNTGCVVTDKNNQVFMSGKREADNCYHWN
        MTNNRSFFTELEECA GRVTFGDRAKGK+IAKGNIDKSNLPCLDEVRYVDGLKANLISVSQLCDQGYSVNFNNTGCVVTDKNNQVFMSGKREADNCYHWN
Subjt:  MTNNRSFFTELEECASGRVTFGDRAKGKLIAKGNIDKSNLPCLDEVRYVDGLKANLISVSQLCDQGYSVNFNNTGCVVTDKNNQVFMSGKREADNCYHWN

Query:  SNGSNICHLTKGYQTWLWHRKLGYISLRSLDKVIINEAVVGIPSLDINGKFFCGDCKVGKQTKTSHRRLKECYTIRVLKLLHLDLVGLMQTESLRGKKYV
        SNGSNICHLTKGYQTWLWHRKLGYISLRSLDKVIINEAVVGIPSLDINGKFFCGDCKVGKQTKTSH+RLKECYTIRVLKLLHLDLVGLMQTE  +     
Subjt:  SNGSNICHLTKGYQTWLWHRKLGYISLRSLDKVIINEAVVGIPSLDINGKFFCGDCKVGKQTKTSHRRLKECYTIRVLKLLHLDLVGLMQTESLRGKKYV

Query:  LVIIIKIRSDHGKEFDNEDLNNFCQTEGIHHEFAAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLNFWAEAVNTACHIHNRVTTRSDTIVTLYELWKG
           IIKIRSDHGKEFDNEDLNNFCQTEGIHHEFAAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLNFWAEAVNTACHIHNRVTTRSDTIVTL      
Subjt:  LVIIIKIRSDHGKEFDNEDLNNFCQTEGIHHEFAAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLNFWAEAVNTACHIHNRVTTRSDTIVTLYELWKG

Query:  RKPNYHRKWDVKSDQGIFLGYSQNSRAYRVFNIKSRTVMETINVMVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQLDHAKTDSNITDELLQF
            YHRKWDVKSDQGIFLGYSQNSRAYRVFNIKSRTVMETINVMVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQLDHAKTDSNITDE   F
Subjt:  RKPNYHRKWDVKSDQGIFLGYSQNSRAYRVFNIKSRTVMETINVMVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQLDHAKTDSNITDELLQF

Query:  KCNNVWTLVPKPDEANIIGTKWIFKNKTDESGSVIRNKARLVAQGYAQVEGVDLDETFASVARFEAICLLFSIACFRKFKLFQMDVKSAFLNGYLNEEVY
        KCNNVWTLVPKPDEANIIGTKWIFKNKTDESGSVIRNKARLVAQGYAQVEGVDLDETFASVARFEAICLLFSIACFRKFKLFQMDVKSAFLNGYLNEEVY
Subjt:  KCNNVWTLVPKPDEANIIGTKWIFKNKTDESGSVIRNKARLVAQGYAQVEGVDLDETFASVARFEAICLLFSIACFRKFKLFQMDVKSAFLNGYLNEEVY

Query:  VAQPREFVDYEFPQYVYKLNKALYGLKQAPRAWYKGLTMYLGERGYSRGETDKTLFINRTSTDLIVAQIYVDDIIFGGFPKTLVIISLTTKSEFEMSLVG
        VAQPREFVDYEFPQYVYKLNKALYGLKQAPRAWYKGLTMYLGERGYSRGETDKTLFINRTSTDLIVAQIYVDDIIFGGFPKTLVIISLTTKSEFEMSLVG
Subjt:  VAQPREFVDYEFPQYVYKLNKALYGLKQAPRAWYKGLTMYLGERGYSRGETDKTLFINRTSTDLIVAQIYVDDIIFGGFPKTLVIISLTTKSEFEMSLVG

Query:  ELSCFLGLQIKQRSEGIFISQEKYAKNLVKKFGLDQSQHKRTSTTTHAKITKDTVGV
        ELSCFLGLQIKQRSEGIFISQEKYAKNLVKKFGLDQSQHKRTSTTTHAKITKDTVGV
Subjt:  ELSCFLGLQIKQRSEGIFISQEKYAKNLVKKFGLDQSQHKRTSTTTHAKITKDTVGV

KAA0054435.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.1e-24467.14Show/hide
Query:  TNNRSFFTELEECASGRVTFGDRAKGKLIAKGNIDKSNLPCLDEVRYVDGLKANLISVSQLCDQGYSVNFNNTGCVVTDKNNQVFMSGKREADNCYHWNS
        T+ +     LE+     +TFGD A GK+IAKGNIDK                                        +TDKNNQV MSG+RE+DNCYHW+S
Subjt:  TNNRSFFTELEECASGRVTFGDRAKGKLIAKGNIDKSNLPCLDEVRYVDGLKANLISVSQLCDQGYSVNFNNTGCVVTDKNNQVFMSGKREADNCYHWNS

Query:  NGSNICHLTKGYQTWLWHRKLGYISLRSLDKVIINEAVVGIPSLDINGKFFCGDCKVGKQTKTSHRRLKECYTIRVLKLLHLDLVGLMQTESLRGKKYVL
        NGSNICHLTK  QTWLWHRKLG+ISLRSLDKVI NEAVVGIPSLDIN KFFCGDC+VGKQTK+SH +LKECYTIRVL+LLHLDL+G MQTESL GKKYVL
Subjt:  NGSNICHLTKGYQTWLWHRKLGYISLRSLDKVIINEAVVGIPSLDINGKFFCGDCKVGKQTKTSHRRLKECYTIRVLKLLHLDLVGLMQTESLRGKKYVL

Query:  VI-------------------------------------IIKIRSDHGKEFDNEDLNNFCQTEGIHHEFAAPITPQQNGVVERKNRTLQEMARVMIHAKN
        V+                                     II+IRSDH KEFDNEDLNNFCQ EGIHHE AAPITPQQNGVVERKNRTLQEMARVMIHAKN
Subjt:  VI-------------------------------------IIKIRSDHGKEFDNEDLNNFCQTEGIHHEFAAPITPQQNGVVERKNRTLQEMARVMIHAKN

Query:  LPLNFWAEAVNTACHIHNRVTTRSDTIVTLYELWKGRKPNYHRKWDVKSDQGIFLGYSQNSRAYRVFNIKSRTVMETINVMVNDFESNVNQFNIEDDETH
        LPLNFWAEAVNTACHIHNRVTTRS T VTLYELWKGRKPN                    +R YRVFNIKS TVMETINV+VNDFE N+NQFNIEDDETH
Subjt:  LPLNFWAEAVNTACHIHNRVTTRSDTIVTLYELWKGRKPNYHRKWDVKSDQGIFLGYSQNSRAYRVFNIKSRTVMETINVMVNDFESNVNQFNIEDDETH

Query:  VTPEVTSTPLDEMPKG--DSQLDHAKTD----SNITDELLQFKCNNVWTLVPKPDEANIIGTKWIFKNKTDESGSVIRNKARLVAQGYAQVEGVDLDETF
        VTPEVTSTPLDEMPK    + +++A  D    + + +ELLQFK NNVWTLVPKP+ AN                                VEGVD DETF
Subjt:  VTPEVTSTPLDEMPKG--DSQLDHAKTD----SNITDELLQFKCNNVWTLVPKPDEANIIGTKWIFKNKTDESGSVIRNKARLVAQGYAQVEGVDLDETF

Query:  ASVARFEAICLLFSIACFRKFKLFQMDVKSAFLNGYLNEEVYVAQPREFVDYEFPQYVYKLNKALYGLKQAPRAWYKGLTMYLGERGYSRGETDKTLFIN
        A VAR EAI LL SI+CFRKFKLFQM VKSAFLNGYLNEEVYV QP+ FVD EFPQYVYKLNKALY LKQAP+AWY+ LTMYLGER YSRGETDKTLFIN
Subjt:  ASVARFEAICLLFSIACFRKFKLFQMDVKSAFLNGYLNEEVYVAQPREFVDYEFPQYVYKLNKALYGLKQAPRAWYKGLTMYLGERGYSRGETDKTLFIN

Query:  RTSTDLIVAQIYVDDIIFGGFPKTLV--IISLTTKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNLVKKFGLDQSQHKRTSTTTHAKITKDTVG
        RTST+LIVAQIYVDDIIFGGFPKTLV   I++  KSEFEMSLVGELSCFLGLQIKQRSEG+FISQEKYAKNLVKKFGLDQSQHKRT   THAKITKD VG
Subjt:  RTSTDLIVAQIYVDDIIFGGFPKTLV--IISLTTKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNLVKKFGLDQSQHKRTSTTTHAKITKDTVG

KAA0059174.1 F5J5.1 [Cucumis melo var. makuwa]3.4e-21456.9Show/hide
Query:  MTNNRSFFTELEECASGRVTFGDRAKGKLIAKGNIDKSNLPCLDEVRYVDGLKANLISVSQLCDQGYSVNFNNTGCVVTDKNNQVFMSGKREADNCYHWN
        MT+NRSFFTELEECASG V F D AKGK+IAKGNIDKSNLPCL++VRYVDGLK NLIS SQLCDQGYSVNFNNTGCVVT+KNNQVF+SG READNCYHW+
Subjt:  MTNNRSFFTELEECASGRVTFGDRAKGKLIAKGNIDKSNLPCLDEVRYVDGLKANLISVSQLCDQGYSVNFNNTGCVVTDKNNQVFMSGKREADNCYHWN

Query:  SNGSNICHLTKGYQTWLWHRKLGYISLRSLDKVIINEAVVGIPSLDINGKFFCGDCKVGKQTKTSHRRLKECYTIRVLKLLHLDLVGLMQTESLRGKKYV
        SNGSNICHLTK  QTWLWHRKLG+ISLRSLDKVI NEA++GIPSLDINGKFFCGDC+VGKQTKTSHRRL ECYTI  L+LLHLDL+ LMQ ESL GKKYV
Subjt:  SNGSNICHLTKGYQTWLWHRKLGYISLRSLDKVIINEAVVGIPSLDINGKFFCGDCKVGKQTKTSHRRLKECYTIRVLKLLHLDLVGLMQTESLRGKKYV

Query:  LVI-------------------------------------IIKIRSDHGKEFDNEDLNNFCQTEGIHHEFAAPITPQQNGVVERKNRTLQEMARVMIHAK
         V+                                     II+IRSDHGK+FDNE+LNNFCQTEGIHHEFAAPITPQQN VVERKNRTLQEMAR      
Subjt:  LVI-------------------------------------IIKIRSDHGKEFDNEDLNNFCQTEGIHHEFAAPITPQQNGVVERKNRTLQEMARVMIHAK

Query:  NLPLNFWAEAVNTACHIHNRVTTRSDTIVTLYELWKGRKPNYHRKWDVKSDQGIFLGYSQNSRAYRVFNIKSRTVMETINVMVNDFESNVNQFNIEDDET
                                                                    N+RAYRVFNIKS TVMETINV+VNDFESNVNQFNIEDDET
Subjt:  NLPLNFWAEAVNTACHIHNRVTTRSDTIVTLYELWKGRKPNYHRKWDVKSDQGIFLGYSQNSRAYRVFNIKSRTVMETINVMVNDFESNVNQFNIEDDET

Query:  HVTPEVTSTPLDEMPKGDSQLDHAKTDSNITD--------------------------------------------------------------------
        HVTP+VTSTPLDEMPKGDSQ D+AKTDS ITD                                                                    
Subjt:  HVTPEVTSTPLDEMPKGDSQLDHAKTDSNITD--------------------------------------------------------------------

Query:  -------ELLQFKCNNVWTLVPKPDEANIIGTKWIFKNKTDESGSVIRNKARLVAQGYAQVEGVDLDETFASVARFEAICLLFSIACFRKFKLFQMDVKS
               ELLQFK NNVWTLVPKPDEANIIGTKWIFKNKTDESGSVIRNKARLVAQ                                        DV S
Subjt:  -------ELLQFKCNNVWTLVPKPDEANIIGTKWIFKNKTDESGSVIRNKARLVAQGYAQVEGVDLDETFASVARFEAICLLFSIACFRKFKLFQMDVKS

Query:  AFLNGYLNEEVYVAQPREFVDYEFPQYVYKLNKALYGLKQAPRAWYKGLTMYLGERGYSRGETDKTLFINRTSTDLIVAQIYVDDIIFGGFPKTLVIISL
         FLNGYLNEEVYVAQP+ FVD+EFPQY+YKLNKALYGLKQAP                            R  TDLIVAQIYVDDIIFGGFPKTL     
Subjt:  AFLNGYLNEEVYVAQPREFVDYEFPQYVYKLNKALYGLKQAPRAWYKGLTMYLGERGYSRGETDKTLFINRTSTDLIVAQIYVDDIIFGGFPKTLVIISL

Query:  TTKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNLVKKFGLDQSQHKRTSTTTHAKITKDTVG
                             IKQRSE +FISQEKY KNLVKKFGLD SQ+KRT   THAKITKDTVG
Subjt:  TTKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNLVKKFGLDQSQHKRTSTTTHAKITKDTVG

TYK26041.1 gag/pol polyprotein [Cucumis melo var. makuwa]1.1e-23862.38Show/hide
Query:  MTNNRSFFTELEECASGRVTFGDRAKGKLIAKGNIDKSNLPCLDEVRYVDGLKANLISVSQLCDQGYSVNFNNTGCVVTDKNNQVFMSGKREADNCYHWN
        MT NRSFFTELEECASG VTFGD AKGK+IAKGN+DKSNLP ++EVRYVDGLK NLISVSQLCDQGYSVNFNNT CV TDKNNQVF+SG+REA+NC HW+
Subjt:  MTNNRSFFTELEECASGRVTFGDRAKGKLIAKGNIDKSNLPCLDEVRYVDGLKANLISVSQLCDQGYSVNFNNTGCVVTDKNNQVFMSGKREADNCYHWN

Query:  SNGSNICHLTKGYQTWLWHRKLGYISLRSLDKVIINEAVVGIPSLDINGKFFCGDCKVGKQTKTSHRRLKECYTIRVLKLLHLDLVGLMQTESLRGKKYV
        SNGSNICHLTK  QTWLWHRKLG+ISLRSLDKVI N+AVVGIPSLDINGKFFCGDC+VGKQTK SHRRLKECYTIRVL+LLHLDL+G M+TESL  KKYV
Subjt:  SNGSNICHLTKGYQTWLWHRKLGYISLRSLDKVIINEAVVGIPSLDINGKFFCGDCKVGKQTKTSHRRLKECYTIRVLKLLHLDLVGLMQTESLRGKKYV

Query:  LVI-------------------------------------IIKIRSDHGKEFDNEDLNNFCQTEGIHHEFAAPITPQQNGVVERKNRTLQEMARVMIHAK
        LV+                                     II+IRSDHGKEFDNEDLNNFCQTEGIHHEF APITPQQNGVVERKNRT            
Subjt:  LVI-------------------------------------IIKIRSDHGKEFDNEDLNNFCQTEGIHHEFAAPITPQQNGVVERKNRTLQEMARVMIHAK

Query:  NLPLNFWAEAVNTACHIHNRVTTRSDTIVTLYELWKGRKPN------------------YHRKWDVKSDQGIFLGYSQNSRAYRVFNIKSRTVMETINVM
                            VTTRS T + LYELWKGRKPN                  YHRKWDVKSDQ IFLGYSQNSR YRVFNIKS TVMETINV+
Subjt:  NLPLNFWAEAVNTACHIHNRVTTRSDTIVTLYELWKGRKPN------------------YHRKWDVKSDQGIFLGYSQNSRAYRVFNIKSRTVMETINVM

Query:  VNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQLDHAKTDSNITD----------------------------------------------ELLQ
        VNDF+SNVNQFNIEDDETHVTPEVTS+PLDEMPKGDSQ D AKT+SNI D                                              ELLQ
Subjt:  VNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQLDHAKTDSNITD----------------------------------------------ELLQ

Query:  FKCNNVWTLVPKPDEANIIGTKWIFKNKTDESGSVIRNKARLVAQGYAQVEGVDLDETFASVARFEAICLLFSIACFRKFKLFQMDVKSAFLNGYLNEEV
        FK NNVWTLVPKPD AN+IGTKWIFKNKTDESG++                            +  A C ++ ++       F+  +K            
Subjt:  FKCNNVWTLVPKPDEANIIGTKWIFKNKTDESGSVIRNKARLVAQGYAQVEGVDLDETFASVARFEAICLLFSIACFRKFKLFQMDVKSAFLNGYLNEEV

Query:  YVAQPREFVDYEFPQYVYKLNKALYGLKQAPRAWYKGLTMYLGERGYSRGETDKTLFINRTSTDLIVAQIYVDDIIFGGFPKTLV--IISLTTKSEFEMS
                       YVYKLNKALYGLKQAP AWY+ LTMYLGERGYSRGETDKT+F+NRT+ DLIVAQIYVDDIIFGGFPKTLV   I++  KSEFEMS
Subjt:  YVAQPREFVDYEFPQYVYKLNKALYGLKQAPRAWYKGLTMYLGERGYSRGETDKTLFINRTSTDLIVAQIYVDDIIFGGFPKTLV--IISLTTKSEFEMS

Query:  LVGELSCFLGLQIKQRSEGIFISQEKYAKNLVKKFGLDQ
        LVGELSC LGLQIKQRSEG+FISQEKYA NLVKKFGLD+
Subjt:  LVGELSCFLGLQIKQRSEGIFISQEKYAKNLVKKFGLDQ

TrEMBL top hitse value%identityAlignment
A0A5A7UVR7 F5J5.11.6e-21456.9Show/hide
Query:  MTNNRSFFTELEECASGRVTFGDRAKGKLIAKGNIDKSNLPCLDEVRYVDGLKANLISVSQLCDQGYSVNFNNTGCVVTDKNNQVFMSGKREADNCYHWN
        MT+NRSFFTELEECASG V F D AKGK+IAKGNIDKSNLPCL++VRYVDGLK NLIS SQLCDQGYSVNFNNTGCVVT+KNNQVF+SG READNCYHW+
Subjt:  MTNNRSFFTELEECASGRVTFGDRAKGKLIAKGNIDKSNLPCLDEVRYVDGLKANLISVSQLCDQGYSVNFNNTGCVVTDKNNQVFMSGKREADNCYHWN

Query:  SNGSNICHLTKGYQTWLWHRKLGYISLRSLDKVIINEAVVGIPSLDINGKFFCGDCKVGKQTKTSHRRLKECYTIRVLKLLHLDLVGLMQTESLRGKKYV
        SNGSNICHLTK  QTWLWHRKLG+ISLRSLDKVI NEA++GIPSLDINGKFFCGDC+VGKQTKTSHRRL ECYTI  L+LLHLDL+ LMQ ESL GKKYV
Subjt:  SNGSNICHLTKGYQTWLWHRKLGYISLRSLDKVIINEAVVGIPSLDINGKFFCGDCKVGKQTKTSHRRLKECYTIRVLKLLHLDLVGLMQTESLRGKKYV

Query:  LVI-------------------------------------IIKIRSDHGKEFDNEDLNNFCQTEGIHHEFAAPITPQQNGVVERKNRTLQEMARVMIHAK
         V+                                     II+IRSDHGK+FDNE+LNNFCQTEGIHHEFAAPITPQQN VVERKNRTLQEMAR      
Subjt:  LVI-------------------------------------IIKIRSDHGKEFDNEDLNNFCQTEGIHHEFAAPITPQQNGVVERKNRTLQEMARVMIHAK

Query:  NLPLNFWAEAVNTACHIHNRVTTRSDTIVTLYELWKGRKPNYHRKWDVKSDQGIFLGYSQNSRAYRVFNIKSRTVMETINVMVNDFESNVNQFNIEDDET
                                                                    N+RAYRVFNIKS TVMETINV+VNDFESNVNQFNIEDDET
Subjt:  NLPLNFWAEAVNTACHIHNRVTTRSDTIVTLYELWKGRKPNYHRKWDVKSDQGIFLGYSQNSRAYRVFNIKSRTVMETINVMVNDFESNVNQFNIEDDET

Query:  HVTPEVTSTPLDEMPKGDSQLDHAKTDSNITD--------------------------------------------------------------------
        HVTP+VTSTPLDEMPKGDSQ D+AKTDS ITD                                                                    
Subjt:  HVTPEVTSTPLDEMPKGDSQLDHAKTDSNITD--------------------------------------------------------------------

Query:  -------ELLQFKCNNVWTLVPKPDEANIIGTKWIFKNKTDESGSVIRNKARLVAQGYAQVEGVDLDETFASVARFEAICLLFSIACFRKFKLFQMDVKS
               ELLQFK NNVWTLVPKPDEANIIGTKWIFKNKTDESGSVIRNKARLVAQ                                        DV S
Subjt:  -------ELLQFKCNNVWTLVPKPDEANIIGTKWIFKNKTDESGSVIRNKARLVAQGYAQVEGVDLDETFASVARFEAICLLFSIACFRKFKLFQMDVKS

Query:  AFLNGYLNEEVYVAQPREFVDYEFPQYVYKLNKALYGLKQAPRAWYKGLTMYLGERGYSRGETDKTLFINRTSTDLIVAQIYVDDIIFGGFPKTLVIISL
         FLNGYLNEEVYVAQP+ FVD+EFPQY+YKLNKALYGLKQAP                            R  TDLIVAQIYVDDIIFGGFPKTL     
Subjt:  AFLNGYLNEEVYVAQPREFVDYEFPQYVYKLNKALYGLKQAPRAWYKGLTMYLGERGYSRGETDKTLFINRTSTDLIVAQIYVDDIIFGGFPKTLVIISL

Query:  TTKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNLVKKFGLDQSQHKRTSTTTHAKITKDTVG
                             IKQRSE +FISQEKY KNLVKKFGLD SQ+KRT   THAKITKDTVG
Subjt:  TTKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNLVKKFGLDQSQHKRTSTTTHAKITKDTVG

A0A5D3C1P5 Gag-pol polyprotein0.0e+0095.89Show/hide
Query:  MTNNRSFFTELEECASGRVTFGDRAKGKLIAKGNIDKSNLPCLDEVRYVDGLKANLISVSQLCDQGYSVNFNNTGCVVTDKNNQVFMSGKREADNCYHWN
        MTNNRSFFTELEECA GRVTFGDRAKGK+IAKGNIDKSNLPCLDEVRYVDGLKANLISVSQLCDQGYSVNFNNTGCVVTDKNNQVFMSGKREADNCYHWN
Subjt:  MTNNRSFFTELEECASGRVTFGDRAKGKLIAKGNIDKSNLPCLDEVRYVDGLKANLISVSQLCDQGYSVNFNNTGCVVTDKNNQVFMSGKREADNCYHWN

Query:  SNGSNICHLTKGYQTWLWHRKLGYISLRSLDKVIINEAVVGIPSLDINGKFFCGDCKVGKQTKTSHRRLKECYTIRVLKLLHLDLVGLMQTESLRGKKYV
        SNGSNICHLTKGYQTWLWHRKLGYISLRSLDKVIINEAVVGIPSLDINGKFFCGDCKVGKQTKTSH+RLKECYTIRVLKLLHLDLVGLMQTE  +     
Subjt:  SNGSNICHLTKGYQTWLWHRKLGYISLRSLDKVIINEAVVGIPSLDINGKFFCGDCKVGKQTKTSHRRLKECYTIRVLKLLHLDLVGLMQTESLRGKKYV

Query:  LVIIIKIRSDHGKEFDNEDLNNFCQTEGIHHEFAAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLNFWAEAVNTACHIHNRVTTRSDTIVTLYELWKG
           IIKIRSDHGKEFDNEDLNNFCQTEGIHHEFAAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLNFWAEAVNTACHIHNRVTTRSDTIVTL      
Subjt:  LVIIIKIRSDHGKEFDNEDLNNFCQTEGIHHEFAAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLNFWAEAVNTACHIHNRVTTRSDTIVTLYELWKG

Query:  RKPNYHRKWDVKSDQGIFLGYSQNSRAYRVFNIKSRTVMETINVMVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQLDHAKTDSNITDELLQF
            YHRKWDVKSDQGIFLGYSQNSRAYRVFNIKSRTVMETINVMVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQLDHAKTDSNITDE   F
Subjt:  RKPNYHRKWDVKSDQGIFLGYSQNSRAYRVFNIKSRTVMETINVMVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQLDHAKTDSNITDELLQF

Query:  KCNNVWTLVPKPDEANIIGTKWIFKNKTDESGSVIRNKARLVAQGYAQVEGVDLDETFASVARFEAICLLFSIACFRKFKLFQMDVKSAFLNGYLNEEVY
        KCNNVWTLVPKPDEANIIGTKWIFKNKTDESGSVIRNKARLVAQGYAQVEGVDLDETFASVARFEAICLLFSIACFRKFKLFQMDVKSAFLNGYLNEEVY
Subjt:  KCNNVWTLVPKPDEANIIGTKWIFKNKTDESGSVIRNKARLVAQGYAQVEGVDLDETFASVARFEAICLLFSIACFRKFKLFQMDVKSAFLNGYLNEEVY

Query:  VAQPREFVDYEFPQYVYKLNKALYGLKQAPRAWYKGLTMYLGERGYSRGETDKTLFINRTSTDLIVAQIYVDDIIFGGFPKTLVIISLTTKSEFEMSLVG
        VAQPREFVDYEFPQYVYKLNKALYGLKQAPRAWYKGLTMYLGERGYSRGETDKTLFINRTSTDLIVAQIYVDDIIFGGFPKTLVIISLTTKSEFEMSLVG
Subjt:  VAQPREFVDYEFPQYVYKLNKALYGLKQAPRAWYKGLTMYLGERGYSRGETDKTLFINRTSTDLIVAQIYVDDIIFGGFPKTLVIISLTTKSEFEMSLVG

Query:  ELSCFLGLQIKQRSEGIFISQEKYAKNLVKKFGLDQSQHKRTSTTTHAKITKDTVGV
        ELSCFLGLQIKQRSEGIFISQEKYAKNLVKKFGLDQSQHKRTSTTTHAKITKDTVGV
Subjt:  ELSCFLGLQIKQRSEGIFISQEKYAKNLVKKFGLDQSQHKRTSTTTHAKITKDTVGV

A0A5D3CS19 Gag-pol polyprotein5.2e-24567.14Show/hide
Query:  TNNRSFFTELEECASGRVTFGDRAKGKLIAKGNIDKSNLPCLDEVRYVDGLKANLISVSQLCDQGYSVNFNNTGCVVTDKNNQVFMSGKREADNCYHWNS
        T+ +     LE+     +TFGD A GK+IAKGNIDK                                        +TDKNNQV MSG+RE+DNCYHW+S
Subjt:  TNNRSFFTELEECASGRVTFGDRAKGKLIAKGNIDKSNLPCLDEVRYVDGLKANLISVSQLCDQGYSVNFNNTGCVVTDKNNQVFMSGKREADNCYHWNS

Query:  NGSNICHLTKGYQTWLWHRKLGYISLRSLDKVIINEAVVGIPSLDINGKFFCGDCKVGKQTKTSHRRLKECYTIRVLKLLHLDLVGLMQTESLRGKKYVL
        NGSNICHLTK  QTWLWHRKLG+ISLRSLDKVI NEAVVGIPSLDIN KFFCGDC+VGKQTK+SH +LKECYTIRVL+LLHLDL+G MQTESL GKKYVL
Subjt:  NGSNICHLTKGYQTWLWHRKLGYISLRSLDKVIINEAVVGIPSLDINGKFFCGDCKVGKQTKTSHRRLKECYTIRVLKLLHLDLVGLMQTESLRGKKYVL

Query:  VI-------------------------------------IIKIRSDHGKEFDNEDLNNFCQTEGIHHEFAAPITPQQNGVVERKNRTLQEMARVMIHAKN
        V+                                     II+IRSDH KEFDNEDLNNFCQ EGIHHE AAPITPQQNGVVERKNRTLQEMARVMIHAKN
Subjt:  VI-------------------------------------IIKIRSDHGKEFDNEDLNNFCQTEGIHHEFAAPITPQQNGVVERKNRTLQEMARVMIHAKN

Query:  LPLNFWAEAVNTACHIHNRVTTRSDTIVTLYELWKGRKPNYHRKWDVKSDQGIFLGYSQNSRAYRVFNIKSRTVMETINVMVNDFESNVNQFNIEDDETH
        LPLNFWAEAVNTACHIHNRVTTRS T VTLYELWKGRKPN                    +R YRVFNIKS TVMETINV+VNDFE N+NQFNIEDDETH
Subjt:  LPLNFWAEAVNTACHIHNRVTTRSDTIVTLYELWKGRKPNYHRKWDVKSDQGIFLGYSQNSRAYRVFNIKSRTVMETINVMVNDFESNVNQFNIEDDETH

Query:  VTPEVTSTPLDEMPKG--DSQLDHAKTD----SNITDELLQFKCNNVWTLVPKPDEANIIGTKWIFKNKTDESGSVIRNKARLVAQGYAQVEGVDLDETF
        VTPEVTSTPLDEMPK    + +++A  D    + + +ELLQFK NNVWTLVPKP+ AN                                VEGVD DETF
Subjt:  VTPEVTSTPLDEMPKG--DSQLDHAKTD----SNITDELLQFKCNNVWTLVPKPDEANIIGTKWIFKNKTDESGSVIRNKARLVAQGYAQVEGVDLDETF

Query:  ASVARFEAICLLFSIACFRKFKLFQMDVKSAFLNGYLNEEVYVAQPREFVDYEFPQYVYKLNKALYGLKQAPRAWYKGLTMYLGERGYSRGETDKTLFIN
        A VAR EAI LL SI+CFRKFKLFQM VKSAFLNGYLNEEVYV QP+ FVD EFPQYVYKLNKALY LKQAP+AWY+ LTMYLGER YSRGETDKTLFIN
Subjt:  ASVARFEAICLLFSIACFRKFKLFQMDVKSAFLNGYLNEEVYVAQPREFVDYEFPQYVYKLNKALYGLKQAPRAWYKGLTMYLGERGYSRGETDKTLFIN

Query:  RTSTDLIVAQIYVDDIIFGGFPKTLV--IISLTTKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNLVKKFGLDQSQHKRTSTTTHAKITKDTVG
        RTST+LIVAQIYVDDIIFGGFPKTLV   I++  KSEFEMSLVGELSCFLGLQIKQRSEG+FISQEKYAKNLVKKFGLDQSQHKRT   THAKITKD VG
Subjt:  RTSTDLIVAQIYVDDIIFGGFPKTLV--IISLTTKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNLVKKFGLDQSQHKRTSTTTHAKITKDTVG

A0A5D3DQT9 Gag/pol polyprotein5.6e-23962.38Show/hide
Query:  MTNNRSFFTELEECASGRVTFGDRAKGKLIAKGNIDKSNLPCLDEVRYVDGLKANLISVSQLCDQGYSVNFNNTGCVVTDKNNQVFMSGKREADNCYHWN
        MT NRSFFTELEECASG VTFGD AKGK+IAKGN+DKSNLP ++EVRYVDGLK NLISVSQLCDQGYSVNFNNT CV TDKNNQVF+SG+REA+NC HW+
Subjt:  MTNNRSFFTELEECASGRVTFGDRAKGKLIAKGNIDKSNLPCLDEVRYVDGLKANLISVSQLCDQGYSVNFNNTGCVVTDKNNQVFMSGKREADNCYHWN

Query:  SNGSNICHLTKGYQTWLWHRKLGYISLRSLDKVIINEAVVGIPSLDINGKFFCGDCKVGKQTKTSHRRLKECYTIRVLKLLHLDLVGLMQTESLRGKKYV
        SNGSNICHLTK  QTWLWHRKLG+ISLRSLDKVI N+AVVGIPSLDINGKFFCGDC+VGKQTK SHRRLKECYTIRVL+LLHLDL+G M+TESL  KKYV
Subjt:  SNGSNICHLTKGYQTWLWHRKLGYISLRSLDKVIINEAVVGIPSLDINGKFFCGDCKVGKQTKTSHRRLKECYTIRVLKLLHLDLVGLMQTESLRGKKYV

Query:  LVI-------------------------------------IIKIRSDHGKEFDNEDLNNFCQTEGIHHEFAAPITPQQNGVVERKNRTLQEMARVMIHAK
        LV+                                     II+IRSDHGKEFDNEDLNNFCQTEGIHHEF APITPQQNGVVERKNRT            
Subjt:  LVI-------------------------------------IIKIRSDHGKEFDNEDLNNFCQTEGIHHEFAAPITPQQNGVVERKNRTLQEMARVMIHAK

Query:  NLPLNFWAEAVNTACHIHNRVTTRSDTIVTLYELWKGRKPN------------------YHRKWDVKSDQGIFLGYSQNSRAYRVFNIKSRTVMETINVM
                            VTTRS T + LYELWKGRKPN                  YHRKWDVKSDQ IFLGYSQNSR YRVFNIKS TVMETINV+
Subjt:  NLPLNFWAEAVNTACHIHNRVTTRSDTIVTLYELWKGRKPN------------------YHRKWDVKSDQGIFLGYSQNSRAYRVFNIKSRTVMETINVM

Query:  VNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQLDHAKTDSNITD----------------------------------------------ELLQ
        VNDF+SNVNQFNIEDDETHVTPEVTS+PLDEMPKGDSQ D AKT+SNI D                                              ELLQ
Subjt:  VNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQLDHAKTDSNITD----------------------------------------------ELLQ

Query:  FKCNNVWTLVPKPDEANIIGTKWIFKNKTDESGSVIRNKARLVAQGYAQVEGVDLDETFASVARFEAICLLFSIACFRKFKLFQMDVKSAFLNGYLNEEV
        FK NNVWTLVPKPD AN+IGTKWIFKNKTDESG++                            +  A C ++ ++       F+  +K            
Subjt:  FKCNNVWTLVPKPDEANIIGTKWIFKNKTDESGSVIRNKARLVAQGYAQVEGVDLDETFASVARFEAICLLFSIACFRKFKLFQMDVKSAFLNGYLNEEV

Query:  YVAQPREFVDYEFPQYVYKLNKALYGLKQAPRAWYKGLTMYLGERGYSRGETDKTLFINRTSTDLIVAQIYVDDIIFGGFPKTLV--IISLTTKSEFEMS
                       YVYKLNKALYGLKQAP AWY+ LTMYLGERGYSRGETDKT+F+NRT+ DLIVAQIYVDDIIFGGFPKTLV   I++  KSEFEMS
Subjt:  YVAQPREFVDYEFPQYVYKLNKALYGLKQAPRAWYKGLTMYLGERGYSRGETDKTLFINRTSTDLIVAQIYVDDIIFGGFPKTLV--IISLTTKSEFEMS

Query:  LVGELSCFLGLQIKQRSEGIFISQEKYAKNLVKKFGLDQ
        LVGELSC LGLQIKQRSEG+FISQEKYA NLVKKFGLD+
Subjt:  LVGELSCFLGLQIKQRSEGIFISQEKYAKNLVKKFGLDQ

A0A5D3DSN1 Gag-pol polyprotein2.1e-23868.64Show/hide
Query:  CVVTDKNNQV----FMSGKREADNCYHWNSNGSNICHLTKGYQTWLWHRKLGYISLRSLDKVIINEAVVGIPSLDINGKFFCGDCKVGKQTKTSHRRLKE
        C +T +NN V     +   + ++ C        N+   T       W  KL +ISLRSLDKVI NEAVVGIPSLDINGKFFCGDC+VGKQTKTSHRRLKE
Subjt:  CVVTDKNNQV----FMSGKREADNCYHWNSNGSNICHLTKGYQTWLWHRKLGYISLRSLDKVIINEAVVGIPSLDINGKFFCGDCKVGKQTKTSHRRLKE

Query:  CYTIRVLKLLHLDLVGLMQTESLRGKKYVLVI-------------------------------------IIKIRSDHGKEFDNEDLNNFCQTEGIHHEFA
        CY IRVL+LLHLDL+G MQTESLRGKKYVLV+                                     IIK+RSDHGKEFDNEDLNNFCQT+GIHHEF 
Subjt:  CYTIRVLKLLHLDLVGLMQTESLRGKKYVLVI-------------------------------------IIKIRSDHGKEFDNEDLNNFCQTEGIHHEFA

Query:  APITPQQNGVVERKNRTLQEMARVMIHAKNLPLNFWAEAVNTACHIHNRVTTRSDTIVTLYELWKGRKPNYHRKWDVKSDQGIFLGYSQNSRAYRVFNIK
        APIT QQNGVVERKNRTLQEMARVMIHA NLPLNF AEAVNT CHI  +      TI T Y L       YHRKWDVKSDQGIFLGYS NSRAYRVFNIK
Subjt:  APITPQQNGVVERKNRTLQEMARVMIHAKNLPLNFWAEAVNTACHIHNRVTTRSDTIVTLYELWKGRKPNYHRKWDVKSDQGIFLGYSQNSRAYRVFNIK

Query:  SRTVMETINVMVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQLDHAKTDSNITD---------------------------------------
        S TVME INV+VNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQ D AKT+SNITD                                       
Subjt:  SRTVMETINVMVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQLDHAKTDSNITD---------------------------------------

Query:  --------------ELLQFKCNNVWTLVPKPDEANIIGTKWIFKNKTDESGSVIRNKARLVAQGYAQVEGVDLDETFASVARFEAICLLFSIACFRKFKL
                      ELLQFK NN+WTLVPKPD ANIIGTKWIFKNKTDES SVIRN+ARLVAQGYAQV+GVD ++TFA VAR EAI LL SI+CFRKFKL
Subjt:  --------------ELLQFKCNNVWTLVPKPDEANIIGTKWIFKNKTDESGSVIRNKARLVAQGYAQVEGVDLDETFASVARFEAICLLFSIACFRKFKL

Query:  FQMDVKSAFLNGYLNEEVYVAQPREFVDYEFPQYVYKLNKALYGLKQAPRAWYKGLTMYLGERGYSRGETDKTLFINRTSTDLIVAQIYVDDIIFGGFPK
        FQMDVKSAFLNGYLNEEVYVAQ + FVD EFPQYVYK NKALYGLKQAPRAWY+ LTMYL ERGYSRGE DKTLFINRTST LIVAQIYVDDIIFGGFPK
Subjt:  FQMDVKSAFLNGYLNEEVYVAQPREFVDYEFPQYVYKLNKALYGLKQAPRAWYKGLTMYLGERGYSRGETDKTLFINRTSTDLIVAQIYVDDIIFGGFPK

Query:  TLV--IISLTTKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNLVKKFGLDQSQHKRTSTTTHAKITKDTV
        TLV   I++  KSEFEMSLVGELSCFL LQIKQR+EGIFISQEKY KNLVKKFGLD SQHKR    THAKI KDTV
Subjt:  TLV--IISLTTKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNLVKKFGLDQSQHKRTSTTTHAKITKDTV

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.4e-5324.66Show/hide
Query:  LDEVRYVDGLKANLISVSQLCDQGYSVNFNNTGCVVTDKNNQVFMSGKREADNCYHWNSNGSNICHLTKGYQTWLWHRKLGYIS----LRSLDKVIINEA
        L++V +      NL+SV +L + G S+ F+ +G V   KN  + +      +N    N    +I    K     LWH + G+IS    L    K + ++ 
Subjt:  LDEVRYVDGLKANLISVSQLCDQGYSVNFNNTGCVVTDKNNQVFMSGKREADNCYHWNSNGSNICHLTKGYQTWLWHRKLGYIS----LRSLDKVIINEA

Query:  VVGIPSLDINGKFFCGDCKVGKQTKTSHRRLKE-CYTIRVLKLLHLDLVGLMQTESLRGKKYVLVII-----------IKIRS-----------------
         + + +L+++ +  C  C  GKQ +   ++LK+  +  R L ++H D+ G +   +L  K Y ++ +           IK +S                 
Subjt:  VVGIPSLDINGKFFCGDCKVGKQTKTSHRRLKE-CYTIRVLKLLHLDLVGLMQTESLRGKKYVLVII-----------IKIRS-----------------

Query:  ---------DHGKEFDNEDLNNFCQTEGIHHEFAAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLNFWAEAVNTACHIHNRVTTRS--DTIVTLYELW
                 D+G+E+ + ++  FC  +GI +    P TPQ NGV ER  RT+ E AR M+    L  +FW EAV TA ++ NR+ +R+  D+  T YE+W
Subjt:  ---------DHGKEFDNEDLNNFCQTEGIHHEFAAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLNFWAEAVNTACHIHNRVTTRS--DTIVTLYELW

Query:  KGRKP-----------------NYHRKWDVKSDQGIFLGYSQ---------------------------NSRAYRV------------------------
          +KP                 N   K+D KS + IF+GY                             NSRA +                         
Subjt:  KGRKP-----------------NYHRKWDVKSDQGIFLGYSQ---------------------------NSRAYRV------------------------

Query:  ---------------------------FNIKSRTVMET-----------INVMVNDFESN----------------------------------------
                                   F   SR +++T           I  + +  ESN                                        
Subjt:  ---------------------------FNIKSRTVMET-----------INVMVNDFESN----------------------------------------

Query:  -------------VNQ------------FNIEDDETHVTPEVTSTPLDEMPKGDSQL----DHAKTDSNITDELLQFKCNNVWTLVPKPDEANIIGTKWI
                     +N+            +N ED+  +       T  +++P    ++    D +  +  I  EL   K NN WT+  +P+  NI+ ++W+
Subjt:  -------------VNQ------------FNIEDDETHVTPEVTSTPLDEMPKGDSQL----DHAKTDSNITDELLQFKCNNVWTLVPKPDEANIIGTKWI

Query:  FKNKTDESGSVIRNKARLVAQGYAQVEGVDLDETFASVARFEAICLLFSIACFRKFKLFQMDVKSAFLNGYLNEEVYVAQPREFVDYEFPQYVYKLNKAL
        F  K +E G+ IR KARLVA+G+ Q   +D +ETFA VAR  +   + S+      K+ QMDVK+AFLNG L EE+Y+  P+          V KLNKA+
Subjt:  FKNKTDESGSVIRNKARLVAQGYAQVEGVDLDETFASVARFEAICLLFSIACFRKFKLFQMDVKSAFLNGYLNEEVYVAQPREFVDYEFPQYVYKLNKAL

Query:  YGLKQAPRAWYKGLTMYLGERGYSRGETDKTLFI--NRTSTDLIVAQIYVDDIIFG-GFPKTLVIISLTTKSEFEMSLVGELSCFLGLQIKQRSEGIFIS
        YGLKQA R W++     L E  +     D+ ++I       + I   +YVDD++   G    +         +F M+ + E+  F+G++I+ + + I++S
Subjt:  YGLKQAPRAWYKGLTMYLGERGYSRGETDKTLFI--NRTSTDLIVAQIYVDDIIFG-GFPKTLVIISLTTKSEFEMSLVGELSCFLGLQIKQRSEGIFIS

Query:  QEKYAKNLVKKFGLD
        Q  Y K ++ KF ++
Subjt:  QEKYAKNLVKKFGLD

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.3e-6326.82Show/hide
Query:  GRVTFGDRAKGKLIAKGNI-DKSNLPC---LDEVRYVDGLKANLISVSQLCDQGYSVNFNNTGCVVTDKNNQVFMSGKREADNCYHWNSNGSNICH----
        G V  G+ +  K+   G+I  K+N+ C   L +VR+V  L+ NLIS   L   GY   F N    +T K + V   G       Y  N   + IC     
Subjt:  GRVTFGDRAKGKLIAKGNI-DKSNLPC---LDEVRYVDGLKANLISVSQLCDQGYSVNFNNTGCVVTDKNNQVFMSGKREADNCYHWNSNGSNICH----

Query:  -LTKGYQTWLWHRKLGYISLRSLDKVIINEAVVGIPSLDINGKFFCGDCKVGKQTKTSHRRLKECYTIRVLKLLHLDLVGLMQTESLRGKKYVLVII---
                 LWH+++G++S + L  +     +       +     C  C  GKQ + S +   E   + +L L++ D+ G M+ ES+ G KY +  I   
Subjt:  -LTKGYQTWLWHRKLGYISLRSLDKVIINEAVVGIPSLDINGKFFCGDCKVGKQTKTSHRRLKECYTIRVLKLLHLDLVGLMQTESLRGKKYVLVII---

Query:  ----------------------------------IKIRSDHGKEFDNEDLNNFCQTEGIHHEFAAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLNFW
                                           ++RSD+G E+ + +   +C + GI HE   P TPQ NGV ER NRT+ E  R M+    LP +FW
Subjt:  ----------------------------------IKIRSDHGKEFDNEDLNNFCQTEGIHHEFAAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLNFW

Query:  AEAVNTACHIHNRVTTRSDTIVTLYELWKGRKPNY------------------HRKWDVKSDQGIFLGYSQNSRAYRVFN------IKSRTVM-------
         EAV TAC++ NR  +          +W  ++ +Y                    K D KS   IF+GY      YR+++      I+SR V+       
Subjt:  AEAVNTACHIHNRVTTRSDTIVTLYELWKGRKPNY------------------HRKWDVKSDQGIFLGYSQNSRAYRVFN------IKSRTVM-------

Query:  -------ETINVMVNDF-------------ESNVNQFN-------------------IEDDETHVTPEVTSTPL--DEMPKGDSQ---------------
               +  N ++ +F             ES  ++ +                   +E+ E     E    PL   E P+ +S+               
Subjt:  -------ETINVMVNDF-------------ESNVNQFN-------------------IEDDETHVTPEVTSTPL--DEMPKGDSQ---------------

Query:  -------LDHAKTD---SNITDELLQFKCNNVWTLVPKPDEANIIGTKWIFKNKTDESGSVIRNKARLVAQGYAQVEGVDLDETFASVARFEAICLLFSI
               L H + +     + +E+   + N  + LV  P     +  KW+FK K D    ++R KARLV +G+ Q +G+D DE F+ V +  +I  + S+
Subjt:  -------LDHAKTD---SNITDELLQFKCNNVWTLVPKPDEANIIGTKWIFKNKTDESGSVIRNKARLVAQGYAQVEGVDLDETFASVARFEAICLLFSI

Query:  ACFRKFKLFQMDVKSAFLNGYLNEEVYVAQPREFVDYEFPQYVYKLNKALYGLKQAPRAWYKGLTMYLGERGYSRGETDKTLFINRTS-TDLIVAQIYVD
        A     ++ Q+DVK+AFL+G L EE+Y+ QP  F        V KLNK+LYGLKQAPR WY     ++  + Y +  +D  ++  R S  + I+  +YVD
Subjt:  ACFRKFKLFQMDVKSAFLNGYLNEEVYVAQPREFVDYEFPQYVYKLNKALYGLKQAPRAWYKGLTMYLGERGYSRGETDKTLFINRTS-TDLIVAQIYVD

Query:  DIIFGGFPKTLVI-ISLTTKSEFEMSLVGELSCFLGLQI--KQRSEGIFISQEKYAKNLVKKFGLDQSQHKRTSTTTHAKITK
        D++  G  K L+  +       F+M  +G     LG++I  ++ S  +++SQEKY + ++++F +  ++   T    H K++K
Subjt:  DIIFGGFPKTLVI-ISLTTKSEFEMSLVGELSCFLGLQI--KQRSEGIFISQEKYAKNLVKKFGLDQSQHKRTSTTTHAKITK

P25600 Putative transposon Ty5-1 protein YCL074W6.7e-1637.06Show/hide
Query:  MDVKSAFLNGYLNEEVYVAQPREFVDYEFPQYVYKLNKALYGLKQAPRAWYKGLTMYLGERGYSRGETDKTLFINRTSTDLIVAQIYVDDIIFGG-FPKT
        MDV +AFLN  ++E +YV QP  FV+   P YV++L   +YGLKQAP  W + +   L + G+ R E +  L+   TS   I   +YVDD++     PK 
Subjt:  MDVKSAFLNGYLNEEVYVAQPREFVDYEFPQYVYKLNKALYGLKQAPRAWYKGLTMYLGERGYSRGETDKTLFINRTSTDLIVAQIYVDDIIFGG-FPKT

Query:  LVIISLTTKSEFEMSLVGELSCFLGLQIKQRSEG-IFISQEKY
           +       + M  +G++  FLGL I Q S G I +S + Y
Subjt:  LVIISLTTKSEFEMSLVGELSCFLGLQIKQRSEG-IFISQEKY

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.6e-4138.65Show/hide
Query:  NNVWTLV-PKPDEANIIGTKWIFKNKTDESGSVIRNKARLVAQGYAQVEGVDLDETFASVARFEAICLLFSIACFRKFKLFQMDVKSAFLNGYLNEEVYV
        N+ W LV P P    I+G +WIF  K +  GS+ R KARLVA+GY Q  G+D  ETF+ V +  +I ++  +A  R + + Q+DV +AFL G L ++VY+
Subjt:  NNVWTLV-PKPDEANIIGTKWIFKNKTDESGSVIRNKARLVAQGYAQVEGVDLDETFASVARFEAICLLFSIACFRKFKLFQMDVKSAFLNGYLNEEVYV

Query:  AQPREFVDYEFPQYVYKLNKALYGLKQAPRAWYKGLTMYLGERGYSRGETDKTLFINRTSTDLIVAQIYVDDIIFGGFPKTLVIISLTTKSE-FEMSLVG
        +QP  F+D + P YV KL KALYGLKQAPRAWY  L  YL   G+    +D +LF+ +    ++   +YVDDI+  G   TL+  +L   S+ F +    
Subjt:  AQPREFVDYEFPQYVYKLNKALYGLKQAPRAWYKGLTMYLGERGYSRGETDKTLFINRTSTDLIVAQIYVDDIIFGGFPKTLVIISLTTKSE-FEMSLVG

Query:  ELSCFLGLQIKQRSEGIFISQEKYAKNLVKKFGLDQSQHKRTSTTTHAKIT
        EL  FLG++ K+   G+ +SQ +Y  +L+ +  +  ++   T      K++
Subjt:  ELSCFLGLQIKQRSEGIFISQEKYAKNLVKKFGLDQSQHKRTSTTTHAKIT

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.4e-1525.85Show/hide
Query:  LDEVRYVDGLKANLISVSQLCD-QGYSVNFNNTGCVVTDKNNQVFMSGKREADNCYHWNSNGSNICHLTKGYQTWL----WHRKLGYISLRSLDKVIINE
        L  + YV  +  NLISV +LC+  G SV F      V D N  V +   +  D  Y W    S    L     +      WH +LG+ +   L+ VI N 
Subjt:  LDEVRYVDGLKANLISVSQLCD-QGYSVNFNNTGCVVTDKNNQVFMSGKREADNCYHWNSNGSNICHLTKGYQTWL----WHRKLGYISLRSLDKVIINE

Query:  AVVGIPSLDINGKFF-CGDCKVGKQTK--------TSHRRLK--------------ECYTIRVLKLLHLD----LVGLMQTESLRGKKYVLVIIIKIR--
            +  L+ + KF  C DC + K  K         S R L+              + Y   V+ + H      L  L Q   ++        +++ R  
Subjt:  AVVGIPSLDINGKFF-CGDCKVGKQTK--------TSHRRLK--------------ECYTIRVLKLLHLD----LVGLMQTESLRGKKYVLVIIIKIR--

Query:  -------SDHGKEFDNEDLNNFCQTEGIHHEFAAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLNFWAEAVNTACHIHNRVTTRSDTIVTLYELWKGR
               SD+G EF    L  +    GI H  + P TP+ NG+ ERK+R + E    ++   ++P  +W  A   A ++ NR+ T    + + ++   G 
Subjt:  -------SDHGKEFDNEDLNNFCQTEGIHHEFAAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLNFWAEAVNTACHIHNRVTTRSDTIVTLYELWKGR

Query:  KPNYHR------------------KWDVKSDQGIFLGYSQNSRAYRVFNIKS
         PNY +                  K D KS Q +FLGYS    AY   ++++
Subjt:  KPNYHR------------------KWDVKSDQGIFLGYSQNSRAYRVFNIKS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-4723.83Show/hide
Query:  LDEVRYVDGLKANLISVSQLCDQG-YSVNFNNTGCVVTDKNNQVFMSGKREADNCYHWNSNGSNICHL-----TKGYQTWLWHRKLGYISLRSLDKVIIN
        L++V YV  +  NLISV +LC+    SV F      V D N  V +   +  D  Y W    S    +     +K   +  WH +LG+ SL  L+ VI N
Subjt:  LDEVRYVDGLKANLISVSQLCDQG-YSVNFNNTGCVVTDKNNQVFMSGKREADNCYHWNSNGSNICHL-----TKGYQTWLWHRKLGYISLRSLDKVIIN

Query:  EAVVGIPSLDINGKFF-CGDCKVGKQTK--------TSHRRLKECYT-IRVLKLLHLD-----------------LVGLMQTESLRGKKYVLVIIIKIR-
         +   +P L+ + K   C DC + K  K        TS + L+  Y+ +    +L +D                 L  L Q   ++    +   +++ R 
Subjt:  EAVVGIPSLDINGKFF-CGDCKVGKQTK--------TSHRRLKECYT-IRVLKLLHLD-----------------LVGLMQTESLRGKKYVLVIIIKIR-

Query:  --------SDHGKEFDNEDLNNFCQTEGIHHEFAAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLNFWAEAVNTACHIHNRVTTRSDTIVTLYELWKG
                SD+G EF    L ++    GI H  + P TP+ NG+ ERK+R + EM   ++   ++P  +W  A + A ++ NR+ T    + + ++   G
Subjt:  --------SDHGKEFDNEDLNNFCQTEGIHHEFAAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLNFWAEAVNTACHIHNRVTTRSDTIVTLYELWKG

Query:  RKPNYHR------------------KWDVKSDQGIFLGYSQNSRAYRVFNIKSRTVMETINVMVND--FESNVNQFNIEDDE------------------
        + PNY +                  K + KS Q  F+GYS    AY   +I +  +  + +V  ++  F  +   F +   +                  
Subjt:  RKPNYHR------------------KWDVKSDQGIFLGYSQNSRAYRVFNIKSRTVMETINVMVND--FESNVNQFNIEDDE------------------

Query:  ------------------------------------------------------THVTPEVTSTP----------------------------LDEMPKG
                                                              +H  P+ T+ P                               +P+ 
Subjt:  ------------------------------------------------------THVTPEVTSTP----------------------------LDEMPKG

Query:  DSQLDHAKTDSNITDE----------------------LLQFKC--------------------------------------------------------
             H  T S    E                      ++Q                                                           
Subjt:  DSQLDHAKTDSNITDE----------------------LLQFKC--------------------------------------------------------

Query:  -----NNVWTLV-PKPDEANIIGTKWIFKNKTDESGSVIRNKARLVAQGYAQVEGVDLDETFASVARFEAICLLFSIACFRKFKLFQMDVKSAFLNGYLN
             N+ W LV P P    I+G +WIF  K +  GS+ R KARLVA+GY Q  G+D  ETF+ V +  +I ++  +A  R + + Q+DV +AFL G L 
Subjt:  -----NNVWTLV-PKPDEANIIGTKWIFKNKTDESGSVIRNKARLVAQGYAQVEGVDLDETFASVARFEAICLLFSIACFRKFKLFQMDVKSAFLNGYLN

Query:  EEVYVAQPREFVDYEFPQYVYKLNKALYGLKQAPRAWYKGLTMYLGERGYSRGETDKTLFINRTSTDLIVAQIYVDDIIFGGFPKTLVIISLTTKSE-FE
        +EVY++QP  FVD + P YV +L KA+YGLKQAPRAWY  L  YL   G+    +D +LF+ +    +I   +YVDDI+  G    L+  +L   S+ F 
Subjt:  EEVYVAQPREFVDYEFPQYVYKLNKALYGLKQAPRAWYKGLTMYLGERGYSRGETDKTLFINRTSTDLIVAQIYVDDIIFGGFPKTLVIISLTTKSE-FE

Query:  MSLVGELSCFLGLQIKQRSEGIFISQEKYAKNLVKKFGLDQSQHKRTSTTTHAKIT
        +    +L  FLG++ K+  +G+ +SQ +Y  +L+ +  +  ++   T   T  K+T
Subjt:  MSLVGELSCFLGLQIKQRSEGIFISQEKYAKNLVKKFGLDQSQHKRTSTTTHAKIT

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.4e-3837.14Show/hide
Query:  DELLQFKCNNVWTLVPKPDEANIIGTKWIFKNKTDESGSVIRNKARLVAQGYAQVEGVDLDETFASVARFEAICLLFSIACFRKFKLFQMDVKSAFLNGY
        DE+   +  + W +   P     IG KW++K K +  G++ R KARLVA+GY Q EG+D  ETF+ V +  ++ L+ +I+    F L Q+D+ +AFLNG 
Subjt:  DELLQFKCNNVWTLVPKPDEANIIGTKWIFKNKTDESGSVIRNKARLVAQGYAQVEGVDLDETFASVARFEAICLLFSIACFRKFKLFQMDVKSAFLNGY

Query:  LNEEVYVAQPREFV----DYEFPQYVYKLNKALYGLKQAPRAWYKGLTMYLGERGYSRGETDKTLFINRTSTDLIVAQIYVDDIIFGGFPKTLV-IISLT
        L+EE+Y+  P  +     D   P  V  L K++YGLKQA R W+   ++ L   G+ +  +D T F+  T+T  +   +YVDDII        V  +   
Subjt:  LNEEVYVAQPREFV----DYEFPQYVYKLNKALYGLKQAPRAWYKGLTMYLGERGYSRGETDKTLFINRTSTDLIVAQIYVDDIIFGGFPKTLV-IISLT

Query:  TKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNLVKKFGL
         KS F++  +G L  FLGL+I + + GI I Q KYA +L+ + GL
Subjt:  TKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNLVKKFGL

ATMG00810.1 DNA/RNA polymerases superfamily protein6.5e-0638.81Show/hide
Query:  IYVDDIIFGGFPKTLV-IISLTTKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNLVKKFGL
        +YVDDI+  G   TL+ ++     S F M  +G +  FLG+QIK    G+F+SQ KYA+ ++   G+
Subjt:  IYVDDIIFGGFPKTLV-IISLTTKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNLVKKFGL

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.4e-1142.5Show/hide
Query:  DELLQFKCNNVWTLVPKPDEANIIGTKWIFKNKTDESGSVIRNKARLVAQGYAQVEGVDLDETFASVARFEAICLLFSIA
        +EL     N  W LVP P   NI+G KW+FK K    G++ R KARLVA+G+ Q EG+   ET++ V R   I  + ++A
Subjt:  DELLQFKCNNVWTLVPKPDEANIIGTKWIFKNKTDESGSVIRNKARLVAQGYAQVEGVDLDETFASVARFEAICLLFSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTAACAATCGATCTTTCTTTACTGAGTTAGAAGAATGTGCCTCAGGTCGTGTCACTTTTGGAGATAGGGCCAAAGGAAAACTTATTGCAAAAGGAAACATTGATAA
AAGTAATCTACCCTGTCTTGATGAAGTTAGATATGTGGATGGACTGAAGGCAAACTTGATTAGTGTAAGTCAACTATGTGACCAAGGATACAGTGTAAACTTTAACAATA
CTGGTTGTGTAGTTACAGACAAGAATAATCAAGTGTTCATGAGTGGCAAACGGGAAGCAGATAACTGTTATCATTGGAATTCCAATGGTTCAAACATATGTCACTTAACT
AAAGGTTATCAAACTTGGTTGTGGCATAGGAAATTGGGGTACATCAGCTTGAGAAGTTTAGATAAAGTTATCATAAACGAGGCGGTTGTAGGCATTCCTTCGTTAGACAT
AAATGGAAAATTCTTTTGTGGTGACTGTAAAGTTGGAAAGCAAACCAAAACCTCCCATAGAAGGCTAAAGGAATGTTATACAATCAGAGTACTTAAGCTTCTACATCTTG
ACCTCGTGGGTCTCATGCAAACTGAAAGTTTGAGGGGAAAGAAGTATGTGTTAGTTATTATAATCAAGATTCGTAGTGATCATGGGAAGGAATTTGATAATGAAGATCTG
AATAACTTCTGTCAGACTGAAGGAATCCATCATGAATTTGCAGCTCCCATAACTCCTCAGCAAAATGGAGTAGTTGAACGGAAGAACAGAACGTTACAAGAAATGGCTCG
AGTTATGATACATGCCAAAAATTTACCTTTGAATTTTTGGGCAGAAGCTGTAAACACAGCATGTCATATTCACAACAGGGTCACTACACGATCTGATACGATAGTTACTT
TGTATGAATTATGGAAGGGAAGGAAACCAAATTATCATCGTAAGTGGGATGTGAAATCTGATCAAGGGATCTTTCTTGGTTATTCTCAGAATAGTCGAGCGTACAGAGTC
TTCAATATTAAATCCAGAACAGTCATGGAAACAATCAATGTTATGGTTAATGATTTTGAGTCTAATGTCAATCAATTTAATATTGAGGATGATGAGACCCATGTGACACC
CGAAGTTACTTCTACTCCCCTTGACGAAATGCCTAAAGGTGATTCACAGCTAGACCATGCTAAGACCGATTCAAACATAACTGATGAGTTATTACAGTTCAAGTGTAACA
ACGTTTGGACTTTGGTTCCTAAACCTGATGAGGCAAACATCATAGGAACTAAGTGGATCTTTAAAAATAAAACTGATGAATCTGGAAGTGTAATAAGGAACAAGGCCCGT
TTGGTAGCTCAAGGTTATGCACAGGTAGAAGGTGTTGATTTAGATGAAACTTTTGCATCTGTGGCTAGATTTGAAGCTATTTGCCTCTTGTTCAGTATAGCATGTTTTCG
AAAATTTAAATTATTTCAAATGGACGTTAAAAGTGCCTTCCTGAATGGATACTTAAATGAGGAAGTCTATGTAGCACAACCTAGAGAGTTTGTTGATTATGAATTTCCTC
AGTATGTCTACAAACTGAATAAAGCTCTATATGGGTTAAAGCAAGCTCCTCGGGCTTGGTATAAAGGACTAACAATGTATCTTGGTGAAAGAGGATATTCTAGAGGTGAG
ACTGACAAGACACTATTCATAAATAGAACCAGCACTGATCTCATTGTAGCTCAAATTTATGTTGATGACATCATCTTTGGTGGATTTCCTAAAACACTTGTCATAATTTC
ATTAACAACGAAATCAGAATTCGAAATGAGCCTAGTAGGTGAACTGTCCTGCTTCCTGGGATTGCAGATCAAACAACGAAGTGAGGGAATATTTATATCACAAGAGAAGT
ATGCCAAGAACTTAGTCAAGAAGTTTGGTCTAGATCAGTCACAACACAAAAGGACTTCTACTACGACTCATGCTAAAATTACGAAGGATACGGTAGGTGTTGGATTTTAT
GTCCTAAAACTCGTAGATAGTAAATATAATCAATTGACTGCCATTAATAAAGTGTTTTATTATTATAATTTCAATAAGTGTTATTGA
mRNA sequenceShow/hide mRNA sequence
ATGACTAACAATCGATCTTTCTTTACTGAGTTAGAAGAATGTGCCTCAGGTCGTGTCACTTTTGGAGATAGGGCCAAAGGAAAACTTATTGCAAAAGGAAACATTGATAA
AAGTAATCTACCCTGTCTTGATGAAGTTAGATATGTGGATGGACTGAAGGCAAACTTGATTAGTGTAAGTCAACTATGTGACCAAGGATACAGTGTAAACTTTAACAATA
CTGGTTGTGTAGTTACAGACAAGAATAATCAAGTGTTCATGAGTGGCAAACGGGAAGCAGATAACTGTTATCATTGGAATTCCAATGGTTCAAACATATGTCACTTAACT
AAAGGTTATCAAACTTGGTTGTGGCATAGGAAATTGGGGTACATCAGCTTGAGAAGTTTAGATAAAGTTATCATAAACGAGGCGGTTGTAGGCATTCCTTCGTTAGACAT
AAATGGAAAATTCTTTTGTGGTGACTGTAAAGTTGGAAAGCAAACCAAAACCTCCCATAGAAGGCTAAAGGAATGTTATACAATCAGAGTACTTAAGCTTCTACATCTTG
ACCTCGTGGGTCTCATGCAAACTGAAAGTTTGAGGGGAAAGAAGTATGTGTTAGTTATTATAATCAAGATTCGTAGTGATCATGGGAAGGAATTTGATAATGAAGATCTG
AATAACTTCTGTCAGACTGAAGGAATCCATCATGAATTTGCAGCTCCCATAACTCCTCAGCAAAATGGAGTAGTTGAACGGAAGAACAGAACGTTACAAGAAATGGCTCG
AGTTATGATACATGCCAAAAATTTACCTTTGAATTTTTGGGCAGAAGCTGTAAACACAGCATGTCATATTCACAACAGGGTCACTACACGATCTGATACGATAGTTACTT
TGTATGAATTATGGAAGGGAAGGAAACCAAATTATCATCGTAAGTGGGATGTGAAATCTGATCAAGGGATCTTTCTTGGTTATTCTCAGAATAGTCGAGCGTACAGAGTC
TTCAATATTAAATCCAGAACAGTCATGGAAACAATCAATGTTATGGTTAATGATTTTGAGTCTAATGTCAATCAATTTAATATTGAGGATGATGAGACCCATGTGACACC
CGAAGTTACTTCTACTCCCCTTGACGAAATGCCTAAAGGTGATTCACAGCTAGACCATGCTAAGACCGATTCAAACATAACTGATGAGTTATTACAGTTCAAGTGTAACA
ACGTTTGGACTTTGGTTCCTAAACCTGATGAGGCAAACATCATAGGAACTAAGTGGATCTTTAAAAATAAAACTGATGAATCTGGAAGTGTAATAAGGAACAAGGCCCGT
TTGGTAGCTCAAGGTTATGCACAGGTAGAAGGTGTTGATTTAGATGAAACTTTTGCATCTGTGGCTAGATTTGAAGCTATTTGCCTCTTGTTCAGTATAGCATGTTTTCG
AAAATTTAAATTATTTCAAATGGACGTTAAAAGTGCCTTCCTGAATGGATACTTAAATGAGGAAGTCTATGTAGCACAACCTAGAGAGTTTGTTGATTATGAATTTCCTC
AGTATGTCTACAAACTGAATAAAGCTCTATATGGGTTAAAGCAAGCTCCTCGGGCTTGGTATAAAGGACTAACAATGTATCTTGGTGAAAGAGGATATTCTAGAGGTGAG
ACTGACAAGACACTATTCATAAATAGAACCAGCACTGATCTCATTGTAGCTCAAATTTATGTTGATGACATCATCTTTGGTGGATTTCCTAAAACACTTGTCATAATTTC
ATTAACAACGAAATCAGAATTCGAAATGAGCCTAGTAGGTGAACTGTCCTGCTTCCTGGGATTGCAGATCAAACAACGAAGTGAGGGAATATTTATATCACAAGAGAAGT
ATGCCAAGAACTTAGTCAAGAAGTTTGGTCTAGATCAGTCACAACACAAAAGGACTTCTACTACGACTCATGCTAAAATTACGAAGGATACGGTAGGTGTTGGATTTTAT
GTCCTAAAACTCGTAGATAGTAAATATAATCAATTGACTGCCATTAATAAAGTGTTTTATTATTATAATTTCAATAAGTGTTATTGA
Protein sequenceShow/hide protein sequence
MTNNRSFFTELEECASGRVTFGDRAKGKLIAKGNIDKSNLPCLDEVRYVDGLKANLISVSQLCDQGYSVNFNNTGCVVTDKNNQVFMSGKREADNCYHWNSNGSNICHLT
KGYQTWLWHRKLGYISLRSLDKVIINEAVVGIPSLDINGKFFCGDCKVGKQTKTSHRRLKECYTIRVLKLLHLDLVGLMQTESLRGKKYVLVIIIKIRSDHGKEFDNEDL
NNFCQTEGIHHEFAAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLNFWAEAVNTACHIHNRVTTRSDTIVTLYELWKGRKPNYHRKWDVKSDQGIFLGYSQNSRAYRV
FNIKSRTVMETINVMVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGDSQLDHAKTDSNITDELLQFKCNNVWTLVPKPDEANIIGTKWIFKNKTDESGSVIRNKAR
LVAQGYAQVEGVDLDETFASVARFEAICLLFSIACFRKFKLFQMDVKSAFLNGYLNEEVYVAQPREFVDYEFPQYVYKLNKALYGLKQAPRAWYKGLTMYLGERGYSRGE
TDKTLFINRTSTDLIVAQIYVDDIIFGGFPKTLVIISLTTKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNLVKKFGLDQSQHKRTSTTTHAKITKDTVGVGFY
VLKLVDSKYNQLTAINKVFYYYNFNKCY