; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019651 (gene) of Snake gourd v1 genome

Gene IDTan0019651
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG01:99041005..99043744
RNA-Seq ExpressionTan0019651
SyntenyTan0019651
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]8.8e-25855.03Show/hide
Query:  MSSSIIALLKSDRLTDENFTTWKSNLNTILVVDDLRFLLTEECPQVPARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLDKKHEGMVSAREIMSSLQD
        M++SI+ LL S++L  +N++ WKSNLNTILVVDDLRF+LTEECPQ PA NA ++V+EAYDRW+KANDKA+VYILAS+++VL KKH+ + +A+ IM SL++
Subjt:  MSSSIIALLKSDRLTDENFTTWKSNLNTILVVDDLRFLLTEECPQVPARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLDKKHEGMVSAREIMSSLQD

Query:  MFGQTSGQIRHESLKYIYNSRMKEGSLVREHVLDLMVHFNVAEMNGAVIDD--------------------NAVMNKIEYNLTTLLNELQTFQSLMKNKG
        MFGQ S  +RHE++K+IY  RMKEG+ VREHVLD+M+HFN+AE+NG  ID+                    NA +NKIE+NLTTLLNELQ FQ+L  +KG
Subjt:  MFGQTSGQIRHESLKYIYNSRMKEGSLVREHVLDLMVHFNVAEMNGAVIDD--------------------NAVMNKIEYNLTTLLNELQTFQSLMKNKG

Query:  QADGEANLCVHSRRFQKGSSSGTKSCGSFSGLKKTQKKKIGGKGKAPAADKGKGKVKVVDKGKCFHCNVDGHWKRNCPKYLAELKEKK------------
        + + EAN+ V  R+F +GSSS  K      G  K Q KK  GKGKAP   K K   K  DKGKCFHCN DGHWKRNCPKYLAE K +K            
Subjt:  QADGEANLCVHSRRFQKGSSSGTKSCGSFSGLKKTQKKKIGGKGKAPAADKGKGKVKVVDKGKCFHCNVDGHWKRNCPKYLAELKEKK------------

Query:  ----------------ATNHVCSSLRETSSFKELEEGEMTLRVGTGDVVSARAVGDAKL-----------------------------------------
                        ATNH+C S +ETSS+K+L+EGE+TL+VGTG+VVSA AVGD  L                                         
Subjt:  ----------------ATNHVCSSLRETSSFKELEEGEMTLRVGTGDVVSARAVGDAKL-----------------------------------------

Query:  ----------------------------------------------------------LGHINLNRIERLSKNGLLNKLEDVSLPPCESCLEGKMTKRLF
                                                                  LGHINLNRIERL K+G+LN+LED SLPPCESCLEGKMTKR F
Subjt:  ----------------------------------------------------------LGHINLNRIERLSKNGLLNKLEDVSLPPCESCLEGKMTKRLF

Query:  TGKGYRAIEPLELVHSDLCGPVNVKAREGYEYFISFIDDYSRYGYL-----------EFKVYKVEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIK
        TGKG RA  PLELVHSDLCGP+NVKAR GYEYFISFIDD+SRYG++           +FK YK EVEN +GKTIKTLRSDRGGEYMD +FQDY+IE GI+
Subjt:  TGKGYRAIEPLELVHSDLCGPVNVKAREGYEYFISFIDDYSRYGYL-----------EFKVYKVEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIK

Query:  SQLSTPNTPHQNGVSERRNRTLLDMFTTLE---------FGVVLR---------------------------------------HVSVSNPKKLEPRSRL
        SQLS P+TP QNGVSERRNRTLLDM  ++          +G  L                                        HV V NPKKLEPRS+L
Subjt:  SQLSTPNTPHQNGVSERRNRTLLDMFTTLE---------FGVVLR---------------------------------------HVSVSNPKKLEPRSRL

Query:  CQFVGYPKETRGGLFYDPQENKVIVSTNTTFLEEDHMRNHKPHSKLVLNEGTDEPTRVVDQAGPSSRVDEGANTSSQSRPSQSLGMPRRSGRVVCQPNRY
        C FVGYPKE+RGGLFY PQENKV VSTN TFLEEDH RNH+P SK+VL E     T   D+   S++V + AN S QS  SQ L +PRRSGRVV QPNRY
Subjt:  CQFVGYPKETRGGLFYDPQENKVIVSTNTTFLEEDHMRNHKPHSKLVLNEGTDEPTRVVDQAGPSSRVDEGANTSSQSRPSQSLGMPRRSGRVVCQPNRY

Query:  LGLAETQVVIPDDDVEDPLSYKQTMSDVDKDQWIKAMDLEMESMDFNSVWELVDQPEGVRPIGCKWIYKRKRDAAGKVQTYKARLVAKGFTQREEVDYEE
        LGL ETQ++IPDD VEDPL+YKQ M+DVD+DQWIKAM+LEMESM FNSVW LVD P  V+PIGCKWIYKRKRD AGKVQT+KARLVAKG+TQ+E VDYEE
Subjt:  LGLAETQVVIPDDDVEDPLSYKQTMSDVDKDQWIKAMDLEMESMDFNSVWELVDQPEGVRPIGCKWIYKRKRDAAGKVQTYKARLVAKGFTQREEVDYEE

Query:  TFFPVAMLKSIRILLSIAAFYDYEI
        TF PVAMLKSIRILLSIA FY+YEI
Subjt:  TFFPVAMLKSIRILLSIAAFYDYEI

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-25053.5Show/hide
Query:  MSSSIIALLKSDRLTDENFTTWKSNLNTILVVDDLRFLLTEECPQVPARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLDKKHEGMVSAREIMSSLQD
        M+S+ + +L +D+L   N+ +WK+ +NT+L++DDLRF+L EECPQVPA NA ++V+E Y+RW KAN+KA+ YILAS+SEVL KKHE M++AREIM SLQ+
Subjt:  MSSSIIALLKSDRLTDENFTTWKSNLNTILVVDDLRFLLTEECPQVPARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLDKKHEGMVSAREIMSSLQD

Query:  MFGQTSGQIRHESLKYIYNSRMKEGSLVREHVLDLMVHFNVAEMNGAVIDD--------------------NAVMNKIEYNLTTLLNELQTFQSLMKNKG
        MFGQ S QI+H++LKYIYN+RM EG+ VREHVL++MVHFNVAEMNGAVID+                    NAVMNKI Y LTTLLNELQTF+SLMK KG
Subjt:  MFGQTSGQIRHESLKYIYNSRMKEGSLVREHVLDLMVHFNVAEMNGAVIDD--------------------NAVMNKIEYNLTTLLNELQTFQSLMKNKG

Query:  QADGEANLCVHSRRFQKGSSSGTKSCGSFSGLKKTQKKKIGGKGKAPAADKGKGKVKVVDKGKCFHCNVDGHWKRNCPKYLAELKEKK------------
        Q  GEAN+   +R+F +GS+SGTKS  S SG KK +KKK G   KA  A     K     KG CFHCN +GHWKRNCPKYLAE K+ K            
Subjt:  QADGEANLCVHSRRFQKGSSSGTKSCGSFSGLKKTQKKKIGGKGKAPAADKGKGKVKVVDKGKCFHCNVDGHWKRNCPKYLAELKEKK------------

Query:  --------------ATNHVCSSLRETSSFKELEEGEMTLRVGTGDVVSARAVGDAKL-------------------------------------------
                      ATNHVCSS +  SS+++LE GEMT+RVGTG VVSA AVG  +L                                           
Subjt:  --------------ATNHVCSSLRETSSFKELEEGEMTLRVGTGDVVSARAVGDAKL-------------------------------------------

Query:  ----------------------------------------------------------LGHINLNRIERLSKNGLLNKLEDVSLPPCESCLEGKMTKRLF
                                                                  LGHINLNRIERL KNGLL++LE+ SLP CESCLEGKMTKR F
Subjt:  ----------------------------------------------------------LGHINLNRIERLSKNGLLNKLEDVSLPPCESCLEGKMTKRLF

Query:  TGKGYRAIEPLELVHSDLCGPVNVKAREGYEYFISFIDDYSRYGYL-----------EFKVYKVEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIK
        TGKG+RA EPLELVHSDLCGP+NVKAR G+EYFI+F DDYSRYGY+           +FK YK EVENAL KTIKT RSDRGGEYMDL+FQ+Y++E GI 
Subjt:  TGKGYRAIEPLELVHSDLCGPVNVKAREGYEYFISFIDDYSRYGYL-----------EFKVYKVEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIK

Query:  SQLSTPNTPHQNGVSERRNRTLLDMFTTLE---------FGVVLR---------------------------------------HVSVSNPKKLEPRSRL
        SQLS P TP QNGVSERRNRTLLDM  ++          +G  ++                                       HV  +NPKKLEPRS+L
Subjt:  SQLSTPNTPHQNGVSERRNRTLLDMFTTLE---------FGVVLR---------------------------------------HVSVSNPKKLEPRSRL

Query:  CQFVGYPKETRGGLFYDPQENKVIVSTNTTFLEEDHMRNHKPHSKLVLN----EGTDEPTRVVDQAGPSSRVDEGANTSSQSRPSQSLGMPRRSGRVVCQ
        C FVGYPK TRGG FYDP++NKV VSTN TFLEEDH+R HKP SK+VLN    E T+  TRVV++    +RV     +S+++   QSL  PRRSGRV   
Subjt:  CQFVGYPKETRGGLFYDPQENKVIVSTNTTFLEEDHMRNHKPHSKLVLN----EGTDEPTRVVDQAGPSSRVDEGANTSSQSRPSQSLGMPRRSGRVVCQ

Query:  PNRYLGLAETQVVIPDDDVEDPLSYKQTMSDVDKDQWIKAMDLEMESMDFNSVWELVDQPEGVRPIGCKWIYKRKRDAAGKVQTYKARLVAKGFTQREEV
        P RY+ L ET  VI D D+EDPL++K+ M DVDKD+WIKAM+LE+ESM FNSVW+LVDQP+GV+PIGCKWIYKRKR A GKVQT+KARLVAKG+TQ E V
Subjt:  PNRYLGLAETQVVIPDDDVEDPLSYKQTMSDVDKDQWIKAMDLEMESMDFNSVWELVDQPEGVRPIGCKWIYKRKRDAAGKVQTYKARLVAKGFTQREEV

Query:  DYEETFFPVAMLKSIRILLSIAAFYDYEI
        DYEETF PVAMLKSIRILLSIAA++DYEI
Subjt:  DYEETFFPVAMLKSIRILLSIAAFYDYEI

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]4.0e-25864.1Show/hide
Query:  MSSSIIALLKSDRLTDENFTTWKSNLNTILVVDDLRFLLTEECPQVPARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLDKKHEGMVSAREIMSSLQD
        MSSSIIALLK D+LT EN+ TWKS LN ILV+ DL F+L EECP  P ++A QSV++AYDRW KANDKA+++ILAS+S++L KKHE MV+AR+IM SL++
Subjt:  MSSSIIALLKSDRLTDENFTTWKSNLNTILVVDDLRFLLTEECPQVPARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLDKKHEGMVSAREIMSSLQD

Query:  MFGQTSGQIRHESLKYIYNSRMKEGSLVREHVLDLMVHFNVAEMNGAVIDDNAVMNKIEYNLTTLLNELQTFQSLMKNKGQADGEANLCVHSRRFQKGSS
        MFGQ S QI+ E+                          NVA                                                HS+R      
Subjt:  MFGQTSGQIRHESLKYIYNSRMKEGSLVREHVLDLMVHFNVAEMNGAVIDDNAVMNKIEYNLTTLLNELQTFQSLMKNKGQADGEANLCVHSRRFQKGSS

Query:  SGTKSCGSFSGLKKTQKKKIGGKGKAPA-ADKGKGKVKVVDKGKCFHCNVDGHWKRNCPKYLAELKEKK-ATNHVCSSLRETSSFKELEEGEMTLRVGTG
           +   S SG +K QK+K  GKGK P  A + KGK KV  K KCFHCNVD HWK NCPKYL + KEK+ ATNHVCSSL+ETSSFK+LE+ EMTL+VGTG
Subjt:  SGTKSCGSFSGLKKTQKKKIGGKGKAPA-ADKGKGKVKVVDKGKCFHCNVDGHWKRNCPKYLAELKEKK-ATNHVCSSLRETSSFKELEEGEMTLRVGTG

Query:  DVVSARAVGDAKLLGHINLNRIERLSKNGLLNKLEDVSLPPCESCLEGKMTKRLFTGKGYRAIEPLELVHSDLCGPVNVKAREGYEYFISFIDDYSRYGY
        DV+SARAVGDAK LGHINL+RI RL KNGLLNKL+DVSLPPCESCLEGKMTKR FTGKGYRA EPLEL+HSDLCGP+NVKAR G+EYFISFIDDYSRYGY
Subjt:  DVVSARAVGDAKLLGHINLNRIERLSKNGLLNKLEDVSLPPCESCLEGKMTKRLFTGKGYRAIEPLELVHSDLCGPVNVKAREGYEYFISFIDDYSRYGY

Query:  L-----------EFKVYKVEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSTPNTPHQNGVSERRNRTLLDMFTTL---------------E
        L           +FK YK EVEN L K IK LRSDRGGEYMDLRFQDYMIEHGI+SQLS P TP QNGVSERRNRTLLDM  ++               E
Subjt:  L-----------EFKVYKVEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSTPNTPHQNGVSERRNRTLLDMFTTL---------------E

Query:  FGVVL---------------------------------RHVSVSNPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNTTFLEEDHMRNHKPHSK
          V +                                  HV V+NPKKLEPRSRLCQFVGYPKETRGGLF+DPQEN+V VSTN TFLEEDHMRNHKP SK
Subjt:  FGVVL---------------------------------RHVSVSNPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNTTFLEEDHMRNHKPHSK

Query:  LVLNEGTDEPTRVVDQAGPSSRVDEGANTSSQSRPSQSLGMPRRSGRVVCQPNRYLGLAETQVVIPDDDVEDPLSYKQTMSDVDKDQWIKAMDLEMESMD
        LVL+E TDE TRVVD+ GPSSRVDE   TS QS PSQSL MPRRSGRVV QPNRYLGL ETQVVIPDD VEDPLSYKQ M+DVDKDQW+KAMDLEMESM 
Subjt:  LVLNEGTDEPTRVVDQAGPSSRVDEGANTSSQSRPSQSLGMPRRSGRVVCQPNRYLGLAETQVVIPDDDVEDPLSYKQTMSDVDKDQWIKAMDLEMESMD

Query:  FNSVWELVDQPEGVRPIGCKWIYKRKRDAAGKVQTYKARLVAKGFTQREEVDYEETFFPVAMLKSIRILLSIAAFYDYEI
        FNSVWELVD PEGV+PIGCKWIYKRKRD+AGKVQT+KARLVAKG+TQRE VDYEETF PVAMLKSIRILLSIA FYDYEI
Subjt:  FNSVWELVDQPEGVRPIGCKWIYKRKRDAAGKVQTYKARLVAKGFTQREEVDYEETFFPVAMLKSIRILLSIAAFYDYEI

TYK02840.1 gag/pol protein [Cucumis melo var. makuwa]4.0e-25864.1Show/hide
Query:  MSSSIIALLKSDRLTDENFTTWKSNLNTILVVDDLRFLLTEECPQVPARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLDKKHEGMVSAREIMSSLQD
        MSSSIIALLK D+LT EN+ TWKS LN ILV+ DL F+L EECP  P ++A QSV++AYDRW KANDKA+++ILAS+S++L KKHE MV+AR+IM SL++
Subjt:  MSSSIIALLKSDRLTDENFTTWKSNLNTILVVDDLRFLLTEECPQVPARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLDKKHEGMVSAREIMSSLQD

Query:  MFGQTSGQIRHESLKYIYNSRMKEGSLVREHVLDLMVHFNVAEMNGAVIDDNAVMNKIEYNLTTLLNELQTFQSLMKNKGQADGEANLCVHSRRFQKGSS
        MFGQ S QI+ E+                          NVA                                                HS+R      
Subjt:  MFGQTSGQIRHESLKYIYNSRMKEGSLVREHVLDLMVHFNVAEMNGAVIDDNAVMNKIEYNLTTLLNELQTFQSLMKNKGQADGEANLCVHSRRFQKGSS

Query:  SGTKSCGSFSGLKKTQKKKIGGKGKAPA-ADKGKGKVKVVDKGKCFHCNVDGHWKRNCPKYLAELKEKK-ATNHVCSSLRETSSFKELEEGEMTLRVGTG
           +   S SG +K QK+K  GKGK P  A + KGK KV  K KCFHCNVD HWK NCPKYL + KEK+ ATNHVCSSL+ETSSFK+LE+ EMTL+VGTG
Subjt:  SGTKSCGSFSGLKKTQKKKIGGKGKAPA-ADKGKGKVKVVDKGKCFHCNVDGHWKRNCPKYLAELKEKK-ATNHVCSSLRETSSFKELEEGEMTLRVGTG

Query:  DVVSARAVGDAKLLGHINLNRIERLSKNGLLNKLEDVSLPPCESCLEGKMTKRLFTGKGYRAIEPLELVHSDLCGPVNVKAREGYEYFISFIDDYSRYGY
        DV+SARAVGDAK LGHINL+RI RL KNGLLNKL+DVSLPPCESCLEGKMTKR FTGKGYRA EPLEL+HSDLCGP+NVKAR G+EYFISFIDDYSRYGY
Subjt:  DVVSARAVGDAKLLGHINLNRIERLSKNGLLNKLEDVSLPPCESCLEGKMTKRLFTGKGYRAIEPLELVHSDLCGPVNVKAREGYEYFISFIDDYSRYGY

Query:  L-----------EFKVYKVEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSTPNTPHQNGVSERRNRTLLDMFTTL---------------E
        L           +FK YK EVEN L K IK LRSDRGGEYMDLRFQDYMIEHGI+SQLS P TP QNGVSERRNRTLLDM  ++               E
Subjt:  L-----------EFKVYKVEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSTPNTPHQNGVSERRNRTLLDMFTTL---------------E

Query:  FGVVL---------------------------------RHVSVSNPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNTTFLEEDHMRNHKPHSK
          V +                                  HV V+NPKKLEPRSRLCQFVGYPKETRGGLF+DPQEN+V VSTN TFLEEDHMRNHKP SK
Subjt:  FGVVL---------------------------------RHVSVSNPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNTTFLEEDHMRNHKPHSK

Query:  LVLNEGTDEPTRVVDQAGPSSRVDEGANTSSQSRPSQSLGMPRRSGRVVCQPNRYLGLAETQVVIPDDDVEDPLSYKQTMSDVDKDQWIKAMDLEMESMD
        LVL+E TDE TRVVD+ GPSSRVDE   TS QS PSQSL MPRRSGRVV QPNRYLGL ETQVVIPDD VEDPLSYKQ M+DVDKDQW+KAMDLEMESM 
Subjt:  LVLNEGTDEPTRVVDQAGPSSRVDEGANTSSQSRPSQSLGMPRRSGRVVCQPNRYLGLAETQVVIPDDDVEDPLSYKQTMSDVDKDQWIKAMDLEMESMD

Query:  FNSVWELVDQPEGVRPIGCKWIYKRKRDAAGKVQTYKARLVAKGFTQREEVDYEETFFPVAMLKSIRILLSIAAFYDYEI
        FNSVWELVD PEGV+PIGCKWIYKRKRD+AGKVQT+KARLVAKG+TQRE VDYEETF PVAMLKSIRILLSIA FYDYEI
Subjt:  FNSVWELVDQPEGVRPIGCKWIYKRKRDAAGKVQTYKARLVAKGFTQREEVDYEETFFPVAMLKSIRILLSIAAFYDYEI

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-25053.5Show/hide
Query:  MSSSIIALLKSDRLTDENFTTWKSNLNTILVVDDLRFLLTEECPQVPARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLDKKHEGMVSAREIMSSLQD
        M+S+ + +L +D+L   N+ +WK+ +NT+L++DDLRF+L EECPQVPA NA ++V+E Y+RW KAN+KA+ YILAS+SEVL KKHE M++AREIM SLQ+
Subjt:  MSSSIIALLKSDRLTDENFTTWKSNLNTILVVDDLRFLLTEECPQVPARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLDKKHEGMVSAREIMSSLQD

Query:  MFGQTSGQIRHESLKYIYNSRMKEGSLVREHVLDLMVHFNVAEMNGAVIDD--------------------NAVMNKIEYNLTTLLNELQTFQSLMKNKG
        MFGQ S QI+H++LKYIYN+RM EG+ VREHVL++MVHFNVAEMNGAVID+                    NAVMNKI Y LTTLLNELQTF+SLMK KG
Subjt:  MFGQTSGQIRHESLKYIYNSRMKEGSLVREHVLDLMVHFNVAEMNGAVIDD--------------------NAVMNKIEYNLTTLLNELQTFQSLMKNKG

Query:  QADGEANLCVHSRRFQKGSSSGTKSCGSFSGLKKTQKKKIGGKGKAPAADKGKGKVKVVDKGKCFHCNVDGHWKRNCPKYLAELKEKK------------
        Q  GEAN+   +R+F +GS+SGTKS  S SG KK +KKK G   KA  A     K     KG CFHCN +GHWKRNCPKYLAE K+ K            
Subjt:  QADGEANLCVHSRRFQKGSSSGTKSCGSFSGLKKTQKKKIGGKGKAPAADKGKGKVKVVDKGKCFHCNVDGHWKRNCPKYLAELKEKK------------

Query:  --------------ATNHVCSSLRETSSFKELEEGEMTLRVGTGDVVSARAVGDAKL-------------------------------------------
                      ATNHVCSS +  SS+++LE GEMT+RVGTG VVSA AVG  +L                                           
Subjt:  --------------ATNHVCSSLRETSSFKELEEGEMTLRVGTGDVVSARAVGDAKL-------------------------------------------

Query:  ----------------------------------------------------------LGHINLNRIERLSKNGLLNKLEDVSLPPCESCLEGKMTKRLF
                                                                  LGHINLNRIERL KNGLL++LE+ SLP CESCLEGKMTKR F
Subjt:  ----------------------------------------------------------LGHINLNRIERLSKNGLLNKLEDVSLPPCESCLEGKMTKRLF

Query:  TGKGYRAIEPLELVHSDLCGPVNVKAREGYEYFISFIDDYSRYGYL-----------EFKVYKVEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIK
        TGKG+RA EPLELVHSDLCGP+NVKAR G+EYFI+F DDYSRYGY+           +FK YK EVENAL KTIKT RSDRGGEYMDL+FQ+Y++E GI 
Subjt:  TGKGYRAIEPLELVHSDLCGPVNVKAREGYEYFISFIDDYSRYGYL-----------EFKVYKVEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIK

Query:  SQLSTPNTPHQNGVSERRNRTLLDMFTTLE---------FGVVLR---------------------------------------HVSVSNPKKLEPRSRL
        SQLS P TP QNGVSERRNRTLLDM  ++          +G  ++                                       HV  +NPKKLEPRS+L
Subjt:  SQLSTPNTPHQNGVSERRNRTLLDMFTTLE---------FGVVLR---------------------------------------HVSVSNPKKLEPRSRL

Query:  CQFVGYPKETRGGLFYDPQENKVIVSTNTTFLEEDHMRNHKPHSKLVLN----EGTDEPTRVVDQAGPSSRVDEGANTSSQSRPSQSLGMPRRSGRVVCQ
        C FVGYPK TRGG FYDP++NKV VSTN TFLEEDH+R HKP SK+VLN    E T+  TRVV++    +RV     +S+++   QSL  PRRSGRV   
Subjt:  CQFVGYPKETRGGLFYDPQENKVIVSTNTTFLEEDHMRNHKPHSKLVLN----EGTDEPTRVVDQAGPSSRVDEGANTSSQSRPSQSLGMPRRSGRVVCQ

Query:  PNRYLGLAETQVVIPDDDVEDPLSYKQTMSDVDKDQWIKAMDLEMESMDFNSVWELVDQPEGVRPIGCKWIYKRKRDAAGKVQTYKARLVAKGFTQREEV
        P RY+ L ET  VI D D+EDPL++K+ M DVDKD+WIKAM+LE+ESM FNSVW+LVDQP+GV+PIGCKWIYKRKR A GKVQT+KARLVAKG+TQ E V
Subjt:  PNRYLGLAETQVVIPDDDVEDPLSYKQTMSDVDKDQWIKAMDLEMESMDFNSVWELVDQPEGVRPIGCKWIYKRKRDAAGKVQTYKARLVAKGFTQREEV

Query:  DYEETFFPVAMLKSIRILLSIAAFYDYEI
        DYEETF PVAMLKSIRILLSIAA++DYEI
Subjt:  DYEETFFPVAMLKSIRILLSIAAFYDYEI

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein6.6e-25153.5Show/hide
Query:  MSSSIIALLKSDRLTDENFTTWKSNLNTILVVDDLRFLLTEECPQVPARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLDKKHEGMVSAREIMSSLQD
        M+S+ + +L +D+L   N+ +WK+ +NT+L++DDLRF+L EECPQVPA NA ++V+E Y+RW KAN+KA+ YILAS+SEVL KKHE M++AREIM SLQ+
Subjt:  MSSSIIALLKSDRLTDENFTTWKSNLNTILVVDDLRFLLTEECPQVPARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLDKKHEGMVSAREIMSSLQD

Query:  MFGQTSGQIRHESLKYIYNSRMKEGSLVREHVLDLMVHFNVAEMNGAVIDD--------------------NAVMNKIEYNLTTLLNELQTFQSLMKNKG
        MFGQ S QI+H++LKYIYN+RM EG+ VREHVL++MVHFNVAEMNGAVID+                    NAVMNKI Y LTTLLNELQTF+SLMK KG
Subjt:  MFGQTSGQIRHESLKYIYNSRMKEGSLVREHVLDLMVHFNVAEMNGAVIDD--------------------NAVMNKIEYNLTTLLNELQTFQSLMKNKG

Query:  QADGEANLCVHSRRFQKGSSSGTKSCGSFSGLKKTQKKKIGGKGKAPAADKGKGKVKVVDKGKCFHCNVDGHWKRNCPKYLAELKEKK------------
        Q  GEAN+   +R+F +GS+SGTKS  S SG KK +KKK G   KA  A     K     KG CFHCN +GHWKRNCPKYLAE K+ K            
Subjt:  QADGEANLCVHSRRFQKGSSSGTKSCGSFSGLKKTQKKKIGGKGKAPAADKGKGKVKVVDKGKCFHCNVDGHWKRNCPKYLAELKEKK------------

Query:  --------------ATNHVCSSLRETSSFKELEEGEMTLRVGTGDVVSARAVGDAKL-------------------------------------------
                      ATNHVCSS +  SS+++LE GEMT+RVGTG VVSA AVG  +L                                           
Subjt:  --------------ATNHVCSSLRETSSFKELEEGEMTLRVGTGDVVSARAVGDAKL-------------------------------------------

Query:  ----------------------------------------------------------LGHINLNRIERLSKNGLLNKLEDVSLPPCESCLEGKMTKRLF
                                                                  LGHINLNRIERL KNGLL++LE+ SLP CESCLEGKMTKR F
Subjt:  ----------------------------------------------------------LGHINLNRIERLSKNGLLNKLEDVSLPPCESCLEGKMTKRLF

Query:  TGKGYRAIEPLELVHSDLCGPVNVKAREGYEYFISFIDDYSRYGYL-----------EFKVYKVEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIK
        TGKG+RA EPLELVHSDLCGP+NVKAR G+EYFI+F DDYSRYGY+           +FK YK EVENAL KTIKT RSDRGGEYMDL+FQ+Y++E GI 
Subjt:  TGKGYRAIEPLELVHSDLCGPVNVKAREGYEYFISFIDDYSRYGYL-----------EFKVYKVEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIK

Query:  SQLSTPNTPHQNGVSERRNRTLLDMFTTLE---------FGVVLR---------------------------------------HVSVSNPKKLEPRSRL
        SQLS P TP QNGVSERRNRTLLDM  ++          +G  ++                                       HV  +NPKKLEPRS+L
Subjt:  SQLSTPNTPHQNGVSERRNRTLLDMFTTLE---------FGVVLR---------------------------------------HVSVSNPKKLEPRSRL

Query:  CQFVGYPKETRGGLFYDPQENKVIVSTNTTFLEEDHMRNHKPHSKLVLN----EGTDEPTRVVDQAGPSSRVDEGANTSSQSRPSQSLGMPRRSGRVVCQ
        C FVGYPK TRGG FYDP++NKV VSTN TFLEEDH+R HKP SK+VLN    E T+  TRVV++    +RV     +S+++   QSL  PRRSGRV   
Subjt:  CQFVGYPKETRGGLFYDPQENKVIVSTNTTFLEEDHMRNHKPHSKLVLN----EGTDEPTRVVDQAGPSSRVDEGANTSSQSRPSQSLGMPRRSGRVVCQ

Query:  PNRYLGLAETQVVIPDDDVEDPLSYKQTMSDVDKDQWIKAMDLEMESMDFNSVWELVDQPEGVRPIGCKWIYKRKRDAAGKVQTYKARLVAKGFTQREEV
        P RY+ L ET  VI D D+EDPL++K+ M DVDKD+WIKAM+LE+ESM FNSVW+LVDQP+GV+PIGCKWIYKRKR A GKVQT+KARLVAKG+TQ E V
Subjt:  PNRYLGLAETQVVIPDDDVEDPLSYKQTMSDVDKDQWIKAMDLEMESMDFNSVWELVDQPEGVRPIGCKWIYKRKRDAAGKVQTYKARLVAKGFTQREEV

Query:  DYEETFFPVAMLKSIRILLSIAAFYDYEI
        DYEETF PVAMLKSIRILLSIAA++DYEI
Subjt:  DYEETFFPVAMLKSIRILLSIAAFYDYEI

A0A5A7UYE8 Gag/pol protein1.9e-25864.1Show/hide
Query:  MSSSIIALLKSDRLTDENFTTWKSNLNTILVVDDLRFLLTEECPQVPARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLDKKHEGMVSAREIMSSLQD
        MSSSIIALLK D+LT EN+ TWKS LN ILV+ DL F+L EECP  P ++A QSV++AYDRW KANDKA+++ILAS+S++L KKHE MV+AR+IM SL++
Subjt:  MSSSIIALLKSDRLTDENFTTWKSNLNTILVVDDLRFLLTEECPQVPARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLDKKHEGMVSAREIMSSLQD

Query:  MFGQTSGQIRHESLKYIYNSRMKEGSLVREHVLDLMVHFNVAEMNGAVIDDNAVMNKIEYNLTTLLNELQTFQSLMKNKGQADGEANLCVHSRRFQKGSS
        MFGQ S QI+ E+                          NVA                                                HS+R      
Subjt:  MFGQTSGQIRHESLKYIYNSRMKEGSLVREHVLDLMVHFNVAEMNGAVIDDNAVMNKIEYNLTTLLNELQTFQSLMKNKGQADGEANLCVHSRRFQKGSS

Query:  SGTKSCGSFSGLKKTQKKKIGGKGKAPA-ADKGKGKVKVVDKGKCFHCNVDGHWKRNCPKYLAELKEKK-ATNHVCSSLRETSSFKELEEGEMTLRVGTG
           +   S SG +K QK+K  GKGK P  A + KGK KV  K KCFHCNVD HWK NCPKYL + KEK+ ATNHVCSSL+ETSSFK+LE+ EMTL+VGTG
Subjt:  SGTKSCGSFSGLKKTQKKKIGGKGKAPA-ADKGKGKVKVVDKGKCFHCNVDGHWKRNCPKYLAELKEKK-ATNHVCSSLRETSSFKELEEGEMTLRVGTG

Query:  DVVSARAVGDAKLLGHINLNRIERLSKNGLLNKLEDVSLPPCESCLEGKMTKRLFTGKGYRAIEPLELVHSDLCGPVNVKAREGYEYFISFIDDYSRYGY
        DV+SARAVGDAK LGHINL+RI RL KNGLLNKL+DVSLPPCESCLEGKMTKR FTGKGYRA EPLEL+HSDLCGP+NVKAR G+EYFISFIDDYSRYGY
Subjt:  DVVSARAVGDAKLLGHINLNRIERLSKNGLLNKLEDVSLPPCESCLEGKMTKRLFTGKGYRAIEPLELVHSDLCGPVNVKAREGYEYFISFIDDYSRYGY

Query:  L-----------EFKVYKVEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSTPNTPHQNGVSERRNRTLLDMFTTL---------------E
        L           +FK YK EVEN L K IK LRSDRGGEYMDLRFQDYMIEHGI+SQLS P TP QNGVSERRNRTLLDM  ++               E
Subjt:  L-----------EFKVYKVEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSTPNTPHQNGVSERRNRTLLDMFTTL---------------E

Query:  FGVVL---------------------------------RHVSVSNPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNTTFLEEDHMRNHKPHSK
          V +                                  HV V+NPKKLEPRSRLCQFVGYPKETRGGLF+DPQEN+V VSTN TFLEEDHMRNHKP SK
Subjt:  FGVVL---------------------------------RHVSVSNPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNTTFLEEDHMRNHKPHSK

Query:  LVLNEGTDEPTRVVDQAGPSSRVDEGANTSSQSRPSQSLGMPRRSGRVVCQPNRYLGLAETQVVIPDDDVEDPLSYKQTMSDVDKDQWIKAMDLEMESMD
        LVL+E TDE TRVVD+ GPSSRVDE   TS QS PSQSL MPRRSGRVV QPNRYLGL ETQVVIPDD VEDPLSYKQ M+DVDKDQW+KAMDLEMESM 
Subjt:  LVLNEGTDEPTRVVDQAGPSSRVDEGANTSSQSRPSQSLGMPRRSGRVVCQPNRYLGLAETQVVIPDDDVEDPLSYKQTMSDVDKDQWIKAMDLEMESMD

Query:  FNSVWELVDQPEGVRPIGCKWIYKRKRDAAGKVQTYKARLVAKGFTQREEVDYEETFFPVAMLKSIRILLSIAAFYDYEI
        FNSVWELVD PEGV+PIGCKWIYKRKRD+AGKVQT+KARLVAKG+TQRE VDYEETF PVAMLKSIRILLSIA FYDYEI
Subjt:  FNSVWELVDQPEGVRPIGCKWIYKRKRDAAGKVQTYKARLVAKGFTQREEVDYEETFFPVAMLKSIRILLSIAAFYDYEI

A0A5D3BUN8 Gag/pol protein1.9e-25864.1Show/hide
Query:  MSSSIIALLKSDRLTDENFTTWKSNLNTILVVDDLRFLLTEECPQVPARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLDKKHEGMVSAREIMSSLQD
        MSSSIIALLK D+LT EN+ TWKS LN ILV+ DL F+L EECP  P ++A QSV++AYDRW KANDKA+++ILAS+S++L KKHE MV+AR+IM SL++
Subjt:  MSSSIIALLKSDRLTDENFTTWKSNLNTILVVDDLRFLLTEECPQVPARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLDKKHEGMVSAREIMSSLQD

Query:  MFGQTSGQIRHESLKYIYNSRMKEGSLVREHVLDLMVHFNVAEMNGAVIDDNAVMNKIEYNLTTLLNELQTFQSLMKNKGQADGEANLCVHSRRFQKGSS
        MFGQ S QI+ E+                          NVA                                                HS+R      
Subjt:  MFGQTSGQIRHESLKYIYNSRMKEGSLVREHVLDLMVHFNVAEMNGAVIDDNAVMNKIEYNLTTLLNELQTFQSLMKNKGQADGEANLCVHSRRFQKGSS

Query:  SGTKSCGSFSGLKKTQKKKIGGKGKAPA-ADKGKGKVKVVDKGKCFHCNVDGHWKRNCPKYLAELKEKK-ATNHVCSSLRETSSFKELEEGEMTLRVGTG
           +   S SG +K QK+K  GKGK P  A + KGK KV  K KCFHCNVD HWK NCPKYL + KEK+ ATNHVCSSL+ETSSFK+LE+ EMTL+VGTG
Subjt:  SGTKSCGSFSGLKKTQKKKIGGKGKAPA-ADKGKGKVKVVDKGKCFHCNVDGHWKRNCPKYLAELKEKK-ATNHVCSSLRETSSFKELEEGEMTLRVGTG

Query:  DVVSARAVGDAKLLGHINLNRIERLSKNGLLNKLEDVSLPPCESCLEGKMTKRLFTGKGYRAIEPLELVHSDLCGPVNVKAREGYEYFISFIDDYSRYGY
        DV+SARAVGDAK LGHINL+RI RL KNGLLNKL+DVSLPPCESCLEGKMTKR FTGKGYRA EPLEL+HSDLCGP+NVKAR G+EYFISFIDDYSRYGY
Subjt:  DVVSARAVGDAKLLGHINLNRIERLSKNGLLNKLEDVSLPPCESCLEGKMTKRLFTGKGYRAIEPLELVHSDLCGPVNVKAREGYEYFISFIDDYSRYGY

Query:  L-----------EFKVYKVEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSTPNTPHQNGVSERRNRTLLDMFTTL---------------E
        L           +FK YK EVEN L K IK LRSDRGGEYMDLRFQDYMIEHGI+SQLS P TP QNGVSERRNRTLLDM  ++               E
Subjt:  L-----------EFKVYKVEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSTPNTPHQNGVSERRNRTLLDMFTTL---------------E

Query:  FGVVL---------------------------------RHVSVSNPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNTTFLEEDHMRNHKPHSK
          V +                                  HV V+NPKKLEPRSRLCQFVGYPKETRGGLF+DPQEN+V VSTN TFLEEDHMRNHKP SK
Subjt:  FGVVL---------------------------------RHVSVSNPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNTTFLEEDHMRNHKPHSK

Query:  LVLNEGTDEPTRVVDQAGPSSRVDEGANTSSQSRPSQSLGMPRRSGRVVCQPNRYLGLAETQVVIPDDDVEDPLSYKQTMSDVDKDQWIKAMDLEMESMD
        LVL+E TDE TRVVD+ GPSSRVDE   TS QS PSQSL MPRRSGRVV QPNRYLGL ETQVVIPDD VEDPLSYKQ M+DVDKDQW+KAMDLEMESM 
Subjt:  LVLNEGTDEPTRVVDQAGPSSRVDEGANTSSQSRPSQSLGMPRRSGRVVCQPNRYLGLAETQVVIPDDDVEDPLSYKQTMSDVDKDQWIKAMDLEMESMD

Query:  FNSVWELVDQPEGVRPIGCKWIYKRKRDAAGKVQTYKARLVAKGFTQREEVDYEETFFPVAMLKSIRILLSIAAFYDYEI
        FNSVWELVD PEGV+PIGCKWIYKRKRD+AGKVQT+KARLVAKG+TQRE VDYEETF PVAMLKSIRILLSIA FYDYEI
Subjt:  FNSVWELVDQPEGVRPIGCKWIYKRKRDAAGKVQTYKARLVAKGFTQREEVDYEETFFPVAMLKSIRILLSIAAFYDYEI

A0A5D3CPJ6 Gag/pol protein6.6e-25153.5Show/hide
Query:  MSSSIIALLKSDRLTDENFTTWKSNLNTILVVDDLRFLLTEECPQVPARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLDKKHEGMVSAREIMSSLQD
        M+S+ + +L +D+L   N+ +WK+ +NT+L++DDLRF+L EECPQVPA NA ++V+E Y+RW KAN+KA+ YILAS+SEVL KKHE M++AREIM SLQ+
Subjt:  MSSSIIALLKSDRLTDENFTTWKSNLNTILVVDDLRFLLTEECPQVPARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLDKKHEGMVSAREIMSSLQD

Query:  MFGQTSGQIRHESLKYIYNSRMKEGSLVREHVLDLMVHFNVAEMNGAVIDD--------------------NAVMNKIEYNLTTLLNELQTFQSLMKNKG
        MFGQ S QI+H++LKYIYN+RM EG+ VREHVL++MVHFNVAEMNGAVID+                    NAVMNKI Y LTTLLNELQTF+SLMK KG
Subjt:  MFGQTSGQIRHESLKYIYNSRMKEGSLVREHVLDLMVHFNVAEMNGAVIDD--------------------NAVMNKIEYNLTTLLNELQTFQSLMKNKG

Query:  QADGEANLCVHSRRFQKGSSSGTKSCGSFSGLKKTQKKKIGGKGKAPAADKGKGKVKVVDKGKCFHCNVDGHWKRNCPKYLAELKEKK------------
        Q  GEAN+   +R+F +GS+SGTKS  S SG KK +KKK G   KA  A     K     KG CFHCN +GHWKRNCPKYLAE K+ K            
Subjt:  QADGEANLCVHSRRFQKGSSSGTKSCGSFSGLKKTQKKKIGGKGKAPAADKGKGKVKVVDKGKCFHCNVDGHWKRNCPKYLAELKEKK------------

Query:  --------------ATNHVCSSLRETSSFKELEEGEMTLRVGTGDVVSARAVGDAKL-------------------------------------------
                      ATNHVCSS +  SS+++LE GEMT+RVGTG VVSA AVG  +L                                           
Subjt:  --------------ATNHVCSSLRETSSFKELEEGEMTLRVGTGDVVSARAVGDAKL-------------------------------------------

Query:  ----------------------------------------------------------LGHINLNRIERLSKNGLLNKLEDVSLPPCESCLEGKMTKRLF
                                                                  LGHINLNRIERL KNGLL++LE+ SLP CESCLEGKMTKR F
Subjt:  ----------------------------------------------------------LGHINLNRIERLSKNGLLNKLEDVSLPPCESCLEGKMTKRLF

Query:  TGKGYRAIEPLELVHSDLCGPVNVKAREGYEYFISFIDDYSRYGYL-----------EFKVYKVEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIK
        TGKG+RA EPLELVHSDLCGP+NVKAR G+EYFI+F DDYSRYGY+           +FK YK EVENAL KTIKT RSDRGGEYMDL+FQ+Y++E GI 
Subjt:  TGKGYRAIEPLELVHSDLCGPVNVKAREGYEYFISFIDDYSRYGYL-----------EFKVYKVEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIK

Query:  SQLSTPNTPHQNGVSERRNRTLLDMFTTLE---------FGVVLR---------------------------------------HVSVSNPKKLEPRSRL
        SQLS P TP QNGVSERRNRTLLDM  ++          +G  ++                                       HV  +NPKKLEPRS+L
Subjt:  SQLSTPNTPHQNGVSERRNRTLLDMFTTLE---------FGVVLR---------------------------------------HVSVSNPKKLEPRSRL

Query:  CQFVGYPKETRGGLFYDPQENKVIVSTNTTFLEEDHMRNHKPHSKLVLN----EGTDEPTRVVDQAGPSSRVDEGANTSSQSRPSQSLGMPRRSGRVVCQ
        C FVGYPK TRGG FYDP++NKV VSTN TFLEEDH+R HKP SK+VLN    E T+  TRVV++    +RV     +S+++   QSL  PRRSGRV   
Subjt:  CQFVGYPKETRGGLFYDPQENKVIVSTNTTFLEEDHMRNHKPHSKLVLN----EGTDEPTRVVDQAGPSSRVDEGANTSSQSRPSQSLGMPRRSGRVVCQ

Query:  PNRYLGLAETQVVIPDDDVEDPLSYKQTMSDVDKDQWIKAMDLEMESMDFNSVWELVDQPEGVRPIGCKWIYKRKRDAAGKVQTYKARLVAKGFTQREEV
        P RY+ L ET  VI D D+EDPL++K+ M DVDKD+WIKAM+LE+ESM FNSVW+LVDQP+GV+PIGCKWIYKRKR A GKVQT+KARLVAKG+TQ E V
Subjt:  PNRYLGLAETQVVIPDDDVEDPLSYKQTMSDVDKDQWIKAMDLEMESMDFNSVWELVDQPEGVRPIGCKWIYKRKRDAAGKVQTYKARLVAKGFTQREEV

Query:  DYEETFFPVAMLKSIRILLSIAAFYDYEI
        DYEETF PVAMLKSIRILLSIAA++DYEI
Subjt:  DYEETFFPVAMLKSIRILLSIAAFYDYEI

E2GK51 Gag/pol protein (Fragment)4.3e-25855.03Show/hide
Query:  MSSSIIALLKSDRLTDENFTTWKSNLNTILVVDDLRFLLTEECPQVPARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLDKKHEGMVSAREIMSSLQD
        M++SI+ LL S++L  +N++ WKSNLNTILVVDDLRF+LTEECPQ PA NA ++V+EAYDRW+KANDKA+VYILAS+++VL KKH+ + +A+ IM SL++
Subjt:  MSSSIIALLKSDRLTDENFTTWKSNLNTILVVDDLRFLLTEECPQVPARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLDKKHEGMVSAREIMSSLQD

Query:  MFGQTSGQIRHESLKYIYNSRMKEGSLVREHVLDLMVHFNVAEMNGAVIDD--------------------NAVMNKIEYNLTTLLNELQTFQSLMKNKG
        MFGQ S  +RHE++K+IY  RMKEG+ VREHVLD+M+HFN+AE+NG  ID+                    NA +NKIE+NLTTLLNELQ FQ+L  +KG
Subjt:  MFGQTSGQIRHESLKYIYNSRMKEGSLVREHVLDLMVHFNVAEMNGAVIDD--------------------NAVMNKIEYNLTTLLNELQTFQSLMKNKG

Query:  QADGEANLCVHSRRFQKGSSSGTKSCGSFSGLKKTQKKKIGGKGKAPAADKGKGKVKVVDKGKCFHCNVDGHWKRNCPKYLAELKEKK------------
        + + EAN+ V  R+F +GSSS  K      G  K Q KK  GKGKAP   K K   K  DKGKCFHCN DGHWKRNCPKYLAE K +K            
Subjt:  QADGEANLCVHSRRFQKGSSSGTKSCGSFSGLKKTQKKKIGGKGKAPAADKGKGKVKVVDKGKCFHCNVDGHWKRNCPKYLAELKEKK------------

Query:  ----------------ATNHVCSSLRETSSFKELEEGEMTLRVGTGDVVSARAVGDAKL-----------------------------------------
                        ATNH+C S +ETSS+K+L+EGE+TL+VGTG+VVSA AVGD  L                                         
Subjt:  ----------------ATNHVCSSLRETSSFKELEEGEMTLRVGTGDVVSARAVGDAKL-----------------------------------------

Query:  ----------------------------------------------------------LGHINLNRIERLSKNGLLNKLEDVSLPPCESCLEGKMTKRLF
                                                                  LGHINLNRIERL K+G+LN+LED SLPPCESCLEGKMTKR F
Subjt:  ----------------------------------------------------------LGHINLNRIERLSKNGLLNKLEDVSLPPCESCLEGKMTKRLF

Query:  TGKGYRAIEPLELVHSDLCGPVNVKAREGYEYFISFIDDYSRYGYL-----------EFKVYKVEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIK
        TGKG RA  PLELVHSDLCGP+NVKAR GYEYFISFIDD+SRYG++           +FK YK EVEN +GKTIKTLRSDRGGEYMD +FQDY+IE GI+
Subjt:  TGKGYRAIEPLELVHSDLCGPVNVKAREGYEYFISFIDDYSRYGYL-----------EFKVYKVEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIK

Query:  SQLSTPNTPHQNGVSERRNRTLLDMFTTLE---------FGVVLR---------------------------------------HVSVSNPKKLEPRSRL
        SQLS P+TP QNGVSERRNRTLLDM  ++          +G  L                                        HV V NPKKLEPRS+L
Subjt:  SQLSTPNTPHQNGVSERRNRTLLDMFTTLE---------FGVVLR---------------------------------------HVSVSNPKKLEPRSRL

Query:  CQFVGYPKETRGGLFYDPQENKVIVSTNTTFLEEDHMRNHKPHSKLVLNEGTDEPTRVVDQAGPSSRVDEGANTSSQSRPSQSLGMPRRSGRVVCQPNRY
        C FVGYPKE+RGGLFY PQENKV VSTN TFLEEDH RNH+P SK+VL E     T   D+   S++V + AN S QS  SQ L +PRRSGRVV QPNRY
Subjt:  CQFVGYPKETRGGLFYDPQENKVIVSTNTTFLEEDHMRNHKPHSKLVLNEGTDEPTRVVDQAGPSSRVDEGANTSSQSRPSQSLGMPRRSGRVVCQPNRY

Query:  LGLAETQVVIPDDDVEDPLSYKQTMSDVDKDQWIKAMDLEMESMDFNSVWELVDQPEGVRPIGCKWIYKRKRDAAGKVQTYKARLVAKGFTQREEVDYEE
        LGL ETQ++IPDD VEDPL+YKQ M+DVD+DQWIKAM+LEMESM FNSVW LVD P  V+PIGCKWIYKRKRD AGKVQT+KARLVAKG+TQ+E VDYEE
Subjt:  LGLAETQVVIPDDDVEDPLSYKQTMSDVDKDQWIKAMDLEMESMDFNSVWELVDQPEGVRPIGCKWIYKRKRDAAGKVQTYKARLVAKGFTQREEVDYEE

Query:  TFFPVAMLKSIRILLSIAAFYDYEI
        TF PVAMLKSIRILLSIA FY+YEI
Subjt:  TFFPVAMLKSIRILLSIAAFYDYEI

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.1e-2723.62Show/hide
Query:  VGDAKLLGHINLNRIERLSKNGLLNKLEDVSLPPCESCLEGKMTKRLFTG-KGYRAIE-PLELVHSDLCGPVNVKAREGYEYFISFIDDYSRYGYLEFKV
        + D KLL    + R    S   LLN LE +S   CE CL GK  +  F   K    I+ PL +VHSD+CGP+     +   YF+ F+D ++ Y       
Subjt:  VGDAKLLGHINLNRIERLSKNGLLNKLEDVSLPPCESCLEGKMTKRLFTG-KGYRAIE-PLELVHSDLCGPVNVKAREGYEYFISFIDDYSRYGYLEFKV

Query:  YKVEV-----------ENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSTPNTPHQNGVSERRNRTLLDMFTTLEFGV-----------------
        YK +V           E      +  L  D G EY+    + + ++ GI   L+ P+TP  NGVSER  RT+ +   T+  G                  
Subjt:  YKVEV-----------ENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSTPNTPHQNGVSERRNRTLLDMFTTLEFGV-----------------

Query:  -------------------------VLRHVSV----------SNPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVS-------TN----------T
                                  L+H+ V          +   K + +S    FVGY  E  G   +D    K IV+       TN          T
Subjt:  -------------------------VLRHVSV----------SNPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVS-------TN----------T

Query:  TFL---EEDHMRNHKPHSKLVL-----NEGTD--------------------EPTRVVDQAGPSSRVD--------------------------------
         FL   +E   +N    S+ ++     NE  +                    +  +++    P+   +                                
Subjt:  TFL---EEDHMRNHKPHSKLVL-----NEGTD--------------------EPTRVVDQAGPSSRVD--------------------------------

Query:  -EGANTSSQSRPSQS------LGMP------------RRSGRVVCQPNRYLGLAE---TQVVIPDDDV--EDPLSYKQTMSDVDKDQWIKAMDLEMESMD
         +G+   ++SR S++      +G+             RRS R+  +P       +    +VV+    +  + P S+ +     DK  W +A++ E+ +  
Subjt:  -EGANTSSQSRPSQS------LGMP------------RRSGRVVCQPNRYLGLAE---TQVVIPDDDV--EDPLSYKQTMSDVDKDQWIKAMDLEMESMD

Query:  FNSVWELVDQPEGVRPIGCKWIYKRKRDAAGKVQTYKARLVAKGFTQREEVDYEETFFPVAMLKSIRILLSIAAFYDYEI
         N+ W +  +PE    +  +W++  K +  G    YKARLVA+GFTQ+ ++DYEETF PVA + S R +LS+   Y+ ++
Subjt:  FNSVWELVDQPEGVRPIGCKWIYKRKRDAAGKVQTYKARLVAKGFTQREEVDYEETFFPVAMLKSIRILLSIAAFYDYEI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-5931.59Show/hide
Query:  KLLGHINLNRIERLSKNGLLNKLEDVSLPPCESCLEGKMTKRLFTGKGYRAIEPLELVHSDLCGPVNVKAREGYEYFISFIDDYSR-----------YGY
        K +GH++   ++ L+K  L++  +  ++ PC+ CL GK  +  F     R +  L+LV+SD+CGP+ +++  G +YF++FIDD SR             +
Subjt:  KLLGHINLNRIERLSKNGLLNKLEDVSLPPCESCLEGKMTKRLFTGKGYRAIEPLELVHSDLCGPVNVKAREGYEYFISFIDDYSR-----------YGY

Query:  LEFKVYKVEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSTPNTPHQNGVSERRNRTLLD--------------------------------
          F+ +   VE   G+ +K LRSD GGEY    F++Y   HGI+ + + P TP  NGV+ER NRT+++                                
Subjt:  LEFKVYKVEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSTPNTPHQNGVSERRNRTLLD--------------------------------

Query:  ----------MFTTLE--------FGV-VLRHVSVSNPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNTTFLEEDHMRNHKPHSKLVLN----
                  ++T  E        FG     HV      KL+ +S  C F+GY  E  G   +DP + KVI S +  F  E  +R     S+ V N    
Subjt:  ----------MFTTLE--------FGV-VLRHVSVSNPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNTTFLEEDHMRNHKPHSKLVLN----

Query:  ----------------EGTDEPTRVVDQAG----PSSRVDEGANTSSQSRPSQSLGMP-RRSGRVVCQPNRYLGLAETQVVIPDDDVEDPLSYKQTMSDV
                          TDE +   +Q G       ++DEG          +    P RRS R   +  RY     T+ V+  DD  +P S K+ +S  
Subjt:  ----------------EGTDEPTRVVDQAG----PSSRVDEGANTSSQSRPSQSLGMP-RRSGRVVCQPNRYLGLAETQVVIPDDDVEDPLSYKQTMSDV

Query:  DKDQWIKAMDLEMESMDFNSVWELVDQPEGVRPIGCKWIYKRKRDAAGKVQTYKARLVAKGFTQREEVDYEETFFPVAMLKSIRILLSIAAFYDYEI
        +K+Q +KAM  EMES+  N  ++LV+ P+G RP+ CKW++K K+D   K+  YKARLV KGF Q++ +D++E F PV  + SIR +LS+AA  D E+
Subjt:  DKDQWIKAMDLEMESMDFNSVWELVDQPEGVRPIGCKWIYKRKRDAAGKVQTYKARLVAKGFTQREEVDYEETFFPVAMLKSIRILLSIAAFYDYEI

Q12337 Transposon Ty2-GR1 Gag-Pol polyprotein1.0e-1426.58Show/hide
Query:  KLLGHINLNRIERLSKNGLLNKLED-------VSLPPCESCLEGKMTKRLFTGKGYR-----AIEPLELVHSDLCGPVNVKAREGYEYFISFIDDYSRYG
        ++LGH N   I++  K   +  L++        S   C  CL GK TK     KG R     + EP + +H+D+ GPV+   +    YFISF D+ +R+ 
Subjt:  KLLGHINLNRIERLSKNGLLNKLED-------VSLPPCESCLEGKMTKRLFTGKGYR-----AIEPLELVHSDLCGPVNVKAREGYEYFISFIDDYSRYG

Query:  YL-------------EFKVYKVEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSTPNTPHQNGVSERRNRTLLD---------------MFT
        ++              F      ++N     +  ++ DRG EY +     +    GI +  +T      +GV+ER NRTLL+                F+
Subjt:  YL-------------EFKVYKVEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSTPNTPHQNGVSERRNRTLLD---------------MFT

Query:  TLEFGVVLRHVSVSNPKKLEPR
         +EF  ++R+  VS  K+   R
Subjt:  TLEFGVVLRHVSVSNPKKLEPR

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.3e-2522.38Show/hide
Query:  CESCLEGKMTKRLFTGKGYRAIEPLELVHSDLCGPVNVKAREGYEYFISFIDDYSRYGYL-----------EFKVYKVEVENALGKTIKTLRSDRGGEYM
        C  CL  K  K  F+     +  PLE ++SD+     + + + Y Y++ F+D ++RY +L            F  +K  +EN     I T  SD GGE++
Subjt:  CESCLEGKMTKRLFTGKGYRAIEPLELVHSDLCGPVNVKAREGYEYFISFIDDYSRYGYL-----------EFKVYKVEVENALGKTIKTLRSDRGGEYM

Query:  DLRFQDYMIEHGIKSQLSTPNTPHQNGVSERRNRTLLDMFTTLEFGVVLRHVSVS---------------------------------------------
         L   +Y  +HGI    S P+TP  NG+SER++R +++   TL     L H S+                                              
Subjt:  DLRFQDYMIEHGIKSQLSTPNTPHQNGVSERRNRTLLDMFTTLEFGVVLRHVSVS---------------------------------------------

Query:  -----------NPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNTTFLEE-----------DHMRNHKPHSKLVLNEGTDEPTRVVDQAGP---
                   N  KL+ +SR C F+GY       L    Q +++ +S +  F E              ++  +  S  V +  T  PTR      P   
Subjt:  -----------NPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNTTFLEE-----------DHMRNHKPHSKLVLNEGTDEPTRVVDQAGP---

Query:  ----------------------SSRVDEGANTSSQSRP------------------------------------------SQSLGMPRRSGRVVCQPNRY
                              SS +D   ++S  S P                                          +QSL  P +S      P   
Subjt:  ----------------------SSRVDEGANTSSQSRP------------------------------------------SQSLGMPRRSGRVVCQPNRY

Query:  LGLAETQVVIPDDDVEDPLSYKQTMSD--------------------------------------------VDKDQWIKAMDLEMESMDFNSVWELVDQP
           + T    P   +  P    Q +++                                            +  ++W  AM  E+ +   N  W+LV  P
Subjt:  LGLAETQVVIPDDDVEDPLSYKQTMSD--------------------------------------------VDKDQWIKAMDLEMESMDFNSVWELVDQP

Query:  EG-VRPIGCKWIYKRKRDAAGKVQTYKARLVAKGFTQREEVDYEETFFPVAMLKSIRILLSIA
           V  +GC+WI+ +K ++ G +  YKARLVAKG+ QR  +DY ETF PV    SIRI+L +A
Subjt:  EG-VRPIGCKWIYKRKRDAAGKVQTYKARLVAKGFTQREEVDYEETFFPVAMLKSIRILLSIA

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-2422.95Show/hide
Query:  LGHINLNRIERLSKNGLLNKLE-DVSLPPCESCLEGKMTKRLFTGKGYRAIEPLELVHSDLCGPVNVKAREGYEYFISFIDDYSRYGYL-----------
        LGH +L  +  +  N  L  L     L  C  C   K  K  F+     + +PLE ++SD+     + + + Y Y++ F+D ++RY +L           
Subjt:  LGHINLNRIERLSKNGLLNKLE-DVSLPPCESCLEGKMTKRLFTGKGYRAIEPLELVHSDLCGPVNVKAREGYEYFISFIDDYSRYGYL-----------

Query:  EFKVYKVEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSTPNTPHQNGVSERRNRTLLDMFTTLEFGVVLRHVSVS----------------
         F ++K  VEN     I TL SD GGE++ LR  DY+ +HGI    S P+TP  NG+SER++R +++M  TL     L H SV                 
Subjt:  EFKVYKVEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSTPNTPHQNGVSERRNRTLLDMFTTLEFGVVLRHVSVS----------------

Query:  ----------------------------------------NPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNTTFLE----------------
                                                N  KLE +S+ C F+GY       L       ++  S +  F E                
Subjt:  ----------------------------------------NPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNTTFLE----------------

Query:  ---EDHMRNHKPHSKL--------------------------------------------VLNEGTDEPTRVVDQAGPSSRV----------------DE
            D   N   H+ L                                            + +  + EPT      GP                    + 
Subjt:  ---EDHMRNHKPHSKL--------------------------------------------VLNEGTDEPTRVVDQAGPSSRV----------------DE

Query:  GANTSSQSRPSQSLGMPR---------RSGRVVCQPNRYLGLAETQVVIP--------------------------DDDVEDP---LSY----------K
          N+ S + P+Q+  +P+              + +PN     + +   +P                           D +  P    SY          +
Subjt:  GANTSSQSRPSQSLGMPR---------RSGRVVCQPNRYLGLAETQVVIP--------------------------DDDVEDP---LSY----------K

Query:  QTMSDVDKDQWIKAMDLEMESMDFNSVWELV-DQPEGVRPIGCKWIYKRKRDAAGKVQTYKARLVAKGFTQREEVDYEETFFPVAMLKSIRILLSIA
          +  +  D+W +AM  E+ +   N  W+LV   P  V  +GC+WI+ +K ++ G +  YKARLVAKG+ QR  +DY ETF PV    SIRI+L +A
Subjt:  QTMSDVDKDQWIKAMDLEMESMDFNSVWELV-DQPEGVRPIGCKWIYKRKRDAAGKVQTYKARLVAKGFTQREEVDYEETFFPVAMLKSIRILLSIA

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 85.7e-2146.24Show/hide
Query:  WIKAMDLEMESMDFNSVWELVDQPEGVRPIGCKWIYKRKRDAAGKVQTYKARLVAKGFTQREEVDYEETFFPVAMLKSIRILLSIAAFYDYEI
        W  AMD E+ +M+    WE+   P   +PIGCKW+YK K ++ G ++ YKARLVAKG+TQ+E +D+ ETF PV  L S++++L+I+A Y++ +
Subjt:  WIKAMDLEMESMDFNSVWELVDQPEGVRPIGCKWIYKRKRDAAGKVQTYKARLVAKGFTQREEVDYEETFFPVAMLKSIRILLSIAAFYDYEI

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.2e-1340.7Show/hide
Query:  WIKAMDLEMESMDFNSVWELVDQPEGVRPIGCKWIYKRKRDAAGKVQTYKARLVAKGFTQREEVDYEETFFPVAMLKSIRILLSIA
        W +AM  E++++  N  W LV  P     +GCKW++K K  + G +   KARLVAKGF Q E + + ET+ PV    +IR +L++A
Subjt:  WIKAMDLEMESMDFNSVWELVDQPEGVRPIGCKWIYKRKRDAAGKVQTYKARLVAKGFTQREEVDYEETFFPVAMLKSIRILLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATCCTCAATAATAGCCTTACTTAAAAGCGATCGTTTAACTGATGAGAATTTTACTACGTGGAAGTCTAACTTGAATACGATTCTCGTTGTTGACGACCTACGGTT
TCTACTAACTGAAGAATGTCCTCAGGTCCCTGCTCGTAACGCTCCTCAATCTGTTAAGGAGGCGTACGACCGCTGGATCAAGGCCAATGATAAGGCCAAGGTCTACATTT
TGGCTAGTGTTTCTGAAGTTCTGGACAAAAAGCATGAGGGCATGGTCTCAGCTCGTGAGATCATGAGTTCGCTGCAGGATATGTTTGGACAAACGTCTGGACAGATTCGA
CACGAATCCCTCAAATACATTTATAACTCTCGTATGAAGGAGGGGTCATTGGTGAGAGAACACGTTCTCGATCTGATGGTCCACTTCAACGTGGCTGAGATGAACGGAGC
AGTCATTGACGACAATGCGGTGATGAACAAGATTGAGTATAACCTGACTACTCTCCTTAATGAACTACAAACTTTCCAGTCTCTTATGAAGAATAAGGGACAGGCTGATG
GAGAAGCAAATCTGTGTGTCCATTCCAGAAGGTTTCAGAAGGGTTCATCCTCTGGAACTAAGTCCTGTGGTTCATTTTCTGGGCTTAAGAAGACCCAAAAGAAGAAGATA
GGAGGGAAAGGGAAGGCACCTGCTGCTGACAAAGGCAAGGGAAAAGTCAAGGTTGTAGATAAAGGAAAGTGTTTCCACTGCAACGTGGATGGGCACTGGAAGCGAAACTG
CCCAAAATACCTTGCTGAGCTCAAAGAGAAGAAAGCCACTAATCATGTTTGCTCTTCGCTTCGTGAAACTAGTTCCTTCAAGGAGCTCGAAGAGGGTGAGATGACGCTCA
GGGTCGGAACGGGAGACGTCGTCTCAGCTCGTGCAGTGGGAGATGCCAAGCTACTTGGTCATATTAATCTCAACCGGATTGAGAGACTTTCTAAGAATGGACTTCTAAAT
AAGTTAGAAGATGTTTCTTTACCTCCTTGCGAGTCTTGCTTGGAAGGTAAAATGACCAAGCGACTTTTTACTGGAAAAGGTTACAGAGCCATAGAGCCATTAGAACTTGT
ACATTCGGATCTTTGTGGTCCGGTGAATGTTAAAGCTCGGGAAGGGTACGAATATTTCATCTCTTTCATAGATGATTATTCGAGGTATGGTTATTTAGAGTTCAAAGTGT
ATAAGGTTGAAGTAGAGAATGCATTAGGGAAAACCATCAAAACACTTCGATCCGATCGAGGTGGAGAGTATATGGATTTGAGATTCCAGGACTATATGATAGAACATGGA
ATTAAATCTCAACTCTCAACACCTAATACACCACACCAAAATGGTGTGTCAGAAAGGAGAAATAGAACCTTGTTGGACATGTTTACAACACTTGAATTTGGGGTTGTCCT
AAGACACGTCTCGGTGTCAAACCCAAAGAAACTGGAACCTCGTTCAAGATTGTGCCAATTTGTTGGCTATCCCAAAGAAACGAGAGGTGGTCTTTTCTACGACCCACAAG
AAAACAAGGTGATTGTATCGACAAACACCACGTTCTTGGAGGAAGATCACATGAGGAACCACAAACCGCATAGTAAATTAGTGCTAAATGAAGGTACAGATGAACCAACA
AGAGTTGTTGATCAAGCAGGACCTTCATCGAGAGTTGATGAAGGTGCCAACACCTCAAGTCAGTCTCGTCCTTCTCAATCGTTGGGAATGCCTCGACGTAGTGGGAGGGT
TGTTTGCCAACCTAACCGCTACTTGGGTTTAGCTGAAACTCAAGTTGTCATACCTGATGACGACGTAGAAGATCCATTGTCTTATAAACAGACGATGAGTGACGTAGACA
AGGACCAATGGATCAAAGCCATGGACCTTGAAATGGAGTCAATGGACTTCAATTCAGTATGGGAACTTGTAGACCAACCTGAAGGGGTTAGACCCATAGGGTGTAAATGG
ATCTATAAGAGAAAAAGAGATGCAGCCGGAAAGGTACAGACCTATAAAGCTAGACTGGTAGCAAAGGGTTTTACCCAAAGGGAAGAAGTTGATTATGAAGAAACTTTTTT
CCCTGTTGCTATGCTGAAGTCTATAAGGATACTCTTGTCCATAGCCGCATTTTATGATTATGAAATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCATCCTCAATAATAGCCTTACTTAAAAGCGATCGTTTAACTGATGAGAATTTTACTACGTGGAAGTCTAACTTGAATACGATTCTCGTTGTTGACGACCTACGGTT
TCTACTAACTGAAGAATGTCCTCAGGTCCCTGCTCGTAACGCTCCTCAATCTGTTAAGGAGGCGTACGACCGCTGGATCAAGGCCAATGATAAGGCCAAGGTCTACATTT
TGGCTAGTGTTTCTGAAGTTCTGGACAAAAAGCATGAGGGCATGGTCTCAGCTCGTGAGATCATGAGTTCGCTGCAGGATATGTTTGGACAAACGTCTGGACAGATTCGA
CACGAATCCCTCAAATACATTTATAACTCTCGTATGAAGGAGGGGTCATTGGTGAGAGAACACGTTCTCGATCTGATGGTCCACTTCAACGTGGCTGAGATGAACGGAGC
AGTCATTGACGACAATGCGGTGATGAACAAGATTGAGTATAACCTGACTACTCTCCTTAATGAACTACAAACTTTCCAGTCTCTTATGAAGAATAAGGGACAGGCTGATG
GAGAAGCAAATCTGTGTGTCCATTCCAGAAGGTTTCAGAAGGGTTCATCCTCTGGAACTAAGTCCTGTGGTTCATTTTCTGGGCTTAAGAAGACCCAAAAGAAGAAGATA
GGAGGGAAAGGGAAGGCACCTGCTGCTGACAAAGGCAAGGGAAAAGTCAAGGTTGTAGATAAAGGAAAGTGTTTCCACTGCAACGTGGATGGGCACTGGAAGCGAAACTG
CCCAAAATACCTTGCTGAGCTCAAAGAGAAGAAAGCCACTAATCATGTTTGCTCTTCGCTTCGTGAAACTAGTTCCTTCAAGGAGCTCGAAGAGGGTGAGATGACGCTCA
GGGTCGGAACGGGAGACGTCGTCTCAGCTCGTGCAGTGGGAGATGCCAAGCTACTTGGTCATATTAATCTCAACCGGATTGAGAGACTTTCTAAGAATGGACTTCTAAAT
AAGTTAGAAGATGTTTCTTTACCTCCTTGCGAGTCTTGCTTGGAAGGTAAAATGACCAAGCGACTTTTTACTGGAAAAGGTTACAGAGCCATAGAGCCATTAGAACTTGT
ACATTCGGATCTTTGTGGTCCGGTGAATGTTAAAGCTCGGGAAGGGTACGAATATTTCATCTCTTTCATAGATGATTATTCGAGGTATGGTTATTTAGAGTTCAAAGTGT
ATAAGGTTGAAGTAGAGAATGCATTAGGGAAAACCATCAAAACACTTCGATCCGATCGAGGTGGAGAGTATATGGATTTGAGATTCCAGGACTATATGATAGAACATGGA
ATTAAATCTCAACTCTCAACACCTAATACACCACACCAAAATGGTGTGTCAGAAAGGAGAAATAGAACCTTGTTGGACATGTTTACAACACTTGAATTTGGGGTTGTCCT
AAGACACGTCTCGGTGTCAAACCCAAAGAAACTGGAACCTCGTTCAAGATTGTGCCAATTTGTTGGCTATCCCAAAGAAACGAGAGGTGGTCTTTTCTACGACCCACAAG
AAAACAAGGTGATTGTATCGACAAACACCACGTTCTTGGAGGAAGATCACATGAGGAACCACAAACCGCATAGTAAATTAGTGCTAAATGAAGGTACAGATGAACCAACA
AGAGTTGTTGATCAAGCAGGACCTTCATCGAGAGTTGATGAAGGTGCCAACACCTCAAGTCAGTCTCGTCCTTCTCAATCGTTGGGAATGCCTCGACGTAGTGGGAGGGT
TGTTTGCCAACCTAACCGCTACTTGGGTTTAGCTGAAACTCAAGTTGTCATACCTGATGACGACGTAGAAGATCCATTGTCTTATAAACAGACGATGAGTGACGTAGACA
AGGACCAATGGATCAAAGCCATGGACCTTGAAATGGAGTCAATGGACTTCAATTCAGTATGGGAACTTGTAGACCAACCTGAAGGGGTTAGACCCATAGGGTGTAAATGG
ATCTATAAGAGAAAAAGAGATGCAGCCGGAAAGGTACAGACCTATAAAGCTAGACTGGTAGCAAAGGGTTTTACCCAAAGGGAAGAAGTTGATTATGAAGAAACTTTTTT
CCCTGTTGCTATGCTGAAGTCTATAAGGATACTCTTGTCCATAGCCGCATTTTATGATTATGAAATTTGA
Protein sequenceShow/hide protein sequence
MSSSIIALLKSDRLTDENFTTWKSNLNTILVVDDLRFLLTEECPQVPARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLDKKHEGMVSAREIMSSLQDMFGQTSGQIR
HESLKYIYNSRMKEGSLVREHVLDLMVHFNVAEMNGAVIDDNAVMNKIEYNLTTLLNELQTFQSLMKNKGQADGEANLCVHSRRFQKGSSSGTKSCGSFSGLKKTQKKKI
GGKGKAPAADKGKGKVKVVDKGKCFHCNVDGHWKRNCPKYLAELKEKKATNHVCSSLRETSSFKELEEGEMTLRVGTGDVVSARAVGDAKLLGHINLNRIERLSKNGLLN
KLEDVSLPPCESCLEGKMTKRLFTGKGYRAIEPLELVHSDLCGPVNVKAREGYEYFISFIDDYSRYGYLEFKVYKVEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHG
IKSQLSTPNTPHQNGVSERRNRTLLDMFTTLEFGVVLRHVSVSNPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNTTFLEEDHMRNHKPHSKLVLNEGTDEPT
RVVDQAGPSSRVDEGANTSSQSRPSQSLGMPRRSGRVVCQPNRYLGLAETQVVIPDDDVEDPLSYKQTMSDVDKDQWIKAMDLEMESMDFNSVWELVDQPEGVRPIGCKW
IYKRKRDAAGKVQTYKARLVAKGFTQREEVDYEETFFPVAMLKSIRILLSIAAFYDYEI