; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021264 (gene) of Snake gourd v1 genome

Gene IDTan0021264
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG01:99043799..99045802
RNA-Seq ExpressionTan0021264
SyntenyTan0021264
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]9.5e-17281.45Show/hide
Query:  NEVVWLRKQASRSWNIRFDEAIRSYGFDQNDDEPCVYKKIINSSVAFLILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGNAVF-SWIQIVKNRKNRT
        N  ++  KQASRSWNIRFD AI+SYGFDQN DEPCVYKKI    VAFL+LYVDDILLIGNDVGYLTD+K WLA QFQMKDLG A +   IQI+++RKN+T
Subjt:  NEVVWLRKQASRSWNIRFDEAIRSYGFDQNDDEPCVYKKIINSSVAFLILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGNAVF-SWIQIVKNRKNRT

Query:  LALSLALYIDKMLSRFKMQDSKKGLLHFRHGIHLSKEQCPKTPQGVEDMRRIPYASAVGSLTYAMLCTRPDICFAVGMVSRYQSNSGLEHWTTVKTILKY
        LALS A YIDK+L R+ MQ+SKKGLL FRHG+HLSKEQ PKTPQ VEDMRRIPYASAVGSL YAMLCTRPDIC+AVG+VSRYQSN GL+HWT VK +LKY
Subjt:  LALSLALYIDKMLSRFKMQDSKKGLLHFRHGIHLSKEQCPKTPQGVEDMRRIPYASAVGSLTYAMLCTRPDICFAVGMVSRYQSNSGLEHWTTVKTILKY

Query:  LRRTRNYMLVYGAKDLILTGYTDSNFQTDKDSRKSTLGSVFTLNGGAVVWRSVKQECIADSTMEAEYVAACEAGKEVVWLRKFMLNLEVVPNMTLPVTLY
        LRRTR+YMLVYGAKDLILTGYTDS+FQTDKDSRKST GSVFTLNGGAVVWRS+KQ CIADSTMEAEYVAACEA KE VWLRKF+ +LEVVPNM LP+TLY
Subjt:  LRRTRNYMLVYGAKDLILTGYTDSNFQTDKDSRKSTLGSVFTLNGGAVVWRSVKQECIADSTMEAEYVAACEAGKEVVWLRKFMLNLEVVPNMTLPVTLY

Query:  CDNSGARANSRKPRSHKRGKHIERKYHLIREIVHRGDVIVTKIASKHNIDDPFTKALTAKVFESDLEGLGLQ
        CDNSGA ANS++PRSHKRGKHIERKYHLIREIV RGDVIVTKIAS+HNI DPFTK LTAKVFE  LE LGL+
Subjt:  CDNSGARANSRKPRSHKRGKHIERKYHLIREIVHRGDVIVTKIASKHNIDDPFTKALTAKVFESDLEGLGLQ

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]1.5e-16980.65Show/hide
Query:  NEVVWLRKQASRSWNIRFDEAIRSYGFDQNDDEPCVYKKIINSSVAFLILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGNAVF-SWIQIVKNRKNRT
        N  ++  KQASRSWNIRFD AI+SYGFDQN DEPCVYKKI    VAFL+LYVDDILLIGNDVGYLTD+K WLA QFQMKDLG   +   IQI+++RKN+T
Subjt:  NEVVWLRKQASRSWNIRFDEAIRSYGFDQNDDEPCVYKKIINSSVAFLILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGNAVF-SWIQIVKNRKNRT

Query:  LALSLALYIDKMLSRFKMQDSKKGLLHFRHGIHLSKEQCPKTPQGVEDMRRIPYASAVGSLTYAMLCTRPDICFAVGMVSRYQSNSGLEHWTTVKTILKY
        LALS A YIDK+L R+ MQ+SKKGLL FRHG+HLSKEQ PKTPQ VEDMRRIPYASAVGSL YAMLCTRPDIC+AVG+VSRYQSN GL+HWT VK ILKY
Subjt:  LALSLALYIDKMLSRFKMQDSKKGLLHFRHGIHLSKEQCPKTPQGVEDMRRIPYASAVGSLTYAMLCTRPDICFAVGMVSRYQSNSGLEHWTTVKTILKY

Query:  LRRTRNYMLVYGAKDLILTGYTDSNFQTDKDSRKSTLGSVFTLNGGAVVWRSVKQECIADSTMEAEYVAACEAGKEVVWLRKFMLNLEVVPNMTLPVTLY
        LRRTR+YMLVYGAKDLILTGYT+S+FQTDKDSRKST  SVFTLNGGAVVWRS+KQ CIADSTMEAEYVAACEA KE VWL+KF+ +LEVVPNM LP+TLY
Subjt:  LRRTRNYMLVYGAKDLILTGYTDSNFQTDKDSRKSTLGSVFTLNGGAVVWRSVKQECIADSTMEAEYVAACEAGKEVVWLRKFMLNLEVVPNMTLPVTLY

Query:  CDNSGARANSRKPRSHKRGKHIERKYHLIREIVHRGDVIVTKIASKHNIDDPFTKALTAKVFESDLEGLGLQ
        CDNSGA ANS++PRSHKRGKHIERKYHLIREIV RGDVIVTKIAS+HNI DPFTK LTAKVFE  LE LGL+
Subjt:  CDNSGARANSRKPRSHKRGKHIERKYHLIREIVHRGDVIVTKIASKHNIDDPFTKALTAKVFESDLEGLGLQ

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]9.5e-17281.45Show/hide
Query:  NEVVWLRKQASRSWNIRFDEAIRSYGFDQNDDEPCVYKKIINSSVAFLILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGNAVF-SWIQIVKNRKNRT
        N  ++  KQASRSWNIRFD AI+SYGFDQN DEPCVYKKI    VAFL+LYVDDILLIGNDVGYLTD+K WLA QFQMKDLG A +   IQI+++RKN+T
Subjt:  NEVVWLRKQASRSWNIRFDEAIRSYGFDQNDDEPCVYKKIINSSVAFLILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGNAVF-SWIQIVKNRKNRT

Query:  LALSLALYIDKMLSRFKMQDSKKGLLHFRHGIHLSKEQCPKTPQGVEDMRRIPYASAVGSLTYAMLCTRPDICFAVGMVSRYQSNSGLEHWTTVKTILKY
        LALS A YIDK+L R+ MQ+SKKGLL FRHG+HLSKEQ PKTPQ VEDMRRIPYASAVGSL YAMLCTRPDIC+AVG+VSRYQSN GL+HWT VK +LKY
Subjt:  LALSLALYIDKMLSRFKMQDSKKGLLHFRHGIHLSKEQCPKTPQGVEDMRRIPYASAVGSLTYAMLCTRPDICFAVGMVSRYQSNSGLEHWTTVKTILKY

Query:  LRRTRNYMLVYGAKDLILTGYTDSNFQTDKDSRKSTLGSVFTLNGGAVVWRSVKQECIADSTMEAEYVAACEAGKEVVWLRKFMLNLEVVPNMTLPVTLY
        LRRTR+YMLVYGAKDLILTGYTDS+FQTDKDSRKST GSVFTLNGGAVVWRS+KQ CIADSTMEAEYVAACEA KE VWLRKF+ +LEVVPNM LP+TLY
Subjt:  LRRTRNYMLVYGAKDLILTGYTDSNFQTDKDSRKSTLGSVFTLNGGAVVWRSVKQECIADSTMEAEYVAACEAGKEVVWLRKFMLNLEVVPNMTLPVTLY

Query:  CDNSGARANSRKPRSHKRGKHIERKYHLIREIVHRGDVIVTKIASKHNIDDPFTKALTAKVFESDLEGLGLQ
        CDNSGA ANS++PRSHKRGKHIERKYHLIREIV RGDVIVTKIAS+HNI DPFTK LTAKVFE  LE LGL+
Subjt:  CDNSGARANSRKPRSHKRGKHIERKYHLIREIVHRGDVIVTKIASKHNIDDPFTKALTAKVFESDLEGLGLQ

KAA0061170.1 gag/pol protein [Cucumis melo var. makuwa]4.7e-17181.45Show/hide
Query:  NEVVWLRKQASRSWNIRFDEAIRSYGFDQNDDEPCVYKKIINSSVAFLILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGNAVF-SWIQIVKNRKNRT
        N  ++  KQASRSWNIRFD AI+SYGFDQN DEPCVYKKI    VAFL+LYVDDILLIGND GYLTD+K WLA QFQMKDLG A +   IQI+++RKN+T
Subjt:  NEVVWLRKQASRSWNIRFDEAIRSYGFDQNDDEPCVYKKIINSSVAFLILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGNAVF-SWIQIVKNRKNRT

Query:  LALSLALYIDKMLSRFKMQDSKKGLLHFRHGIHLSKEQCPKTPQGVEDMRRIPYASAVGSLTYAMLCTRPDICFAVGMVSRYQSNSGLEHWTTVKTILKY
        LALS A YIDK+L R+ MQ+SKKGLL FRHG+HLSKEQ PKTPQ VEDMRRIPYASAVGSL YAMLCTRPDIC+AVG+VSRYQSN GL+HWTTVK ILKY
Subjt:  LALSLALYIDKMLSRFKMQDSKKGLLHFRHGIHLSKEQCPKTPQGVEDMRRIPYASAVGSLTYAMLCTRPDICFAVGMVSRYQSNSGLEHWTTVKTILKY

Query:  LRRTRNYMLVYGAKDLILTGYTDSNFQTDKDSRKSTLGSVFTLNGGAVVWRSVKQECIADSTMEAEYVAACEAGKEVVWLRKFMLNLEVVPNMTLPVTLY
        LRRTR+YMLVYGAKDLILTGYTDS+FQTDKDSRKST GSVFTLN GAVVWRS+KQ CIADSTMEAEYVAACEA KE VWLRKF+ +LEVVPNM LP+TLY
Subjt:  LRRTRNYMLVYGAKDLILTGYTDSNFQTDKDSRKSTLGSVFTLNGGAVVWRSVKQECIADSTMEAEYVAACEAGKEVVWLRKFMLNLEVVPNMTLPVTLY

Query:  CDNSGARANSRKPRSHKRGKHIERKYHLIREIVHRGDVIVTKIASKHNIDDPFTKALTAKVFESDLEGLGLQ
        CDNSGA ANS++PRSHKRGKHIERKYHLIREIV RGDVIVTKIAS+HNI DPFTK LTAKVFE  LE LGL+
Subjt:  CDNSGARANSRKPRSHKRGKHIERKYHLIREIVHRGDVIVTKIASKHNIDDPFTKALTAKVFESDLEGLGLQ

TYJ97618.1 gag/pol protein [Cucumis melo var. makuwa]7.5e-16978.99Show/hide
Query:  NEVVWLRKQASRSWNIRFDEAIRSYGFDQNDDEPCVYKKIINSSVAFLILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGNAVF-SWIQIVKNRKNRT
        N  ++  KQASRSWNIRFD AI+SYGFDQ  DEPCVYK+IIN SVAFL+LYVDDILLIGND+G LTDIK+WLATQFQMKDLG A F   IQI ++RKN+ 
Subjt:  NEVVWLRKQASRSWNIRFDEAIRSYGFDQNDDEPCVYKKIINSSVAFLILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGNAVF-SWIQIVKNRKNRT

Query:  LALSLALYIDKMLSRFKMQDSKKGLLHFRHGIHLSKEQCPKTPQGVEDMRRIPYASAVGSLTYAMLCTRPDICFAVGMVSRYQSNSGLEHWTTVKTILKY
        LALS A YIDK++ ++ MQ+SK+GLL FRHG+ LSKEQCPKTPQ VE+MR IPYASAVGSL YAMLCTRPDIC+AVG+VSRYQSN GL HWT VKTILKY
Subjt:  LALSLALYIDKMLSRFKMQDSKKGLLHFRHGIHLSKEQCPKTPQGVEDMRRIPYASAVGSLTYAMLCTRPDICFAVGMVSRYQSNSGLEHWTTVKTILKY

Query:  LRRTRNYMLVYGAKDLILTGYTDSNFQTDKDSRKSTLGSVFTLNGGAVVWRSVKQECIADSTMEAEYVAACEAGKEVVWLRKFMLNLEVVPNMTLPVTLY
        LRRTR+Y LVYG+KDLILTGYTDS+FQTD+DSRKST GSVFTLNGGAVVWRS+KQ CIADSTMEAEYVAACEA KE VWLR F+++LEVVPNM+ P+TLY
Subjt:  LRRTRNYMLVYGAKDLILTGYTDSNFQTDKDSRKSTLGSVFTLNGGAVVWRSVKQECIADSTMEAEYVAACEAGKEVVWLRKFMLNLEVVPNMTLPVTLY

Query:  CDNSGARANSRKPRSHKRGKHIERKYHLIREIVHRGDVIVTKIASKHNIDDPFTKALTAKVFESDLEGLGLQVFPN
        CDNSGA ANSR+PRSHKRGKHIERKYHLIREIVHRGDVIVT+IAS HN+ DPFTK LTAKVFE  LE LGL+  P+
Subjt:  CDNSGARANSRKPRSHKRGKHIERKYHLIREIVHRGDVIVTKIASKHNIDDPFTKALTAKVFESDLEGLGLQVFPN

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein3.6e-16978.99Show/hide
Query:  NEVVWLRKQASRSWNIRFDEAIRSYGFDQNDDEPCVYKKIINSSVAFLILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGNAVF-SWIQIVKNRKNRT
        N  ++  KQASRSWNIRFD AI+SYGFDQ  DEPCVYK+IIN SVAFL+LYVDDILLIGND+G LTDIK+WLATQFQMKDLG A F   IQI ++RKN+ 
Subjt:  NEVVWLRKQASRSWNIRFDEAIRSYGFDQNDDEPCVYKKIINSSVAFLILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGNAVF-SWIQIVKNRKNRT

Query:  LALSLALYIDKMLSRFKMQDSKKGLLHFRHGIHLSKEQCPKTPQGVEDMRRIPYASAVGSLTYAMLCTRPDICFAVGMVSRYQSNSGLEHWTTVKTILKY
        LALS A YIDK++ ++ MQ+SK+GLL FRHG+ LSKEQCPKTPQ VE+MR IPYASAVGSL YAMLCTRPDIC+AVG+VSRYQSN GL HWT VKTILKY
Subjt:  LALSLALYIDKMLSRFKMQDSKKGLLHFRHGIHLSKEQCPKTPQGVEDMRRIPYASAVGSLTYAMLCTRPDICFAVGMVSRYQSNSGLEHWTTVKTILKY

Query:  LRRTRNYMLVYGAKDLILTGYTDSNFQTDKDSRKSTLGSVFTLNGGAVVWRSVKQECIADSTMEAEYVAACEAGKEVVWLRKFMLNLEVVPNMTLPVTLY
        LRRTR+Y LVYG+KDLILTGYTDS+FQTD+DSRKST GSVFTLNGGAVVWRS+KQ CIADSTMEAEYVAACEA KE VWLR F+++LEVVPNM+ P+TLY
Subjt:  LRRTRNYMLVYGAKDLILTGYTDSNFQTDKDSRKSTLGSVFTLNGGAVVWRSVKQECIADSTMEAEYVAACEAGKEVVWLRKFMLNLEVVPNMTLPVTLY

Query:  CDNSGARANSRKPRSHKRGKHIERKYHLIREIVHRGDVIVTKIASKHNIDDPFTKALTAKVFESDLEGLGLQVFPN
        CDNSGA ANSR+PRSHKRGKHIERKYHLIREIVHRGDVIVT+IAS HN+ DPFTK LTAKVFE  LE LGL+  P+
Subjt:  CDNSGARANSRKPRSHKRGKHIERKYHLIREIVHRGDVIVTKIASKHNIDDPFTKALTAKVFESDLEGLGLQVFPN

A0A5A7T2V9 Gag/pol protein7.3e-17080.65Show/hide
Query:  NEVVWLRKQASRSWNIRFDEAIRSYGFDQNDDEPCVYKKIINSSVAFLILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGNAVF-SWIQIVKNRKNRT
        N  ++  KQASRSWNIRFD AI+SYGFDQN DEPCVYKKI    VAFL+LYVDDILLIGNDVGYLTD+K WLA QFQMKDLG   +   IQI+++RKN+T
Subjt:  NEVVWLRKQASRSWNIRFDEAIRSYGFDQNDDEPCVYKKIINSSVAFLILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGNAVF-SWIQIVKNRKNRT

Query:  LALSLALYIDKMLSRFKMQDSKKGLLHFRHGIHLSKEQCPKTPQGVEDMRRIPYASAVGSLTYAMLCTRPDICFAVGMVSRYQSNSGLEHWTTVKTILKY
        LALS A YIDK+L R+ MQ+SKKGLL FRHG+HLSKEQ PKTPQ VEDMRRIPYASAVGSL YAMLCTRPDIC+AVG+VSRYQSN GL+HWT VK ILKY
Subjt:  LALSLALYIDKMLSRFKMQDSKKGLLHFRHGIHLSKEQCPKTPQGVEDMRRIPYASAVGSLTYAMLCTRPDICFAVGMVSRYQSNSGLEHWTTVKTILKY

Query:  LRRTRNYMLVYGAKDLILTGYTDSNFQTDKDSRKSTLGSVFTLNGGAVVWRSVKQECIADSTMEAEYVAACEAGKEVVWLRKFMLNLEVVPNMTLPVTLY
        LRRTR+YMLVYGAKDLILTGYT+S+FQTDKDSRKST  SVFTLNGGAVVWRS+KQ CIADSTMEAEYVAACEA KE VWL+KF+ +LEVVPNM LP+TLY
Subjt:  LRRTRNYMLVYGAKDLILTGYTDSNFQTDKDSRKSTLGSVFTLNGGAVVWRSVKQECIADSTMEAEYVAACEAGKEVVWLRKFMLNLEVVPNMTLPVTLY

Query:  CDNSGARANSRKPRSHKRGKHIERKYHLIREIVHRGDVIVTKIASKHNIDDPFTKALTAKVFESDLEGLGLQ
        CDNSGA ANS++PRSHKRGKHIERKYHLIREIV RGDVIVTKIAS+HNI DPFTK LTAKVFE  LE LGL+
Subjt:  CDNSGARANSRKPRSHKRGKHIERKYHLIREIVHRGDVIVTKIASKHNIDDPFTKALTAKVFESDLEGLGLQ

A0A5A7TZD0 Gag/pol protein4.6e-17281.45Show/hide
Query:  NEVVWLRKQASRSWNIRFDEAIRSYGFDQNDDEPCVYKKIINSSVAFLILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGNAVF-SWIQIVKNRKNRT
        N  ++  KQASRSWNIRFD AI+SYGFDQN DEPCVYKKI    VAFL+LYVDDILLIGNDVGYLTD+K WLA QFQMKDLG A +   IQI+++RKN+T
Subjt:  NEVVWLRKQASRSWNIRFDEAIRSYGFDQNDDEPCVYKKIINSSVAFLILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGNAVF-SWIQIVKNRKNRT

Query:  LALSLALYIDKMLSRFKMQDSKKGLLHFRHGIHLSKEQCPKTPQGVEDMRRIPYASAVGSLTYAMLCTRPDICFAVGMVSRYQSNSGLEHWTTVKTILKY
        LALS A YIDK+L R+ MQ+SKKGLL FRHG+HLSKEQ PKTPQ VEDMRRIPYASAVGSL YAMLCTRPDIC+AVG+VSRYQSN GL+HWT VK +LKY
Subjt:  LALSLALYIDKMLSRFKMQDSKKGLLHFRHGIHLSKEQCPKTPQGVEDMRRIPYASAVGSLTYAMLCTRPDICFAVGMVSRYQSNSGLEHWTTVKTILKY

Query:  LRRTRNYMLVYGAKDLILTGYTDSNFQTDKDSRKSTLGSVFTLNGGAVVWRSVKQECIADSTMEAEYVAACEAGKEVVWLRKFMLNLEVVPNMTLPVTLY
        LRRTR+YMLVYGAKDLILTGYTDS+FQTDKDSRKST GSVFTLNGGAVVWRS+KQ CIADSTMEAEYVAACEA KE VWLRKF+ +LEVVPNM LP+TLY
Subjt:  LRRTRNYMLVYGAKDLILTGYTDSNFQTDKDSRKSTLGSVFTLNGGAVVWRSVKQECIADSTMEAEYVAACEAGKEVVWLRKFMLNLEVVPNMTLPVTLY

Query:  CDNSGARANSRKPRSHKRGKHIERKYHLIREIVHRGDVIVTKIASKHNIDDPFTKALTAKVFESDLEGLGLQ
        CDNSGA ANS++PRSHKRGKHIERKYHLIREIV RGDVIVTKIAS+HNI DPFTK LTAKVFE  LE LGL+
Subjt:  CDNSGARANSRKPRSHKRGKHIERKYHLIREIVHRGDVIVTKIASKHNIDDPFTKALTAKVFESDLEGLGLQ

A0A5A7UYE8 Gag/pol protein4.6e-17281.45Show/hide
Query:  NEVVWLRKQASRSWNIRFDEAIRSYGFDQNDDEPCVYKKIINSSVAFLILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGNAVF-SWIQIVKNRKNRT
        N  ++  KQASRSWNIRFD AI+SYGFDQN DEPCVYKKI    VAFL+LYVDDILLIGNDVGYLTD+K WLA QFQMKDLG A +   IQI+++RKN+T
Subjt:  NEVVWLRKQASRSWNIRFDEAIRSYGFDQNDDEPCVYKKIINSSVAFLILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGNAVF-SWIQIVKNRKNRT

Query:  LALSLALYIDKMLSRFKMQDSKKGLLHFRHGIHLSKEQCPKTPQGVEDMRRIPYASAVGSLTYAMLCTRPDICFAVGMVSRYQSNSGLEHWTTVKTILKY
        LALS A YIDK+L R+ MQ+SKKGLL FRHG+HLSKEQ PKTPQ VEDMRRIPYASAVGSL YAMLCTRPDIC+AVG+VSRYQSN GL+HWT VK +LKY
Subjt:  LALSLALYIDKMLSRFKMQDSKKGLLHFRHGIHLSKEQCPKTPQGVEDMRRIPYASAVGSLTYAMLCTRPDICFAVGMVSRYQSNSGLEHWTTVKTILKY

Query:  LRRTRNYMLVYGAKDLILTGYTDSNFQTDKDSRKSTLGSVFTLNGGAVVWRSVKQECIADSTMEAEYVAACEAGKEVVWLRKFMLNLEVVPNMTLPVTLY
        LRRTR+YMLVYGAKDLILTGYTDS+FQTDKDSRKST GSVFTLNGGAVVWRS+KQ CIADSTMEAEYVAACEA KE VWLRKF+ +LEVVPNM LP+TLY
Subjt:  LRRTRNYMLVYGAKDLILTGYTDSNFQTDKDSRKSTLGSVFTLNGGAVVWRSVKQECIADSTMEAEYVAACEAGKEVVWLRKFMLNLEVVPNMTLPVTLY

Query:  CDNSGARANSRKPRSHKRGKHIERKYHLIREIVHRGDVIVTKIASKHNIDDPFTKALTAKVFESDLEGLGLQ
        CDNSGA ANS++PRSHKRGKHIERKYHLIREIV RGDVIVTKIAS+HNI DPFTK LTAKVFE  LE LGL+
Subjt:  CDNSGARANSRKPRSHKRGKHIERKYHLIREIVHRGDVIVTKIASKHNIDDPFTKALTAKVFESDLEGLGLQ

A0A5A7V1F5 Gag/pol protein2.3e-17181.45Show/hide
Query:  NEVVWLRKQASRSWNIRFDEAIRSYGFDQNDDEPCVYKKIINSSVAFLILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGNAVF-SWIQIVKNRKNRT
        N  ++  KQASRSWNIRFD AI+SYGFDQN DEPCVYKKI    VAFL+LYVDDILLIGND GYLTD+K WLA QFQMKDLG A +   IQI+++RKN+T
Subjt:  NEVVWLRKQASRSWNIRFDEAIRSYGFDQNDDEPCVYKKIINSSVAFLILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGNAVF-SWIQIVKNRKNRT

Query:  LALSLALYIDKMLSRFKMQDSKKGLLHFRHGIHLSKEQCPKTPQGVEDMRRIPYASAVGSLTYAMLCTRPDICFAVGMVSRYQSNSGLEHWTTVKTILKY
        LALS A YIDK+L R+ MQ+SKKGLL FRHG+HLSKEQ PKTPQ VEDMRRIPYASAVGSL YAMLCTRPDIC+AVG+VSRYQSN GL+HWTTVK ILKY
Subjt:  LALSLALYIDKMLSRFKMQDSKKGLLHFRHGIHLSKEQCPKTPQGVEDMRRIPYASAVGSLTYAMLCTRPDICFAVGMVSRYQSNSGLEHWTTVKTILKY

Query:  LRRTRNYMLVYGAKDLILTGYTDSNFQTDKDSRKSTLGSVFTLNGGAVVWRSVKQECIADSTMEAEYVAACEAGKEVVWLRKFMLNLEVVPNMTLPVTLY
        LRRTR+YMLVYGAKDLILTGYTDS+FQTDKDSRKST GSVFTLN GAVVWRS+KQ CIADSTMEAEYVAACEA KE VWLRKF+ +LEVVPNM LP+TLY
Subjt:  LRRTRNYMLVYGAKDLILTGYTDSNFQTDKDSRKSTLGSVFTLNGGAVVWRSVKQECIADSTMEAEYVAACEAGKEVVWLRKFMLNLEVVPNMTLPVTLY

Query:  CDNSGARANSRKPRSHKRGKHIERKYHLIREIVHRGDVIVTKIASKHNIDDPFTKALTAKVFESDLEGLGLQ
        CDNSGA ANS++PRSHKRGKHIERKYHLIREIV RGDVIVTKIAS+HNI DPFTK LTAKVFE  LE LGL+
Subjt:  CDNSGARANSRKPRSHKRGKHIERKYHLIREIVHRGDVIVTKIASKHNIDDPFTKALTAKVFESDLEGLGLQ

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.6e-4731.76Show/hide
Query:  NEVVWLRKQASRSWNIRFDEAIRSYGFDQNDDEPCVY---KKIINSSVAFLILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGNAVFSWIQIVKNRKN
        N+ ++  KQA+R W   F++A++   F  +  + C+Y   K  IN ++ +++LYVDD+++   D+  + + K +L  +F+M DL N +  +I I    + 
Subjt:  NEVVWLRKQASRSWNIRFDEAIRSYGFDQNDDEPCVY---KKIINSSVAFLILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGNAVFSWIQIVKNRKN

Query:  RTLALSLALYIDKMLSRFKMQDSKKGLLHFRHGIHL----SKEQCPKTPQGVEDMRRIPYASAVGSLTYAMLCTRPDICFAVGMVSRYQSNSGLEHWTTV
          + LS + Y+ K+LS+F M++           I+     S E C             P  S +G L Y MLCTRPD+  AV ++SRY S +  E W  +
Subjt:  RTLALSLALYIDKMLSRFKMQDSKKGLLHFRHGIHL----SKEQCPKTPQGVEDMRRIPYASAVGSLTYAMLCTRPDICFAVGMVSRYQSNSGLEHWTTV

Query:  KTILKYLRRTRNYMLVYG---AKDLILTGYTDSNFQTDKDSRKSTLGSVFTL-NGGAVVWRSVKQECIADSTMEAEYVAACEAGKEVVWLRKFMLNLEVV
        K +L+YL+ T +  L++    A +  + GY DS++   +  RKST G +F + +   + W + +Q  +A S+ EAEY+A  EA +E +WL+  + ++ + 
Subjt:  KTILKYLRRTRNYMLVYG---AKDLILTGYTDSNFQTDKDSRKSTLGSVFTL-NGGAVVWRSVKQECIADSTMEAEYVAACEAGKEVVWLRKFMLNLEVV

Query:  PNMTLPVTLYCDNSGARANSRKPRSHKRGKHIERKYHLIREIVHRGDVIVTKIASKHNIDDPFTKALTAKVFESDLEGLGL
          +  P+ +Y DN G  + +  P  HKR KHI+ KYH  RE V    + +  I +++ + D FTK L A  F    + LGL
Subjt:  PNMTLPVTLYCDNSGARANSRKPRSHKRGKHIERKYHLIREIVHRGDVIVTKIASKHNIDDPFTKALTAKVFESDLEGLGL

P0CV72 Secreted RxLR effector protein 1612.5e-2645.86Show/hide
Query:  MRRIPYASAVGSLTYAMLCTRPDICFAVGMVSRYQSNSGLEHWTTVKTILKYLRRTRNYMLVY-GAKDLILTGYTDSNFQTDKDSRKSTLGSVFTLNGGA
        M+ +PY SAVG++ Y M+ TRPD+  AVG++S++ S+    HW  +K +L+YL+ T+ Y L +  A    L GY+D+++  D +SR+ST G +F LNGG 
Subjt:  MRRIPYASAVGSLTYAMLCTRPDICFAVGMVSRYQSNSGLEHWTTVKTILKYLRRTRNYMLVY-GAKDLILTGYTDSNFQTDKDSRKSTLGSVFTLNGGA

Query:  VVWRSVKQECIADSTMEAEYVAACEAGKEVVWL
        V WRS KQ  +A S+ E EY+A  EA +E VWL
Subjt:  VVWRSVKQECIADSTMEAEYVAACEAGKEVVWL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.4e-7742.47Show/hide
Query:  NEVVWLRKQASRSWNIRFDEAIRSYGFDQNDDEPCVY-KKIINSSVAFLILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGNA-VFSWIQIVKNRKNR
        N+ ++  KQA R W ++FD  ++S  + +   +PCVY K+   ++   L+LYVDD+L++G D G +  +K  L+  F MKDLG A     ++IV+ R +R
Subjt:  NEVVWLRKQASRSWNIRFDEAIRSYGFDQNDDEPCVY-KKIINSSVAFLILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGNA-VFSWIQIVKNRKNR

Query:  TLALSLALYIDKMLSRFKMQDSKKGLLHFRHGIHLSKEQCPKTPQGVEDMRRIPYASAVGSLTYAMLCTRPDICFAVGMVSRYQSNSGLEHWTTVKTILK
         L LS   YI+++L RF M+++K         + LSK+ CP T +   +M ++PY+SAVGSL YAM+CTRPDI  AVG+VSR+  N G EHW  VK IL+
Subjt:  TLALSLALYIDKMLSRFKMQDSKKGLLHFRHGIHLSKEQCPKTPQGVEDMRRIPYASAVGSLTYAMLCTRPDICFAVGMVSRYQSNSGLEHWTTVKTILK

Query:  YLRRTRNYMLVYGAKDLILTGYTDSNFQTDKDSRKSTLGSVFTLNGGAVVWRSVKQECIADSTMEAEYVAACEAGKEVVWLRKFMLNLEVVPNMTLPVTL
        YLR T    L +G  D IL GYTD++   D D+RKS+ G +FT +GGA+ W+S  Q+C+A ST EAEY+AA E GKE++WL++F+  L +         +
Subjt:  YLRRTRNYMLVYGAKDLILTGYTDSNFQTDKDSRKSTLGSVFTLNGGAVVWRSVKQECIADSTMEAEYVAACEAGKEVVWLRKFMLNLEVVPNMTLPVTL

Query:  YCDNSGARANSRKPRSHKRGKHIERKYHLIREIVHRGDVIVTKIASKHNIDDPFTKALTAKVFESDLEGLGL
        YCD+  A   S+    H R KHI+ +YH IRE+V    + V KI++  N  D  TK +    FE   E +G+
Subjt:  YCDNSGARANSRKPRSHKRGKHIERKYHLIREIVHRGDVIVTKIASKHNIDDPFTKALTAKVFESDLEGLGL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.1e-3730.16Show/hide
Query:  KQASRSWNIRFDEAIRSYGFDQNDDEPCVYKKIINSSVAFLILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGNAVFSWIQIVKNRKNRTLALSLALY
        KQA R+W +     + + GF  +  +  ++      S+ ++++YVDDIL+ GND   L +  + L+ +F +KD    +  ++ I   R    L LS   Y
Subjt:  KQASRSWNIRFDEAIRSYGFDQNDDEPCVYKKIINSSVAFLILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGNAVFSWIQIVKNRKNRTLALSLALY

Query:  IDKMLSRFKMQDSKKGLLHFRHGIHLSKEQCPKTPQGVEDMRRIPYASAVGSLTYAMLCTRPDICFAVGMVSRYQSNSGLEHWTTVKTILKYLRRTRNY-
        I  +L+R  M  +K           LS     K     E      Y   VGSL Y +  TRPDI +AV  +S++      EH   +K IL+YL  T N+ 
Subjt:  IDKMLSRFKMQDSKKGLLHFRHGIHLSKEQCPKTPQGVEDMRRIPYASAVGSLTYAMLCTRPDICFAVGMVSRYQSNSGLEHWTTVKTILKYLRRTRNY-

Query:  MLVYGAKDLILTGYTDSNFQTDKDSRKSTLGSVFTLNGGAVVWRSVKQECIADSTMEAEYVAACEAGKEVVWLRKFMLNLEVVPNMTLPVTLYCDNSGAR
        + +     L L  Y+D+++  DKD   ST G +  L    + W S KQ+ +  S+ EAEY +      E+ W+   +  L +   +T P  +YCDN GA 
Subjt:  MLVYGAKDLILTGYTDSNFQTDKDSRKSTLGSVFTLNGGAVVWRSVKQECIADSTMEAEYVAACEAGKEVVWLRKFMLNLEVVPNMTLPVTLYCDNSGAR

Query:  ANSRKPRSHKRGKHIERKYHLIREIVHRGDVIVTKIASKHNIDDPFTKALTAKVFESDLEGLGLQVFP
             P  H R KHI   YH IR  V  G + V  +++   + D  TK L+   F++    +G+   P
Subjt:  ANSRKPRSHKRGKHIERKYHLIREIVHRGDVIVTKIASKHNIDDPFTKALTAKVFESDLEGLGLQVFP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.0e-3628.53Show/hide
Query:  KQASRSWNIRFDEAIRSYGFDQNDDEPCVYKKIINSSVAFLILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGNAVFSWIQIVKNRKNRTLALSLALY
        KQA R+W +     + + GF  +  +  ++      S+ ++++YVDDIL+ GND   L    + L+ +F +K+  +  + ++ I   R  + L LS   Y
Subjt:  KQASRSWNIRFDEAIRSYGFDQNDDEPCVYKKIINSSVAFLILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGNAVFSWIQIVKNRKNRTLALSLALY

Query:  IDKMLSRFKMQDSKKGLLHFRHGIHLSKEQCPKTPQGVEDMRRIPYASAVGSLTYAMLCTRPDICFAVGMVSRYQSNSGLEHWTTVKTILKYLRRTRNY-
           +L+R  M  +K           L+     K P   E      Y   VGSL Y +  TRPD+ +AV  +S+Y      +HW  +K +L+YL  T ++ 
Subjt:  IDKMLSRFKMQDSKKGLLHFRHGIHLSKEQCPKTPQGVEDMRRIPYASAVGSLTYAMLCTRPDICFAVGMVSRYQSNSGLEHWTTVKTILKYLRRTRNY-

Query:  MLVYGAKDLILTGYTDSNFQTDKDSRKSTLGSVFTLNGGAVVWRSVKQECIADSTMEAEYVAACEAGKEVVWLRKFMLNLEVVPNMTLPVTLYCDNSGAR
        + +     L L  Y+D+++  D D   ST G +  L    + W S KQ+ +  S+ EAEY +      E+ W+   +  L +   ++ P  +YCDN GA 
Subjt:  MLVYGAKDLILTGYTDSNFQTDKDSRKSTLGSVFTLNGGAVVWRSVKQECIADSTMEAEYVAACEAGKEVVWLRKFMLNLEVVPNMTLPVTLYCDNSGAR

Query:  ANSRKPRSHKRGKHIERKYHLIREIVHRGDVIVTKIASKHNIDDPFTKALTAKVFESDLEGLGLQVFP
             P  H R KHI   YH IR  V  G + V  +++   + D  TK L+   F++    +G+   P
Subjt:  ANSRKPRSHKRGKHIERKYHLIREIVHRGDVIVTKIASKHNIDDPFTKALTAKVFESDLEGLGLQVFP

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.4e-3228.49Show/hide
Query:  AACEAANEVVWLRKQASRSWNIRFDEAIRSYGFDQNDDEPCVYKKIINSSVAFLILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGNAVFSWIQIVKN
        A C     +  L KQASR W ++F   +  +GF Q+  +   + KI  +    +++YVDDI++  N+   + ++K  L + F+++DLG   + ++ +   
Subjt:  AACEAANEVVWLRKQASRSWNIRFDEAIRSYGFDQNDDEPCVYKKIINSSVAFLILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGNAVFSWIQIVKN

Query:  RKNRTLALSLALYIDKMLSRFKMQDSKKGLLHFRHGIHLSKEQCPKTPQGVEDMRRIPYASAVGSLTYAMLCTRPDICFAVGMVSRYQSNSGLEHWTTVK
        R    + +    Y   +L    +   K   +     +  S         G + +    Y   +G L Y  + TR DI FAV  +S++     L H   V 
Subjt:  RKNRTLALSLALYIDKMLSRFKMQDSKKGLLHFRHGIHLSKEQCPKTPQGVEDMRRIPYASAVGSLTYAMLCTRPDICFAVGMVSRYQSNSGLEHWTTVK

Query:  TILKYLRRTRNYMLVYGAK-DLILTGYTDSNFQTDKDSRKSTLGSVFTLNGGAVVWRSVKQECIADSTMEAEYVAACEAGKEVVWLRKFMLNLEVVPNMT
         IL Y++ T    L Y ++ ++ L  ++D++FQ+ KD+R+ST G    L    + W+S KQ+ ++ S+ EAEY A   A  E++WL +F   L++   ++
Subjt:  TILKYLRRTRNYMLVYGAK-DLILTGYTDSNFQTDKDSRKSTLGSVFTLNGGAVVWRSVKQECIADSTMEAEYVAACEAGKEVVWLRKFMLNLEVVPNMT

Query:  LPVTLYCDNSGARANSRKPRSHKRGKHIERKYHLIRE
         P  L+CDN+ A   +     H+R KHIE   H +RE
Subjt:  LPVTLYCDNSGARANSRKPRSHKRGKHIERKYHLIRE

ATMG00240.1 Gag-Pol-related retrotransposon family protein2.1e-0436.11Show/hide
Query:  TRPDICFAVGMVSRYQSNSGLEHWTTVKTILKYLRRTRNYMLVYGA-KDLILTGYTDSNFQTDKDSRKSTLG
        TRPD+ FAV  +S++ S S       V  +L Y++ T    L Y A  DL L  + DS++ +  D+R+S  G
Subjt:  TRPDICFAVGMVSRYQSNSGLEHWTTVKTILKYLRRTRNYMLVYGA-KDLILTGYTDSNFQTDKDSRKSTLG

ATMG00810.1 DNA/RNA polymerases superfamily protein6.0e-1528.99Show/hide
Query:  FLILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGNA-VFSWIQIVKNRKNRTLALSLALYIDKMLSRFKMQDSKKGL----LHFRHGIHLSKEQCPKT
        +L+LYVDDILL G+    L  +   L++ F MKDLG    F  IQI  +     L LS   Y +++L+   M D K       L     +  +K   P  
Subjt:  FLILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGNA-VFSWIQIVKNRKNRTLALSLALYIDKMLSRFKMQDSKKGL----LHFRHGIHLSKEQCPKT

Query:  PQGVEDMRRIPYASAVGSLTYAMLCTRPDICFAVGMVSRYQSNSGLEHWTTVKTILKYLRRTRNY-MLVYGAKDLILTGYTDSNFQTDKDSRKSTLGSVF
                   + S VG+L Y  L TRPDI +AV +V +      L  +  +K +L+Y++ T  + + ++    L +  + DS++     +R+ST G   
Subjt:  PQGVEDMRRIPYASAVGSLTYAMLCTRPDICFAVGMVSRYQSNSGLEHWTTVKTILKYLRRTRNY-MLVYGAKDLILTGYTDSNFQTDKDSRKSTLGSVF

Query:  TLNGGAVVWRSVKQECIADSTMEAEYVAACEAGKEVVW
         L    + W + +Q  ++ S+ E EY A      E+ W
Subjt:  TLNGGAVVWRSVKQECIADSTMEAEYVAACEAGKEVVW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCAGCCCGAAGGGTTCATAACCCAAGCTCAGGAGCAAAAAGTTTGCAAGCTCAATCGATCGATTTATGGGTTGAAACAAGCATCCAAATCATGGAATATAAGATT
TGATATTGCGATCAAGTCTTATGGCTTTGACCAAAACGTTGATGAGCCTTGCGTTTACAAGAGGATCATCAACGACAAAGTAGCTTTATTAGTACTTTATGTGGATGATA
TCCTACTCATTGGGAATGATAATTCCAAGAGGGGCTTGTTGCCATTCAGTCATGGAATTCATCTGTCTAAGGAACAGTGTCCTAAGACACCTCAAGAAGTTGAGGATATG
AGACGTATTCCTTATGCCTCTGTAACCGATAAGGATTCTCGAAAATCCACATCGGGATCAGTTTTCGACCTTAACGGGGGTGCTATAGTATGGCGAAGTATCAAGCAAGG
ATGCATCGCTGACTCCATGATGGAGACTGAGTATGTCGCTGCTTGTGAAGCAGCTAACGAGGTTGTTTGGCTAAGGAAACAAGCCTCTAGGTCCTGGAATATAAGATTTG
ATGAGGCTATCAGATCTTATGGTTTTGACCAGAATGATGATGAACCTTGTGTCTACAAGAAAATCATCAATAGTTCTGTCGCTTTCCTAATTCTCTATGTGGATGATATC
CTACTCATTGGGAATGATGTAGGTTATCTTACTGACATCAAGGAATGGCTAGCAACGCAATTCCAAATGAAAGATTTGGGTAATGCAGTTTTTTCTTGGATCCAGATTGT
CAAGAATCGCAAGAATAGAACACTAGCCTTGTCTCTAGCATTATACATAGACAAAATGTTGTCAAGGTTTAAGATGCAAGATTCCAAAAAGGGCTTGTTGCATTTTAGAC
ATGGAATCCATTTGTCTAAGGAACAGTGTCCTAAGACACCTCAAGGAGTTGAGGATATGCGACGGATTCCTTACGCATCAGCTGTTGGGAGCCTAACGTACGCCATGTTG
TGTACTAGGCCTGACATCTGTTTTGCAGTTGGGATGGTCAGTAGGTATCAATCCAATTCAGGACTTGAACACTGGACAACGGTTAAAACGATCCTTAAGTATCTTCGGAG
AACAAGGAACTACATGCTTGTGTATGGGGCTAAGGATTTGATCCTTACAGGATACACGGATTCTAACTTTCAGACTGATAAAGATTCTCGAAAATCTACATTAGGGTCAG
TTTTCACTCTTAATGGAGGAGCTGTAGTATGGCGAAGCGTCAAGCAAGAATGCATCGCTGATTCCACCATGGAGGCCGAATATGTAGCAGCTTGTGAAGCAGGTAAGGAG
GTCGTTTGGTTAAGGAAATTTATGCTAAATTTGGAAGTTGTTCCAAATATGACTTTGCCCGTCACACTGTATTGCGATAACAGTGGTGCAAGGGCAAATTCGAGGAAACC
CCGAAGTCACAAGAGGGGAAAGCACATAGAGCGGAAGTATCATCTTATCCGGGAGATCGTGCACAGAGGAGACGTGATTGTCACGAAGATAGCCTCAAAGCACAACATTG
ATGATCCTTTTACAAAGGCTCTCACGGCTAAAGTATTTGAGAGTGACCTAGAAGGTCTAGGTTTACAAGTCTTCCCCAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTCAGCCCGAAGGGTTCATAACCCAAGCTCAGGAGCAAAAAGTTTGCAAGCTCAATCGATCGATTTATGGGTTGAAACAAGCATCCAAATCATGGAATATAAGATT
TGATATTGCGATCAAGTCTTATGGCTTTGACCAAAACGTTGATGAGCCTTGCGTTTACAAGAGGATCATCAACGACAAAGTAGCTTTATTAGTACTTTATGTGGATGATA
TCCTACTCATTGGGAATGATAATTCCAAGAGGGGCTTGTTGCCATTCAGTCATGGAATTCATCTGTCTAAGGAACAGTGTCCTAAGACACCTCAAGAAGTTGAGGATATG
AGACGTATTCCTTATGCCTCTGTAACCGATAAGGATTCTCGAAAATCCACATCGGGATCAGTTTTCGACCTTAACGGGGGTGCTATAGTATGGCGAAGTATCAAGCAAGG
ATGCATCGCTGACTCCATGATGGAGACTGAGTATGTCGCTGCTTGTGAAGCAGCTAACGAGGTTGTTTGGCTAAGGAAACAAGCCTCTAGGTCCTGGAATATAAGATTTG
ATGAGGCTATCAGATCTTATGGTTTTGACCAGAATGATGATGAACCTTGTGTCTACAAGAAAATCATCAATAGTTCTGTCGCTTTCCTAATTCTCTATGTGGATGATATC
CTACTCATTGGGAATGATGTAGGTTATCTTACTGACATCAAGGAATGGCTAGCAACGCAATTCCAAATGAAAGATTTGGGTAATGCAGTTTTTTCTTGGATCCAGATTGT
CAAGAATCGCAAGAATAGAACACTAGCCTTGTCTCTAGCATTATACATAGACAAAATGTTGTCAAGGTTTAAGATGCAAGATTCCAAAAAGGGCTTGTTGCATTTTAGAC
ATGGAATCCATTTGTCTAAGGAACAGTGTCCTAAGACACCTCAAGGAGTTGAGGATATGCGACGGATTCCTTACGCATCAGCTGTTGGGAGCCTAACGTACGCCATGTTG
TGTACTAGGCCTGACATCTGTTTTGCAGTTGGGATGGTCAGTAGGTATCAATCCAATTCAGGACTTGAACACTGGACAACGGTTAAAACGATCCTTAAGTATCTTCGGAG
AACAAGGAACTACATGCTTGTGTATGGGGCTAAGGATTTGATCCTTACAGGATACACGGATTCTAACTTTCAGACTGATAAAGATTCTCGAAAATCTACATTAGGGTCAG
TTTTCACTCTTAATGGAGGAGCTGTAGTATGGCGAAGCGTCAAGCAAGAATGCATCGCTGATTCCACCATGGAGGCCGAATATGTAGCAGCTTGTGAAGCAGGTAAGGAG
GTCGTTTGGTTAAGGAAATTTATGCTAAATTTGGAAGTTGTTCCAAATATGACTTTGCCCGTCACACTGTATTGCGATAACAGTGGTGCAAGGGCAAATTCGAGGAAACC
CCGAAGTCACAAGAGGGGAAAGCACATAGAGCGGAAGTATCATCTTATCCGGGAGATCGTGCACAGAGGAGACGTGATTGTCACGAAGATAGCCTCAAAGCACAACATTG
ATGATCCTTTTACAAAGGCTCTCACGGCTAAAGTATTTGAGAGTGACCTAGAAGGTCTAGGTTTACAAGTCTTCCCCAACTAG
Protein sequenceShow/hide protein sequence
MSQPEGFITQAQEQKVCKLNRSIYGLKQASKSWNIRFDIAIKSYGFDQNVDEPCVYKRIINDKVALLVLYVDDILLIGNDNSKRGLLPFSHGIHLSKEQCPKTPQEVEDM
RRIPYASVTDKDSRKSTSGSVFDLNGGAIVWRSIKQGCIADSMMETEYVAACEAANEVVWLRKQASRSWNIRFDEAIRSYGFDQNDDEPCVYKKIINSSVAFLILYVDDI
LLIGNDVGYLTDIKEWLATQFQMKDLGNAVFSWIQIVKNRKNRTLALSLALYIDKMLSRFKMQDSKKGLLHFRHGIHLSKEQCPKTPQGVEDMRRIPYASAVGSLTYAML
CTRPDICFAVGMVSRYQSNSGLEHWTTVKTILKYLRRTRNYMLVYGAKDLILTGYTDSNFQTDKDSRKSTLGSVFTLNGGAVVWRSVKQECIADSTMEAEYVAACEAGKE
VVWLRKFMLNLEVVPNMTLPVTLYCDNSGARANSRKPRSHKRGKHIERKYHLIREIVHRGDVIVTKIASKHNIDDPFTKALTAKVFESDLEGLGLQVFPN