; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015574 (gene) of Snake gourd v1 genome

Gene IDTan0015574
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionX8 domain-containing protein
Genome locationLG05:74993122..74997278
RNA-Seq ExpressionTan0015574
SyntenyTan0015574
Gene Ontology termsGO:0006810 - transport (biological process)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6584413.1 hypothetical protein SDJN03_20345, partial [Cucurbita argyrosperma subsp. sororia]2.2e-21672.54Show/hide
Query:  MPSKSHDVPEFWLCGNCTLDEAKSPDDSGLQVQPKMLRHAKACKVKFLPTEEVIKLSSGGMKGPSKFNIALAPQRTSKSRKAFESSMPRPLFQAPKESQE
        MP+KS DVP+ WLCGNCTLDEAKSPDDSG  VQPK+LRHAK  KVKFLPTEEV KLSSGGMKGPSK N A  PQRT KSRK FESS+PRP FQA KESQE
Subjt:  MPSKSHDVPEFWLCGNCTLDEAKSPDDSGLQVQPKMLRHAKACKVKFLPTEEVIKLSSGGMKGPSKFNIALAPQRTSKSRKAFESSMPRPLFQAPKESQE

Query:  RSPATMPPKVCDVNKQALATCLP-----PVQTLKKA--KIIDIYACTSSVSTHGLPVTNTGKEVPSPSTRLEDTQKQMKDGLM------------KEVPS
        RS +T+PP +C V KQAL TCLP     PVQTLKKA  K+ DI A  SSVS HGLPVT TGKEVPSPST+LED QKQ K+  +            KEVPS
Subjt:  RSPATMPPKVCDVNKQALATCLP-----PVQTLKKA--KIIDIYACTSSVSTHGLPVTNTGKEVPSPSTRLEDTQKQMKDGLM------------KEVPS

Query:  TSNKLEDKQKQIGDVLMAQEIQTYRDYLPSLHASWKGGFQFVDTHMAGEFYDGFLAKPPCVVHGRAYELSRKIPTILQVKLLSRSDIWDDLFQDDRPDLA
           KLED QKQ+   LM QEI  YRD LPSLHASWKGGFQF+DT M GEFYDGFLAKPPCVVHGRAYELSRKIP ILQVKLLSRSDIWD+LF D+ PDLA
Subjt:  TSNKLEDKQKQIGDVLMAQEIQTYRDYLPSLHASWKGGFQFVDTHMAGEFYDGFLAKPPCVVHGRAYELSRKIPTILQVKLLSRSDIWDDLFQDDRPDLA

Query:  DIALYFFPSANIERSRRNSSCLFELMERKDLLMISLIDGAELVVFTSRQLDLNSQYIVNRLNAECLLFGVFRAIKDNQSPFHNLREWTAAVPMLDYGSAG
        DIALYFFP+ NIERSR N+S LFELMER+DLL+ SLIDGAELVVFT RQLDL+SQ++ N L+A+CLLFGVFRAIK +QS           VPML+YGSA 
Subjt:  DIALYFFPSANIERSRRNSSCLFELMERKDLLMISLIDGAELVVFTSRQLDLNSQYIVNRLNAECLLFGVFRAIKDNQSPFHNLREWTAAVPMLDYGSAG

Query:  SSVECASRVPLLEFTPNGHGEHDEGNAVNKEMVITSGNTA----SSKDIDSTIQRLLLEFGSQKPGEYDVSALTTNAQIKNQEPAPISAIGSHSLSLSKV
        SSV   S+VPLLEFTP GHG+HDE NAV + + IT GNT     ++KD+DSTIQRLLLEFGSQKP E DV+ALTT AQIK+QEPAPI+A GS+SLS SKV
Subjt:  SSVECASRVPLLEFTPNGHGEHDEGNAVNKEMVITSGNTA----SSKDIDSTIQRLLLEFGSQKPGEYDVSALTTNAQIKNQEPAPISAIGSHSLSLSKV

Query:  KTEPLSVIKEEGSDDKKCLEPENFSRMAPTFSIDGSQSRTGLVDQDAPKRVADKYLQIFNAGVKKERR
        K EP+ VIK E   D+KCLE E+ SRMAPTFSIDGSQ+RTGL DQD PKRVADKYLQIFNAG+KKERR
Subjt:  KTEPLSVIKEEGSDDKKCLEPENFSRMAPTFSIDGSQSRTGLVDQDAPKRVADKYLQIFNAGVKKERR

XP_022923683.1 uncharacterized protein LOC111431323 isoform X1 [Cucurbita moschata]5.7e-21772.68Show/hide
Query:  MPSKSHDVPEFWLCGNCTLDEAKSPDDSGLQVQPKMLRHAKACKVKFLPTEEVIKLSSGGMKGPSKFNIALAPQRTSKSRKAFESSMPRPLFQAPKESQE
        MP+KS DVP+ WLCGNCTLDEAKSPDDSG  VQPKMLRHAK  KVKFLPTEEV KLSSGGMKGPSK N A  PQRT KSRK FESS+PRP FQA KESQE
Subjt:  MPSKSHDVPEFWLCGNCTLDEAKSPDDSGLQVQPKMLRHAKACKVKFLPTEEVIKLSSGGMKGPSKFNIALAPQRTSKSRKAFESSMPRPLFQAPKESQE

Query:  RSPATMPPKVCDVNKQALATCLP-----PVQTLKKA--KIIDIYACTSSVSTHGLPVTNTGKEVPSPSTRLEDTQKQMKDGLM------------KEVPS
        RS AT+PP +C + KQAL TCLP     PVQTLKKA  K+ DI A  SSVS HGLPVT TGKEVPSPST+LED QKQ K+  +            KEVPS
Subjt:  RSPATMPPKVCDVNKQALATCLP-----PVQTLKKA--KIIDIYACTSSVSTHGLPVTNTGKEVPSPSTRLEDTQKQMKDGLM------------KEVPS

Query:  TSNKLEDKQKQIGDVLMAQEIQTYRDYLPSLHASWKGGFQFVDTHMAGEFYDGFLAKPPCVVHGRAYELSRKIPTILQVKLLSRSDIWDDLFQDDRPDLA
           KLED QKQI   LM QEI  YRD LPSLHASWKGGFQF+D  M GEFYDGFLAKPPCVVHGRAYELSRKIP ILQVKLLSRSDIWD+LF D+ PDLA
Subjt:  TSNKLEDKQKQIGDVLMAQEIQTYRDYLPSLHASWKGGFQFVDTHMAGEFYDGFLAKPPCVVHGRAYELSRKIPTILQVKLLSRSDIWDDLFQDDRPDLA

Query:  DIALYFFPSANIERSRRNSSCLFELMERKDLLMISLIDGAELVVFTSRQLDLNSQYI---VNRLNAECLLFGVFRAIKDNQSPFHNLREWTAAVPMLDYG
        DIALYFFP+ NIERSR+NSS LFELMER+DLL+ SLIDGAELVVFT RQLDL+SQ +   +N L+A+CLLFGVFRAIK +QS           VPML+YG
Subjt:  DIALYFFPSANIERSRRNSSCLFELMERKDLLMISLIDGAELVVFTSRQLDLNSQYI---VNRLNAECLLFGVFRAIKDNQSPFHNLREWTAAVPMLDYG

Query:  SAGSSVECASRVPLLEFTPNGHGEHDEGNAVNKEMVITSGNTA----SSKDIDSTIQRLLLEFGSQKPGEYDVSALTTNAQIKNQEPAPISAIGSHSLSL
        SA SSVE  S+VPLLEFTP GHG+HDE NAV +   IT GNT     ++KD+DSTIQRLLLEFGSQK  E DV+ALTT AQIK+QEPAPI+A GS+SLS 
Subjt:  SAGSSVECASRVPLLEFTPNGHGEHDEGNAVNKEMVITSGNTA----SSKDIDSTIQRLLLEFGSQKPGEYDVSALTTNAQIKNQEPAPISAIGSHSLSL

Query:  SKVKTEPLSVIKEEGSDDKKCLEPENFSRMAPTFSIDGSQSRTGLVDQDAPKRVADKYLQIFNAGVKKERR
        SKVK EP+ VIKEE S D+KCLE E+ SRMAPTFSIDGSQ+RTGL DQD P+RVADKYLQIFNAG+KKERR
Subjt:  SKVKTEPLSVIKEEGSDDKKCLEPENFSRMAPTFSIDGSQSRTGLVDQDAPKRVADKYLQIFNAGVKKERR

XP_022923685.1 uncharacterized protein LOC111431323 isoform X2 [Cucurbita moschata]1.2e-21773.06Show/hide
Query:  MPSKSHDVPEFWLCGNCTLDEAKSPDDSGLQVQPKMLRHAKACKVKFLPTEEVIKLSSGGMKGPSKFNIALAPQRTSKSRKAFESSMPRPLFQAPKESQE
        MP+KS DVP+ WLCGNCTLDEAKSPDDSG  VQPKMLRHAK  KVKFLPTEEV KLSSGGMKGPSK N A  PQRT KSRK FESS+PRP FQA KESQE
Subjt:  MPSKSHDVPEFWLCGNCTLDEAKSPDDSGLQVQPKMLRHAKACKVKFLPTEEVIKLSSGGMKGPSKFNIALAPQRTSKSRKAFESSMPRPLFQAPKESQE

Query:  RSPATMPPKVCDVNKQALATCLP-----PVQTLKKA--KIIDIYACTSSVSTHGLPVTNTGKEVPSPSTRLEDTQKQMKDGLM------------KEVPS
        RS AT+PP +C + KQAL TCLP     PVQTLKKA  K+ DI A  SSVS HGLPVT TGKEVPSPST+LED QKQ K+  +            KEVPS
Subjt:  RSPATMPPKVCDVNKQALATCLP-----PVQTLKKA--KIIDIYACTSSVSTHGLPVTNTGKEVPSPSTRLEDTQKQMKDGLM------------KEVPS

Query:  TSNKLEDKQKQIGDVLMAQEIQTYRDYLPSLHASWKGGFQFVDTHMAGEFYDGFLAKPPCVVHGRAYELSRKIPTILQVKLLSRSDIWDDLFQDDRPDLA
           KLED QKQI   LM QEI  YRD LPSLHASWKGGFQF+D  M GEFYDGFLAKPPCVVHGRAYELSRKIP ILQVKLLSRSDIWD+LF D+ PDLA
Subjt:  TSNKLEDKQKQIGDVLMAQEIQTYRDYLPSLHASWKGGFQFVDTHMAGEFYDGFLAKPPCVVHGRAYELSRKIPTILQVKLLSRSDIWDDLFQDDRPDLA

Query:  DIALYFFPSANIERSRRNSSCLFELMERKDLLMISLIDGAELVVFTSRQLDLNSQYIVNRLNAECLLFGVFRAIKDNQSPFHNLREWTAAVPMLDYGSAG
        DIALYFFP+ NIERSR+NSS LFELMER+DLL+ SLIDGAELVVFT RQLDL+SQ++ N L+A+CLLFGVFRAIK +QS           VPML+YGSA 
Subjt:  DIALYFFPSANIERSRRNSSCLFELMERKDLLMISLIDGAELVVFTSRQLDLNSQYIVNRLNAECLLFGVFRAIKDNQSPFHNLREWTAAVPMLDYGSAG

Query:  SSVECASRVPLLEFTPNGHGEHDEGNAVNKEMVITSGNTA----SSKDIDSTIQRLLLEFGSQKPGEYDVSALTTNAQIKNQEPAPISAIGSHSLSLSKV
        SSVE  S+VPLLEFTP GHG+HDE NAV +   IT GNT     ++KD+DSTIQRLLLEFGSQK  E DV+ALTT AQIK+QEPAPI+A GS+SLS SKV
Subjt:  SSVECASRVPLLEFTPNGHGEHDEGNAVNKEMVITSGNTA----SSKDIDSTIQRLLLEFGSQKPGEYDVSALTTNAQIKNQEPAPISAIGSHSLSLSKV

Query:  KTEPLSVIKEEGSDDKKCLEPENFSRMAPTFSIDGSQSRTGLVDQDAPKRVADKYLQIFNAGVKKERR
        K EP+ VIKEE S D+KCLE E+ SRMAPTFSIDGSQ+RTGL DQD P+RVADKYLQIFNAG+KKERR
Subjt:  KTEPLSVIKEEGSDDKKCLEPENFSRMAPTFSIDGSQSRTGLVDQDAPKRVADKYLQIFNAGVKKERR

XP_022923686.1 uncharacterized protein LOC111431323 isoform X3 [Cucurbita moschata]5.7e-21772.68Show/hide
Query:  MPSKSHDVPEFWLCGNCTLDEAKSPDDSGLQVQPKMLRHAKACKVKFLPTEEVIKLSSGGMKGPSKFNIALAPQRTSKSRKAFESSMPRPLFQAPKESQE
        MP+KS DVP+ WLCGNCTLDEAKSPDDSG  VQPKMLRHAK  KVKFLPTEEV KLSSGGMKGPSK N A  PQRT KSRK FESS+PRP FQA KESQE
Subjt:  MPSKSHDVPEFWLCGNCTLDEAKSPDDSGLQVQPKMLRHAKACKVKFLPTEEVIKLSSGGMKGPSKFNIALAPQRTSKSRKAFESSMPRPLFQAPKESQE

Query:  RSPATMPPKVCDVNKQALATCLP-----PVQTLKKA--KIIDIYACTSSVSTHGLPVTNTGKEVPSPSTRLEDTQKQMKDGLM------------KEVPS
        RS AT+PP +C + KQAL TCLP     PVQTLKKA  K+ DI A  SSVS HGLPVT TGKEVPSPST+LED QKQ K+  +            KEVPS
Subjt:  RSPATMPPKVCDVNKQALATCLP-----PVQTLKKA--KIIDIYACTSSVSTHGLPVTNTGKEVPSPSTRLEDTQKQMKDGLM------------KEVPS

Query:  TSNKLEDKQKQIGDVLMAQEIQTYRDYLPSLHASWKGGFQFVDTHMAGEFYDGFLAKPPCVVHGRAYELSRKIPTILQVKLLSRSDIWDDLFQDDRPDLA
           KLED QKQI   LM QEI  YRD LPSLHASWKGGFQF+D  M GEFYDGFLAKPPCVVHGRAYELSRKIP ILQVKLLSRSDIWD+LF D+ PDLA
Subjt:  TSNKLEDKQKQIGDVLMAQEIQTYRDYLPSLHASWKGGFQFVDTHMAGEFYDGFLAKPPCVVHGRAYELSRKIPTILQVKLLSRSDIWDDLFQDDRPDLA

Query:  DIALYFFPSANIERSRRNSSCLFELMERKDLLMISLIDGAELVVFTSRQLDLNSQYI---VNRLNAECLLFGVFRAIKDNQSPFHNLREWTAAVPMLDYG
        DIALYFFP+ NIERSR+NSS LFELMER+DLL+ SLIDGAELVVFT RQLDL+SQ +   +N L+A+CLLFGVFRAIK +QS           VPML+YG
Subjt:  DIALYFFPSANIERSRRNSSCLFELMERKDLLMISLIDGAELVVFTSRQLDLNSQYI---VNRLNAECLLFGVFRAIKDNQSPFHNLREWTAAVPMLDYG

Query:  SAGSSVECASRVPLLEFTPNGHGEHDEGNAVNKEMVITSGNTA----SSKDIDSTIQRLLLEFGSQKPGEYDVSALTTNAQIKNQEPAPISAIGSHSLSL
        SA SSVE  S+VPLLEFTP GHG+HDE NAV +   IT GNT     ++KD+DSTIQRLLLEFGSQK  E DV+ALTT AQIK+QEPAPI+A GS+SLS 
Subjt:  SAGSSVECASRVPLLEFTPNGHGEHDEGNAVNKEMVITSGNTA----SSKDIDSTIQRLLLEFGSQKPGEYDVSALTTNAQIKNQEPAPISAIGSHSLSL

Query:  SKVKTEPLSVIKEEGSDDKKCLEPENFSRMAPTFSIDGSQSRTGLVDQDAPKRVADKYLQIFNAGVKKERR
        SKVK EP+ VIKEE S D+KCLE E+ SRMAPTFSIDGSQ+RTGL DQD P+RVADKYLQIFNAG+KKERR
Subjt:  SKVKTEPLSVIKEEGSDDKKCLEPENFSRMAPTFSIDGSQSRTGLVDQDAPKRVADKYLQIFNAGVKKERR

XP_023520146.1 uncharacterized protein LOC111783448 isoform X2 [Cucurbita pepo subsp. pepo]2.7e-21472.06Show/hide
Query:  MPSKSHDVPEFWLCGNCTLDEAKSPDDSGLQVQPKMLRHAKACKVKFLPTEEVIKLSSGGMKGPSKFNIALAPQRTSKSRKAFESSMPRPLFQAPKESQE
        MP+KS DVP+ WLCGNCTLDEAKSPDDSG  VQPKM RHAK  KVKFLPTEEV KLSSGGMKGPSK N A  PQRT KSRK FESS+PRP +QA KESQE
Subjt:  MPSKSHDVPEFWLCGNCTLDEAKSPDDSGLQVQPKMLRHAKACKVKFLPTEEVIKLSSGGMKGPSKFNIALAPQRTSKSRKAFESSMPRPLFQAPKESQE

Query:  RSPATMPPKVCDVNKQALATCLP-----PVQTLKKA--KIIDIYACTSSVSTHGLPVTNTGKEVPSPSTRLEDTQKQMKDGLM------------KEVPS
        RS AT+PP +C V K +LATCLP     PVQTLKKA  K+ DI A  SSVS H LPVT TGKEVPSPST+LED QKQ K+  +            KEVPS
Subjt:  RSPATMPPKVCDVNKQALATCLP-----PVQTLKKA--KIIDIYACTSSVSTHGLPVTNTGKEVPSPSTRLEDTQKQMKDGLM------------KEVPS

Query:  TSNKLEDKQKQIGDVLMAQEIQTYRDYLPSLHASWKGGFQFVDTHMAGEFYDGFLAKPPCVVHGRAYELSRKIPTILQVKLLSRSDIWDDLFQDDRPDLA
           KLED QKQ+   LM QEI  YRD LPSLHASWKGGFQF+D  M GEFYDGFLAKPPCVVHGRAYELSRKIP ILQVKLLSRSDIWD+LF D+ PDLA
Subjt:  TSNKLEDKQKQIGDVLMAQEIQTYRDYLPSLHASWKGGFQFVDTHMAGEFYDGFLAKPPCVVHGRAYELSRKIPTILQVKLLSRSDIWDDLFQDDRPDLA

Query:  DIALYFFPSANIERSRRNSSCLFELMERKDLLMISLIDGAELVVFTSRQLDLNSQYIVNRLNAECLLFGVFRAIKDNQSPFHNLREWTAAVPMLDYGSAG
        DIALYFFP+ NIERSR+N+S LFELMER+DLL+ SLIDGAELVVFT RQLDL+SQ++ N L+A+CLLFGVFRA K +QS           VPML+YGSA 
Subjt:  DIALYFFPSANIERSRRNSSCLFELMERKDLLMISLIDGAELVVFTSRQLDLNSQYIVNRLNAECLLFGVFRAIKDNQSPFHNLREWTAAVPMLDYGSAG

Query:  SSVECASRVPLLEFTPNGHGEHDEGNAVNKEMVITSGNTA----SSKDIDSTIQRLLLEFGSQKPGEYDVSALTTNAQIKNQEPAPI-SAIGSHSLSLSK
        SSVE  S+VPLLEFTP GHG+HDE NAV + + IT GNT     ++KD+DSTIQRLLLEFGSQKP E DV+ALTT AQIK+QEPAPI +A GS+SLS SK
Subjt:  SSVECASRVPLLEFTPNGHGEHDEGNAVNKEMVITSGNTA----SSKDIDSTIQRLLLEFGSQKPGEYDVSALTTNAQIKNQEPAPI-SAIGSHSLSLSK

Query:  VKTEPLSVIKEEGSDDKKCLEPENFSRMAPTFSIDGSQSRTGLVDQDAPKRVADKYLQIFNAGVKKERR
        VK EP+ VIKEE S D+KCLE E+ SRMAPTFSIDGSQ+RTGL DQD P+RVADKYLQIFNAG+KKERR
Subjt:  VKTEPLSVIKEEGSDDKKCLEPENFSRMAPTFSIDGSQSRTGLVDQDAPKRVADKYLQIFNAGVKKERR

TrEMBL top hitse value%identityAlignment
A0A6J1E6T7 uncharacterized protein LOC111431323 isoform X32.8e-21772.68Show/hide
Query:  MPSKSHDVPEFWLCGNCTLDEAKSPDDSGLQVQPKMLRHAKACKVKFLPTEEVIKLSSGGMKGPSKFNIALAPQRTSKSRKAFESSMPRPLFQAPKESQE
        MP+KS DVP+ WLCGNCTLDEAKSPDDSG  VQPKMLRHAK  KVKFLPTEEV KLSSGGMKGPSK N A  PQRT KSRK FESS+PRP FQA KESQE
Subjt:  MPSKSHDVPEFWLCGNCTLDEAKSPDDSGLQVQPKMLRHAKACKVKFLPTEEVIKLSSGGMKGPSKFNIALAPQRTSKSRKAFESSMPRPLFQAPKESQE

Query:  RSPATMPPKVCDVNKQALATCLP-----PVQTLKKA--KIIDIYACTSSVSTHGLPVTNTGKEVPSPSTRLEDTQKQMKDGLM------------KEVPS
        RS AT+PP +C + KQAL TCLP     PVQTLKKA  K+ DI A  SSVS HGLPVT TGKEVPSPST+LED QKQ K+  +            KEVPS
Subjt:  RSPATMPPKVCDVNKQALATCLP-----PVQTLKKA--KIIDIYACTSSVSTHGLPVTNTGKEVPSPSTRLEDTQKQMKDGLM------------KEVPS

Query:  TSNKLEDKQKQIGDVLMAQEIQTYRDYLPSLHASWKGGFQFVDTHMAGEFYDGFLAKPPCVVHGRAYELSRKIPTILQVKLLSRSDIWDDLFQDDRPDLA
           KLED QKQI   LM QEI  YRD LPSLHASWKGGFQF+D  M GEFYDGFLAKPPCVVHGRAYELSRKIP ILQVKLLSRSDIWD+LF D+ PDLA
Subjt:  TSNKLEDKQKQIGDVLMAQEIQTYRDYLPSLHASWKGGFQFVDTHMAGEFYDGFLAKPPCVVHGRAYELSRKIPTILQVKLLSRSDIWDDLFQDDRPDLA

Query:  DIALYFFPSANIERSRRNSSCLFELMERKDLLMISLIDGAELVVFTSRQLDLNSQYI---VNRLNAECLLFGVFRAIKDNQSPFHNLREWTAAVPMLDYG
        DIALYFFP+ NIERSR+NSS LFELMER+DLL+ SLIDGAELVVFT RQLDL+SQ +   +N L+A+CLLFGVFRAIK +QS           VPML+YG
Subjt:  DIALYFFPSANIERSRRNSSCLFELMERKDLLMISLIDGAELVVFTSRQLDLNSQYI---VNRLNAECLLFGVFRAIKDNQSPFHNLREWTAAVPMLDYG

Query:  SAGSSVECASRVPLLEFTPNGHGEHDEGNAVNKEMVITSGNTA----SSKDIDSTIQRLLLEFGSQKPGEYDVSALTTNAQIKNQEPAPISAIGSHSLSL
        SA SSVE  S+VPLLEFTP GHG+HDE NAV +   IT GNT     ++KD+DSTIQRLLLEFGSQK  E DV+ALTT AQIK+QEPAPI+A GS+SLS 
Subjt:  SAGSSVECASRVPLLEFTPNGHGEHDEGNAVNKEMVITSGNTA----SSKDIDSTIQRLLLEFGSQKPGEYDVSALTTNAQIKNQEPAPISAIGSHSLSL

Query:  SKVKTEPLSVIKEEGSDDKKCLEPENFSRMAPTFSIDGSQSRTGLVDQDAPKRVADKYLQIFNAGVKKERR
        SKVK EP+ VIKEE S D+KCLE E+ SRMAPTFSIDGSQ+RTGL DQD P+RVADKYLQIFNAG+KKERR
Subjt:  SKVKTEPLSVIKEEGSDDKKCLEPENFSRMAPTFSIDGSQSRTGLVDQDAPKRVADKYLQIFNAGVKKERR

A0A6J1EAB4 uncharacterized protein LOC111431323 isoform X12.8e-21772.68Show/hide
Query:  MPSKSHDVPEFWLCGNCTLDEAKSPDDSGLQVQPKMLRHAKACKVKFLPTEEVIKLSSGGMKGPSKFNIALAPQRTSKSRKAFESSMPRPLFQAPKESQE
        MP+KS DVP+ WLCGNCTLDEAKSPDDSG  VQPKMLRHAK  KVKFLPTEEV KLSSGGMKGPSK N A  PQRT KSRK FESS+PRP FQA KESQE
Subjt:  MPSKSHDVPEFWLCGNCTLDEAKSPDDSGLQVQPKMLRHAKACKVKFLPTEEVIKLSSGGMKGPSKFNIALAPQRTSKSRKAFESSMPRPLFQAPKESQE

Query:  RSPATMPPKVCDVNKQALATCLP-----PVQTLKKA--KIIDIYACTSSVSTHGLPVTNTGKEVPSPSTRLEDTQKQMKDGLM------------KEVPS
        RS AT+PP +C + KQAL TCLP     PVQTLKKA  K+ DI A  SSVS HGLPVT TGKEVPSPST+LED QKQ K+  +            KEVPS
Subjt:  RSPATMPPKVCDVNKQALATCLP-----PVQTLKKA--KIIDIYACTSSVSTHGLPVTNTGKEVPSPSTRLEDTQKQMKDGLM------------KEVPS

Query:  TSNKLEDKQKQIGDVLMAQEIQTYRDYLPSLHASWKGGFQFVDTHMAGEFYDGFLAKPPCVVHGRAYELSRKIPTILQVKLLSRSDIWDDLFQDDRPDLA
           KLED QKQI   LM QEI  YRD LPSLHASWKGGFQF+D  M GEFYDGFLAKPPCVVHGRAYELSRKIP ILQVKLLSRSDIWD+LF D+ PDLA
Subjt:  TSNKLEDKQKQIGDVLMAQEIQTYRDYLPSLHASWKGGFQFVDTHMAGEFYDGFLAKPPCVVHGRAYELSRKIPTILQVKLLSRSDIWDDLFQDDRPDLA

Query:  DIALYFFPSANIERSRRNSSCLFELMERKDLLMISLIDGAELVVFTSRQLDLNSQYI---VNRLNAECLLFGVFRAIKDNQSPFHNLREWTAAVPMLDYG
        DIALYFFP+ NIERSR+NSS LFELMER+DLL+ SLIDGAELVVFT RQLDL+SQ +   +N L+A+CLLFGVFRAIK +QS           VPML+YG
Subjt:  DIALYFFPSANIERSRRNSSCLFELMERKDLLMISLIDGAELVVFTSRQLDLNSQYI---VNRLNAECLLFGVFRAIKDNQSPFHNLREWTAAVPMLDYG

Query:  SAGSSVECASRVPLLEFTPNGHGEHDEGNAVNKEMVITSGNTA----SSKDIDSTIQRLLLEFGSQKPGEYDVSALTTNAQIKNQEPAPISAIGSHSLSL
        SA SSVE  S+VPLLEFTP GHG+HDE NAV +   IT GNT     ++KD+DSTIQRLLLEFGSQK  E DV+ALTT AQIK+QEPAPI+A GS+SLS 
Subjt:  SAGSSVECASRVPLLEFTPNGHGEHDEGNAVNKEMVITSGNTA----SSKDIDSTIQRLLLEFGSQKPGEYDVSALTTNAQIKNQEPAPISAIGSHSLSL

Query:  SKVKTEPLSVIKEEGSDDKKCLEPENFSRMAPTFSIDGSQSRTGLVDQDAPKRVADKYLQIFNAGVKKERR
        SKVK EP+ VIKEE S D+KCLE E+ SRMAPTFSIDGSQ+RTGL DQD P+RVADKYLQIFNAG+KKERR
Subjt:  SKVKTEPLSVIKEEGSDDKKCLEPENFSRMAPTFSIDGSQSRTGLVDQDAPKRVADKYLQIFNAGVKKERR

A0A6J1ECK6 uncharacterized protein LOC111431323 isoform X25.6e-21873.06Show/hide
Query:  MPSKSHDVPEFWLCGNCTLDEAKSPDDSGLQVQPKMLRHAKACKVKFLPTEEVIKLSSGGMKGPSKFNIALAPQRTSKSRKAFESSMPRPLFQAPKESQE
        MP+KS DVP+ WLCGNCTLDEAKSPDDSG  VQPKMLRHAK  KVKFLPTEEV KLSSGGMKGPSK N A  PQRT KSRK FESS+PRP FQA KESQE
Subjt:  MPSKSHDVPEFWLCGNCTLDEAKSPDDSGLQVQPKMLRHAKACKVKFLPTEEVIKLSSGGMKGPSKFNIALAPQRTSKSRKAFESSMPRPLFQAPKESQE

Query:  RSPATMPPKVCDVNKQALATCLP-----PVQTLKKA--KIIDIYACTSSVSTHGLPVTNTGKEVPSPSTRLEDTQKQMKDGLM------------KEVPS
        RS AT+PP +C + KQAL TCLP     PVQTLKKA  K+ DI A  SSVS HGLPVT TGKEVPSPST+LED QKQ K+  +            KEVPS
Subjt:  RSPATMPPKVCDVNKQALATCLP-----PVQTLKKA--KIIDIYACTSSVSTHGLPVTNTGKEVPSPSTRLEDTQKQMKDGLM------------KEVPS

Query:  TSNKLEDKQKQIGDVLMAQEIQTYRDYLPSLHASWKGGFQFVDTHMAGEFYDGFLAKPPCVVHGRAYELSRKIPTILQVKLLSRSDIWDDLFQDDRPDLA
           KLED QKQI   LM QEI  YRD LPSLHASWKGGFQF+D  M GEFYDGFLAKPPCVVHGRAYELSRKIP ILQVKLLSRSDIWD+LF D+ PDLA
Subjt:  TSNKLEDKQKQIGDVLMAQEIQTYRDYLPSLHASWKGGFQFVDTHMAGEFYDGFLAKPPCVVHGRAYELSRKIPTILQVKLLSRSDIWDDLFQDDRPDLA

Query:  DIALYFFPSANIERSRRNSSCLFELMERKDLLMISLIDGAELVVFTSRQLDLNSQYIVNRLNAECLLFGVFRAIKDNQSPFHNLREWTAAVPMLDYGSAG
        DIALYFFP+ NIERSR+NSS LFELMER+DLL+ SLIDGAELVVFT RQLDL+SQ++ N L+A+CLLFGVFRAIK +QS           VPML+YGSA 
Subjt:  DIALYFFPSANIERSRRNSSCLFELMERKDLLMISLIDGAELVVFTSRQLDLNSQYIVNRLNAECLLFGVFRAIKDNQSPFHNLREWTAAVPMLDYGSAG

Query:  SSVECASRVPLLEFTPNGHGEHDEGNAVNKEMVITSGNTA----SSKDIDSTIQRLLLEFGSQKPGEYDVSALTTNAQIKNQEPAPISAIGSHSLSLSKV
        SSVE  S+VPLLEFTP GHG+HDE NAV +   IT GNT     ++KD+DSTIQRLLLEFGSQK  E DV+ALTT AQIK+QEPAPI+A GS+SLS SKV
Subjt:  SSVECASRVPLLEFTPNGHGEHDEGNAVNKEMVITSGNTA----SSKDIDSTIQRLLLEFGSQKPGEYDVSALTTNAQIKNQEPAPISAIGSHSLSLSKV

Query:  KTEPLSVIKEEGSDDKKCLEPENFSRMAPTFSIDGSQSRTGLVDQDAPKRVADKYLQIFNAGVKKERR
        K EP+ VIKEE S D+KCLE E+ SRMAPTFSIDGSQ+RTGL DQD P+RVADKYLQIFNAG+KKERR
Subjt:  KTEPLSVIKEEGSDDKKCLEPENFSRMAPTFSIDGSQSRTGLVDQDAPKRVADKYLQIFNAGVKKERR

A0A6J1KLX3 uncharacterized protein LOC111495374 isoform X31.1e-21371.45Show/hide
Query:  MPSKSHDVPEFWLCGNCTLDEAKSPDDSGLQVQPKMLRHAKACKVKFLPTEEVIKLSSGGMKGPSKFNIALAPQRTSKSRKAFESSMPRPLFQAPKESQE
        M +KS +VP+ WLCGNCTLDEAKSPDDSG  VQPKM RHAK  KVKFLPTEEV KLSSG MKGPSK N A  PQRT KSRK FESS+PRP FQA KESQE
Subjt:  MPSKSHDVPEFWLCGNCTLDEAKSPDDSGLQVQPKMLRHAKACKVKFLPTEEVIKLSSGGMKGPSKFNIALAPQRTSKSRKAFESSMPRPLFQAPKESQE

Query:  RSPATMPPKVCDVNKQALATCLP-----PVQTLKKA--KIIDIYACTSSVSTHGLPVTNTGKEVPSPSTRLEDTQKQMKDGLM------------KEVPS
        RS AT+PP +C V KQALATCLP     PVQTLKKA  K+ DI A  SSVS HG PVT TGKEVPSPST+LED QKQ K+  +            KEVPS
Subjt:  RSPATMPPKVCDVNKQALATCLP-----PVQTLKKA--KIIDIYACTSSVSTHGLPVTNTGKEVPSPSTRLEDTQKQMKDGLM------------KEVPS

Query:  TSNKLEDKQKQIGDVLMAQEIQTYRDYLPSLHASWKGGFQFVDTHMAGEFYDGFLAKPPCVVHGRAYELSRKIPTILQVKLLSRSDIWDDLFQDDRPDLA
           KL+D QKQ+   LM QEI  YRD LPSLHASWKGGFQF+DT M GEFYDGFLAKPPCVVHGRAYELSRKIP ILQVKLLSRSDIWD+LF D+ PDLA
Subjt:  TSNKLEDKQKQIGDVLMAQEIQTYRDYLPSLHASWKGGFQFVDTHMAGEFYDGFLAKPPCVVHGRAYELSRKIPTILQVKLLSRSDIWDDLFQDDRPDLA

Query:  DIALYFFPSANIERSRRNSSCLFELMERKDLLMISLIDGAELVVFTSRQLDLNSQYI---VNRLNAECLLFGVFRAIKDNQSPFHNLREWTAAVPMLDYG
        DIALYFFP+ N ERSR+N+S LFELMER+DLL+ SLIDGAELVVFT RQLDL+SQ +   +N L+A+CLLFGVFRAIK ++S           VPML+YG
Subjt:  DIALYFFPSANIERSRRNSSCLFELMERKDLLMISLIDGAELVVFTSRQLDLNSQYI---VNRLNAECLLFGVFRAIKDNQSPFHNLREWTAAVPMLDYG

Query:  SAGSSVECASRVPLLEFTPNGHGEHDEGNAVNKEMVITSGNTA----SSKDIDSTIQRLLLEFGSQKPGEYDVSALTTNAQIKNQEPAPISAIGSHSLSL
        SA SSVE  S+VPLLEFTP GHG+HDE NAV + + IT GNT     ++KD+DSTI+RLLLEFGSQKP E DV+ALTT AQIK+QEPAPI+A G +SLS 
Subjt:  SAGSSVECASRVPLLEFTPNGHGEHDEGNAVNKEMVITSGNTA----SSKDIDSTIQRLLLEFGSQKPGEYDVSALTTNAQIKNQEPAPISAIGSHSLSL

Query:  SKVKTEPLSVIKEEGSDDKKCLEPENFSRMAPTFSIDGSQSRTGLVDQDAPKRVADKYLQIFNAGVKKERR
        SKVK EP+ VIKEE S D+KCLE E+ SRMAPTFSIDGSQ+RTGL DQD PKRVADKYLQIFNAG+KKERR
Subjt:  SKVKTEPLSVIKEEGSDDKKCLEPENFSRMAPTFSIDGSQSRTGLVDQDAPKRVADKYLQIFNAGVKKERR

A0A6J1KPN9 uncharacterized protein LOC111495374 isoform X22.2e-21471.83Show/hide
Query:  MPSKSHDVPEFWLCGNCTLDEAKSPDDSGLQVQPKMLRHAKACKVKFLPTEEVIKLSSGGMKGPSKFNIALAPQRTSKSRKAFESSMPRPLFQAPKESQE
        M +KS +VP+ WLCGNCTLDEAKSPDDSG  VQPKM RHAK  KVKFLPTEEV KLSSG MKGPSK N A  PQRT KSRK FESS+PRP FQA KESQE
Subjt:  MPSKSHDVPEFWLCGNCTLDEAKSPDDSGLQVQPKMLRHAKACKVKFLPTEEVIKLSSGGMKGPSKFNIALAPQRTSKSRKAFESSMPRPLFQAPKESQE

Query:  RSPATMPPKVCDVNKQALATCLP-----PVQTLKKA--KIIDIYACTSSVSTHGLPVTNTGKEVPSPSTRLEDTQKQMKDGLM------------KEVPS
        RS AT+PP +C V KQALATCLP     PVQTLKKA  K+ DI A  SSVS HG PVT TGKEVPSPST+LED QKQ K+  +            KEVPS
Subjt:  RSPATMPPKVCDVNKQALATCLP-----PVQTLKKA--KIIDIYACTSSVSTHGLPVTNTGKEVPSPSTRLEDTQKQMKDGLM------------KEVPS

Query:  TSNKLEDKQKQIGDVLMAQEIQTYRDYLPSLHASWKGGFQFVDTHMAGEFYDGFLAKPPCVVHGRAYELSRKIPTILQVKLLSRSDIWDDLFQDDRPDLA
           KL+D QKQ+   LM QEI  YRD LPSLHASWKGGFQF+DT M GEFYDGFLAKPPCVVHGRAYELSRKIP ILQVKLLSRSDIWD+LF D+ PDLA
Subjt:  TSNKLEDKQKQIGDVLMAQEIQTYRDYLPSLHASWKGGFQFVDTHMAGEFYDGFLAKPPCVVHGRAYELSRKIPTILQVKLLSRSDIWDDLFQDDRPDLA

Query:  DIALYFFPSANIERSRRNSSCLFELMERKDLLMISLIDGAELVVFTSRQLDLNSQYIVNRLNAECLLFGVFRAIKDNQSPFHNLREWTAAVPMLDYGSAG
        DIALYFFP+ N ERSR+N+S LFELMER+DLL+ SLIDGAELVVFT RQLDL+SQ++ N L+A+CLLFGVFRAIK ++S           VPML+YGSA 
Subjt:  DIALYFFPSANIERSRRNSSCLFELMERKDLLMISLIDGAELVVFTSRQLDLNSQYIVNRLNAECLLFGVFRAIKDNQSPFHNLREWTAAVPMLDYGSAG

Query:  SSVECASRVPLLEFTPNGHGEHDEGNAVNKEMVITSGNTA----SSKDIDSTIQRLLLEFGSQKPGEYDVSALTTNAQIKNQEPAPISAIGSHSLSLSKV
        SSVE  S+VPLLEFTP GHG+HDE NAV + + IT GNT     ++KD+DSTI+RLLLEFGSQKP E DV+ALTT AQIK+QEPAPI+A G +SLS SKV
Subjt:  SSVECASRVPLLEFTPNGHGEHDEGNAVNKEMVITSGNTA----SSKDIDSTIQRLLLEFGSQKPGEYDVSALTTNAQIKNQEPAPISAIGSHSLSLSKV

Query:  KTEPLSVIKEEGSDDKKCLEPENFSRMAPTFSIDGSQSRTGLVDQDAPKRVADKYLQIFNAGVKKERR
        K EP+ VIKEE S D+KCLE E+ SRMAPTFSIDGSQ+RTGL DQD PKRVADKYLQIFNAG+KKERR
Subjt:  KTEPLSVIKEEGSDDKKCLEPENFSRMAPTFSIDGSQSRTGLVDQDAPKRVADKYLQIFNAGVKKERR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43770.2 RING/FYVE/PHD zinc finger superfamily protein2.3e-0627.05Show/hide
Query:  DGFLAKPPCVVHGRAYELSRKIPTILQVKLLSRSDIWDDLF-QDDRPDLADIALYFFPSANIERSRRNSSCLFELMERKDLLMISLIDGAELVVFTSRQL
        DG +A    +   + +E +  +   L  ++L R ++W   F ++  P    +AL+FFPS+      +    L + M++ D  M  +++ AEL++FTS  L
Subjt:  DGFLAKPPCVVHGRAYELSRKIPTILQVKLLSRSDIWDDLF-QDDRPDLADIALYFFPSANIERSRRNSSCLFELMERKDLLMISLIDGAELVVFTSRQL

Query:  DLNSQYIVNRLNAECLLFGVFR
          +S       N++  L+GVF+
Subjt:  DLNSQYIVNRLNAECLLFGVFR

AT3G02890.1 RING/FYVE/PHD zinc finger superfamily protein1.0e-0926.97Show/hide
Query:  LPSLHASWKGGFQFVDTHMAGEFYDGFLAKPPCVVHGRAYELSRKIPTILQVKLLSRSDIWDDLFQDDRPDLADIALYFFPSANIERSRRNSSCLFELME
        +P     W+G  +   +      + G  A    +   +  E+ ++ P  + +  + R   W   FQD       +AL+FF + +IE   +N   L + M 
Subjt:  LPSLHASWKGGFQFVDTHMAGEFYDGFLAKPPCVVHGRAYELSRKIPTILQVKLLSRSDIWDDLFQDDRPDLADIALYFFPSANIERSRRNSSCLFELME

Query:  RKDLLMISLIDGAELVVFTSRQLDLNSQYIVNRLNAECLLFGVFRAIKDNQS
        +KDL +   ++G EL++F S QL  + Q    R N    L+GVFR  K++ S
Subjt:  RKDLLMISLIDGAELVVFTSRQLDLNSQYIVNRLNAECLLFGVFRAIKDNQS

AT5G61090.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.1e-1524.58Show/hide
Query:  LAPQRTSKSRKAFESSMPRPLFQAPKESQERSPATMPPKVCDVNKQALATCLPPVQTLKKAKIIDIYA---CTSSVSTHGLPVTNTGKEVPSPSTR---L
        L P R + +  +  S+ P  +  +P      S A  P   C  ++ +      P   +++  +  + A    T+ V     P     K+V  PS R   +
Subjt:  LAPQRTSKSRKAFESSMPRPLFQAPKESQERSPATMPPKVCDVNKQALATCLPPVQTLKKAKIIDIYA---CTSSVSTHGLPVTNTGKEVPSPSTR---L

Query:  EDTQKQMKDGLMKEVPSTSNKLEDKQKQIGD-----VLMAQEIQTYRDYLPSLHASWKGGFQFVDTHMAGEFYDGFLAKPPCVVHGRAYELSRKIPTILQ
            KQ      K       ++ DK K  G       +  +E+     YLP+ + +W G  + +D+    EF   F +KP   +  +A   S+ +P +L+
Subjt:  EDTQKQMKDGLMKEVPSTSNKLEDKQKQIGD-----VLMAQEIQTYRDYLPSLHASWKGGFQFVDTHMAGEFYDGFLAKPPCVVHGRAYELSRKIPTILQ

Query:  VKLLSRSDIWDDLFQDDRPDLADIALYFFP-SANIERSRRNSSCLFELMERKDLLMISLIDGAELVVFTSRQLDLNSQYIVN-RLNAECLLFGVFRAIKD
        V+LL    I +D+     P L ++ +Y FP     ER     + LF+ M  + ++  + I+G EL++F+S+ LD  SQ+++N +   E  L+G F   K+
Subjt:  VKLLSRSDIWDDLFQDDRPDLADIALYFFP-SANIERSRRNSSCLFELMERKDLLMISLIDGAELVVFTSRQLDLNSQYIVN-RLNAECLLFGVFRAIKD

Query:  N
        +
Subjt:  N

AT5G61120.1 BEST Arabidopsis thaliana protein match is: Polynucleotidyl transferase, ribonuclease H-like superfamily protein (TAIR:AT5G61090.1)1.9e-2123.77Show/hide
Query:  MPSKSHDVPEFWLCGNCTLDEAKSPDDS-GLQVQPKMLRHAKACKVKFLPTEEVIKLSSGGMKGPSKFNIALAPQRTSKSRKAFESSMPRPLFQAPKESQ
        M   S +    ++C +C++   ++   S  ++  P+ +R+    KV+   +  V K     +   S+  + ++P+   K   A   S  +P F+ P+   
Subjt:  MPSKSHDVPEFWLCGNCTLDEAKSPDDS-GLQVQPKMLRHAKACKVKFLPTEEVIKLSSGGMKGPSKFNIALAPQRTSKSRKAFESSMPRPLFQAPKESQ

Query:  ERSPATMPPKVCDVNKQALATCLPPVQTLKKAKIIDIYACTSSVSTHGLPVTNTGKEVPSPSTRLEDTQKQMKDGLMKEVPSTSNKLEDKQKQIGD----
         R P  +                 P     +A+ ++         +  LP        P     L    +Q++ G+M++        E  + ++GD    
Subjt:  ERSPATMPPKVCDVNKQALATCLPPVQTLKKAKIIDIYACTSSVSTHGLPVTNTGKEVPSPSTRLEDTQKQMKDGLMKEVPSTSNKLEDKQKQIGD----

Query:  --------VLMAQEIQTYRDYLPSLHASWKGGFQFVDTHMAGEFYDGFLAKPPCVVHGRAYELSRKIPTILQVKLLSRSDIWDDLFQDDRPDLADIALYF
                 ++++++     Y P+LH  WKG  + VD+    EF   FLA+P   V G+AY LS+ IP +L+VKL+   ++   LF + +P L+D+ +Y 
Subjt:  --------VLMAQEIQTYRDYLPSLHASWKGGFQFVDTHMAGEFYDGFLAKPPCVVHGRAYELSRKIPTILQVKLLSRSDIWDDLFQDDRPDLADIALYF

Query:  FP-SANIERSRRNSSCLFELMERKDLLMISLIDGAELVVFTSRQLDLNSQYIVN-RLNAECLLFGVFRAIK---------DNQSPFH
        FP   N +R       +FE M  ++ +M   I+G  L++F+S+ LD +SQ I+  +      L+G+F   K          NQ+P H
Subjt:  FP-SANIERSRRNSSCLFELMERKDLLMISLIDGAELVVFTSRQLDLNSQYIVN-RLNAECLLFGVFRAIK---------DNQSPFH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCAGTAAAAGTCATGATGTCCCCGAATTCTGGCTTTGTGGTAATTGTACATTGGATGAGGCCAAAAGTCCTGATGATTCAGGGCTGCAGGTTCAACCTAAAATGCT
AAGGCATGCTAAAGCTTGTAAAGTGAAGTTCTTGCCTACTGAAGAAGTAATAAAGCTATCATCAGGAGGGATGAAGGGACCTTCCAAGTTCAACATAGCTCTTGCACCAC
AAAGAACATCCAAGTCCCGTAAAGCCTTTGAGAGTTCTATGCCCCGGCCGCTTTTCCAAGCACCAAAAGAATCCCAAGAACGAAGTCCAGCTACGATGCCTCCAAAGGTA
TGTGATGTAAACAAACAAGCATTAGCTACTTGTTTACCTCCAGTACAAACTTTAAAAAAGGCGAAGATTATAGACATATATGCTTGCACTTCTTCTGTGTCAACGCATGG
TTTACCTGTCACAAATACAGGAAAAGAAGTTCCTTCACCTTCCACTAGGTTGGAAGATACACAAAAACAAATGAAGGATGGTTTGATGAAAGAAGTTCCTTCAACTTCCA
ATAAGTTGGAGGATAAGCAAAAACAAATTGGGGATGTTTTGATGGCACAGGAAATACAAACATACCGTGACTATCTCCCATCATTACATGCCTCTTGGAAGGGAGGCTTC
CAATTTGTTGATACACATATGGCTGGTGAATTCTATGATGGTTTCCTGGCAAAGCCTCCATGTGTAGTACATGGTAGAGCGTATGAATTGTCACGAAAGATTCCTACAAT
TCTTCAAGTGAAGCTGCTTAGTCGTTCTGATATTTGGGATGATCTATTTCAGGATGATCGTCCTGATCTTGCTGACATTGCCTTGTACTTCTTTCCCTCCGCCAATATTG
AAAGGTCCAGAAGGAACAGCTCTTGCCTGTTTGAACTTATGGAGAGGAAAGATTTATTGATGATAAGTCTTATTGACGGTGCAGAGTTAGTCGTATTTACATCTAGACAG
CTGGATCTCAACTCTCAGTATATTGTGAATAGGTTAAATGCTGAATGCCTCCTTTTCGGAGTCTTCCGTGCTATAAAAGACAATCAGTCTCCTTTTCATAATCTTCGAGA
ATGGACTGCAGCAGTTCCTATGTTAGATTATGGTTCTGCAGGTTCTTCAGTAGAATGTGCTTCCAGAGTTCCCCTGCTGGAATTCACGCCCAACGGACATGGAGAGCACG
ATGAAGGCAATGCTGTTAACAAGGAAATGGTCATCACAAGTGGAAACACTGCTTCATCAAAGGATATAGACTCTACCATTCAGCGATTACTATTAGAATTTGGATCACAA
AAACCCGGGGAATATGATGTCAGTGCATTAACTACGAATGCTCAAATAAAAAATCAGGAACCTGCTCCAATCTCAGCAATTGGCTCCCATTCTCTTTCCCTATCAAAAGT
AAAGACCGAACCTTTATCAGTTATTAAAGAGGAAGGAAGCGATGACAAAAAATGCTTGGAGCCGGAGAATTTCTCGAGAATGGCGCCGACATTTAGTATCGATGGTTCTC
AAAGCAGGACTGGTTTAGTTGACCAAGATGCTCCCAAGAGAGTTGCAGACAAGTATCTCCAAATCTTCAATGCAGGGGTTAAAAAGGAACGTCGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCCAGTAAAAGTCATGATGTCCCCGAATTCTGGCTTTGTGGTAATTGTACATTGGATGAGGCCAAAAGTCCTGATGATTCAGGGCTGCAGGTTCAACCTAAAATGCT
AAGGCATGCTAAAGCTTGTAAAGTGAAGTTCTTGCCTACTGAAGAAGTAATAAAGCTATCATCAGGAGGGATGAAGGGACCTTCCAAGTTCAACATAGCTCTTGCACCAC
AAAGAACATCCAAGTCCCGTAAAGCCTTTGAGAGTTCTATGCCCCGGCCGCTTTTCCAAGCACCAAAAGAATCCCAAGAACGAAGTCCAGCTACGATGCCTCCAAAGGTA
TGTGATGTAAACAAACAAGCATTAGCTACTTGTTTACCTCCAGTACAAACTTTAAAAAAGGCGAAGATTATAGACATATATGCTTGCACTTCTTCTGTGTCAACGCATGG
TTTACCTGTCACAAATACAGGAAAAGAAGTTCCTTCACCTTCCACTAGGTTGGAAGATACACAAAAACAAATGAAGGATGGTTTGATGAAAGAAGTTCCTTCAACTTCCA
ATAAGTTGGAGGATAAGCAAAAACAAATTGGGGATGTTTTGATGGCACAGGAAATACAAACATACCGTGACTATCTCCCATCATTACATGCCTCTTGGAAGGGAGGCTTC
CAATTTGTTGATACACATATGGCTGGTGAATTCTATGATGGTTTCCTGGCAAAGCCTCCATGTGTAGTACATGGTAGAGCGTATGAATTGTCACGAAAGATTCCTACAAT
TCTTCAAGTGAAGCTGCTTAGTCGTTCTGATATTTGGGATGATCTATTTCAGGATGATCGTCCTGATCTTGCTGACATTGCCTTGTACTTCTTTCCCTCCGCCAATATTG
AAAGGTCCAGAAGGAACAGCTCTTGCCTGTTTGAACTTATGGAGAGGAAAGATTTATTGATGATAAGTCTTATTGACGGTGCAGAGTTAGTCGTATTTACATCTAGACAG
CTGGATCTCAACTCTCAGTATATTGTGAATAGGTTAAATGCTGAATGCCTCCTTTTCGGAGTCTTCCGTGCTATAAAAGACAATCAGTCTCCTTTTCATAATCTTCGAGA
ATGGACTGCAGCAGTTCCTATGTTAGATTATGGTTCTGCAGGTTCTTCAGTAGAATGTGCTTCCAGAGTTCCCCTGCTGGAATTCACGCCCAACGGACATGGAGAGCACG
ATGAAGGCAATGCTGTTAACAAGGAAATGGTCATCACAAGTGGAAACACTGCTTCATCAAAGGATATAGACTCTACCATTCAGCGATTACTATTAGAATTTGGATCACAA
AAACCCGGGGAATATGATGTCAGTGCATTAACTACGAATGCTCAAATAAAAAATCAGGAACCTGCTCCAATCTCAGCAATTGGCTCCCATTCTCTTTCCCTATCAAAAGT
AAAGACCGAACCTTTATCAGTTATTAAAGAGGAAGGAAGCGATGACAAAAAATGCTTGGAGCCGGAGAATTTCTCGAGAATGGCGCCGACATTTAGTATCGATGGTTCTC
AAAGCAGGACTGGTTTAGTTGACCAAGATGCTCCCAAGAGAGTTGCAGACAAGTATCTCCAAATCTTCAATGCAGGGGTTAAAAAGGAACGTCGCTAG
Protein sequenceShow/hide protein sequence
MPSKSHDVPEFWLCGNCTLDEAKSPDDSGLQVQPKMLRHAKACKVKFLPTEEVIKLSSGGMKGPSKFNIALAPQRTSKSRKAFESSMPRPLFQAPKESQERSPATMPPKV
CDVNKQALATCLPPVQTLKKAKIIDIYACTSSVSTHGLPVTNTGKEVPSPSTRLEDTQKQMKDGLMKEVPSTSNKLEDKQKQIGDVLMAQEIQTYRDYLPSLHASWKGGF
QFVDTHMAGEFYDGFLAKPPCVVHGRAYELSRKIPTILQVKLLSRSDIWDDLFQDDRPDLADIALYFFPSANIERSRRNSSCLFELMERKDLLMISLIDGAELVVFTSRQ
LDLNSQYIVNRLNAECLLFGVFRAIKDNQSPFHNLREWTAAVPMLDYGSAGSSVECASRVPLLEFTPNGHGEHDEGNAVNKEMVITSGNTASSKDIDSTIQRLLLEFGSQ
KPGEYDVSALTTNAQIKNQEPAPISAIGSHSLSLSKVKTEPLSVIKEEGSDDKKCLEPENFSRMAPTFSIDGSQSRTGLVDQDAPKRVADKYLQIFNAGVKKERR