; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0010455 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0010455
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationchr9:47381875..47384350
RNA-Seq ExpressionLag0010455
SyntenyLag0010455
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605833.1 hypothetical protein SDJN03_03150, partial [Cucurbita argyrosperma subsp. sororia]8.3e-24095.25Show/hide
Query:  MGPAQIGR-AMAGRALVLVFCLWGLICLSCAARLSPSMQKLEVQKHLRRLNKPPLKTIQSSDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTFHPEGLFD
        M PAQIGR   AGRA VLVF LW LICLS AARLSPSMQ+LEVQKHLRRLNKPPLKTI+S DGDIIDCVHISNQPAFDHPFLKDHKI TRPT+HPEGLFD
Subjt:  MGPAQIGR-AMAGRALVLVFCLWGLICLSCAARLSPSMQKLEVQKHLRRLNKPPLKTIQSSDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTFHPEGLFD

Query:  ENKMSEKPKERTNPINQLWHENGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRSIPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQ
        ENK+SEKPKERTNPINQLWH NGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRSIPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQ
Subjt:  ENKMSEKPKERTNPINQLWHENGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRSIPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQ

Query:  QSNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRNPQYDISILIWKDPSE
        Q NEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFR+PQYDISILIWKDPSE
Subjt:  QSNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRNPQYDISILIWKDPSE

Query:  GHWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEADGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTG
        GHWWMQFGNDYVLGYWPSFLFSYLADSASM+EWGGEVVNSE DGLHTSTQMGSGHFPEEGFGK+SYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTG
Subjt:  GHWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEADGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTG

Query:  SNGDWGHYFYYGGPGRNQNCP
        SNGDWGHYFYYGGPGRNQNCP
Subjt:  SNGDWGHYFYYGGPGRNQNCP

XP_022153819.1 uncharacterized protein LOC111021244 [Momordica charantia]5.7e-24195Show/hide
Query:  MGPAQIGRAMAGRALVLVFCLWGLICLSCAARLSPSMQKLEVQKHLRRLNKPPLKTIQSSDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTFHPEGLFDE
        MGPA  GR  AGRALVLVFCLWGLI LS AARLSPSM  LEVQKHLRR NKPPLKTIQS DGDIIDCVHISNQPAFDHPFLKDHKIQTRPT+HPEGLFDE
Subjt:  MGPAQIGRAMAGRALVLVFCLWGLICLSCAARLSPSMQKLEVQKHLRRLNKPPLKTIQSSDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTFHPEGLFDE

Query:  NKMSEKPKERTNPINQLWHENGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRSIPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ
        NK+SEKPKERTNPI QLWH NGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHR++PQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ
Subjt:  NKMSEKPKERTNPINQLWHENGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRSIPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ

Query:  SNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRNPQYDISILIWKDPSEG
         NEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFR+PQYDISILIWKDP+EG
Subjt:  SNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRNPQYDISILIWKDPSEG

Query:  HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEADGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTGS
        HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSE DGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTE+SNCYDVQTGS
Subjt:  HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEADGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTGS

Query:  NGDWGHYFYYGGPGRNQNCP
        NGDWGHYFYYGGPGRNQNCP
Subjt:  NGDWGHYFYYGGPGRNQNCP

XP_022958627.1 uncharacterized protein LOC111459796 [Cucurbita moschata]1.4e-23995.01Show/hide
Query:  MGPAQIGR-AMAGRALVLVFCLWGLICLSCAARLSPSMQKLEVQKHLRRLNKPPLKTIQSSDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTFHPEGLFD
        M PAQIGR   AGRA VLVF LW LICLS AARLSPSMQ+LEVQKHLRR+NKPPLKTI+S DGDIIDCVHISNQPAFDHPFLKDHKI TRPT+HPEGLFD
Subjt:  MGPAQIGR-AMAGRALVLVFCLWGLICLSCAARLSPSMQKLEVQKHLRRLNKPPLKTIQSSDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTFHPEGLFD

Query:  ENKMSEKPKERTNPINQLWHENGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRSIPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQ
        ENK+SEKPKERTNPINQLWH NGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRSIPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQ
Subjt:  ENKMSEKPKERTNPINQLWHENGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRSIPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQ

Query:  QSNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRNPQYDISILIWKDPSE
        Q NEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFR+PQYDISILIWKDPSE
Subjt:  QSNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRNPQYDISILIWKDPSE

Query:  GHWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEADGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTG
        GHWWMQFGNDYVLGYWPSFLFSYLADSASM+EWGGEVVNSE DGLHTSTQMGSGHFPEEGFGK+SYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTG
Subjt:  GHWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEADGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTG

Query:  SNGDWGHYFYYGGPGRNQNCP
        SNGDWGHYFYYGGPGRNQNCP
Subjt:  SNGDWGHYFYYGGPGRNQNCP

XP_023533510.1 uncharacterized protein LOC111795366 [Cucurbita pepo subsp. pepo]3.1e-23995.01Show/hide
Query:  MGPAQIGR-AMAGRALVLVFCLWGLICLSCAARLSPSMQKLEVQKHLRRLNKPPLKTIQSSDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTFHPEGLFD
        M PAQIGR   AGRA VLVF LW LICLS AARLSPSMQ+LEVQKHLRRLNKPPLKTI+S DGDIIDCVHISNQPAFDHPFLKDHKI TRPT+HPEGLFD
Subjt:  MGPAQIGR-AMAGRALVLVFCLWGLICLSCAARLSPSMQKLEVQKHLRRLNKPPLKTIQSSDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTFHPEGLFD

Query:  ENKMSEKPKERTNPINQLWHENGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRSIPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQ
        E+K+SEKPKERTNPINQLWH NGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRSIPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQ
Subjt:  ENKMSEKPKERTNPINQLWHENGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRSIPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQ

Query:  QSNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRNPQYDISILIWKDPSE
        Q NEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFR+PQYDISILIWKDPSE
Subjt:  QSNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRNPQYDISILIWKDPSE

Query:  GHWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEADGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTG
        GHWWMQFGNDYVLGYWPSFLFSYLADSASM+EWGGEVVNSE DGLHTSTQMGSGHFPEEGFGK+SYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTG
Subjt:  GHWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEADGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTG

Query:  SNGDWGHYFYYGGPGRNQNCP
        SNGDWGHYFYYGGPGRNQNCP
Subjt:  SNGDWGHYFYYGGPGRNQNCP

XP_038901449.1 uncharacterized protein LOC120088312 [Benincasa hispida]2.4e-23994.29Show/hide
Query:  MGPAQIGRAMAGRALVLVFCLWGLICLSCAARLSPSMQKLEVQKHLRRLNKPPLKTIQSSDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTFHPEGLFDE
        M P+QIGR  AGR L+ +F LW LICLS +ARLSPSMQ LEVQKHLRRLNKPPLKTIQS DGDIIDCVHISNQPAFDHPFLKDHKI TRPT+HPEGLFDE
Subjt:  MGPAQIGRAMAGRALVLVFCLWGLICLSCAARLSPSMQKLEVQKHLRRLNKPPLKTIQSSDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTFHPEGLFDE

Query:  NKMSEKPKERTNPINQLWHENGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRSIPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ
        NK+SEKPKERTNPINQLWH NGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHR+IPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ
Subjt:  NKMSEKPKERTNPINQLWHENGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRSIPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ

Query:  SNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRNPQYDISILIWKDPSEG
         NEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVS FR+PQYDISILIWKDP+EG
Subjt:  SNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRNPQYDISILIWKDPSEG

Query:  HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEADGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTGS
        HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEADGLHT TQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTGS
Subjt:  HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEADGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTGS

Query:  NGDWGHYFYYGGPGRNQNCP
        NGDWGHYFYYGGPGRNQNCP
Subjt:  NGDWGHYFYYGGPGRNQNCP

TrEMBL top hitse value%identityAlignment
A0A0A0KWZ7 Uncharacterized protein4.1e-23793.57Show/hide
Query:  MGPAQIGRAMAGRALVLVFCLWGLICLSCAARLSPSMQKLEVQKHLRRLNKPPLKTIQSSDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTFHPEGLFDE
        MGPAQI R  AGRAL+L+F LW LI LS A+RLSPSMQ LEVQKHLRRLNKPPLKTIQS DGDIIDCVHISNQPAFDHPFLKDHKIQTRPT+HPEGLFDE
Subjt:  MGPAQIGRAMAGRALVLVFCLWGLICLSCAARLSPSMQKLEVQKHLRRLNKPPLKTIQSSDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTFHPEGLFDE

Query:  NKMSEKPKERTNPINQLWHENGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRSIPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ
        NK+SEKPKE +NPINQLWH NGRCPENTIP+RRTKEDDVLRASS KRYGKKRHR+IPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ
Subjt:  NKMSEKPKERTNPINQLWHENGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRSIPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ

Query:  SNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRNPQYDISILIWKDPSEG
         NEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQV+S+IAMGASISPVS FRN QYDISILIWKDP+EG
Subjt:  SNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRNPQYDISILIWKDPSEG

Query:  HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEADGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTGS
        HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEADGLHT TQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTGS
Subjt:  HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEADGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTGS

Query:  NGDWGHYFYYGGPGRNQNCP
        NGDWGHYFYYGGPGRNQNCP
Subjt:  NGDWGHYFYYGGPGRNQNCP

A0A5D3DDU5 Uncharacterized protein1.4e-23794.05Show/hide
Query:  MGPAQIGRAMAGRALVLVFCLWGLICLSCAARLSPSMQKLEVQKHLRRLNKPPLKTIQSSDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTFHPEGLFDE
        MGPAQI R  AGRAL+L+F LW +I LS AARLSPSMQ LEVQKHLRRLNKPPLKTIQS DGDIIDCVHISNQPAFDHPFLKDHKIQTRPT+HPEGLFDE
Subjt:  MGPAQIGRAMAGRALVLVFCLWGLICLSCAARLSPSMQKLEVQKHLRRLNKPPLKTIQSSDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTFHPEGLFDE

Query:  NKMSEKPKERTNPINQLWHENGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRSIPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ
        NK+SEKPKE TNPINQLWH NGRCPENTIPIRRTKEDDVLRASS KRYGKKRHR+IPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ
Subjt:  NKMSEKPKERTNPINQLWHENGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRSIPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ

Query:  SNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRNPQYDISILIWKDPSEG
         NEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNS+IAMGASISPVS FR+ QYDISILIWKDP+EG
Subjt:  SNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRNPQYDISILIWKDPSEG

Query:  HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEADGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTGS
        HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEADGLHT TQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTGS
Subjt:  HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEADGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTGS

Query:  NGDWGHYFYYGGPGRNQNCP
        NGDWGHYFYYGGPGRNQNCP
Subjt:  NGDWGHYFYYGGPGRNQNCP

A0A6J1DK76 uncharacterized protein LOC1110212442.8e-24195Show/hide
Query:  MGPAQIGRAMAGRALVLVFCLWGLICLSCAARLSPSMQKLEVQKHLRRLNKPPLKTIQSSDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTFHPEGLFDE
        MGPA  GR  AGRALVLVFCLWGLI LS AARLSPSM  LEVQKHLRR NKPPLKTIQS DGDIIDCVHISNQPAFDHPFLKDHKIQTRPT+HPEGLFDE
Subjt:  MGPAQIGRAMAGRALVLVFCLWGLICLSCAARLSPSMQKLEVQKHLRRLNKPPLKTIQSSDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTFHPEGLFDE

Query:  NKMSEKPKERTNPINQLWHENGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRSIPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ
        NK+SEKPKERTNPI QLWH NGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHR++PQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ
Subjt:  NKMSEKPKERTNPINQLWHENGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRSIPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ

Query:  SNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRNPQYDISILIWKDPSEG
         NEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFR+PQYDISILIWKDP+EG
Subjt:  SNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRNPQYDISILIWKDPSEG

Query:  HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEADGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTGS
        HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSE DGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTE+SNCYDVQTGS
Subjt:  HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEADGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTGS

Query:  NGDWGHYFYYGGPGRNQNCP
        NGDWGHYFYYGGPGRNQNCP
Subjt:  NGDWGHYFYYGGPGRNQNCP

A0A6J1H2C7 uncharacterized protein LOC1114597966.8e-24095.01Show/hide
Query:  MGPAQIGR-AMAGRALVLVFCLWGLICLSCAARLSPSMQKLEVQKHLRRLNKPPLKTIQSSDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTFHPEGLFD
        M PAQIGR   AGRA VLVF LW LICLS AARLSPSMQ+LEVQKHLRR+NKPPLKTI+S DGDIIDCVHISNQPAFDHPFLKDHKI TRPT+HPEGLFD
Subjt:  MGPAQIGR-AMAGRALVLVFCLWGLICLSCAARLSPSMQKLEVQKHLRRLNKPPLKTIQSSDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTFHPEGLFD

Query:  ENKMSEKPKERTNPINQLWHENGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRSIPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQ
        ENK+SEKPKERTNPINQLWH NGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRSIPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQ
Subjt:  ENKMSEKPKERTNPINQLWHENGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRSIPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQ

Query:  QSNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRNPQYDISILIWKDPSE
        Q NEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFR+PQYDISILIWKDPSE
Subjt:  QSNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRNPQYDISILIWKDPSE

Query:  GHWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEADGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTG
        GHWWMQFGNDYVLGYWPSFLFSYLADSASM+EWGGEVVNSE DGLHTSTQMGSGHFPEEGFGK+SYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTG
Subjt:  GHWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEADGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTG

Query:  SNGDWGHYFYYGGPGRNQNCP
        SNGDWGHYFYYGGPGRNQNCP
Subjt:  SNGDWGHYFYYGGPGRNQNCP

A0A6J1K8Y4 uncharacterized protein LOC1114911942.6e-23994.77Show/hide
Query:  MGPAQIGR-AMAGRALVLVFCLWGLICLSCAARLSPSMQKLEVQKHLRRLNKPPLKTIQSSDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTFHPEGLFD
        M PAQIGR   AGRA VLVF LW LICLS AARLSPS Q+LEVQKHLRRLNKPPLKTI+SSDGDIIDCVHISNQPAFDHPF+KDHKI TRPT+HPEGLFD
Subjt:  MGPAQIGR-AMAGRALVLVFCLWGLICLSCAARLSPSMQKLEVQKHLRRLNKPPLKTIQSSDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTFHPEGLFD

Query:  ENKMSEKPKERTNPINQLWHENGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRSIPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQ
        ENK+SEKPKERTNPINQLWH NGRCPENTIPIRRTK+DDVLRASSAKRYGKKRHRSIPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQ
Subjt:  ENKMSEKPKERTNPINQLWHENGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRSIPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQ

Query:  QSNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRNPQYDISILIWKDPSE
        Q NEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFR+PQYDISILIWKDPSE
Subjt:  QSNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRNPQYDISILIWKDPSE

Query:  GHWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEADGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTG
        GHWWMQFGNDYVLGYWPSFLFSYLADSASM+EWGGEVVNSE DGLHTSTQMGSGHFPEEGFGK+SYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTG
Subjt:  GHWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEADGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTG

Query:  SNGDWGHYFYYGGPGRNQNCP
        SNGDWGHYFYYGGPGRNQNCP
Subjt:  SNGDWGHYFYYGGPGRNQNCP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55360.1 Protein of Unknown Function (DUF239)4.0e-20880.24Show/hide
Query:  GPAQIGRAMAGRALVLVFCLWGLICLSCAARLSPSMQKLEVQKHLRRLNKPPLKTIQSSDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTFHPEGLFDEN
        G   +  A   R  ++  CLWG   LS AAR   S QK EV+KHL RLNKP +K+IQSSDGD+IDCV IS QPAFDHPFLKDHKIQ +P +HPEGLFD+N
Subjt:  GPAQIGRAMAGRALVLVFCLWGLICLSCAARLSPSMQKLEVQKHLRRLNKPPLKTIQSSDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTFHPEGLFDEN

Query:  KMS-EKPKERTNPINQLWHENGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRSIPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ
        K+S  K  E+   I QLWH  G+C E TIP+RRTKEDDVLRASS KRYGKK+ RS+P P+SA+PDLINQSGHQHAIAYVEGDK+YGAKATINVWEPKIQQ
Subjt:  KMS-EKPKERTNPINQLWHENGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRSIPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ

Query:  SNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRNPQYDISILIWKDPSEG
         NEFSLSQ+W+LGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQ+NS+IAMGASISPVS +RN QYDISILIWKDP EG
Subjt:  SNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRNPQYDISILIWKDPSEG

Query:  HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEADGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTGS
        HWWMQFGN YVLGYWPSFLFSYL +SASMIEWGGEVVNS++DG HTSTQMGSG FPEEGF KASYFRNIQVVD SNNLKAPKG+GTFTEQSNCYDVQTGS
Subjt:  HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEADGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTGS

Query:  NGDWGHYFYYGGPGRNQNCP
        N DWGHYFYYGGPG+NQ CP
Subjt:  NGDWGHYFYYGGPGRNQNCP

AT2G44210.1 Protein of Unknown Function (DUF239)1.4e-16065.62Show/hide
Query:  LEVQKHLRRLNKPPLKTIQSSDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTFHPEGLFDENKMSEKPK-ERTNPINQLWHENGRCPENTIPIRRTKEDD
        L+++ HL+RLNKP LK+I+S DGD+IDCV I++QPAF HP L +H +Q  P+ +PE +F E+K+S K K +++N I+QLWH NG+CP+NTIPIRRT+  D
Subjt:  LEVQKHLRRLNKPPLKTIQSSDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTFHPEGLFDENKMSEKPK-ERTNPINQLWHENGRCPENTIPIRRTKEDD

Query:  VLRASSAKRYGKKRHRSIPQPRSAD-PDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQSNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDN
        + RASS + YG K  +SIP+P+S++ P+++ Q+GHQHAI YVE   FYGAKA INVW+P ++  NEFSL+Q+W+LGG+F  DLNSIEAGWQVSP LYGDN
Subjt:  VLRASSAKRYGKKRHRSIPQPRSAD-PDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQSNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDN

Query:  NTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRNPQYDISILIWKDPSEGHWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVV
         TRLFTYWTSDAYQ TGCYNLLCSGF+Q+N EIAMG SISP+S + N QYDI+ILIWKDP EGHWW+QFG  Y++GYWP+ LFSYL++SASMIEWGGEVV
Subjt:  NTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRNPQYDISILIWKDPSEGHWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVV

Query:  NSEA-DGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTGSNGDWGHYFYYGGPGRNQNCP
        NS++ +G HT+TQMGSG F EEG+GKASYF+N+QVVD SN L+ P+ +  FT+Q NCY+V++G+ G WG YFYYGGPGRN NCP
Subjt:  NSEA-DGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTGSNGDWGHYFYYGGPGRNQNCP

AT3G13510.1 Protein of Unknown Function (DUF239)2.0e-20782.09Show/hide
Query:  CLWGLICLSC-AARLSPSMQKLEVQKHLRRLNKPPLKTIQSSDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTFHPEGLFDENKMSEKPKERTNPINQLW
        CLW ++ LSC AA    S QK EV+KHL RLNKPP+KTIQS DGDIIDC+ IS QPAFDHPFLKDHKIQ RP++HPEGLFD+NK+S +PK +   I QLW
Subjt:  CLWGLICLSC-AARLSPSMQKLEVQKHLRRLNKPPLKTIQSSDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTFHPEGLFDENKMSEKPKERTNPINQLW

Query:  HENGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRSIPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQSNEFSLSQLWILGGSFGQ
        H  G+C E TIP+RRT+EDDVLRASS KRYGKK+HRS+P P+SA+PDLINQ+GHQHAIAYVEGDK+YGAKAT+NVWEPKIQ +NEFSLSQ+W+LGGSFGQ
Subjt:  HENGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRSIPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQSNEFSLSQLWILGGSFGQ

Query:  DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRNPQYDISILIWKDPSEGHWWMQFGNDYVLGYWPSF
        DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQ+NS+IAMGASISPVS +RN QYDISILIWKDP EGHWWMQFGN YVLGYWPSF
Subjt:  DLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRNPQYDISILIWKDPSEGHWWMQFGNDYVLGYWPSF

Query:  LFSYLADSASMIEWGGEVVNSEADGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTGSNGDWGHYFYYGGPGRNQN
        LFSYL +SASMIEWGGEVVNS+++G HT TQMGSGHFPEEGF KASYFRNIQVVD SNNLKAPKG+GTFTE+SNCYDVQTGSN DWGHYFYYGGPG+N+N
Subjt:  LFSYLADSASMIEWGGEVVNSEADGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTGSNGDWGHYFYYGGPGRNQN

Query:  CP
        CP
Subjt:  CP

AT5G56530.1 Protein of Unknown Function (DUF239)4.0e-20879.95Show/hide
Query:  MGPAQIGRAMAGRALVLVFCLWGLICLSCAARLSPSMQKLEVQKHLRRLNKPPLKTIQSSDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTFHPEGLFDE
        M  A   +    R  ++ FC WGL+ L+CA RLS S Q  EV KHL RLNKP +K+IQS DGDIIDCVHIS QPAFDHPFLKDHKIQ  P++ PE LF E
Subjt:  MGPAQIGRAMAGRALVLVFCLWGLICLSCAARLSPSMQKLEVQKHLRRLNKPPLKTIQSSDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTFHPEGLFDE

Query:  NKMSEKPKERTNPINQLWHENGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRSIPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ
        +K+SEKPKE  NPI QLWH+NG C E TIP+RRTK++DVLRASS KRYGKK+H S+P PRSADPDLINQSGHQHAIAYVEG KFYGAKATINVWEPK+Q 
Subjt:  NKMSEKPKERTNPINQLWHENGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRSIPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ

Query:  SNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRNPQYDISILIWKDPSEG
        SNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQ+NS+IAMGASISPVS F NPQYDISI IWKDP EG
Subjt:  SNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRNPQYDISILIWKDPSEG

Query:  HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEADGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTGS
        HWWMQFG+ YVLGYWPSFLFSYLADSAS++EWGGEVVN E DG HT+TQMGSG FP+EGF KASYFRNIQVVDSSNNLK PKG+ TFTE+SNCYDV+ G 
Subjt:  HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEADGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTGS

Query:  NGDWGHYFYYGGPGRNQNC
        N DWGHYFYYGGPGRN NC
Subjt:  NGDWGHYFYYGGPGRNQNC

AT5G56530.2 Protein of Unknown Function (DUF239)4.0e-20879.95Show/hide
Query:  MGPAQIGRAMAGRALVLVFCLWGLICLSCAARLSPSMQKLEVQKHLRRLNKPPLKTIQSSDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTFHPEGLFDE
        M  A   +    R  ++ FC WGL+ L+CA RLS S Q  EV KHL RLNKP +K+IQS DGDIIDCVHIS QPAFDHPFLKDHKIQ  P++ PE LF E
Subjt:  MGPAQIGRAMAGRALVLVFCLWGLICLSCAARLSPSMQKLEVQKHLRRLNKPPLKTIQSSDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTFHPEGLFDE

Query:  NKMSEKPKERTNPINQLWHENGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRSIPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ
        +K+SEKPKE  NPI QLWH+NG C E TIP+RRTK++DVLRASS KRYGKK+H S+P PRSADPDLINQSGHQHAIAYVEG KFYGAKATINVWEPK+Q 
Subjt:  NKMSEKPKERTNPINQLWHENGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRSIPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQ

Query:  SNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRNPQYDISILIWKDPSEG
        SNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQ+NS+IAMGASISPVS F NPQYDISI IWKDP EG
Subjt:  SNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRNPQYDISILIWKDPSEG

Query:  HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEADGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTGS
        HWWMQFG+ YVLGYWPSFLFSYLADSAS++EWGGEVVN E DG HT+TQMGSG FP+EGF KASYFRNIQVVDSSNNLK PKG+ TFTE+SNCYDV+ G 
Subjt:  HWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEADGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTGS

Query:  NGDWGHYFYYGGPGRNQNC
        N DWGHYFYYGGPGRN NC
Subjt:  NGDWGHYFYYGGPGRNQNC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCCTGCTCAGATTGGCAGAGCAATGGCTGGGAGAGCTCTGGTACTGGTTTTCTGTTTATGGGGTCTGATCTGTCTGTCCTGCGCCGCTCGATTATCCCCTTCCAT
GCAGAAGCTTGAAGTTCAGAAGCACCTCAGGCGCTTAAACAAGCCCCCATTGAAAACAATTCAGAGTTCAGATGGGGATATCATCGACTGTGTCCACATTTCTAATCAAC
CTGCTTTTGATCATCCTTTCCTCAAAGATCACAAAATTCAGACGAGACCTACTTTCCACCCAGAAGGGCTATTTGATGAGAACAAGATGTCTGAAAAACCTAAAGAAAGA
ACAAACCCCATCAATCAGCTGTGGCATGAAAATGGAAGGTGCCCAGAAAACACAATTCCTATTAGGAGAACCAAGGAAGATGATGTTCTGAGAGCAAGCTCTGCCAAAAG
ATATGGAAAGAAAAGGCACAGAAGCATTCCTCAACCCAGATCTGCAGATCCTGATCTCATCAATCAAAGTGGTCATCAGCACGCAATAGCTTATGTGGAAGGAGATAAGT
TTTATGGAGCAAAGGCAACTATCAATGTATGGGAGCCCAAAATACAGCAGTCTAATGAGTTTAGCTTGTCACAGTTATGGATACTAGGAGGTTCTTTTGGTCAAGATCTC
AATAGTATTGAAGCTGGCTGGCAGGTCAGCCCAGACCTGTATGGCGATAACAACACAAGACTCTTCACTTACTGGACTAGCGATGCATATCAAGCCACAGGTTGTTATAA
CCTCCTCTGCTCGGGCTTTATTCAAGTTAACAGTGAAATAGCGATGGGGGCAAGTATCTCACCCGTGTCTGCATTTCGCAACCCCCAGTATGATATCAGTATACTTATCT
GGAAGGATCCAAGTGAGGGACACTGGTGGATGCAGTTTGGCAATGACTATGTGTTGGGATATTGGCCTTCTTTCTTATTCTCGTACCTGGCTGATAGTGCCTCCATGATA
GAGTGGGGAGGGGAGGTTGTGAATTCAGAGGCTGATGGACTGCACACCTCAACCCAGATGGGCAGTGGTCATTTTCCTGAAGAGGGGTTTGGGAAGGCAAGTTATTTCCG
GAACATTCAAGTTGTTGACAGTTCCAACAATCTCAAAGCTCCCAAAGGGATTGGTACTTTTACAGAGCAATCCAACTGCTATGATGTCCAAACTGGCAGCAATGGGGATT
GGGGCCATTACTTTTACTATGGAGGCCCTGGCAGAAACCAAAATTGCCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTCCTGCTCAGATTGGCAGAGCAATGGCTGGGAGAGCTCTGGTACTGGTTTTCTGTTTATGGGGTCTGATCTGTCTGTCCTGCGCCGCTCGATTATCCCCTTCCAT
GCAGAAGCTTGAAGTTCAGAAGCACCTCAGGCGCTTAAACAAGCCCCCATTGAAAACAATTCAGAGTTCAGATGGGGATATCATCGACTGTGTCCACATTTCTAATCAAC
CTGCTTTTGATCATCCTTTCCTCAAAGATCACAAAATTCAGACGAGACCTACTTTCCACCCAGAAGGGCTATTTGATGAGAACAAGATGTCTGAAAAACCTAAAGAAAGA
ACAAACCCCATCAATCAGCTGTGGCATGAAAATGGAAGGTGCCCAGAAAACACAATTCCTATTAGGAGAACCAAGGAAGATGATGTTCTGAGAGCAAGCTCTGCCAAAAG
ATATGGAAAGAAAAGGCACAGAAGCATTCCTCAACCCAGATCTGCAGATCCTGATCTCATCAATCAAAGTGGTCATCAGCACGCAATAGCTTATGTGGAAGGAGATAAGT
TTTATGGAGCAAAGGCAACTATCAATGTATGGGAGCCCAAAATACAGCAGTCTAATGAGTTTAGCTTGTCACAGTTATGGATACTAGGAGGTTCTTTTGGTCAAGATCTC
AATAGTATTGAAGCTGGCTGGCAGGTCAGCCCAGACCTGTATGGCGATAACAACACAAGACTCTTCACTTACTGGACTAGCGATGCATATCAAGCCACAGGTTGTTATAA
CCTCCTCTGCTCGGGCTTTATTCAAGTTAACAGTGAAATAGCGATGGGGGCAAGTATCTCACCCGTGTCTGCATTTCGCAACCCCCAGTATGATATCAGTATACTTATCT
GGAAGGATCCAAGTGAGGGACACTGGTGGATGCAGTTTGGCAATGACTATGTGTTGGGATATTGGCCTTCTTTCTTATTCTCGTACCTGGCTGATAGTGCCTCCATGATA
GAGTGGGGAGGGGAGGTTGTGAATTCAGAGGCTGATGGACTGCACACCTCAACCCAGATGGGCAGTGGTCATTTTCCTGAAGAGGGGTTTGGGAAGGCAAGTTATTTCCG
GAACATTCAAGTTGTTGACAGTTCCAACAATCTCAAAGCTCCCAAAGGGATTGGTACTTTTACAGAGCAATCCAACTGCTATGATGTCCAAACTGGCAGCAATGGGGATT
GGGGCCATTACTTTTACTATGGAGGCCCTGGCAGAAACCAAAATTGCCCTTGA
Protein sequenceShow/hide protein sequence
MGPAQIGRAMAGRALVLVFCLWGLICLSCAARLSPSMQKLEVQKHLRRLNKPPLKTIQSSDGDIIDCVHISNQPAFDHPFLKDHKIQTRPTFHPEGLFDENKMSEKPKER
TNPINQLWHENGRCPENTIPIRRTKEDDVLRASSAKRYGKKRHRSIPQPRSADPDLINQSGHQHAIAYVEGDKFYGAKATINVWEPKIQQSNEFSLSQLWILGGSFGQDL
NSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRNPQYDISILIWKDPSEGHWWMQFGNDYVLGYWPSFLFSYLADSASMI
EWGGEVVNSEADGLHTSTQMGSGHFPEEGFGKASYFRNIQVVDSSNNLKAPKGIGTFTEQSNCYDVQTGSNGDWGHYFYYGGPGRNQNCP