; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr015373 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr015373
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Genome locationtig00003469:766784..815281
RNA-Seq ExpressionSgr015373
SyntenySgr015373
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158445.1 uncharacterized protein LOC111024932 isoform X1 [Momordica charantia]0.0e+0091.76Show/hide
Query:  MENGFDGRSLAEKFSGLAVTAA-PEQSNSHSSNNHSSSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIRELEKY-KVDESLDRRFHSTDQWNE
        MENGFDGRSLAEKFSGLAVTAA PEQ NSHSSNNHS+SNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKI+ELEKY KVDESL RRFHSTDQWNE
Subjt:  MENGFDGRSLAEKFSGLAVTAA-PEQSNSHSSNNHSSSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIRELEKY-KVDESLDRRFHSTDQWNE

Query:  NENDHHGSNGGHQSDNSVDNERHMFKNNISIVDSHGTLVVHQAVEQKDEVSMRIDTEPRFENSKSDRIVNALPGVQPPVDNAGCSQFSSPSTTSFSASRF
        N+   HGSNGGHQSDNSVDNERH FKNNIS VDSHGTLVVH+ VEQKDEVSMRID E R+ + KSD IVNALPGVQP VDNAG SQFSSPSTTSFSASRF
Subjt:  NENDHHGSNGGHQSDNSVDNERHMFKNNISIVDSHGTLVVHQAVEQKDEVSMRIDTEPRFENSKSDRIVNALPGVQPPVDNAGCSQFSSPSTTSFSASRF

Query:  PVDGEYDPRIKLSGHGLMPKAEVNNPNSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQD
        PVDGEYDP+IKLSGHGLM KAE NNP SLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQD
Subjt:  PVDGEYDPRIKLSGHGLMPKAEVNNPNSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQD

Query:  IIEENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAPQSPFHSIGATLT
        IIEENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFA QSPFHS+GATLT
Subjt:  IIEENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAPQSPFHSIGATLT

Query:  TSTKNGLELVPQPSYWNGKIPVSSSDAQATADWDLSSHHQMGLGVGVATKLEPDDLGRYSLHASSTHYSEATNKQVTFREPVSNSEMDDPDVVHQAERET
         STKNGLELVPQPSYWNGKIPVSSSDAQ TADWDLSSHHQ+GLGV VA  LEPDDLGRYSLHAS    SEATNKQVTFREPVSNSEMDDPDVVHQ +R+ 
Subjt:  TSTKNGLELVPQPSYWNGKIPVSSSDAQATADWDLSSHHQMGLGVGVATKLEPDDLGRYSLHASSTHYSEATNKQVTFREPVSNSEMDDPDVVHQAERET

Query:  ITNWSSGQSPPAATLDEPSSSHSPILPPVLEEPSPSFSEDDDPLPAIEALQISGEAFPGQELQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAKQPNYR
        +TNWSS +SPP ATLDEPSSSHSPILPPVLEEPSPSFSEDDDPLPAIEALQISGEAFPGQELQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAKQPNYR
Subjt:  ITNWSSGQSPPAATLDEPSSSHSPILPPVLEEPSPSFSEDDDPLPAIEALQISGEAFPGQELQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAKQPNYR

Query:  VTADDVDTYLAIEVQPLDNRRRKGELVKVFANEHQKITCDLEMQNHIEKTLYSGHASYKVSMSARYLDIWEPATLSVKREGYSIKCSGPSNDVITEKFSP
        VTADDVDTYLAIEVQPLDNRRRKGELVKVFANEH+KITC+ EMQNHIEKTLYSGHASYKVS+SA YLDIWEPATLS+KREGYSIKCSGPS D ITEKFS 
Subjt:  VTADDVDTYLAIEVQPLDNRRRKGELVKVFANEHQKITCDLEMQNHIEKTLYSGHASYKVSMSARYLDIWEPATLSVKREGYSIKCSGPSNDVITEKFSP

Query:  NTIVSIPFGHPSEFIITGSNNVEHHLRAENNSADISGFRDTIVLTLRLFILR
        NT VSIPFGHP EF+ITGSNNVEHHLRAENNSADIS FRDTIVL LRLFI+R
Subjt:  NTIVSIPFGHPSEFIITGSNNVEHHLRAENNSADISGFRDTIVLTLRLFILR

XP_022158446.1 uncharacterized protein LOC111024932 isoform X2 [Momordica charantia]0.0e+0091.88Show/hide
Query:  MENGFDGRSLAEKFSGLAVTAA-PEQSNSHSSNNHSSSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIRELEKYKVDESLDRRFHSTDQWNEN
        MENGFDGRSLAEKFSGLAVTAA PEQ NSHSSNNHS+SNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKI+ELEKYKVDESL RRFHSTDQWNEN
Subjt:  MENGFDGRSLAEKFSGLAVTAA-PEQSNSHSSNNHSSSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIRELEKYKVDESLDRRFHSTDQWNEN

Query:  ENDHHGSNGGHQSDNSVDNERHMFKNNISIVDSHGTLVVHQAVEQKDEVSMRIDTEPRFENSKSDRIVNALPGVQPPVDNAGCSQFSSPSTTSFSASRFP
        +   HGSNGGHQSDNSVDNERH FKNNIS VDSHGTLVVH+ VEQKDEVSMRID E R+ + KSD IVNALPGVQP VDNAG SQFSSPSTTSFSASRFP
Subjt:  ENDHHGSNGGHQSDNSVDNERHMFKNNISIVDSHGTLVVHQAVEQKDEVSMRIDTEPRFENSKSDRIVNALPGVQPPVDNAGCSQFSSPSTTSFSASRFP

Query:  VDGEYDPRIKLSGHGLMPKAEVNNPNSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDI
        VDGEYDP+IKLSGHGLM KAE NNP SLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDI
Subjt:  VDGEYDPRIKLSGHGLMPKAEVNNPNSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDI

Query:  IEENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAPQSPFHSIGATLTT
        IEENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFA QSPFHS+GATLT 
Subjt:  IEENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAPQSPFHSIGATLTT

Query:  STKNGLELVPQPSYWNGKIPVSSSDAQATADWDLSSHHQMGLGVGVATKLEPDDLGRYSLHASSTHYSEATNKQVTFREPVSNSEMDDPDVVHQAERETI
        STKNGLELVPQPSYWNGKIPVSSSDAQ TADWDLSSHHQ+GLGV VA  LEPDDLGRYSLHAS    SEATNKQVTFREPVSNSEMDDPDVVHQ +R+ +
Subjt:  STKNGLELVPQPSYWNGKIPVSSSDAQATADWDLSSHHQMGLGVGVATKLEPDDLGRYSLHASSTHYSEATNKQVTFREPVSNSEMDDPDVVHQAERETI

Query:  TNWSSGQSPPAATLDEPSSSHSPILPPVLEEPSPSFSEDDDPLPAIEALQISGEAFPGQELQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAKQPNYRV
        TNWSS +SPP ATLDEPSSSHSPILPPVLEEPSPSFSEDDDPLPAIEALQISGEAFPGQELQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAKQPNYRV
Subjt:  TNWSSGQSPPAATLDEPSSSHSPILPPVLEEPSPSFSEDDDPLPAIEALQISGEAFPGQELQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAKQPNYRV

Query:  TADDVDTYLAIEVQPLDNRRRKGELVKVFANEHQKITCDLEMQNHIEKTLYSGHASYKVSMSARYLDIWEPATLSVKREGYSIKCSGPSNDVITEKFSPN
        TADDVDTYLAIEVQPLDNRRRKGELVKVFANEH+KITC+ EMQNHIEKTLYSGHASYKVS+SA YLDIWEPATLS+KREGYSIKCSGPS D ITEKFS N
Subjt:  TADDVDTYLAIEVQPLDNRRRKGELVKVFANEHQKITCDLEMQNHIEKTLYSGHASYKVSMSARYLDIWEPATLSVKREGYSIKCSGPSNDVITEKFSPN

Query:  TIVSIPFGHPSEFIITGSNNVEHHLRAENNSADISGFRDTIVLTLRLFILR
        T VSIPFGHP EF+ITGSNNVEHHLRAENNSADIS FRDTIVL LRLFI+R
Subjt:  TIVSIPFGHPSEFIITGSNNVEHHLRAENNSADISGFRDTIVLTLRLFILR

XP_022158447.1 uncharacterized protein LOC111024932 isoform X3 [Momordica charantia]0.0e+0091.76Show/hide
Query:  MENGFDGRSLAEKFSGLAVTAA-PEQSNSHSSNNHSSSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIRELEKY-KVDESLDRRFHSTDQWNE
        MENGFDGRSLAEKFSGLAVTAA PEQ NSHSSNNHS+SNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKI+ELEKY KVDESL RRFHSTDQWNE
Subjt:  MENGFDGRSLAEKFSGLAVTAA-PEQSNSHSSNNHSSSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIRELEKY-KVDESLDRRFHSTDQWNE

Query:  NENDHHGSNGGHQSDNSVDNERHMFKNNISIVDSHGTLVVHQAVEQKDEVSMRIDTEPRFENSKSDRIVNALPGVQPPVDNAGCSQFSSPSTTSFSASRF
        N+   HGSNGGHQSDNSVDNERH FKNNIS VDSHGTLVVH+ VEQKDEVSMRID E R+ + KSD IVNALPGVQP VDNAG SQFSSPSTTSFSASRF
Subjt:  NENDHHGSNGGHQSDNSVDNERHMFKNNISIVDSHGTLVVHQAVEQKDEVSMRIDTEPRFENSKSDRIVNALPGVQPPVDNAGCSQFSSPSTTSFSASRF

Query:  PVDGEYDPRIKLSGHGLMPKAEVNNPNSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQD
        PVDGEYDP+IKLSGHGLM KAE NNP SLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQD
Subjt:  PVDGEYDPRIKLSGHGLMPKAEVNNPNSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQD

Query:  IIEENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAPQSPFHSIGATLT
        IIEENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFA QSPFHS+GATLT
Subjt:  IIEENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAPQSPFHSIGATLT

Query:  TSTKNGLELVPQPSYWNGKIPVSSSDAQATADWDLSSHHQMGLGVGVATKLEPDDLGRYSLHASSTHYSEATNKQVTFREPVSNSEMDDPDVVHQAERET
         STKNGLELVPQPSYWNGKIPVSSSDAQ TADWDLSSHHQ+GLGV VA  LEPDDLGRYSLHAS    SEATNKQVTFREPVSNSEMDDPDVVHQ +R+ 
Subjt:  TSTKNGLELVPQPSYWNGKIPVSSSDAQATADWDLSSHHQMGLGVGVATKLEPDDLGRYSLHASSTHYSEATNKQVTFREPVSNSEMDDPDVVHQAERET

Query:  ITNWSSGQSPPAATLDEPSSSHSPILPPVLEEPSPSFSEDDDPLPAIEALQISGEAFPGQELQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAKQPNYR
        +TNWSS +SPP ATLDEPSSSHSPILPPVLEEPSPSFSEDDDPLPAIEALQISGEAFPGQELQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAKQPNYR
Subjt:  ITNWSSGQSPPAATLDEPSSSHSPILPPVLEEPSPSFSEDDDPLPAIEALQISGEAFPGQELQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAKQPNYR

Query:  VTADDVDTYLAIEVQPLDNRRRKGELVKVFANEHQKITCDLEMQNHIEKTLYSGHASYKVSMSARYLDIWEPATLSVKREGYSIKCSGPSNDVITEKFSP
        VTADDVDTYLAIEVQPLDNRRRKGELVKVFANEH+KITC+ EMQNHIEKTLYSGHASYKVS+SA YLDIWEPATLS+KREGYSIKCSGPS D ITEKFS 
Subjt:  VTADDVDTYLAIEVQPLDNRRRKGELVKVFANEHQKITCDLEMQNHIEKTLYSGHASYKVSMSARYLDIWEPATLSVKREGYSIKCSGPSNDVITEKFSP

Query:  NTIVSIPFGHPSEFIITGSNNVEHHLRAENNSADISGFRDTIVLTLRLFILR
        NT VSIPFGHP EF+ITGSNNVEHHLRAENNSADIS FRDTIVL LRLFI+R
Subjt:  NTIVSIPFGHPSEFIITGSNNVEHHLRAENNSADISGFRDTIVLTLRLFILR

XP_038900808.1 uncharacterized protein LOC120087880 isoform X1 [Benincasa hispida]0.0e+0090.68Show/hide
Query:  MENGFDGRSLAEKFSGLAVTAAP-EQSNSHSSNNHSSSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIRELEKYKVDESLDRRFHSTDQWNEN
        MENGFDGRSLAEKFS L V+A P EQSNSHSSNNH ++NDSNLFQVLKAVEAAEATIKQQVEENNRLR ELQKKI+ELEKYKVDE L +RFHST+QW  +
Subjt:  MENGFDGRSLAEKFSGLAVTAAP-EQSNSHSSNNHSSSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIRELEKYKVDESLDRRFHSTDQWNEN

Query:  ENDHHGSNGGHQSDNSVDNERHMFKNNISIVDSHGTLVVHQAVEQKDEVSMRIDTEPRFENSKSDRIVNALPGVQPPVDNAGCSQFSSPSTTSFSASRFP
        ENDHHGSNGGHQSDNSVDNER  FKNNISIVDS G LV+HQ VEQKDEVSMR+D E RFE+ KSDR+VNALPGVQ  VDNAGCSQFSSPSTTSFSASRF 
Subjt:  ENDHHGSNGGHQSDNSVDNERHMFKNNISIVDSHGTLVVHQAVEQKDEVSMRIDTEPRFENSKSDRIVNALPGVQPPVDNAGCSQFSSPSTTSFSASRFP

Query:  VDGEYDPRIKLSGHGLMPKAEVNNPNSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDI
        +D EYDPRIKLSGHG+MPKAE NNPNSLWKQDLVVKVQEHEDEIVQLRKHLA+YSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDI
Subjt:  VDGEYDPRIKLSGHGLMPKAEVNNPNSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDI

Query:  IEENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAPQSPFHSIGATLTT
        IEENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAPQSPFHSIGATLTT
Subjt:  IEENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAPQSPFHSIGATLTT

Query:  STKNGLELVPQPSYWNGKIPVSSSDAQATADWDLSSHHQMGLGVGVATKLEPDDLGRYSLHASSTHYSEATNKQVTFREPVSNSEMDDPDVVHQAERETI
        STKNGLELVPQPSYWNGK+PVSSSDAQ TADWDLS+HHQ+GLGVGVA KLEPDDLGRYS HAS    SE TNKQVTFREPVSNSE+DD DVVHQ ERETI
Subjt:  STKNGLELVPQPSYWNGKIPVSSSDAQATADWDLSSHHQMGLGVGVATKLEPDDLGRYSLHASSTHYSEATNKQVTFREPVSNSEMDDPDVVHQAERETI

Query:  TNWSSGQSPPAATLDEPSSSHSPILPPVLEEPSPSFSEDDDPLPAIEALQISGEAFPGQELQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAKQPNYRV
        TNWSSGQSPP A  DEPSSSHSPILPPVLEEPSPSFSEDDDPLPAIEALQISGEAFPGQ+LQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAKQPNYRV
Subjt:  TNWSSGQSPPAATLDEPSSSHSPILPPVLEEPSPSFSEDDDPLPAIEALQISGEAFPGQELQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAKQPNYRV

Query:  TADDVDTYLAIEVQPLDNRRRKGELVKVFANEHQKITCDLEMQNHIEKTLYSGHASYKVSMSARYLDIWEPATLSVKREGYSIKCSGPSNDVITEKFSPN
        TADDVDTYLAIEVQPLDNRRRKGELVKVFAN+H+KITCD EMQN IEKTLYSGHASYKVSMSA YLDIWE ATLS+KREGYSIKCSG S DVITEKFSPN
Subjt:  TADDVDTYLAIEVQPLDNRRRKGELVKVFANEHQKITCDLEMQNHIEKTLYSGHASYKVSMSARYLDIWEPATLSVKREGYSIKCSGPSNDVITEKFSPN

Query:  TIVSIPFGHPSEFIITGSNNVEHHLRAENNSADISGFRDTIVLTLRLFILR
        TIVSIPFG+PSEF ITGSNNVEHHLR +NNSADIS  RDTIVLTLRLFILR
Subjt:  TIVSIPFGHPSEFIITGSNNVEHHLRAENNSADISGFRDTIVLTLRLFILR

XP_038900809.1 uncharacterized protein LOC120087880 isoform X2 [Benincasa hispida]0.0e+0090.55Show/hide
Query:  MENGFDGRSLAEKFSGLAVTAAP-EQSNSHSSNNHSSSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIRELEKYKVDESLDRRFHSTDQWNEN
        MENGFDGRSLAEKFS L V+A P EQSNSHSSNNH ++NDSNLFQVLKAVEAAEATIKQQVEENNRLR ELQKKI+ELEKY VDE L +RFHST+QW  +
Subjt:  MENGFDGRSLAEKFSGLAVTAAP-EQSNSHSSNNHSSSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIRELEKYKVDESLDRRFHSTDQWNEN

Query:  ENDHHGSNGGHQSDNSVDNERHMFKNNISIVDSHGTLVVHQAVEQKDEVSMRIDTEPRFENSKSDRIVNALPGVQPPVDNAGCSQFSSPSTTSFSASRFP
        ENDHHGSNGGHQSDNSVDNER  FKNNISIVDS G LV+HQ VEQKDEVSMR+D E RFE+ KSDR+VNALPGVQ  VDNAGCSQFSSPSTTSFSASRF 
Subjt:  ENDHHGSNGGHQSDNSVDNERHMFKNNISIVDSHGTLVVHQAVEQKDEVSMRIDTEPRFENSKSDRIVNALPGVQPPVDNAGCSQFSSPSTTSFSASRFP

Query:  VDGEYDPRIKLSGHGLMPKAEVNNPNSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDI
        +D EYDPRIKLSGHG+MPKAE NNPNSLWKQDLVVKVQEHEDEIVQLRKHLA+YSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDI
Subjt:  VDGEYDPRIKLSGHGLMPKAEVNNPNSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDI

Query:  IEENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAPQSPFHSIGATLTT
        IEENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAPQSPFHSIGATLTT
Subjt:  IEENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAPQSPFHSIGATLTT

Query:  STKNGLELVPQPSYWNGKIPVSSSDAQATADWDLSSHHQMGLGVGVATKLEPDDLGRYSLHASSTHYSEATNKQVTFREPVSNSEMDDPDVVHQAERETI
        STKNGLELVPQPSYWNGK+PVSSSDAQ TADWDLS+HHQ+GLGVGVA KLEPDDLGRYS HAS    SE TNKQVTFREPVSNSE+DD DVVHQ ERETI
Subjt:  STKNGLELVPQPSYWNGKIPVSSSDAQATADWDLSSHHQMGLGVGVATKLEPDDLGRYSLHASSTHYSEATNKQVTFREPVSNSEMDDPDVVHQAERETI

Query:  TNWSSGQSPPAATLDEPSSSHSPILPPVLEEPSPSFSEDDDPLPAIEALQISGEAFPGQELQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAKQPNYRV
        TNWSSGQSPP A  DEPSSSHSPILPPVLEEPSPSFSEDDDPLPAIEALQISGEAFPGQ+LQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAKQPNYRV
Subjt:  TNWSSGQSPPAATLDEPSSSHSPILPPVLEEPSPSFSEDDDPLPAIEALQISGEAFPGQELQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAKQPNYRV

Query:  TADDVDTYLAIEVQPLDNRRRKGELVKVFANEHQKITCDLEMQNHIEKTLYSGHASYKVSMSARYLDIWEPATLSVKREGYSIKCSGPSNDVITEKFSPN
        TADDVDTYLAIEVQPLDNRRRKGELVKVFAN+H+KITCD EMQN IEKTLYSGHASYKVSMSA YLDIWE ATLS+KREGYSIKCSG S DVITEKFSPN
Subjt:  TADDVDTYLAIEVQPLDNRRRKGELVKVFANEHQKITCDLEMQNHIEKTLYSGHASYKVSMSARYLDIWEPATLSVKREGYSIKCSGPSNDVITEKFSPN

Query:  TIVSIPFGHPSEFIITGSNNVEHHLRAENNSADISGFRDTIVLTLRLFILR
        TIVSIPFG+PSEF ITGSNNVEHHLR +NNSADIS  RDTIVLTLRLFILR
Subjt:  TIVSIPFGHPSEFIITGSNNVEHHLRAENNSADISGFRDTIVLTLRLFILR

TrEMBL top hitse value%identityAlignment
A0A1S3CDV6 uncharacterized protein LOC103499606 isoform X10.0e+0089.71Show/hide
Query:  GFDGRSLAEKFSGLAVTAAP-EQSNSHSSNNHSSSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIRELEKYKVDESLDRRFHSTDQWNENEND
        GFDGRSLAEKFS L V+A P EQSNSH   NH +++DSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKI+ELEKYKV E L +RFHST+QW  NE+D
Subjt:  GFDGRSLAEKFSGLAVTAAP-EQSNSHSSNNHSSSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIRELEKYKVDESLDRRFHSTDQWNENEND

Query:  HHGSNGGHQSDNSVDNERHMFKNNISIVDSHGTLVVHQAVEQKDEVSMRIDTEPRFENSKSDRIVNALPGVQPPVDNAGCSQFSSPSTTSFSASRFPVDG
        HHGSNGGHQSDNSVDNER  FKN+IS+VDSHGTLV+HQ VEQKDEVSMR+DTE RFE+SKSDR+VNALPGVQP VDNAGCSQFSSPSTTSFSASRF +D 
Subjt:  HHGSNGGHQSDNSVDNERHMFKNNISIVDSHGTLVVHQAVEQKDEVSMRIDTEPRFENSKSDRIVNALPGVQPPVDNAGCSQFSSPSTTSFSASRFPVDG

Query:  EYDPRIKLSGHGLMPKAEVNNPNSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIEE
        EYDPRIKLSGHG+MPKAE NNPNSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIEE
Subjt:  EYDPRIKLSGHGLMPKAEVNNPNSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIEE

Query:  NIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAPQSPFHSIGATLTTSTK
        NIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAPQSPFHSIGATLT STK
Subjt:  NIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAPQSPFHSIGATLTTSTK

Query:  NGLELVPQPSYWNGKIPVSSSDAQATADWDLSSHHQMGLGVGVATKLEPDDLGRYSLHASSTHYSEATNKQVTFREPVSNSEMDDPDVVHQAERETITNW
        NGLELVPQPSYWNGK+PVSSSDAQ TADWDLS+HHQ+GLGVGV   LEPDDLGRYS HAS    SE TNKQVTFREPVSNSE+DD DVVHQ ERE ITNW
Subjt:  NGLELVPQPSYWNGKIPVSSSDAQATADWDLSSHHQMGLGVGVATKLEPDDLGRYSLHASSTHYSEATNKQVTFREPVSNSEMDDPDVVHQAERETITNW

Query:  SSGQSPPAATLDEPSSSHSPILPPVLEEPSPSFSEDDDPLPAIEALQISGEAFPGQELQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAKQPNYRVTAD
        SSGQSPP AT DEPSSSHSPILPPVLEEPSPSFSEDDDPLPAIEALQISGEAFPGQ+LQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAKQPNYRVTAD
Subjt:  SSGQSPPAATLDEPSSSHSPILPPVLEEPSPSFSEDDDPLPAIEALQISGEAFPGQELQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAKQPNYRVTAD

Query:  DVDTYLAIEVQPLDNRRRKGELVKVFANEHQKITCDLEMQNHIEKTLYSGHASYKVSMSARYLDIWEPATLSVKREGYSIKCSGPSNDVITEKFSPNTIV
        DVDTYLAIEVQPLDNRRRKGELVKVFAN+H+KITCD EMQN IEKTLYSGHASYKVSMSA YL IWE ATLS+KREGYSIKCSG S DVITEKFS NTIV
Subjt:  DVDTYLAIEVQPLDNRRRKGELVKVFANEHQKITCDLEMQNHIEKTLYSGHASYKVSMSARYLDIWEPATLSVKREGYSIKCSGPSNDVITEKFSPNTIV

Query:  SIPFGHPSEFIITGSNNVEHHLRAENNSADISGFRDTIVLTLRLFILR
        SIPFG+PS+F ITGSNNVEH LR +NNSADIS  RDTIVLTLRLFILR
Subjt:  SIPFGHPSEFIITGSNNVEHHLRAENNSADISGFRDTIVLTLRLFILR

A0A1S4E315 uncharacterized protein LOC103499606 isoform X20.0e+0089.57Show/hide
Query:  GFDGRSLAEKFSGLAVTAAP-EQSNSHSSNNHSSSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIRELEKYKVDESLDRRFHSTDQWNENEND
        GFDGRSLAEKFS L V+A P EQSNSH   NH +++DSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKI+ELEKY V E L +RFHST+QW  NE+D
Subjt:  GFDGRSLAEKFSGLAVTAAP-EQSNSHSSNNHSSSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIRELEKYKVDESLDRRFHSTDQWNENEND

Query:  HHGSNGGHQSDNSVDNERHMFKNNISIVDSHGTLVVHQAVEQKDEVSMRIDTEPRFENSKSDRIVNALPGVQPPVDNAGCSQFSSPSTTSFSASRFPVDG
        HHGSNGGHQSDNSVDNER  FKN+IS+VDSHGTLV+HQ VEQKDEVSMR+DTE RFE+SKSDR+VNALPGVQP VDNAGCSQFSSPSTTSFSASRF +D 
Subjt:  HHGSNGGHQSDNSVDNERHMFKNNISIVDSHGTLVVHQAVEQKDEVSMRIDTEPRFENSKSDRIVNALPGVQPPVDNAGCSQFSSPSTTSFSASRFPVDG

Query:  EYDPRIKLSGHGLMPKAEVNNPNSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIEE
        EYDPRIKLSGHG+MPKAE NNPNSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIEE
Subjt:  EYDPRIKLSGHGLMPKAEVNNPNSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIEE

Query:  NIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAPQSPFHSIGATLTTSTK
        NIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAPQSPFHSIGATLT STK
Subjt:  NIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAPQSPFHSIGATLTTSTK

Query:  NGLELVPQPSYWNGKIPVSSSDAQATADWDLSSHHQMGLGVGVATKLEPDDLGRYSLHASSTHYSEATNKQVTFREPVSNSEMDDPDVVHQAERETITNW
        NGLELVPQPSYWNGK+PVSSSDAQ TADWDLS+HHQ+GLGVGV   LEPDDLGRYS HAS    SE TNKQVTFREPVSNSE+DD DVVHQ ERE ITNW
Subjt:  NGLELVPQPSYWNGKIPVSSSDAQATADWDLSSHHQMGLGVGVATKLEPDDLGRYSLHASSTHYSEATNKQVTFREPVSNSEMDDPDVVHQAERETITNW

Query:  SSGQSPPAATLDEPSSSHSPILPPVLEEPSPSFSEDDDPLPAIEALQISGEAFPGQELQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAKQPNYRVTAD
        SSGQSPP AT DEPSSSHSPILPPVLEEPSPSFSEDDDPLPAIEALQISGEAFPGQ+LQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAKQPNYRVTAD
Subjt:  SSGQSPPAATLDEPSSSHSPILPPVLEEPSPSFSEDDDPLPAIEALQISGEAFPGQELQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAKQPNYRVTAD

Query:  DVDTYLAIEVQPLDNRRRKGELVKVFANEHQKITCDLEMQNHIEKTLYSGHASYKVSMSARYLDIWEPATLSVKREGYSIKCSGPSNDVITEKFSPNTIV
        DVDTYLAIEVQPLDNRRRKGELVKVFAN+H+KITCD EMQN IEKTLYSGHASYKVSMSA YL IWE ATLS+KREGYSIKCSG S DVITEKFS NTIV
Subjt:  DVDTYLAIEVQPLDNRRRKGELVKVFANEHQKITCDLEMQNHIEKTLYSGHASYKVSMSARYLDIWEPATLSVKREGYSIKCSGPSNDVITEKFSPNTIV

Query:  SIPFGHPSEFIITGSNNVEHHLRAENNSADISGFRDTIVLTLRLFILR
        SIPFG+PS+F ITGSNNVEH LR +NNSADIS  RDTIVLTLRLFILR
Subjt:  SIPFGHPSEFIITGSNNVEHHLRAENNSADISGFRDTIVLTLRLFILR

A0A6J1DVU9 uncharacterized protein LOC111024932 isoform X10.0e+0091.76Show/hide
Query:  MENGFDGRSLAEKFSGLAVTAA-PEQSNSHSSNNHSSSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIRELEKY-KVDESLDRRFHSTDQWNE
        MENGFDGRSLAEKFSGLAVTAA PEQ NSHSSNNHS+SNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKI+ELEKY KVDESL RRFHSTDQWNE
Subjt:  MENGFDGRSLAEKFSGLAVTAA-PEQSNSHSSNNHSSSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIRELEKY-KVDESLDRRFHSTDQWNE

Query:  NENDHHGSNGGHQSDNSVDNERHMFKNNISIVDSHGTLVVHQAVEQKDEVSMRIDTEPRFENSKSDRIVNALPGVQPPVDNAGCSQFSSPSTTSFSASRF
        N+   HGSNGGHQSDNSVDNERH FKNNIS VDSHGTLVVH+ VEQKDEVSMRID E R+ + KSD IVNALPGVQP VDNAG SQFSSPSTTSFSASRF
Subjt:  NENDHHGSNGGHQSDNSVDNERHMFKNNISIVDSHGTLVVHQAVEQKDEVSMRIDTEPRFENSKSDRIVNALPGVQPPVDNAGCSQFSSPSTTSFSASRF

Query:  PVDGEYDPRIKLSGHGLMPKAEVNNPNSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQD
        PVDGEYDP+IKLSGHGLM KAE NNP SLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQD
Subjt:  PVDGEYDPRIKLSGHGLMPKAEVNNPNSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQD

Query:  IIEENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAPQSPFHSIGATLT
        IIEENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFA QSPFHS+GATLT
Subjt:  IIEENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAPQSPFHSIGATLT

Query:  TSTKNGLELVPQPSYWNGKIPVSSSDAQATADWDLSSHHQMGLGVGVATKLEPDDLGRYSLHASSTHYSEATNKQVTFREPVSNSEMDDPDVVHQAERET
         STKNGLELVPQPSYWNGKIPVSSSDAQ TADWDLSSHHQ+GLGV VA  LEPDDLGRYSLHAS    SEATNKQVTFREPVSNSEMDDPDVVHQ +R+ 
Subjt:  TSTKNGLELVPQPSYWNGKIPVSSSDAQATADWDLSSHHQMGLGVGVATKLEPDDLGRYSLHASSTHYSEATNKQVTFREPVSNSEMDDPDVVHQAERET

Query:  ITNWSSGQSPPAATLDEPSSSHSPILPPVLEEPSPSFSEDDDPLPAIEALQISGEAFPGQELQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAKQPNYR
        +TNWSS +SPP ATLDEPSSSHSPILPPVLEEPSPSFSEDDDPLPAIEALQISGEAFPGQELQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAKQPNYR
Subjt:  ITNWSSGQSPPAATLDEPSSSHSPILPPVLEEPSPSFSEDDDPLPAIEALQISGEAFPGQELQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAKQPNYR

Query:  VTADDVDTYLAIEVQPLDNRRRKGELVKVFANEHQKITCDLEMQNHIEKTLYSGHASYKVSMSARYLDIWEPATLSVKREGYSIKCSGPSNDVITEKFSP
        VTADDVDTYLAIEVQPLDNRRRKGELVKVFANEH+KITC+ EMQNHIEKTLYSGHASYKVS+SA YLDIWEPATLS+KREGYSIKCSGPS D ITEKFS 
Subjt:  VTADDVDTYLAIEVQPLDNRRRKGELVKVFANEHQKITCDLEMQNHIEKTLYSGHASYKVSMSARYLDIWEPATLSVKREGYSIKCSGPSNDVITEKFSP

Query:  NTIVSIPFGHPSEFIITGSNNVEHHLRAENNSADISGFRDTIVLTLRLFILR
        NT VSIPFGHP EF+ITGSNNVEHHLRAENNSADIS FRDTIVL LRLFI+R
Subjt:  NTIVSIPFGHPSEFIITGSNNVEHHLRAENNSADISGFRDTIVLTLRLFILR

A0A6J1DX87 uncharacterized protein LOC111024932 isoform X30.0e+0091.76Show/hide
Query:  MENGFDGRSLAEKFSGLAVTAA-PEQSNSHSSNNHSSSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIRELEKY-KVDESLDRRFHSTDQWNE
        MENGFDGRSLAEKFSGLAVTAA PEQ NSHSSNNHS+SNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKI+ELEKY KVDESL RRFHSTDQWNE
Subjt:  MENGFDGRSLAEKFSGLAVTAA-PEQSNSHSSNNHSSSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIRELEKY-KVDESLDRRFHSTDQWNE

Query:  NENDHHGSNGGHQSDNSVDNERHMFKNNISIVDSHGTLVVHQAVEQKDEVSMRIDTEPRFENSKSDRIVNALPGVQPPVDNAGCSQFSSPSTTSFSASRF
        N+   HGSNGGHQSDNSVDNERH FKNNIS VDSHGTLVVH+ VEQKDEVSMRID E R+ + KSD IVNALPGVQP VDNAG SQFSSPSTTSFSASRF
Subjt:  NENDHHGSNGGHQSDNSVDNERHMFKNNISIVDSHGTLVVHQAVEQKDEVSMRIDTEPRFENSKSDRIVNALPGVQPPVDNAGCSQFSSPSTTSFSASRF

Query:  PVDGEYDPRIKLSGHGLMPKAEVNNPNSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQD
        PVDGEYDP+IKLSGHGLM KAE NNP SLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQD
Subjt:  PVDGEYDPRIKLSGHGLMPKAEVNNPNSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQD

Query:  IIEENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAPQSPFHSIGATLT
        IIEENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFA QSPFHS+GATLT
Subjt:  IIEENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAPQSPFHSIGATLT

Query:  TSTKNGLELVPQPSYWNGKIPVSSSDAQATADWDLSSHHQMGLGVGVATKLEPDDLGRYSLHASSTHYSEATNKQVTFREPVSNSEMDDPDVVHQAERET
         STKNGLELVPQPSYWNGKIPVSSSDAQ TADWDLSSHHQ+GLGV VA  LEPDDLGRYSLHAS    SEATNKQVTFREPVSNSEMDDPDVVHQ +R+ 
Subjt:  TSTKNGLELVPQPSYWNGKIPVSSSDAQATADWDLSSHHQMGLGVGVATKLEPDDLGRYSLHASSTHYSEATNKQVTFREPVSNSEMDDPDVVHQAERET

Query:  ITNWSSGQSPPAATLDEPSSSHSPILPPVLEEPSPSFSEDDDPLPAIEALQISGEAFPGQELQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAKQPNYR
        +TNWSS +SPP ATLDEPSSSHSPILPPVLEEPSPSFSEDDDPLPAIEALQISGEAFPGQELQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAKQPNYR
Subjt:  ITNWSSGQSPPAATLDEPSSSHSPILPPVLEEPSPSFSEDDDPLPAIEALQISGEAFPGQELQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAKQPNYR

Query:  VTADDVDTYLAIEVQPLDNRRRKGELVKVFANEHQKITCDLEMQNHIEKTLYSGHASYKVSMSARYLDIWEPATLSVKREGYSIKCSGPSNDVITEKFSP
        VTADDVDTYLAIEVQPLDNRRRKGELVKVFANEH+KITC+ EMQNHIEKTLYSGHASYKVS+SA YLDIWEPATLS+KREGYSIKCSGPS D ITEKFS 
Subjt:  VTADDVDTYLAIEVQPLDNRRRKGELVKVFANEHQKITCDLEMQNHIEKTLYSGHASYKVSMSARYLDIWEPATLSVKREGYSIKCSGPSNDVITEKFSP

Query:  NTIVSIPFGHPSEFIITGSNNVEHHLRAENNSADISGFRDTIVLTLRLFILR
        NT VSIPFGHP EF+ITGSNNVEHHLRAENNSADIS FRDTIVL LRLFI+R
Subjt:  NTIVSIPFGHPSEFIITGSNNVEHHLRAENNSADISGFRDTIVLTLRLFILR

A0A6J1E0X2 uncharacterized protein LOC111024932 isoform X20.0e+0091.88Show/hide
Query:  MENGFDGRSLAEKFSGLAVTAA-PEQSNSHSSNNHSSSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIRELEKYKVDESLDRRFHSTDQWNEN
        MENGFDGRSLAEKFSGLAVTAA PEQ NSHSSNNHS+SNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKI+ELEKYKVDESL RRFHSTDQWNEN
Subjt:  MENGFDGRSLAEKFSGLAVTAA-PEQSNSHSSNNHSSSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIRELEKYKVDESLDRRFHSTDQWNEN

Query:  ENDHHGSNGGHQSDNSVDNERHMFKNNISIVDSHGTLVVHQAVEQKDEVSMRIDTEPRFENSKSDRIVNALPGVQPPVDNAGCSQFSSPSTTSFSASRFP
        +   HGSNGGHQSDNSVDNERH FKNNIS VDSHGTLVVH+ VEQKDEVSMRID E R+ + KSD IVNALPGVQP VDNAG SQFSSPSTTSFSASRFP
Subjt:  ENDHHGSNGGHQSDNSVDNERHMFKNNISIVDSHGTLVVHQAVEQKDEVSMRIDTEPRFENSKSDRIVNALPGVQPPVDNAGCSQFSSPSTTSFSASRFP

Query:  VDGEYDPRIKLSGHGLMPKAEVNNPNSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDI
        VDGEYDP+IKLSGHGLM KAE NNP SLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDI
Subjt:  VDGEYDPRIKLSGHGLMPKAEVNNPNSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDI

Query:  IEENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAPQSPFHSIGATLTT
        IEENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFA QSPFHS+GATLT 
Subjt:  IEENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAPQSPFHSIGATLTT

Query:  STKNGLELVPQPSYWNGKIPVSSSDAQATADWDLSSHHQMGLGVGVATKLEPDDLGRYSLHASSTHYSEATNKQVTFREPVSNSEMDDPDVVHQAERETI
        STKNGLELVPQPSYWNGKIPVSSSDAQ TADWDLSSHHQ+GLGV VA  LEPDDLGRYSLHAS    SEATNKQVTFREPVSNSEMDDPDVVHQ +R+ +
Subjt:  STKNGLELVPQPSYWNGKIPVSSSDAQATADWDLSSHHQMGLGVGVATKLEPDDLGRYSLHASSTHYSEATNKQVTFREPVSNSEMDDPDVVHQAERETI

Query:  TNWSSGQSPPAATLDEPSSSHSPILPPVLEEPSPSFSEDDDPLPAIEALQISGEAFPGQELQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAKQPNYRV
        TNWSS +SPP ATLDEPSSSHSPILPPVLEEPSPSFSEDDDPLPAIEALQISGEAFPGQELQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAKQPNYRV
Subjt:  TNWSSGQSPPAATLDEPSSSHSPILPPVLEEPSPSFSEDDDPLPAIEALQISGEAFPGQELQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAKQPNYRV

Query:  TADDVDTYLAIEVQPLDNRRRKGELVKVFANEHQKITCDLEMQNHIEKTLYSGHASYKVSMSARYLDIWEPATLSVKREGYSIKCSGPSNDVITEKFSPN
        TADDVDTYLAIEVQPLDNRRRKGELVKVFANEH+KITC+ EMQNHIEKTLYSGHASYKVS+SA YLDIWEPATLS+KREGYSIKCSGPS D ITEKFS N
Subjt:  TADDVDTYLAIEVQPLDNRRRKGELVKVFANEHQKITCDLEMQNHIEKTLYSGHASYKVSMSARYLDIWEPATLSVKREGYSIKCSGPSNDVITEKFSPN

Query:  TIVSIPFGHPSEFIITGSNNVEHHLRAENNSADISGFRDTIVLTLRLFILR
        T VSIPFGHP EF+ITGSNNVEHHLRAENNSADIS FRDTIVL LRLFI+R
Subjt:  TIVSIPFGHPSEFIITGSNNVEHHLRAENNSADISGFRDTIVLTLRLFILR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G08440.1 unknown protein3.0e-19054.33Show/hide
Query:  MENGFDGRSLAEKFSGLAVTAAPEQSNSHSSNNHSSSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIRELEKYKVDESLDRRFHSTDQWNENE
        M+NG + R LAE+FSG+ +    E S SH ++     NDS LFQV+KAVEAAEATIKQQVEENN L+ ELQ++  EL KYK  ESL +   ++D  N + 
Subjt:  MENGFDGRSLAEKFSGLAVTAAPEQSNSHSSNNHSSSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIRELEKYKVDESLDRRFHSTDQWNENE

Query:  NDHHGSNGGHQSDNSVD-NERHMFKNNISIVDSHGTLVVHQAVEQKDEVSMRIDTEPRFENSKSDRIV-NALPGVQPPVDNAGCSQF-SSPSTTSFSASR
            GS+  HQS   +   +R   K N S     G LVVHQ V    E         R E+  S+ I+ N +  V+  V   G SQ  SSPST S S  R
Subjt:  NDHHGSNGGHQSDNSVD-NERHMFKNNISIVDSHGTLVVHQAVEQKDEVSMRIDTEPRFENSKSDRIV-NALPGVQPPVDNAGCSQF-SSPSTTSFSASR

Query:  FPVDGEYDPRIKLSGHGLMPKAEVNNPNSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQ
          ++G++D  I  S H LMP  EVNN  + WKQ+L+ KVQE + EI++LRK+LADYS KE QIRNEKYVLEKRIA+MR AFDQQQQDLVDAASKALSYRQ
Subjt:  FPVDGEYDPRIKLSGHGLMPKAEVNNPNSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQ

Query:  DIIEENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAPQSPFHSIGATL
        +IIEENIRLTYALQ A+QER+ FVS LLPLL+EYSL P + D+QSI+S+VK+LF+HLQEKL +TETKLKE++YQL PW+SD +HS+ +P SP+  +G  L
Subjt:  DIIEENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAPQSPFHSIGATL

Query:  TTSTKNGLELVPQPSYWNGKIPVSSSDAQATADWDLSSHHQMGLGVGVATKLEPDDLGRYS-------LHASSTHYSEATNKQVTFREPVSNSEMDDPDV
          ST                              D   HHQ   G   A+    D     S         A +   S   N +V FREP+SN+ MDD   
Subjt:  TTSTKNGLELVPQPSYWNGKIPVSSSDAQATADWDLSSHHQMGLGVGVATKLEPDDLGRYS-------LHASSTHYSEATNKQVTFREPVSNSEMDDPDV

Query:  VHQAERETITNWSSGQSPPAATLDEPSSSHSPILPPVLEEPSPSFSE--DDDPLPAIEALQISGEAFPGQELQACGYSINGTTSCNFEWVRHLEDGSVNY
          QA+  T       ++     +D+PS S+ PIL PVLEEPS SFSE  DDDPLP I  LQISGE FPG+ELQ  G+SINGTT CNFEWVRHLEDGSVNY
Subjt:  VHQAERETITNWSSGQSPPAATLDEPSSSHSPILPPVLEEPSPSFSE--DDDPLPAIEALQISGEAFPGQELQACGYSINGTTSCNFEWVRHLEDGSVNY

Query:  IEGAKQPNYRVTADDVDTYLAIEVQPLDNRRRKGELVKVFANEHQKITCDLEMQNHIEKTLYSGHASYKVSMSARYLDIWEPATLSVKREGYSIKCSGPS
        I+GAK+P+Y VTADDVD YLAIEV PLD++ RKGELV+VFANE+ KITC  EMQ+HIEK+LY+GHA +KVS S  YLDIWE ATLS+K+EGYSIK   P+
Subjt:  IEGAKQPNYRVTADDVDTYLAIEVQPLDNRRRKGELVKVFANEHQKITCDLEMQNHIEKTLYSGHASYKVSMSARYLDIWEPATLSVKREGYSIKCSGPS

Query:  ND-VITEKFSPNTIVSIPFGHPSEFIITGSNNVEHHLR-AENNSADISGFRDTIVLTLRLFI
        ND VITEKFS +T + IPF  P++F+I G++  EH  R  +N++ D+S  RDTIVLTLRLF+
Subjt:  ND-VITEKFSPNTIVSIPFGHPSEFIITGSNNVEHHLR-AENNSADISGFRDTIVLTLRLFI

AT5G08440.2 unknown protein1.0e-17750.37Show/hide
Query:  MENGFDGRSLAEKFSGLAVTAAPEQSNSHSSNNHSSSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIRELEKYKVDESLDRRFHSTDQWNENE
        M+NG + R LAE+FSG+ +    E S SH ++     NDS LFQV+KAVEAAEATIKQQVEENN L+ ELQ++  EL KYK  ESL +   ++D  N + 
Subjt:  MENGFDGRSLAEKFSGLAVTAAPEQSNSHSSNNHSSSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIRELEKYKVDESLDRRFHSTDQWNENE

Query:  NDHHGSNGGHQSDNSVD-NERHMFKNNISIVDSHGTLVVHQAVEQKDEVSMRIDTEPRFENSKSDRIV-NALPGVQPPVDNAGCSQF-SSPSTTSFSASR
            GS+  HQS   +   +R   K N S     G LVVHQ V    E         R E+  S+ I+ N +  V+  V   G SQ  SSPST S S  R
Subjt:  NDHHGSNGGHQSDNSVD-NERHMFKNNISIVDSHGTLVVHQAVEQKDEVSMRIDTEPRFENSKSDRIV-NALPGVQPPVDNAGCSQF-SSPSTTSFSASR

Query:  FPVDGEYDPRIKLSGHGLMPKAEVNNPNSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQ
          ++G++D  I  S H LMP  EVNN  + WKQ+L+ KVQE + EI++LRK+LADYS KE QIRNEKYVLEKRIA+MR AFDQQQQDLVDAASKALSYRQ
Subjt:  FPVDGEYDPRIKLSGHGLMPKAEVNNPNSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQ

Query:  DIIEENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLT------------------------------------
        +IIEENIRLTYALQ A+QER+ FVS LLPLL+EYSL P + D+QSI+S+VKI       KL                                       
Subjt:  DIIEENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLT------------------------------------

Query:  ----------ETKLKESQYQLTPWRSDASHSSFAPQSPFHSIGATLTTSTKNGLELVPQPSYWNGKIPVSSSDAQATADWDLSSHHQMGLGVGVATKLEP
                   TKLKE++YQL PW+SD +HS+ +P SP+  +G  L  ST                              D   HHQ   G   A+    
Subjt:  ----------ETKLKESQYQLTPWRSDASHSSFAPQSPFHSIGATLTTSTKNGLELVPQPSYWNGKIPVSSSDAQATADWDLSSHHQMGLGVGVATKLEP

Query:  DDLGRYS-------LHASSTHYSEATNKQVTFREPVSNSEMDDPDVVHQAERETITNWSSGQSPPAATLDEPSSSHSPILPPVLEEPSPSFSE--DDDPL
        D     S         A +   S   N +V FREP+SN+ MDD     QA+  T       ++     +D+PS S+ PIL PVLEEPS SFSE  DDDPL
Subjt:  DDLGRYS-------LHASSTHYSEATNKQVTFREPVSNSEMDDPDVVHQAERETITNWSSGQSPPAATLDEPSSSHSPILPPVLEEPSPSFSE--DDDPL

Query:  PAIEALQISGEAFPGQELQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAKQPNYRVTADDVDTYLAIEVQPLDNRRRKGELVKVFANEHQKITCDLEMQ
        P I  LQISGE FPG+ELQ  G+SINGTT CNFEWVRHLEDGSVNYI+GAK+P+Y VTADDVD YLAIEV PLD++ RKGELV+VFANE+ KITC  EMQ
Subjt:  PAIEALQISGEAFPGQELQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAKQPNYRVTADDVDTYLAIEVQPLDNRRRKGELVKVFANEHQKITCDLEMQ

Query:  NHIEKTLYSGHASYKVSMSARYLDIWEPATLSVKREGYSIKCSGPSND-VITEKFSPNTIVSIPFGHPSEFIITGSNNVEHHLR-AENNSADISGFRDTI
        +HIEK+LY+GHA +KVS S  YLDIWE ATLS+K+EGYSIK   P+ND VITEKFS +T + IPF  P++F+I G++  EH  R  +N++ D+S  RDTI
Subjt:  NHIEKTLYSGHASYKVSMSARYLDIWEPATLSVKREGYSIKCSGPSND-VITEKFSPNTIVSIPFGHPSEFIITGSNNVEHHLR-AENNSADISGFRDTI

Query:  VLTLRLFI
        VLTLRLF+
Subjt:  VLTLRLFI

AT5G23490.1 unknown protein1.0e-19855.75Show/hide
Query:  MENGFDGRSLAEKFSGLAVTAAPEQSNSHSSNNHSSSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIRELEKYKVDESLDRRFHSTDQWNENE
        MENG + R LAE+FSGL      E S+    N      + NLFQV+KAVEAAE TIK+QVEEN+RL+ ELQ+   EL KYK DESL +    T    ++ 
Subjt:  MENGFDGRSLAEKFSGLAVTAAPEQSNSHSSNNHSSSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIRELEKYKVDESLDRRFHSTDQWNENE

Query:  NDHHGSNGGHQSDNSVDNERHMFKNNISIVDSHGTLVVHQAVEQKDEVSMRIDTEPRFENSKSDRIVNALPGVQPPVDNAGCSQFSSPSTTSFSASRFPV
        N    S   HQ    VD +  + K   S  DS G LVVH  V    E         RFE+   + I N    V+  +D  G SQF S    S S  R  +
Subjt:  NDHHGSNGGHQSDNSVDNERHMFKNNISIVDSHGTLVVHQAVEQKDEVSMRIDTEPRFENSKSDRIVNALPGVQPPVDNAGCSQFSSPSTTSFSASRFPV

Query:  DGEYDPRIKLSGHGLMPKAEVNNPNSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDII
        +GE+D     S HG MP  EVN+  + WKQDL+ KVQE E EI QLR++L D S+KEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDA+SKALSYRQ+II
Subjt:  DGEYDPRIKLSGHGLMPKAEVNNPNSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDII

Query:  EENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAPQSPFHSIGATLTTS
        EENIRLTYALQ  QQER+TFVS LLPLL+EYSLQP V DAQSI+SNVK+LFKHLQEKLLLTETKLKES+YQL PW+SD +HS+ +P +P  S G  LT S
Subjt:  EENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAPQSPFHSIGATLTTS

Query:  TKNGLELVPQPSYWNGKIPVSSSDAQATADWDLSSHHQMGLGVGVATKLEPDDLGRYS--LHASSTHYSEATNKQVTFREPVSNSEMDDPDVVHQAERET
        TK+ +                 S      DW+L    Q   G         DD   +S   ++ S  +        +  E  ++ ++D+    H    E 
Subjt:  TKNGLELVPQPSYWNGKIPVSSSDAQATADWDLSSHHQMGLGVGVATKLEPDDLGRYS--LHASSTHYSEATNKQVTFREPVSNSEMDDPDVVHQAERET

Query:  I--TNWSSGQSPP-AATLDEPSSSHSPILPPVLEEPSPSFSE--DDDPLPAIEALQISGEAFPGQELQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAK
        I  T     Q+P   +  D+PSSS+SP+L PV EEPS SFSE  DDDPLP IE LQISGE +PG ELQACGYSINGTTSCNFEWV HLEDGSVNYI+GAK
Subjt:  I--TNWSSGQSPP-AATLDEPSSSHSPILPPVLEEPSPSFSE--DDDPLPAIEALQISGEAFPGQELQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAK

Query:  QPNYRVTADDVDTYLAIEVQPLDNRRRKGELVKVFANEHQKITCDLEMQNHIEKTLYSGHASYKVSMSARYLDIWEPATLSVKREGYSIKCSGPSNDVIT
        QPNY VTADDVD YLAIEVQPLD+R RKGELVKVFAN+++KI C  +MQ++IEKTL++GHASYKVS++  ++DIWE ATLS+KREGYSIKC   S+  I 
Subjt:  QPNYRVTADDVDTYLAIEVQPLDNRRRKGELVKVFANEHQKITCDLEMQNHIEKTLYSGHASYKVSMSARYLDIWEPATLSVKREGYSIKCSGPSNDVIT

Query:  EKFSPNTIVSIPFGHPSEFIITGSNNVEHHLRAENNSADISGFRDTIVLTLRLFILR
        EKFS +T V+IPFG P+E +I GS+  EH LRA+N S D+ G RD IVLTLRLFI R
Subjt:  EKFSPNTIVSIPFGHPSEFIITGSNNVEHHLRAENNSADISGFRDTIVLTLRLFILR

AT5G23510.1 unknown protein8.5e-7661.04Show/hide
Query:  PPAATLDEPSSSHSPILPP----VLEEPSPSF---SEDDDPLPAIEALQISGEAFPGQELQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAKQPNYRVT
        P  +  + P+ + +   PP    V+ +    F     DD PLPA+E LQISGE +PG ELQACGYSINGTTSCNFEWV HLEDGSVNYI+GAK+PNY VT
Subjt:  PPAATLDEPSSSHSPILPP----VLEEPSPSF---SEDDDPLPAIEALQISGEAFPGQELQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAKQPNYRVT

Query:  ADDVDTYLAIEVQPLDNRRRKGELVKVFANEHQKITCDLEMQNHIEKTLYSGHASYKVSMSARYLDIWEPATLSVKREGYSIKCSGPSNDV-ITEKFSPN
        ADDV   LAIEVQPLD+R RKGELVKVFAN+++KI C  EMQ++I+KTL++GHASYKVS++  ++ IWE ATLS++REGY+IKC   +ND+ ITEKFS +
Subjt:  ADDVDTYLAIEVQPLDNRRRKGELVKVFANEHQKITCDLEMQNHIEKTLYSGHASYKVSMSARYLDIWEPATLSVKREGYSIKCSGPSNDV-ITEKFSPN

Query:  TIVSIPFGHPSEFIITGSNNVEHHLRAENNSADISGFRDTIVLTLRLFI
        T V IPF  P+E +I GS+  EH LR +N   DIS  RD IVLTLR FI
Subjt:  TIVSIPFGHPSEFIITGSNNVEHHLRAENNSADISGFRDTIVLTLRLFI

AT5G23510.2 unknown protein2.0e-7756.55Show/hide
Query:  STHYSEATNKQVTFREPVSNSEMDDPDVVHQAERETITNWSSGQSPPAATLDEPSSSHSPILPPVLEEPSPSF---SEDDDPLPAIEALQISGEAFPGQE
        STH+    N  +T  +  +  ++D   + H A  E     S  +SP     DE      PI   V+ +    F     DD PLPA+E LQISGE +PG E
Subjt:  STHYSEATNKQVTFREPVSNSEMDDPDVVHQAERETITNWSSGQSPPAATLDEPSSSHSPILPPVLEEPSPSF---SEDDDPLPAIEALQISGEAFPGQE

Query:  LQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAKQPNYRVTADDVDTYLAIEVQPLDNRRRKGELVKVFANEHQKITCDLEMQNHIEKTLYSGHASYKVS
        LQACGYSINGTTSCNFEWV HLEDGSVNYI+GAK+PNY VTADDV   LAIEVQPLD+R RKGELVKVFAN+++KI C  EMQ++I+KTL++GHASYKVS
Subjt:  LQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAKQPNYRVTADDVDTYLAIEVQPLDNRRRKGELVKVFANEHQKITCDLEMQNHIEKTLYSGHASYKVS

Query:  MSARYLDIWEPATLSVKREGYSIKCSGPSNDV-ITEKFSPNTIVSIPFGHPSEFIITGSNNVEHHLRAENNSADISGFRDTIVLTLRLFI
        ++  ++ IWE ATLS++REGY+IKC   +ND+ ITEKFS +T V IPF  P+E +I GS+  EH LR +N   DIS  RD IVLTLR FI
Subjt:  MSARYLDIWEPATLSVKREGYSIKCSGPSNDV-ITEKFSPNTIVSIPFGHPSEFIITGSNNVEHHLRAENNSADISGFRDTIVLTLRLFI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAATGGTTTTGACGGGAGATCATTGGCTGAAAAGTTCTCCGGATTGGCCGTCACAGCTGCTCCAGAGCAATCTAATTCCCACTCATCCAACAATCACAGTAGCAG
CAACGACAGCAACTTGTTTCAGGTCTTGAAAGCTGTTGAAGCAGCCGAGGCTACCATCAAGCAACAGGTGGAGGAAAATAATCGACTGAGGATTGAACTTCAGAAAAAGA
TTCGGGAACTGGAGAAATATAAGGTTGATGAATCTTTGGATCGAAGGTTTCATTCCACAGACCAATGGAATGAGAATGAAAATGACCACCATGGGTCTAATGGGGGTCAT
CAATCAGATAATTCAGTCGATAATGAAAGGCATATGTTTAAGAATAACATTTCTATAGTTGATTCACATGGAACGCTAGTTGTCCATCAAGCTGTTGAGCAAAAAGATGA
AGTTTCCATGCGAATTGATACAGAACCTCGCTTTGAGAATAGCAAATCCGACAGGATAGTGAATGCTCTTCCTGGTGTTCAGCCTCCAGTTGATAATGCTGGTTGCTCAC
AGTTCTCTTCACCATCTACAACATCCTTCTCGGCTAGCAGGTTTCCAGTAGATGGAGAATATGATCCACGGATTAAGTTGTCTGGACATGGCCTGATGCCAAAGGCTGAA
GTAAATAATCCCAACAGTCTCTGGAAGCAGGATCTTGTTGTTAAAGTCCAGGAACATGAAGATGAAATTGTGCAGCTACGCAAGCATCTTGCTGATTATTCTATCAAGGA
AGCACAAATTCGAAATGAAAAATATGTTCTGGAAAAACGTATTGCCTATATGCGTTTGGCCTTTGATCAACAACAACAAGACCTTGTTGATGCTGCTTCTAAAGCTCTCT
CATATCGACAAGACATAATTGAGGAAAATATACGTCTTACATATGCATTGCAGGAAGCACAGCAAGAGAGAACCACCTTTGTATCATCTTTGCTGCCTCTTCTTGCGGAA
TATTCACTACAGCCTCCTGTTCCTGATGCTCAGTCCATCATCAGCAATGTCAAGATTCTATTTAAGCACTTGCAGGAGAAGCTCCTTCTGACCGAGACAAAATTGAAGGA
GTCACAGTATCAATTAACACCTTGGCGCTCTGATGCAAGCCATTCGAGTTTTGCACCGCAGTCACCTTTCCACTCCATTGGTGCAACCTTAACCACTTCAACTAAAAATG
GGCTCGAACTGGTTCCTCAACCTTCATACTGGAACGGGAAGATCCCAGTTTCTTCTTCTGATGCTCAGGCGACAGCTGATTGGGATCTATCAAGTCATCATCAGATGGGT
TTAGGTGTTGGTGTTGCAACAAAGTTGGAACCAGATGATTTGGGGAGGTATTCACTTCATGCAAGCAGCACACATTACAGTGAAGCAACAAACAAGCAGGTGACATTTCG
TGAACCTGTAAGCAATAGTGAGATGGATGACCCGGATGTGGTCCACCAAGCAGAGAGAGAAACTATCACCAACTGGAGTTCTGGGCAATCTCCTCCTGCTGCCACTCTCG
ATGAGCCAAGCTCCTCTCATTCTCCAATTCTGCCTCCAGTCCTCGAGGAACCTTCACCTTCATTTTCTGAAGATGACGATCCATTACCTGCTATTGAGGCCCTCCAAATA
TCTGGTGAAGCGTTTCCAGGACAAGAGCTCCAAGCATGTGGATACTCAATTAATGGAACAACTAGCTGTAATTTTGAGTGGGTGCGGCATCTGGAAGATGGATCTGTTAA
TTATATTGAAGGGGCGAAGCAACCAAACTATCGTGTTACTGCAGATGATGTTGACACCTATCTAGCTATTGAAGTCCAGCCTTTGGATAACAGAAGGCGCAAGGGAGAGC
TTGTAAAGGTATTTGCCAATGAGCACCAAAAGATTACTTGTGATCTTGAAATGCAGAACCACATAGAGAAGACTCTTTACAGTGGTCATGCTTCATATAAAGTATCCATG
TCGGCTAGATATCTTGATATATGGGAACCGGCTACACTATCTGTCAAAAGGGAAGGATACAGTATAAAATGTAGTGGGCCCAGTAATGATGTCATCACTGAAAAGTTTTC
ACCAAATACAATTGTTTCAATTCCATTTGGACATCCTTCTGAGTTTATAATAACTGGTTCTAACAATGTTGAGCATCATTTGCGAGCAGAAAACAATTCAGCAGATATTA
GCGGTTTCAGAGATACCATTGTGCTAACCTTGAGACTATTCATTCTAAGG
mRNA sequenceShow/hide mRNA sequence
ATGGAAAATGGTTTTGACGGGAGATCATTGGCTGAAAAGTTCTCCGGATTGGCCGTCACAGCTGCTCCAGAGCAATCTAATTCCCACTCATCCAACAATCACAGTAGCAG
CAACGACAGCAACTTGTTTCAGGTCTTGAAAGCTGTTGAAGCAGCCGAGGCTACCATCAAGCAACAGGTGGAGGAAAATAATCGACTGAGGATTGAACTTCAGAAAAAGA
TTCGGGAACTGGAGAAATATAAGGTTGATGAATCTTTGGATCGAAGGTTTCATTCCACAGACCAATGGAATGAGAATGAAAATGACCACCATGGGTCTAATGGGGGTCAT
CAATCAGATAATTCAGTCGATAATGAAAGGCATATGTTTAAGAATAACATTTCTATAGTTGATTCACATGGAACGCTAGTTGTCCATCAAGCTGTTGAGCAAAAAGATGA
AGTTTCCATGCGAATTGATACAGAACCTCGCTTTGAGAATAGCAAATCCGACAGGATAGTGAATGCTCTTCCTGGTGTTCAGCCTCCAGTTGATAATGCTGGTTGCTCAC
AGTTCTCTTCACCATCTACAACATCCTTCTCGGCTAGCAGGTTTCCAGTAGATGGAGAATATGATCCACGGATTAAGTTGTCTGGACATGGCCTGATGCCAAAGGCTGAA
GTAAATAATCCCAACAGTCTCTGGAAGCAGGATCTTGTTGTTAAAGTCCAGGAACATGAAGATGAAATTGTGCAGCTACGCAAGCATCTTGCTGATTATTCTATCAAGGA
AGCACAAATTCGAAATGAAAAATATGTTCTGGAAAAACGTATTGCCTATATGCGTTTGGCCTTTGATCAACAACAACAAGACCTTGTTGATGCTGCTTCTAAAGCTCTCT
CATATCGACAAGACATAATTGAGGAAAATATACGTCTTACATATGCATTGCAGGAAGCACAGCAAGAGAGAACCACCTTTGTATCATCTTTGCTGCCTCTTCTTGCGGAA
TATTCACTACAGCCTCCTGTTCCTGATGCTCAGTCCATCATCAGCAATGTCAAGATTCTATTTAAGCACTTGCAGGAGAAGCTCCTTCTGACCGAGACAAAATTGAAGGA
GTCACAGTATCAATTAACACCTTGGCGCTCTGATGCAAGCCATTCGAGTTTTGCACCGCAGTCACCTTTCCACTCCATTGGTGCAACCTTAACCACTTCAACTAAAAATG
GGCTCGAACTGGTTCCTCAACCTTCATACTGGAACGGGAAGATCCCAGTTTCTTCTTCTGATGCTCAGGCGACAGCTGATTGGGATCTATCAAGTCATCATCAGATGGGT
TTAGGTGTTGGTGTTGCAACAAAGTTGGAACCAGATGATTTGGGGAGGTATTCACTTCATGCAAGCAGCACACATTACAGTGAAGCAACAAACAAGCAGGTGACATTTCG
TGAACCTGTAAGCAATAGTGAGATGGATGACCCGGATGTGGTCCACCAAGCAGAGAGAGAAACTATCACCAACTGGAGTTCTGGGCAATCTCCTCCTGCTGCCACTCTCG
ATGAGCCAAGCTCCTCTCATTCTCCAATTCTGCCTCCAGTCCTCGAGGAACCTTCACCTTCATTTTCTGAAGATGACGATCCATTACCTGCTATTGAGGCCCTCCAAATA
TCTGGTGAAGCGTTTCCAGGACAAGAGCTCCAAGCATGTGGATACTCAATTAATGGAACAACTAGCTGTAATTTTGAGTGGGTGCGGCATCTGGAAGATGGATCTGTTAA
TTATATTGAAGGGGCGAAGCAACCAAACTATCGTGTTACTGCAGATGATGTTGACACCTATCTAGCTATTGAAGTCCAGCCTTTGGATAACAGAAGGCGCAAGGGAGAGC
TTGTAAAGGTATTTGCCAATGAGCACCAAAAGATTACTTGTGATCTTGAAATGCAGAACCACATAGAGAAGACTCTTTACAGTGGTCATGCTTCATATAAAGTATCCATG
TCGGCTAGATATCTTGATATATGGGAACCGGCTACACTATCTGTCAAAAGGGAAGGATACAGTATAAAATGTAGTGGGCCCAGTAATGATGTCATCACTGAAAAGTTTTC
ACCAAATACAATTGTTTCAATTCCATTTGGACATCCTTCTGAGTTTATAATAACTGGTTCTAACAATGTTGAGCATCATTTGCGAGCAGAAAACAATTCAGCAGATATTA
GCGGTTTCAGAGATACCATTGTGCTAACCTTGAGACTATTCATTCTAAGG
Protein sequenceShow/hide protein sequence
MENGFDGRSLAEKFSGLAVTAAPEQSNSHSSNNHSSSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIRELEKYKVDESLDRRFHSTDQWNENENDHHGSNGGH
QSDNSVDNERHMFKNNISIVDSHGTLVVHQAVEQKDEVSMRIDTEPRFENSKSDRIVNALPGVQPPVDNAGCSQFSSPSTTSFSASRFPVDGEYDPRIKLSGHGLMPKAE
VNNPNSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIEENIRLTYALQEAQQERTTFVSSLLPLLAE
YSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAPQSPFHSIGATLTTSTKNGLELVPQPSYWNGKIPVSSSDAQATADWDLSSHHQMG
LGVGVATKLEPDDLGRYSLHASSTHYSEATNKQVTFREPVSNSEMDDPDVVHQAERETITNWSSGQSPPAATLDEPSSSHSPILPPVLEEPSPSFSEDDDPLPAIEALQI
SGEAFPGQELQACGYSINGTTSCNFEWVRHLEDGSVNYIEGAKQPNYRVTADDVDTYLAIEVQPLDNRRRKGELVKVFANEHQKITCDLEMQNHIEKTLYSGHASYKVSM
SARYLDIWEPATLSVKREGYSIKCSGPSNDVITEKFSPNTIVSIPFGHPSEFIITGSNNVEHHLRAENNSADISGFRDTIVLTLRLFILR