; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0012993 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0012993
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionSHR-BD domain-containing protein
Genome locationchr1:46443736..46455189
RNA-Seq ExpressionLag0012993
SyntenyLag0012993
Gene Ontology termsGO:0006623 - protein targeting to vacuole (biological process)
GO:0045053 - protein retention in Golgi apparatus (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0019898 - extrinsic component of membrane (cellular component)
InterPro domainsIPR026847 - Vacuolar protein sorting-associated protein 13
IPR031645 - Vacuolar protein sorting-associated protein 13, C-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022977391.1 uncharacterized protein LOC111477741 isoform X1 [Cucurbita maxima]2.3e-27489.96Show/hide
Query:  DGYGDEPS-HKPLVAARLEDINLDSVFTEQQKYNQITIQSLKLEEKRVGATFAAMLRRHQLDWSDSNDCVLKIVCVLNSTSFHVKQVKYFSVVLQPIDLN
        D YG EPS  +PLVAARL DI+LDSVFTEQQKYNQ+TIQSL+LEEKRVGATFAAMLRRHQLD+SDSNDC LKIVCVLNSTSFHVKQVKYFS+VLQPIDLN
Subjt:  DGYGDEPS-HKPLVAARLEDINLDSVFTEQQKYNQITIQSLKLEEKRVGATFAAMLRRHQLDWSDSNDCVLKIVCVLNSTSFHVKQVKYFSVVLQPIDLN

Query:  LDEETLMRIAPFWR-SLNVSKSESQQYYFDHFEIHPIKIIANFLPEELYSSYSSTQETLRTLLHSVVKIPPMKNVVVELNGVLVTHALITMRELFLRCAQ
        LDEETLMRIAPF R SL+ SK+ SQQYYFDHFEIHPIKI ANFLPEE YSSYSSTQETLRTLLHSVVKIP MKNVVVELNGVLVTHALITMRELFLRCAQ
Subjt:  LDEETLMRIAPFWR-SLNVSKSESQQYYFDHFEIHPIKIIANFLPEELYSSYSSTQETLRTLLHSVVKIPPMKNVVVELNGVLVTHALITMRELFLRCAQ

Query:  HYSWYAMRAIYIAKGSSLLPPDFISIFDDLSSSSLDVFFDPSRGFAGFPGTFKFIKKLIDVKRGSGTKRYFGDLGKTLRTAGSNVIFAAITEISDSVLKG
        HYSWYA+RAIYIAKGSSLLPPDFISIFDDLSSSSLDVFFDPSRGF GFPGTFKFIKK I VKRGSGTKRYFGDLGKTLRTAGSNV+FAAITEISDSVLKG
Subjt:  HYSWYAMRAIYIAKGSSLLPPDFISIFDDLSSSSLDVFFDPSRGFAGFPGTFKFIKKLIDVKRGSGTKRYFGDLGKTLRTAGSNVIFAAITEISDSVLKG

Query:  AEASGFNGMVSGFHQGILKISMEPSLLGSALMQGGPERKIKLDRSPGVDELYIEGYLQAMLDTMYRQEYLRVRVVDNQVILKNLPPNTPLSNEIVERVKG
        AEASGF+GMVSGFHQGILKI+MEPSLLGSALMQGGP+RKIKLDRSPG DELYIEGYLQA LDT+YRQEYLRVRV+DNQV LKNLPPNTPL NEIVE VKG
Subjt:  AEASGFNGMVSGFHQGILKISMEPSLLGSALMQGGPERKIKLDRSPGVDELYIEGYLQAMLDTMYRQEYLRVRVVDNQVILKNLPPNTPLSNEIVERVKG

Query:  FLVSKALLKGDSAMSTRPFHHLRRESEWKIGPTVRTLCEHLFVSFAIRMLRKGVTQIVVRIPRNKESESDSQETNLALVPTGKEQKGKFIWTVGIGKFVL
        FLVSK LLKGD AM TRPF HL+R+SEWKIGPTVRTLCEHLFVSFAIRMLRKGV QIVVRIP+NKES SD QE+NL+LVPTGK +KGKFIWT+GIGKF+L
Subjt:  FLVSKALLKGDSAMSTRPFHHLRRESEWKIGPTVRTLCEHLFVSFAIRMLRKGVTQIVVRIPRNKESESDSQETNLALVPTGKEQKGKFIWTVGIGKFVL

Query:  SGILAYIDGRLCRNIPHPIARRIVSGFLLTLLDNNDKE
        SG+LAYIDGRLCRNIPHPIARRIVSGFLLTLLD+NDKE
Subjt:  SGILAYIDGRLCRNIPHPIARRIVSGFLLTLLDNNDKE

XP_023543847.1 uncharacterized protein LOC111803593 isoform X1 [Cucurbita pepo subsp. pepo]3.0e-27490.15Show/hide
Query:  DGYGDEPS-HKPLVAARLEDINLDSVFTEQQKYNQITIQSLKLEEKRVGATFAAMLRRHQLDWSDSNDCVLKIVCVLNSTSFHVKQVKYFSVVLQPIDLN
        D YG EPS  +PLVAARL DI+LDSVFTEQQKYNQITIQSL+LEEKRVGATFAAMLRRHQLD+ DSNDC LKIVCVLNSTSFHVKQVKYFS+VLQPIDLN
Subjt:  DGYGDEPS-HKPLVAARLEDINLDSVFTEQQKYNQITIQSLKLEEKRVGATFAAMLRRHQLDWSDSNDCVLKIVCVLNSTSFHVKQVKYFSVVLQPIDLN

Query:  LDEETLMRIAPFWR-SLNVSKSESQQYYFDHFEIHPIKIIANFLPEELYSSYSSTQETLRTLLHSVVKIPPMKNVVVELNGVLVTHALITMRELFLRCAQ
        LDEETLMRIAPF R SL+ SK+ SQQYYFDHFEIHPIKI ANFLPEE YSSYSSTQETLRTLLHSVVKIP MKNVVVELNGVLVTHALITMRELFLRCAQ
Subjt:  LDEETLMRIAPFWR-SLNVSKSESQQYYFDHFEIHPIKIIANFLPEELYSSYSSTQETLRTLLHSVVKIPPMKNVVVELNGVLVTHALITMRELFLRCAQ

Query:  HYSWYAMRAIYIAKGSSLLPPDFISIFDDLSSSSLDVFFDPSRGFAGFPGTFKFIKKLIDVKRGSGTKRYFGDLGKTLRTAGSNVIFAAITEISDSVLKG
        HYSWYA+RAIYIAKGSSLLPPDFISIFDDLSSSSLDVFFDPSRGF GFPGTFKFIKK I VKRGSGTKRYFGDLGKTLRTAGSNV+FAAITEISDSVLKG
Subjt:  HYSWYAMRAIYIAKGSSLLPPDFISIFDDLSSSSLDVFFDPSRGFAGFPGTFKFIKKLIDVKRGSGTKRYFGDLGKTLRTAGSNVIFAAITEISDSVLKG

Query:  AEASGFNGMVSGFHQGILKISMEPSLLGSALMQGGPERKIKLDRSPGVDELYIEGYLQAMLDTMYRQEYLRVRVVDNQVILKNLPPNTPLSNEIVERVKG
        AEASGF+GMVSGFHQGILKI+MEPSLLGSALMQGGP+RKIKLDRSPG DELYIEGYLQA LDT+YRQEYLRVRV+DNQV LKNLPPNTPL NEIVE VKG
Subjt:  AEASGFNGMVSGFHQGILKISMEPSLLGSALMQGGPERKIKLDRSPGVDELYIEGYLQAMLDTMYRQEYLRVRVVDNQVILKNLPPNTPLSNEIVERVKG

Query:  FLVSKALLKGDSAMSTRPFHHLRRESEWKIGPTVRTLCEHLFVSFAIRMLRKGVTQIVVRIPRNKESESDSQETNLALVPTGKEQKGKFIWTVGIGKFVL
        FLVSK LLKGD AM TRPF HL+R+SEWKIGPTVRTLCEHLFVSFAIRMLRKGV QIVVRIPRNKES SD QE+NL+LVPTGK +KGKFIWT+GIGKF+L
Subjt:  FLVSKALLKGDSAMSTRPFHHLRRESEWKIGPTVRTLCEHLFVSFAIRMLRKGVTQIVVRIPRNKESESDSQETNLALVPTGKEQKGKFIWTVGIGKFVL

Query:  SGILAYIDGRLCRNIPHPIARRIVSGFLLTLLDNNDKE
        SG+LAYIDGRLCRNIPHPIARRIVSGFLLTLLD+NDKE
Subjt:  SGILAYIDGRLCRNIPHPIARRIVSGFLLTLLDNNDKE

XP_038880802.1 uncharacterized protein LOC120072502 isoform X1 [Benincasa hispida]5.2e-27991.45Show/hide
Query:  DGYGDEPSH-KPLVAARLEDINLDSVFTEQQKYNQITIQSLKLEEKRVGATFAAMLRRHQLDWSDSNDCVLKIVCVLNSTSFHVKQVKYFSVVLQPIDLN
        DGY  EPS   PLVAARL DI+LDSVF EQQKYNQIT+QSLKLEEKRVGATFAAMLRRH+LD+SD NDCVLKIVCVLNSTSFHVKQVKYFSVVLQPIDLN
Subjt:  DGYGDEPSH-KPLVAARLEDINLDSVFTEQQKYNQITIQSLKLEEKRVGATFAAMLRRHQLDWSDSNDCVLKIVCVLNSTSFHVKQVKYFSVVLQPIDLN

Query:  LDEETLMRIAPFWR-SLNVSKSESQQYYFDHFEIHPIKIIANFLPEELYSSYSSTQETLRTLLHSVVKIPPMKNVVVELNGVLVTHALITMRELFLRCAQ
        LDEETLMRIAPFWR SLN SK+ SQQYYFDHFEIHPIKI ANF PEE YSSYSSTQETLRTLLHSVVKIP MKNVVVELNGVLVTHALITM ELFLRCAQ
Subjt:  LDEETLMRIAPFWR-SLNVSKSESQQYYFDHFEIHPIKIIANFLPEELYSSYSSTQETLRTLLHSVVKIPPMKNVVVELNGVLVTHALITMRELFLRCAQ

Query:  HYSWYAMRAIYIAKGSSLLPPDFISIFDDLSSSSLDVFFDPSRGFAGFPGTFKFIKKLIDVKRGSGTKRYFGDLGKTLRTAGSNVIFAAITEISDSVLKG
        HYSWYAMRAIYIAKGSSLLPPDFISIFDDLSSSSLDVFFDPSRGF GFPGTFKFIK+ ID KRGSGTKRYFGDLGKTLRTAGSNV+FAAITEISDSVLKG
Subjt:  HYSWYAMRAIYIAKGSSLLPPDFISIFDDLSSSSLDVFFDPSRGFAGFPGTFKFIKKLIDVKRGSGTKRYFGDLGKTLRTAGSNVIFAAITEISDSVLKG

Query:  AEASGFNGMVSGFHQGILKISMEPSLLGSALMQGGPERKIKLDRSPGVDELYIEGYLQAMLDTMYRQEYLRVRVVDNQVILKNLPPNTPLSNEIVERVKG
        AEASGFNGMVSGFHQGILKI+MEPSLLGS LMQGGP+R IKLD+SPGVDELYIEGYLQAMLDT+YRQEYLRVRV+DNQVILKNLPPNTPL NEIVERVKG
Subjt:  AEASGFNGMVSGFHQGILKISMEPSLLGSALMQGGPERKIKLDRSPGVDELYIEGYLQAMLDTMYRQEYLRVRVVDNQVILKNLPPNTPLSNEIVERVKG

Query:  FLVSKALLKGDSAMSTRPFHHLRRESEWKIGPTVRTLCEHLFVSFAIRMLRKGVTQIVVRIPRNKESESDSQETNLALVPTGKEQKGKFIWTVGIGKFVL
        FLVSK LLKGDS MS+RPFHHLRRESEWKIGPTV+TLCEHLFVSFAIRMLRKGVTQIVVRIPRNKES SD+QETNLALVPTGKE KGKFIWT GIGKF+L
Subjt:  FLVSKALLKGDSAMSTRPFHHLRRESEWKIGPTVRTLCEHLFVSFAIRMLRKGVTQIVVRIPRNKESESDSQETNLALVPTGKEQKGKFIWTVGIGKFVL

Query:  SGILAYIDGRLCRNIPHPIARRIVSGFLLTLLDNNDKE
        SGI+AYIDGRLCRNIP+PI RRIVSGFLLTLLDNNDKE
Subjt:  SGILAYIDGRLCRNIPHPIARRIVSGFLLTLLDNNDKE

XP_038880803.1 uncharacterized protein LOC120072502 isoform X2 [Benincasa hispida]5.2e-27991.45Show/hide
Query:  DGYGDEPSH-KPLVAARLEDINLDSVFTEQQKYNQITIQSLKLEEKRVGATFAAMLRRHQLDWSDSNDCVLKIVCVLNSTSFHVKQVKYFSVVLQPIDLN
        DGY  EPS   PLVAARL DI+LDSVF EQQKYNQIT+QSLKLEEKRVGATFAAMLRRH+LD+SD NDCVLKIVCVLNSTSFHVKQVKYFSVVLQPIDLN
Subjt:  DGYGDEPSH-KPLVAARLEDINLDSVFTEQQKYNQITIQSLKLEEKRVGATFAAMLRRHQLDWSDSNDCVLKIVCVLNSTSFHVKQVKYFSVVLQPIDLN

Query:  LDEETLMRIAPFWR-SLNVSKSESQQYYFDHFEIHPIKIIANFLPEELYSSYSSTQETLRTLLHSVVKIPPMKNVVVELNGVLVTHALITMRELFLRCAQ
        LDEETLMRIAPFWR SLN SK+ SQQYYFDHFEIHPIKI ANF PEE YSSYSSTQETLRTLLHSVVKIP MKNVVVELNGVLVTHALITM ELFLRCAQ
Subjt:  LDEETLMRIAPFWR-SLNVSKSESQQYYFDHFEIHPIKIIANFLPEELYSSYSSTQETLRTLLHSVVKIPPMKNVVVELNGVLVTHALITMRELFLRCAQ

Query:  HYSWYAMRAIYIAKGSSLLPPDFISIFDDLSSSSLDVFFDPSRGFAGFPGTFKFIKKLIDVKRGSGTKRYFGDLGKTLRTAGSNVIFAAITEISDSVLKG
        HYSWYAMRAIYIAKGSSLLPPDFISIFDDLSSSSLDVFFDPSRGF GFPGTFKFIK+ ID KRGSGTKRYFGDLGKTLRTAGSNV+FAAITEISDSVLKG
Subjt:  HYSWYAMRAIYIAKGSSLLPPDFISIFDDLSSSSLDVFFDPSRGFAGFPGTFKFIKKLIDVKRGSGTKRYFGDLGKTLRTAGSNVIFAAITEISDSVLKG

Query:  AEASGFNGMVSGFHQGILKISMEPSLLGSALMQGGPERKIKLDRSPGVDELYIEGYLQAMLDTMYRQEYLRVRVVDNQVILKNLPPNTPLSNEIVERVKG
        AEASGFNGMVSGFHQGILKI+MEPSLLGS LMQGGP+R IKLD+SPGVDELYIEGYLQAMLDT+YRQEYLRVRV+DNQVILKNLPPNTPL NEIVERVKG
Subjt:  AEASGFNGMVSGFHQGILKISMEPSLLGSALMQGGPERKIKLDRSPGVDELYIEGYLQAMLDTMYRQEYLRVRVVDNQVILKNLPPNTPLSNEIVERVKG

Query:  FLVSKALLKGDSAMSTRPFHHLRRESEWKIGPTVRTLCEHLFVSFAIRMLRKGVTQIVVRIPRNKESESDSQETNLALVPTGKEQKGKFIWTVGIGKFVL
        FLVSK LLKGDS MS+RPFHHLRRESEWKIGPTV+TLCEHLFVSFAIRMLRKGVTQIVVRIPRNKES SD+QETNLALVPTGKE KGKFIWT GIGKF+L
Subjt:  FLVSKALLKGDSAMSTRPFHHLRRESEWKIGPTVRTLCEHLFVSFAIRMLRKGVTQIVVRIPRNKESESDSQETNLALVPTGKEQKGKFIWTVGIGKFVL

Query:  SGILAYIDGRLCRNIPHPIARRIVSGFLLTLLDNNDKE
        SGI+AYIDGRLCRNIP+PI RRIVSGFLLTLLDNNDKE
Subjt:  SGILAYIDGRLCRNIPHPIARRIVSGFLLTLLDNNDKE

XP_038880804.1 uncharacterized protein LOC120072502 isoform X3 [Benincasa hispida]5.2e-27991.45Show/hide
Query:  DGYGDEPSH-KPLVAARLEDINLDSVFTEQQKYNQITIQSLKLEEKRVGATFAAMLRRHQLDWSDSNDCVLKIVCVLNSTSFHVKQVKYFSVVLQPIDLN
        DGY  EPS   PLVAARL DI+LDSVF EQQKYNQIT+QSLKLEEKRVGATFAAMLRRH+LD+SD NDCVLKIVCVLNSTSFHVKQVKYFSVVLQPIDLN
Subjt:  DGYGDEPSH-KPLVAARLEDINLDSVFTEQQKYNQITIQSLKLEEKRVGATFAAMLRRHQLDWSDSNDCVLKIVCVLNSTSFHVKQVKYFSVVLQPIDLN

Query:  LDEETLMRIAPFWR-SLNVSKSESQQYYFDHFEIHPIKIIANFLPEELYSSYSSTQETLRTLLHSVVKIPPMKNVVVELNGVLVTHALITMRELFLRCAQ
        LDEETLMRIAPFWR SLN SK+ SQQYYFDHFEIHPIKI ANF PEE YSSYSSTQETLRTLLHSVVKIP MKNVVVELNGVLVTHALITM ELFLRCAQ
Subjt:  LDEETLMRIAPFWR-SLNVSKSESQQYYFDHFEIHPIKIIANFLPEELYSSYSSTQETLRTLLHSVVKIPPMKNVVVELNGVLVTHALITMRELFLRCAQ

Query:  HYSWYAMRAIYIAKGSSLLPPDFISIFDDLSSSSLDVFFDPSRGFAGFPGTFKFIKKLIDVKRGSGTKRYFGDLGKTLRTAGSNVIFAAITEISDSVLKG
        HYSWYAMRAIYIAKGSSLLPPDFISIFDDLSSSSLDVFFDPSRGF GFPGTFKFIK+ ID KRGSGTKRYFGDLGKTLRTAGSNV+FAAITEISDSVLKG
Subjt:  HYSWYAMRAIYIAKGSSLLPPDFISIFDDLSSSSLDVFFDPSRGFAGFPGTFKFIKKLIDVKRGSGTKRYFGDLGKTLRTAGSNVIFAAITEISDSVLKG

Query:  AEASGFNGMVSGFHQGILKISMEPSLLGSALMQGGPERKIKLDRSPGVDELYIEGYLQAMLDTMYRQEYLRVRVVDNQVILKNLPPNTPLSNEIVERVKG
        AEASGFNGMVSGFHQGILKI+MEPSLLGS LMQGGP+R IKLD+SPGVDELYIEGYLQAMLDT+YRQEYLRVRV+DNQVILKNLPPNTPL NEIVERVKG
Subjt:  AEASGFNGMVSGFHQGILKISMEPSLLGSALMQGGPERKIKLDRSPGVDELYIEGYLQAMLDTMYRQEYLRVRVVDNQVILKNLPPNTPLSNEIVERVKG

Query:  FLVSKALLKGDSAMSTRPFHHLRRESEWKIGPTVRTLCEHLFVSFAIRMLRKGVTQIVVRIPRNKESESDSQETNLALVPTGKEQKGKFIWTVGIGKFVL
        FLVSK LLKGDS MS+RPFHHLRRESEWKIGPTV+TLCEHLFVSFAIRMLRKGVTQIVVRIPRNKES SD+QETNLALVPTGKE KGKFIWT GIGKF+L
Subjt:  FLVSKALLKGDSAMSTRPFHHLRRESEWKIGPTVRTLCEHLFVSFAIRMLRKGVTQIVVRIPRNKESESDSQETNLALVPTGKEQKGKFIWTVGIGKFVL

Query:  SGILAYIDGRLCRNIPHPIARRIVSGFLLTLLDNNDKE
        SGI+AYIDGRLCRNIP+PI RRIVSGFLLTLLDNNDKE
Subjt:  SGILAYIDGRLCRNIPHPIARRIVSGFLLTLLDNNDKE

TrEMBL top hitse value%identityAlignment
A0A6J1BTB2 uncharacterized protein LOC111005570 isoform X32.4e-26988.48Show/hide
Query:  DGYGDEP-SHKPLVAARLEDINLDSVFTEQQKYNQITIQSLKLEEKRVGATFAAMLRRHQLDWSDSNDCVLKIVCVLNSTSFHVKQVKYFSVVLQPIDLN
        D  G EP ++ PLVAARL DI L SVFTEQQKYNQITIQSL+LEEKRVGA FAAM+RRHQ+D+SDSNDCVLK+VC+LNSTS HVKQVKY SVVLQPIDLN
Subjt:  DGYGDEP-SHKPLVAARLEDINLDSVFTEQQKYNQITIQSLKLEEKRVGATFAAMLRRHQLDWSDSNDCVLKIVCVLNSTSFHVKQVKYFSVVLQPIDLN

Query:  LDEETLMRIAPFWR-SLNVSKSESQQYYFDHFEIHPIKIIANFLPEELYSSYSSTQETLRTLLHSVVKIPPMKNVVVELNGVLVTHALITMRELFLRCAQ
        LDEETLMRIAPFWR SL+ SK+ SQQYYFDHFEIHPIKIIANFLPEE YSSYSSTQETLRTLLHSVVKIPPMKNV VELNGVLVTHALITMRELFLRCAQ
Subjt:  LDEETLMRIAPFWR-SLNVSKSESQQYYFDHFEIHPIKIIANFLPEELYSSYSSTQETLRTLLHSVVKIPPMKNVVVELNGVLVTHALITMRELFLRCAQ

Query:  HYSWYAMRAIYIAKGSSLLPPDFISIFDDLSSSSLDVFFDPSRGFAGFPGTFKFIKKLIDVKRGSGTKRYFGDLGKTLRTAGSNVIFAAITEISDSVLKG
        HYSWYAMRA+YIAKGSSLLPPDFISIFDDLSSSSLDVFFDPSRGF G PGTFKFIKK IDVKR SGTKRYFGDLGKTL+TAGSNVIFAAITEISDSVLKG
Subjt:  HYSWYAMRAIYIAKGSSLLPPDFISIFDDLSSSSLDVFFDPSRGFAGFPGTFKFIKKLIDVKRGSGTKRYFGDLGKTLRTAGSNVIFAAITEISDSVLKG

Query:  AEASGFNGMVSGFHQGILKISMEPSLLGSALMQGGPERKIKLDRSPGVDELYIEGYLQAMLDTMYRQEYLRVRVVDNQVILKNLPPNTPLSNEIVERVKG
        AEASGFNGMVSGFHQGILKI+M+PSLLGS LM+GGP+RKIKLDRSPGVDELY+EGYLQAMLDTMY+QEYLRVRV+DNQVILKNLPPNT L NEIVE VK 
Subjt:  AEASGFNGMVSGFHQGILKISMEPSLLGSALMQGGPERKIKLDRSPGVDELYIEGYLQAMLDTMYRQEYLRVRVVDNQVILKNLPPNTPLSNEIVERVKG

Query:  FLVSKALLKGDSAMSTRPFHHLRRESEWKIGPTVRTLCEHLFVSFAIRMLRKGVTQIVVRIPRNKESESDSQETNLALVPTGKEQKGKFIWTVGIGKFVL
        FLVSKALLKGD A +TRPFHHLRRESEWKIGPTVRTLCEHLFVSFAIRMLRKGV+QIVVRIP+NKES SD + T LALVP GKEQ GKFIW++GIGKF+L
Subjt:  FLVSKALLKGDSAMSTRPFHHLRRESEWKIGPTVRTLCEHLFVSFAIRMLRKGVTQIVVRIPRNKESESDSQETNLALVPTGKEQKGKFIWTVGIGKFVL

Query:  SGILAYIDGRLCRNIPHPIARRIVSGFLLTLLDNNDKE
        SGILAYIDGRLCRNIPHPI RRIVSGFLLTLLDNN  E
Subjt:  SGILAYIDGRLCRNIPHPIARRIVSGFLLTLLDNNDKE

A0A6J1BTH4 uncharacterized protein LOC111005570 isoform X22.4e-26988.48Show/hide
Query:  DGYGDEP-SHKPLVAARLEDINLDSVFTEQQKYNQITIQSLKLEEKRVGATFAAMLRRHQLDWSDSNDCVLKIVCVLNSTSFHVKQVKYFSVVLQPIDLN
        D  G EP ++ PLVAARL DI L SVFTEQQKYNQITIQSL+LEEKRVGA FAAM+RRHQ+D+SDSNDCVLK+VC+LNSTS HVKQVKY SVVLQPIDLN
Subjt:  DGYGDEP-SHKPLVAARLEDINLDSVFTEQQKYNQITIQSLKLEEKRVGATFAAMLRRHQLDWSDSNDCVLKIVCVLNSTSFHVKQVKYFSVVLQPIDLN

Query:  LDEETLMRIAPFWR-SLNVSKSESQQYYFDHFEIHPIKIIANFLPEELYSSYSSTQETLRTLLHSVVKIPPMKNVVVELNGVLVTHALITMRELFLRCAQ
        LDEETLMRIAPFWR SL+ SK+ SQQYYFDHFEIHPIKIIANFLPEE YSSYSSTQETLRTLLHSVVKIPPMKNV VELNGVLVTHALITMRELFLRCAQ
Subjt:  LDEETLMRIAPFWR-SLNVSKSESQQYYFDHFEIHPIKIIANFLPEELYSSYSSTQETLRTLLHSVVKIPPMKNVVVELNGVLVTHALITMRELFLRCAQ

Query:  HYSWYAMRAIYIAKGSSLLPPDFISIFDDLSSSSLDVFFDPSRGFAGFPGTFKFIKKLIDVKRGSGTKRYFGDLGKTLRTAGSNVIFAAITEISDSVLKG
        HYSWYAMRA+YIAKGSSLLPPDFISIFDDLSSSSLDVFFDPSRGF G PGTFKFIKK IDVKR SGTKRYFGDLGKTL+TAGSNVIFAAITEISDSVLKG
Subjt:  HYSWYAMRAIYIAKGSSLLPPDFISIFDDLSSSSLDVFFDPSRGFAGFPGTFKFIKKLIDVKRGSGTKRYFGDLGKTLRTAGSNVIFAAITEISDSVLKG

Query:  AEASGFNGMVSGFHQGILKISMEPSLLGSALMQGGPERKIKLDRSPGVDELYIEGYLQAMLDTMYRQEYLRVRVVDNQVILKNLPPNTPLSNEIVERVKG
        AEASGFNGMVSGFHQGILKI+M+PSLLGS LM+GGP+RKIKLDRSPGVDELY+EGYLQAMLDTMY+QEYLRVRV+DNQVILKNLPPNT L NEIVE VK 
Subjt:  AEASGFNGMVSGFHQGILKISMEPSLLGSALMQGGPERKIKLDRSPGVDELYIEGYLQAMLDTMYRQEYLRVRVVDNQVILKNLPPNTPLSNEIVERVKG

Query:  FLVSKALLKGDSAMSTRPFHHLRRESEWKIGPTVRTLCEHLFVSFAIRMLRKGVTQIVVRIPRNKESESDSQETNLALVPTGKEQKGKFIWTVGIGKFVL
        FLVSKALLKGD A +TRPFHHLRRESEWKIGPTVRTLCEHLFVSFAIRMLRKGV+QIVVRIP+NKES SD + T LALVP GKEQ GKFIW++GIGKF+L
Subjt:  FLVSKALLKGDSAMSTRPFHHLRRESEWKIGPTVRTLCEHLFVSFAIRMLRKGVTQIVVRIPRNKESESDSQETNLALVPTGKEQKGKFIWTVGIGKFVL

Query:  SGILAYIDGRLCRNIPHPIARRIVSGFLLTLLDNNDKE
        SGILAYIDGRLCRNIPHPI RRIVSGFLLTLLDNN  E
Subjt:  SGILAYIDGRLCRNIPHPIARRIVSGFLLTLLDNNDKE

A0A6J1BU46 uncharacterized protein LOC111005570 isoform X12.4e-26988.48Show/hide
Query:  DGYGDEP-SHKPLVAARLEDINLDSVFTEQQKYNQITIQSLKLEEKRVGATFAAMLRRHQLDWSDSNDCVLKIVCVLNSTSFHVKQVKYFSVVLQPIDLN
        D  G EP ++ PLVAARL DI L SVFTEQQKYNQITIQSL+LEEKRVGA FAAM+RRHQ+D+SDSNDCVLK+VC+LNSTS HVKQVKY SVVLQPIDLN
Subjt:  DGYGDEP-SHKPLVAARLEDINLDSVFTEQQKYNQITIQSLKLEEKRVGATFAAMLRRHQLDWSDSNDCVLKIVCVLNSTSFHVKQVKYFSVVLQPIDLN

Query:  LDEETLMRIAPFWR-SLNVSKSESQQYYFDHFEIHPIKIIANFLPEELYSSYSSTQETLRTLLHSVVKIPPMKNVVVELNGVLVTHALITMRELFLRCAQ
        LDEETLMRIAPFWR SL+ SK+ SQQYYFDHFEIHPIKIIANFLPEE YSSYSSTQETLRTLLHSVVKIPPMKNV VELNGVLVTHALITMRELFLRCAQ
Subjt:  LDEETLMRIAPFWR-SLNVSKSESQQYYFDHFEIHPIKIIANFLPEELYSSYSSTQETLRTLLHSVVKIPPMKNVVVELNGVLVTHALITMRELFLRCAQ

Query:  HYSWYAMRAIYIAKGSSLLPPDFISIFDDLSSSSLDVFFDPSRGFAGFPGTFKFIKKLIDVKRGSGTKRYFGDLGKTLRTAGSNVIFAAITEISDSVLKG
        HYSWYAMRA+YIAKGSSLLPPDFISIFDDLSSSSLDVFFDPSRGF G PGTFKFIKK IDVKR SGTKRYFGDLGKTL+TAGSNVIFAAITEISDSVLKG
Subjt:  HYSWYAMRAIYIAKGSSLLPPDFISIFDDLSSSSLDVFFDPSRGFAGFPGTFKFIKKLIDVKRGSGTKRYFGDLGKTLRTAGSNVIFAAITEISDSVLKG

Query:  AEASGFNGMVSGFHQGILKISMEPSLLGSALMQGGPERKIKLDRSPGVDELYIEGYLQAMLDTMYRQEYLRVRVVDNQVILKNLPPNTPLSNEIVERVKG
        AEASGFNGMVSGFHQGILKI+M+PSLLGS LM+GGP+RKIKLDRSPGVDELY+EGYLQAMLDTMY+QEYLRVRV+DNQVILKNLPPNT L NEIVE VK 
Subjt:  AEASGFNGMVSGFHQGILKISMEPSLLGSALMQGGPERKIKLDRSPGVDELYIEGYLQAMLDTMYRQEYLRVRVVDNQVILKNLPPNTPLSNEIVERVKG

Query:  FLVSKALLKGDSAMSTRPFHHLRRESEWKIGPTVRTLCEHLFVSFAIRMLRKGVTQIVVRIPRNKESESDSQETNLALVPTGKEQKGKFIWTVGIGKFVL
        FLVSKALLKGD A +TRPFHHLRRESEWKIGPTVRTLCEHLFVSFAIRMLRKGV+QIVVRIP+NKES SD + T LALVP GKEQ GKFIW++GIGKF+L
Subjt:  FLVSKALLKGDSAMSTRPFHHLRRESEWKIGPTVRTLCEHLFVSFAIRMLRKGVTQIVVRIPRNKESESDSQETNLALVPTGKEQKGKFIWTVGIGKFVL

Query:  SGILAYIDGRLCRNIPHPIARRIVSGFLLTLLDNNDKE
        SGILAYIDGRLCRNIPHPI RRIVSGFLLTLLDNN  E
Subjt:  SGILAYIDGRLCRNIPHPIARRIVSGFLLTLLDNNDKE

A0A6J1GDG3 uncharacterized protein LOC111453187 isoform X12.1e-27389.78Show/hide
Query:  DGYGDEPS-HKPLVAARLEDINLDSVFTEQQKYNQITIQSLKLEEKRVGATFAAMLRRHQLDWSDSNDCVLKIVCVLNSTSFHVKQVKYFSVVLQPIDLN
        D YG EPS  +PLVAARL DI+LDSVFTEQQKYNQITIQSL+LEEKR GATFAAMLRRHQLD+SDSNDC LKIVCVLNSTSFHVKQVKYFS+VLQPIDLN
Subjt:  DGYGDEPS-HKPLVAARLEDINLDSVFTEQQKYNQITIQSLKLEEKRVGATFAAMLRRHQLDWSDSNDCVLKIVCVLNSTSFHVKQVKYFSVVLQPIDLN

Query:  LDEETLMRIAPFWR-SLNVSKSESQQYYFDHFEIHPIKIIANFLPEELYSSYSSTQETLRTLLHSVVKIPPMKNVVVELNGVLVTHALITMRELFLRCAQ
        LDEETLMRIAPF R SL+ SK+ SQQYYFDHFEIHPIKI ANFLPEE YSSYSSTQETLRTLLHSVVKIP MKNVVVELNGVLVTHALITMRELFLRCAQ
Subjt:  LDEETLMRIAPFWR-SLNVSKSESQQYYFDHFEIHPIKIIANFLPEELYSSYSSTQETLRTLLHSVVKIPPMKNVVVELNGVLVTHALITMRELFLRCAQ

Query:  HYSWYAMRAIYIAKGSSLLPPDFISIFDDLSSSSLDVFFDPSRGFAGFPGTFKFIKKLIDVKRGSGTKRYFGDLGKTLRTAGSNVIFAAITEISDSVLKG
        HYSWYA+RAIYIAKGSSLLPPDFISIFDDLSSSSLDVFFDPSRGF GFPGTFKFIKK I VKRGSGTKRYFGDLGKTLRTAGSNV+FAAITEISDSVLKG
Subjt:  HYSWYAMRAIYIAKGSSLLPPDFISIFDDLSSSSLDVFFDPSRGFAGFPGTFKFIKKLIDVKRGSGTKRYFGDLGKTLRTAGSNVIFAAITEISDSVLKG

Query:  AEASGFNGMVSGFHQGILKISMEPSLLGSALMQGGPERKIKLDRSPGVDELYIEGYLQAMLDTMYRQEYLRVRVVDNQVILKNLPPNTPLSNEIVERVKG
        AEASGF+GMVSGFHQGILKI+MEPSLLGSALMQGGP+RKIKLDRSPG DELYIEGYLQA LDT+YRQEYLRVRV+DNQV LKNLPPNTPL NEIVE VKG
Subjt:  AEASGFNGMVSGFHQGILKISMEPSLLGSALMQGGPERKIKLDRSPGVDELYIEGYLQAMLDTMYRQEYLRVRVVDNQVILKNLPPNTPLSNEIVERVKG

Query:  FLVSKALLKGDSAMSTRPFHHLRRESEWKIGPTVRTLCEHLFVSFAIRMLRKGVTQIVVRIPRNKESESDSQETNLALVPTGKEQKGKFIWTVGIGKFVL
        FLVSK LLKGD AM TRPF HL+R+SEWKIGPTVRTLCEHLFVSFAIRMLRKGV QIVVRIPRNKES  D  E+NL+LVPTGK +KGKFIWT+GIGKF+L
Subjt:  FLVSKALLKGDSAMSTRPFHHLRRESEWKIGPTVRTLCEHLFVSFAIRMLRKGVTQIVVRIPRNKESESDSQETNLALVPTGKEQKGKFIWTVGIGKFVL

Query:  SGILAYIDGRLCRNIPHPIARRIVSGFLLTLLDNNDKE
        SG+LAYIDGRLCRNIPHPIARRIVSGFLLTLLD+NDKE
Subjt:  SGILAYIDGRLCRNIPHPIARRIVSGFLLTLLDNNDKE

A0A6J1IJS2 uncharacterized protein LOC111477741 isoform X11.1e-27489.96Show/hide
Query:  DGYGDEPS-HKPLVAARLEDINLDSVFTEQQKYNQITIQSLKLEEKRVGATFAAMLRRHQLDWSDSNDCVLKIVCVLNSTSFHVKQVKYFSVVLQPIDLN
        D YG EPS  +PLVAARL DI+LDSVFTEQQKYNQ+TIQSL+LEEKRVGATFAAMLRRHQLD+SDSNDC LKIVCVLNSTSFHVKQVKYFS+VLQPIDLN
Subjt:  DGYGDEPS-HKPLVAARLEDINLDSVFTEQQKYNQITIQSLKLEEKRVGATFAAMLRRHQLDWSDSNDCVLKIVCVLNSTSFHVKQVKYFSVVLQPIDLN

Query:  LDEETLMRIAPFWR-SLNVSKSESQQYYFDHFEIHPIKIIANFLPEELYSSYSSTQETLRTLLHSVVKIPPMKNVVVELNGVLVTHALITMRELFLRCAQ
        LDEETLMRIAPF R SL+ SK+ SQQYYFDHFEIHPIKI ANFLPEE YSSYSSTQETLRTLLHSVVKIP MKNVVVELNGVLVTHALITMRELFLRCAQ
Subjt:  LDEETLMRIAPFWR-SLNVSKSESQQYYFDHFEIHPIKIIANFLPEELYSSYSSTQETLRTLLHSVVKIPPMKNVVVELNGVLVTHALITMRELFLRCAQ

Query:  HYSWYAMRAIYIAKGSSLLPPDFISIFDDLSSSSLDVFFDPSRGFAGFPGTFKFIKKLIDVKRGSGTKRYFGDLGKTLRTAGSNVIFAAITEISDSVLKG
        HYSWYA+RAIYIAKGSSLLPPDFISIFDDLSSSSLDVFFDPSRGF GFPGTFKFIKK I VKRGSGTKRYFGDLGKTLRTAGSNV+FAAITEISDSVLKG
Subjt:  HYSWYAMRAIYIAKGSSLLPPDFISIFDDLSSSSLDVFFDPSRGFAGFPGTFKFIKKLIDVKRGSGTKRYFGDLGKTLRTAGSNVIFAAITEISDSVLKG

Query:  AEASGFNGMVSGFHQGILKISMEPSLLGSALMQGGPERKIKLDRSPGVDELYIEGYLQAMLDTMYRQEYLRVRVVDNQVILKNLPPNTPLSNEIVERVKG
        AEASGF+GMVSGFHQGILKI+MEPSLLGSALMQGGP+RKIKLDRSPG DELYIEGYLQA LDT+YRQEYLRVRV+DNQV LKNLPPNTPL NEIVE VKG
Subjt:  AEASGFNGMVSGFHQGILKISMEPSLLGSALMQGGPERKIKLDRSPGVDELYIEGYLQAMLDTMYRQEYLRVRVVDNQVILKNLPPNTPLSNEIVERVKG

Query:  FLVSKALLKGDSAMSTRPFHHLRRESEWKIGPTVRTLCEHLFVSFAIRMLRKGVTQIVVRIPRNKESESDSQETNLALVPTGKEQKGKFIWTVGIGKFVL
        FLVSK LLKGD AM TRPF HL+R+SEWKIGPTVRTLCEHLFVSFAIRMLRKGV QIVVRIP+NKES SD QE+NL+LVPTGK +KGKFIWT+GIGKF+L
Subjt:  FLVSKALLKGDSAMSTRPFHHLRRESEWKIGPTVRTLCEHLFVSFAIRMLRKGVTQIVVRIPRNKESESDSQETNLALVPTGKEQKGKFIWTVGIGKFVL

Query:  SGILAYIDGRLCRNIPHPIARRIVSGFLLTLLDNNDKE
        SG+LAYIDGRLCRNIPHPIARRIVSGFLLTLLD+NDKE
Subjt:  SGILAYIDGRLCRNIPHPIARRIVSGFLLTLLDNNDKE

SwissProt top hitse value%identityAlignment
O42926 Vacuolar protein sorting-associated protein 13b2.9e-0625.64Show/hide
Query:  VLNSTSFHVKQVKYFSVVLQPIDLNLDEETLMRIAP-FWRSLNVSK---SESQQYYFDHFEIHPIK---IIANFLPEELY-----------SSYSSTQET
        VL  +SF++  +KY S++LQ + L +++  ++ +    + S +VSK   S S+  + D FEI  +      +N   E L+           +SY S Q  
Subjt:  VLNSTSFHVKQVKYFSVVLQPIDLNLDEETLMRIAP-FWRSLNVSK---SESQQYYFDHFEIHPIK---IIANFLPEELY-----------SSYSSTQET

Query:  LRTLLHSV-----VKIPPMKNV---VVELNGVLVTHALITMRELFLRCAQHYSWYAMRAIYIAKGSSLLPPDFISIFDDLSSSSLDVFFDPSRGF
        +++   ++     + I  + N+    V+LN +L+ +A  T+ E+  R A HY       IY   G +    + + +F++++S   D+F++P +GF
Subjt:  LRTLLHSV-----VKIPPMKNV---VVELNGVLVTHALITMRELFLRCAQHYSWYAMRAIYIAKGSSLLPPDFISIFDDLSSSSLDVFFDPSRGF

Q55FG3 Putative vacuolar protein sorting-associated protein 13C1.9e-0521.55Show/hide
Query:  KYFSVVLQPIDLNLDEETLMRIAPF-----------------------WRSLNVSKSESQQYYFDHFEIHPIKIIANFL----PEELYSSYSSTQETLRT
        +YFS+++Q  D+NLDE +++    F                         + N S  E+   YF+   I+P+K+  +F+    P+E  +   +   +L  
Subjt:  KYFSVVLQPIDLNLDEETLMRIAPF-----------------------WRSLNVSKSESQQYYFDHFEIHPIKIIANFL----PEELYSSYSSTQETLRT

Query:  LLHSVVKIPPMKNV---VVELNGVLVTHALITMRELFLRCAQHYSWYAMRAIYIAKGSSLLPPDFISIFDDLSSSSLDVFFDPSRGFAGFPGTFKFIKKL
        LL       P  N+    ++ NG +  H  ++ R++    + H+S+  M   +   GS     + I + + L S   D F +P+ G    P  F      
Subjt:  LLHSVVKIPPMKNV---VVELNGVLVTHALITMRELFLRCAQHYSWYAMRAIYIAKGSSLLPPDFISIFDDLSSSSLDVFFDPSRGFAGFPGTFKFIKKL

Query:  IDVKRGSGT---KRYFGDLGKTLRTAGSNVIFAAITEISDSVLKGAEAS--------------GFNGMVSGFHQGILKISMEP
          + +G+ +      FG    T +  G+         + DS +K  + S              GF     G  +GI  I  EP
Subjt:  IDVKRGSGT---KRYFGDLGKTLRTAGSNVIFAAITEISDSVLKGAEAS--------------GFNGMVSGFHQGILKISMEP

Arabidopsis top hitse value%identityAlignment
AT3G50380.1 Protein of unknown function (DUF1162)1.6e-20164.75Show/hide
Query:  SHKPLVAARLEDINLDSVFTEQQKYNQITIQSLKLEEKRVGATFAAMLRRHQLDWSDSNDCVLKIVCVLNSTSFHVKQVKYFSVVLQPIDLNLDEETLMR
        S+ P++ ARLE++ L S+FT+QQK+NQ+ I++L ++ K  GA FAAMLR+HQ   SD+N C+ K V +L S+   V QVK+ S+VLQP++LNLDEETLMR
Subjt:  SHKPLVAARLEDINLDSVFTEQQKYNQITIQSLKLEEKRVGATFAAMLRRHQLDWSDSNDCVLKIVCVLNSTSFHVKQVKYFSVVLQPIDLNLDEETLMR

Query:  IAPFWRSLNVSKSESQQYYFDHFEIHPIKIIANFLPEELYSSYSSTQETLRTLLHSVVKIPPMKNVVVELNGVLVTHALITMRELFLRCAQHYSWYAMRA
        +  FWRS   + ++S QYYFDHFEIHPIKI ANF+P   YSSY+S QETLR+LLHSVVK+P +KN+VVELNGVLVTHALIT+REL LRC +HYSWYAMRA
Subjt:  IAPFWRSLNVSKSESQQYYFDHFEIHPIKIIANFLPEELYSSYSSTQETLRTLLHSVVKIPPMKNVVVELNGVLVTHALITMRELFLRCAQHYSWYAMRA

Query:  IYIAKGSSLLPPDFISIFDDLSSSSLDVFFDPSRGFAGFP----GTFKFIKKLIDVKRGSGTKRYFGDLGKTLRTAGSNVIFAAITEISDSVLKGAEASG
        IYIAKGS LLPP F S+FDD SSSSLD FFDPSRG    P    GTFK + KLID K  SGT+RYFGDLGKTLRTAGSNV+F A+TEISDSVL+GAE  G
Subjt:  IYIAKGSSLLPPDFISIFDDLSSSSLDVFFDPSRGFAGFP----GTFKFIKKLIDVKRGSGTKRYFGDLGKTLRTAGSNVIFAAITEISDSVLKGAEASG

Query:  FNGMVSGFHQGILKISMEPSLLGSALMQGGPERKIKLDRSPGVDELYIEGYLQAMLDTMYRQEYLRVRVVDNQVILKNLPPNTPLSNEIVERVKGFLVSK
         +G+VSGFH GILK++MEPS++G+ALM+GGP+R IKLDR+PG+DELYIEGYLQAMLDTMYRQEYLRV+V+D+QV LKNLPP+  L +E+++RVK FL S+
Subjt:  FNGMVSGFHQGILKISMEPSLLGSALMQGGPERKIKLDRSPGVDELYIEGYLQAMLDTMYRQEYLRVRVVDNQVILKNLPPNTPLSNEIVERVKGFLVSK

Query:  ALLKGDSAMSTRPFHHLRRESEWKIGPTVRTLCEHLFVSFAIRMLRKGVTQIVVRI-PRNKESESDSQE--TNLALVPT---GKEQKGKFIWTVGIGKFV
         LLKGD + S+RP   L  + EWKIGPTV TLCEHLFVSFAIR+L++  T+ +  + P+ +E+E+++ +  +N A+VP     K++K KF+W  GIG FV
Subjt:  ALLKGDSAMSTRPFHHLRRESEWKIGPTVRTLCEHLFVSFAIRMLRKGVTQIVVRI-PRNKESESDSQE--TNLALVPT---GKEQKGKFIWTVGIGKFV

Query:  LSGILAYIDGRLCRNIPHPIARRIVSGFLLTLLDNNDKE
         SGI+AYIDGRLCR IP+PIARRIVSGFLL+ LD + ++
Subjt:  LSGILAYIDGRLCRNIPHPIARRIVSGFLLTLLDNNDKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACGAAATTGATGGATATGGGGATGAGCCGTCTCATAAGCCACTTGTAGCTGCTAGACTCGAGGACATTAATTTGGACTCTGTGTTCACAGAGCAGCAGAAA
TATAACCAGATTACTATTCAGTCATTGAAATTGGAGGAGAAGAGAGTAGGAGCTACTTTTGCTGCAATGCTTCGTAGGCATCAACTTGATTGGAGTGACTCAAAT
GATTGTGTTCTCAAAATTGTCTGCGTCCTGAACTCAACCAGCTTTCATGTCAAGCAAGTGAAATACTTTTCTGTTGTTCTGCAGCCAATTGATTTGAACCTTGAT
GAGGAGACCTTAATGAGGATTGCTCCATTTTGGAGATCTCTCAACGTCTCAAAATCTGAGAGCCAGCAGTACTATTTTGACCATTTTGAGATCCACCCAATAAAG
ATTATTGCAAATTTCCTTCCTGAGGAATTATATTCAAGTTATAGTTCAACACAGGAGACCTTGAGGACCTTACTACATAGTGTAGTAAAGATCCCTCCAATGAAG
AATGTGGTTGTCGAGCTCAATGGTGTTCTAGTTACTCATGCTTTGATCACAATGCGTGAATTATTTCTTAGATGTGCTCAACATTATTCATGGTATGCCATGAGG
GCTATCTATATTGCAAAAGGAAGTTCATTGCTCCCTCCAGATTTTATTTCCATCTTTGATGATTTGTCTTCATCTTCATTAGATGTCTTCTTCGATCCCTCACGT
GGGTTTGCGGGTTTTCCAGGGACTTTCAAGTTTATCAAGAAATTAATTGATGTGAAAAGAGGGTCAGGTACCAAACGCTATTTTGGAGATTTAGGGAAAACTTTG
AGGACAGCCGGGTCGAATGTTATTTTTGCTGCTATAACTGAGATTTCTGACTCTGTTCTCAAGGGAGCAGAAGCAAGTGGTTTTAATGGAATGGTGAGTGGATTT
CACCAAGGAATACTAAAGATATCCATGGAGCCATCTTTACTTGGGAGCGCTTTGATGCAAGGTGGCCCAGAGAGAAAAATCAAACTTGATCGAAGCCCAGGGGTT
GATGAGTTATATATTGAAGGCTACCTGCAAGCTATGCTGGATACAATGTACAGACAAGAATATCTTAGAGTTAGAGTCGTTGACAATCAGGTTATCCTTAAAAAT
CTTCCCCCAAACACACCTCTGTCAAATGAGATTGTGGAACGTGTGAAAGGATTTCTTGTTAGCAAAGCATTGTTAAAGGGAGATTCAGCAATGAGTACTCGCCCT
TTTCACCATCTTCGACGTGAAAGTGAATGGAAGATTGGACCTACAGTGCGAACACTCTGTGAGCACCTCTTTGTAAGTTTTGCAATCCGTATGCTGAGAAAAGGT
GTCACACAGATTGTGGTCAGGATTCCTCGGAACAAAGAATCAGAATCTGATAGTCAAGAAACCAATCTTGCGTTAGTTCCAACAGGCAAGGAACAGAAAGGTAAG
TTCATCTGGACTGTGGGAATTGGCAAGTTTGTGCTGTCTGGTATCTTAGCATATATTGATGGTCGGTTGTGCCGTAATATCCCTCATCCTATTGCACGGCGGATT
GTAAGTGGCTTCTTGTTGACTCTCCTTGACAATAATGATAAGGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGTACGAAATTGATGGATATGGGGATGAGCCGTCTCATAAGCCACTTGTAGCTGCTAGACTCGAGGACATTAATTTGGACTCTGTGTTCACAGAGCAGCAGAAA
TATAACCAGATTACTATTCAGTCATTGAAATTGGAGGAGAAGAGAGTAGGAGCTACTTTTGCTGCAATGCTTCGTAGGCATCAACTTGATTGGAGTGACTCAAAT
GATTGTGTTCTCAAAATTGTCTGCGTCCTGAACTCAACCAGCTTTCATGTCAAGCAAGTGAAATACTTTTCTGTTGTTCTGCAGCCAATTGATTTGAACCTTGAT
GAGGAGACCTTAATGAGGATTGCTCCATTTTGGAGATCTCTCAACGTCTCAAAATCTGAGAGCCAGCAGTACTATTTTGACCATTTTGAGATCCACCCAATAAAG
ATTATTGCAAATTTCCTTCCTGAGGAATTATATTCAAGTTATAGTTCAACACAGGAGACCTTGAGGACCTTACTACATAGTGTAGTAAAGATCCCTCCAATGAAG
AATGTGGTTGTCGAGCTCAATGGTGTTCTAGTTACTCATGCTTTGATCACAATGCGTGAATTATTTCTTAGATGTGCTCAACATTATTCATGGTATGCCATGAGG
GCTATCTATATTGCAAAAGGAAGTTCATTGCTCCCTCCAGATTTTATTTCCATCTTTGATGATTTGTCTTCATCTTCATTAGATGTCTTCTTCGATCCCTCACGT
GGGTTTGCGGGTTTTCCAGGGACTTTCAAGTTTATCAAGAAATTAATTGATGTGAAAAGAGGGTCAGGTACCAAACGCTATTTTGGAGATTTAGGGAAAACTTTG
AGGACAGCCGGGTCGAATGTTATTTTTGCTGCTATAACTGAGATTTCTGACTCTGTTCTCAAGGGAGCAGAAGCAAGTGGTTTTAATGGAATGGTGAGTGGATTT
CACCAAGGAATACTAAAGATATCCATGGAGCCATCTTTACTTGGGAGCGCTTTGATGCAAGGTGGCCCAGAGAGAAAAATCAAACTTGATCGAAGCCCAGGGGTT
GATGAGTTATATATTGAAGGCTACCTGCAAGCTATGCTGGATACAATGTACAGACAAGAATATCTTAGAGTTAGAGTCGTTGACAATCAGGTTATCCTTAAAAAT
CTTCCCCCAAACACACCTCTGTCAAATGAGATTGTGGAACGTGTGAAAGGATTTCTTGTTAGCAAAGCATTGTTAAAGGGAGATTCAGCAATGAGTACTCGCCCT
TTTCACCATCTTCGACGTGAAAGTGAATGGAAGATTGGACCTACAGTGCGAACACTCTGTGAGCACCTCTTTGTAAGTTTTGCAATCCGTATGCTGAGAAAAGGT
GTCACACAGATTGTGGTCAGGATTCCTCGGAACAAAGAATCAGAATCTGATAGTCAAGAAACCAATCTTGCGTTAGTTCCAACAGGCAAGGAACAGAAAGGTAAG
TTCATCTGGACTGTGGGAATTGGCAAGTTTGTGCTGTCTGGTATCTTAGCATATATTGATGGTCGGTTGTGCCGTAATATCCCTCATCCTATTGCACGGCGGATT
GTAAGTGGCTTCTTGTTGACTCTCCTTGACAATAATGATAAGGAATAA
Protein sequenceShow/hide protein sequence
MYEIDGYGDEPSHKPLVAARLEDINLDSVFTEQQKYNQITIQSLKLEEKRVGATFAAMLRRHQLDWSDSNDCVLKIVCVLNSTSFHVKQVKYFSVVLQPIDLNLD
EETLMRIAPFWRSLNVSKSESQQYYFDHFEIHPIKIIANFLPEELYSSYSSTQETLRTLLHSVVKIPPMKNVVVELNGVLVTHALITMRELFLRCAQHYSWYAMR
AIYIAKGSSLLPPDFISIFDDLSSSSLDVFFDPSRGFAGFPGTFKFIKKLIDVKRGSGTKRYFGDLGKTLRTAGSNVIFAAITEISDSVLKGAEASGFNGMVSGF
HQGILKISMEPSLLGSALMQGGPERKIKLDRSPGVDELYIEGYLQAMLDTMYRQEYLRVRVVDNQVILKNLPPNTPLSNEIVERVKGFLVSKALLKGDSAMSTRP
FHHLRRESEWKIGPTVRTLCEHLFVSFAIRMLRKGVTQIVVRIPRNKESESDSQETNLALVPTGKEQKGKFIWTVGIGKFVLSGILAYIDGRLCRNIPHPIARRI
VSGFLLTLLDNNDKE