; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g30400 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g30400
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionProtein of Unknown Function (DUF239)
Genome locationchr3:21654934..21657236
RNA-Seq ExpressionMoc03g30400
SyntenyMoc03g30400
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137033.1 uncharacterized protein LOC111008593 [Momordica charantia]1.8e-244100Show/hide
Query:  MASSSCFVVFLLVFTSLTSVFSTSMPPKNQTFHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNS
        MASSSCFVVFLLVFTSLTSVFSTSMPPKNQTFHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNS
Subjt:  MASSSCFVVFLLVFTSLTSVFSTSMPPKNQTFHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNS

Query:  TEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVIS
        TEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVIS
Subjt:  TEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVIS

Query:  GSFGNDLNTIEAGWQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVG
        GSFGNDLNTIEAGWQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVG
Subjt:  GSFGNDLNTIEAGWQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVG

Query:  YWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGP
        YWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGP
Subjt:  YWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGP

Query:  GRNVKCP
        GRNVKCP
Subjt:  GRNVKCP

XP_022923706.1 uncharacterized protein LOC111431334 [Cucurbita moschata]1.7e-22189.54Show/hide
Query:  ASSSCFVVFLLVFTSLTSVFSTSM----PPKNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPR
        +SSSCFVV LLVFTS +SVF TS+    PPKN+T FHP+KEL +LKHIRAYLRKINKP+TKTI+SSDGDVIDCV+SHLQPAFDHP LKGHTPL+PPERPR
Subjt:  ASSSCFVVFLLVFTSLTSVFSTSM----PPKNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPR

Query:  GNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI
        GNNS E VAESFQLWS SG+FCPEGTIPIRRT E DI RASS RRFGRKPIR +RRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ EFSLSQI
Subjt:  GNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI

Query:  WVISGSFGNDLNTIEAGWQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSG
        WVISGSFGNDLNTIEAGWQVSPELYGD+NPRFFTYWTTDAYQATGCYNLLCSGFVQTNN+IAIGAAISPISSY GKQFD+G+MVWKDPKHGHWWLEYGSG
Subjt:  WVISGSFGNDLNTIEAGWQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSG

Query:  LLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFY
        LLVGYWPAFLFSHLRSHGSMVQFGGEIVNSR SGFHTAT+MGSGHF EEGFGKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQG N+ WGTYFY
Subjt:  LLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFY

Query:  YGGPGRNVKCP
        YGGPGRNVKCP
Subjt:  YGGPGRNVKCP

XP_023000690.1 uncharacterized protein LOC111495054 [Cucurbita maxima]3.7e-22189.54Show/hide
Query:  ASSSCFVVFLLVFTSLTSVFSTSM----PPKNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPR
        +SSSCFVV LLVFTS +SVF TS+    PPKNQT FHP+KEL +LKHIRAYLRKINKP TKTI+SSDGDVIDCV+SHLQPAFDHP LKGHTPL PPERPR
Subjt:  ASSSCFVVFLLVFTSLTSVFSTSM----PPKNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPR

Query:  GNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI
        GNNS E VAE+FQLWS SG+FCPEGTIPIRRT E DI RASS RRFGRKPIR +RRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ EFSLSQI
Subjt:  GNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI

Query:  WVISGSFGNDLNTIEAGWQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSG
        WVISGSFGNDLNTIEAGWQVSPELYGD+NPRFFTYWTTDAYQATGCYNLLCSGFVQTNN+IAIGAAISP+SSYNGKQFD+G+MVWKDPKHGHWWLEYGSG
Subjt:  WVISGSFGNDLNTIEAGWQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSG

Query:  LLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFY
        LLVGYWPAFLFSHLRSHGSMVQFGGEIVNSR SGFHTAT+MGSGHF EEGF KASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQG N+VWGTYFY
Subjt:  LLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFY

Query:  YGGPGRNVKCP
        YGGPGRNVKCP
Subjt:  YGGPGRNVKCP

XP_023519342.1 uncharacterized protein LOC111782769 [Cucurbita pepo subsp. pepo]3.2e-22089.29Show/hide
Query:  ASSSCFVVFLLVFTSLTSVFSTSM----PPKNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPR
        +SSSCFVV LLVFTS +SVF TS+    PPKNQT FHP+KEL +LKHIRAYLRKINK  TKTI+SSDGDVIDCV+SHLQPAFDHP LKGHTPL+PPERPR
Subjt:  ASSSCFVVFLLVFTSLTSVFSTSM----PPKNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPR

Query:  GNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI
        GNNS E VAESFQLWS SG+FCPEGTIPIRRT E DI RASS RRFGRKPIR +RRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ EFSLSQI
Subjt:  GNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI

Query:  WVISGSFGNDLNTIEAGWQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSG
        WVISGSFGNDLNTIEAGWQVSPELYGD+NPRFFTYWTTDAYQATGCYNLLCSGFVQTNN+IAIGAAISPISSY GKQFD+G+MVWKDPKHGHWWLEYGSG
Subjt:  WVISGSFGNDLNTIEAGWQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSG

Query:  LLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFY
        LLVGYWPAFLFSHLRSHGSMVQFGGEIVNS+ SGFHTAT+MGSGHF EEGFGKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQG N+ WGTYFY
Subjt:  LLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFY

Query:  YGGPGRNVKCP
        YGGPGRNVKCP
Subjt:  YGGPGRNVKCP

XP_038895687.1 uncharacterized protein LOC120083860 isoform X1 [Benincasa hispida]4.3e-22592.46Show/hide
Query:  SSSCFVVFLLVF-TSLTSVFSTS----MPPKNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPR
        S SCFVVFLLVF TS +SVFS+S    +PPKNQT FHPAKEL+KLKHIR YLRKINKP  KTIRSSDGDVIDCV+SHLQPAFDHPELKGHTPLEPPERPR
Subjt:  SSSCFVVFLLVF-TSLTSVFSTS----MPPKNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPR

Query:  GNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI
        GNNS E VAE+FQLWS SG+FCPEGTIPIRRT E DI RASS RRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI
Subjt:  GNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI

Query:  WVISGSFGNDLNTIEAGWQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSG
        WVISGSFGNDLNTIEAGWQVSPELYGD+NPRFFTYWTTDAYQATGCYNLLCSGFVQTNN+IAIGAAISPISSY+GKQFDIGLMVWKDPKHGHWWLEYGSG
Subjt:  WVISGSFGNDLNTIEAGWQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSG

Query:  LLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFY
        LLVGYWPAFLFSHLRSH SMVQFGGEIVNSRSSGFHT TQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQ NNNVWGTYFY
Subjt:  LLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFY

Query:  YGGPGRNVKCP
        YGGPGRNVKCP
Subjt:  YGGPGRNVKCP

TrEMBL top hitse value%identityAlignment
A0A6J1C958 uncharacterized protein LOC1110085938.9e-245100Show/hide
Query:  MASSSCFVVFLLVFTSLTSVFSTSMPPKNQTFHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNS
        MASSSCFVVFLLVFTSLTSVFSTSMPPKNQTFHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNS
Subjt:  MASSSCFVVFLLVFTSLTSVFSTSMPPKNQTFHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNS

Query:  TEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVIS
        TEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVIS
Subjt:  TEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVIS

Query:  GSFGNDLNTIEAGWQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVG
        GSFGNDLNTIEAGWQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVG
Subjt:  GSFGNDLNTIEAGWQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVG

Query:  YWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGP
        YWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGP
Subjt:  YWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGP

Query:  GRNVKCP
        GRNVKCP
Subjt:  GRNVKCP

A0A6J1E7H5 uncharacterized protein LOC1114313348.1e-22289.54Show/hide
Query:  ASSSCFVVFLLVFTSLTSVFSTSM----PPKNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPR
        +SSSCFVV LLVFTS +SVF TS+    PPKN+T FHP+KEL +LKHIRAYLRKINKP+TKTI+SSDGDVIDCV+SHLQPAFDHP LKGHTPL+PPERPR
Subjt:  ASSSCFVVFLLVFTSLTSVFSTSM----PPKNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPR

Query:  GNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI
        GNNS E VAESFQLWS SG+FCPEGTIPIRRT E DI RASS RRFGRKPIR +RRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ EFSLSQI
Subjt:  GNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI

Query:  WVISGSFGNDLNTIEAGWQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSG
        WVISGSFGNDLNTIEAGWQVSPELYGD+NPRFFTYWTTDAYQATGCYNLLCSGFVQTNN+IAIGAAISPISSY GKQFD+G+MVWKDPKHGHWWLEYGSG
Subjt:  WVISGSFGNDLNTIEAGWQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSG

Query:  LLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFY
        LLVGYWPAFLFSHLRSHGSMVQFGGEIVNSR SGFHTAT+MGSGHF EEGFGKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQG N+ WGTYFY
Subjt:  LLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFY

Query:  YGGPGRNVKCP
        YGGPGRNVKCP
Subjt:  YGGPGRNVKCP

A0A6J1GRA4 uncharacterized protein LOC1114568101.7e-21988.81Show/hide
Query:  ASSSCFVVFLLVFTSLTSVFSTS----MPPKNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPR
        +S SCFVV LLVFTSL SVFSTS    MP KNQT FHP KEL+KLKHIRAYLRKINKP  KTI+SSDGDVIDCV+SHLQPAFDHPELKGH+PLEPPERPR
Subjt:  ASSSCFVVFLLVFTSLTSVFSTS----MPPKNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPR

Query:  GNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI
         N S E VA++ QLWS SGEFCPEGTIPIRRT E DI RA+S+RRFGRKPIR VRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI
Subjt:  GNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI

Query:  WVISGSFGNDLNTIEAGWQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSG
        W+ISGSFGNDLNTIEAGWQVSPELYGD+NPRFFTYWTTDAYQATGCYNLLCSGFVQTN++IAIGAAISP+SSY GKQFDIGLMVWKDPKHGHWWLEYGSG
Subjt:  WVISGSFGNDLNTIEAGWQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSG

Query:  LLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFY
         LVGYWPAFLFSHLRSH SMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQG+NNVWGTYFY
Subjt:  LLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFY

Query:  YGGPGRNVKCP
        YGGPGR V+CP
Subjt:  YGGPGRNVKCP

A0A6J1JZN8 uncharacterized protein LOC1114902772.0e-22089.05Show/hide
Query:  ASSSCFVVFLLVFTSLTSVFSTS----MPPKNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPR
        +S SCFVVFLLVFTSL SVFSTS    MP KNQT FHP KEL+KLK+IRAYLRKINKP  KTI+SSDGDVIDCV+SHLQPAFDHPELKGH+PLEPPERPR
Subjt:  ASSSCFVVFLLVFTSLTSVFSTS----MPPKNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPR

Query:  GNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI
         N S E VA++ QLWS SGEFCPEGTIPIRRT E DI RA+S+RRFGRKPIR VRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI
Subjt:  GNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI

Query:  WVISGSFGNDLNTIEAGWQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSG
        W+ISGSFGNDLNTIEAGWQVSPELYGD+NPRFFTYWTTDAYQATGCYNLLCSGFVQTNN+IAIGAAISP+SSY GKQFDIGLMVWKDPKHGHWWLEYGSG
Subjt:  WVISGSFGNDLNTIEAGWQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSG

Query:  LLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFY
        +LVGYWPAFLFSHLRSH SMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQG+NNVWGTYFY
Subjt:  LLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFY

Query:  YGGPGRNVKCP
        YGGPGR V+CP
Subjt:  YGGPGRNVKCP

A0A6J1KJ26 uncharacterized protein LOC1114950541.8e-22189.54Show/hide
Query:  ASSSCFVVFLLVFTSLTSVFSTSM----PPKNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPR
        +SSSCFVV LLVFTS +SVF TS+    PPKNQT FHP+KEL +LKHIRAYLRKINKP TKTI+SSDGDVIDCV+SHLQPAFDHP LKGHTPL PPERPR
Subjt:  ASSSCFVVFLLVFTSLTSVFSTSM----PPKNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPR

Query:  GNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI
        GNNS E VAE+FQLWS SG+FCPEGTIPIRRT E DI RASS RRFGRKPIR +RRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ EFSLSQI
Subjt:  GNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI

Query:  WVISGSFGNDLNTIEAGWQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSG
        WVISGSFGNDLNTIEAGWQVSPELYGD+NPRFFTYWTTDAYQATGCYNLLCSGFVQTNN+IAIGAAISP+SSYNGKQFD+G+MVWKDPKHGHWWLEYGSG
Subjt:  WVISGSFGNDLNTIEAGWQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSG

Query:  LLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFY
        LLVGYWPAFLFSHLRSHGSMVQFGGEIVNSR SGFHTAT+MGSGHF EEGF KASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQG N+VWGTYFY
Subjt:  LLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFY

Query:  YGGPGRNVKCP
        YGGPGRNVKCP
Subjt:  YGGPGRNVKCP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10750.1 Protein of Unknown Function (DUF239)9.9e-18071.18Show/hide
Query:  ASSSCFVVFLLVFTSLTSVFSTSMPPKNQTFHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNST
        +S+  F+  LL+ +S +SV S ++ P+NQT  P  EL KLK I  +LRKINKPS KTI S DGD+IDCV+ H QPAFDHP L+G  PL+PPERPRG+N  
Subjt:  ASSSCFVVFLLVFTSLTSVFSTSMPPKNQTFHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNST

Query:  EGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISG
            +SFQLW   GE CPEGT+PIRRT+E DILRA+SV  FG+K +R  RRD+S NGHEHAV +V+GE+YYGAKAS+N+WAP+V +QYEFSLSQIW+ISG
Subjt:  EGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISG

Query:  SFGNDLNTIEAGWQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGY
        SFGNDLNTIEAGWQVSPELYGD+ PRFFTYWT DAYQATGCYNLLCSGFVQTN++IAIGAAISP SSY G QFDI L++WKDPKHG+WWLE+GSG+LVGY
Subjt:  SFGNDLNTIEAGWQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGY

Query:  WPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPG
        WP+FLF+HL+ H SMVQ+GGEIVNS   G HT+TQMGSGHFAEEGF K+SYFRN+QVVDWDNNL+P  NL +LADH +CYDI+ G+N  WG+YFYYGGPG
Subjt:  WPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPG

Query:  RNVKCP
        +N KCP
Subjt:  RNVKCP

AT1G23340.1 Protein of Unknown Function (DUF239)1.9e-17871.6Show/hide
Query:  ASSSC-FVVFLLVFTSLTSVFSTSMPPKNQT-----FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERP
        +SSSC F  F+L    L S+FS+   P N T       P +E+QK+K IR  L+KINKP+ KTI SSDGD IDCV SH QPAFDHP L+G  P++PPE P
Subjt:  ASSSC-FVVFLLVFTSLTSVFSTSMPPKNQT-----FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERP

Query:  RGNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQ
         G +      E+FQLWS  GE CPEGTIPIRRT E D+LRA+SVRRFGRK IRRVRRDSS NGHEHAV +V+G QYYGAKAS+N+W PRV  QYEFSLSQ
Subjt:  RGNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQ

Query:  IWVISGSFGNDLNTIEAGWQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGS
        IW+I+GSF  DLNTIEAGWQ+SPELYGD+NPRFFTYWT+DAYQATGCYNLLCSGFVQTNN+IAIGAAISP+SSY G QFDI L++WKDPKHGHWWL++GS
Subjt:  IWVISGSFGNDLNTIEAGWQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGS

Query:  GLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYF
        G LVGYWP  LF+HLR HG+MVQFGGEIVN+R  G HT+TQMGSGHFA EGFGKASYFRNLQ+VDWDN L+P++NL +LADH +CYDIR G N VWG +F
Subjt:  GLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYF

Query:  YYGGPGRNVKCP
        YYGGPG+N KCP
Subjt:  YYGGPGRNVKCP

AT1G23340.2 Protein of Unknown Function (DUF239)1.9e-17871.6Show/hide
Query:  ASSSC-FVVFLLVFTSLTSVFSTSMPPKNQT-----FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERP
        +SSSC F  F+L    L S+FS+   P N T       P +E+QK+K IR  L+KINKP+ KTI SSDGD IDCV SH QPAFDHP L+G  P++PPE P
Subjt:  ASSSC-FVVFLLVFTSLTSVFSTSMPPKNQT-----FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERP

Query:  RGNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQ
         G +      E+FQLWS  GE CPEGTIPIRRT E D+LRA+SVRRFGRK IRRVRRDSS NGHEHAV +V+G QYYGAKAS+N+W PRV  QYEFSLSQ
Subjt:  RGNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQ

Query:  IWVISGSFGNDLNTIEAGWQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGS
        IW+I+GSF  DLNTIEAGWQ+SPELYGD+NPRFFTYWT+DAYQATGCYNLLCSGFVQTNN+IAIGAAISP+SSY G QFDI L++WKDPKHGHWWL++GS
Subjt:  IWVISGSFGNDLNTIEAGWQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGS

Query:  GLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYF
        G LVGYWP  LF+HLR HG+MVQFGGEIVN+R  G HT+TQMGSGHFA EGFGKASYFRNLQ+VDWDN L+P++NL +LADH +CYDIR G N VWG +F
Subjt:  GLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYF

Query:  YYGGPGRNVKCP
        YYGGPG+N KCP
Subjt:  YYGGPGRNVKCP

AT1G70550.2 Protein of Unknown Function (DUF239)3.5e-17771.15Show/hide
Query:  SSCFVVFLLVFTSLTSVFSTSMPPKN-----QTFHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGN
        SS F+  +L+   ++S FS++    N     QT  P +ELQKL  IR  L KINKP+ KTI+SSDGD IDCV +H QPAFDHP L+G  PL+PPE P+G 
Subjt:  SSCFVVFLLVFTSLTSVFSTSMPPKN-----QTFHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGN

Query:  NSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWV
        +  +G  E+ QLWS SGE CPEGTIPIRRT E D+LRASSV+RFGRK IRRV+RDS+ NGHEHAV +V G QYYGAKAS+N+W+PRVT QYEFSLSQIWV
Subjt:  NSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWV

Query:  ISGSFGNDLNTIEAGWQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLL
        I+GSF +DLNTIEAGWQ+SPELYGD+ PRFFTYWT+DAY+ TGCYNLLCSGFVQTN +IAIGAAISP SSY G QFDI L++WKDPKHGHWWL++GSG L
Subjt:  ISGSFGNDLNTIEAGWQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLL

Query:  VGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYG
        VGYWPAFLF+HL+ HGSMVQFGGEIVN+R  G HT TQMGSGHFA EGFGKASYFRNLQ+VDWDN L+P +NL +LADH +CYDIR G N VWG YFYYG
Subjt:  VGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYG

Query:  GPGRNVKCP
        GPG+N +CP
Subjt:  GPGRNVKCP

AT5G50150.1 Protein of Unknown Function (DUF239)5.3e-18977.5Show/hide
Query:  VFLLVFTSLTSVFSTSMPPKNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNSTEGVAES
        + LL+   L  +  +++  KNQT F P +E+QKL+ + AYL KINKPS KTI S DGDVI+CV SHLQPAFDHP+L+G  PL+ P RP   N T      
Subjt:  VFLLVFTSLTSVFSTSMPPKNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNSTEGVAES

Query:  FQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGNDL
         QLWS SGE CP G+IPIR+T + D+LRA+SVRRFGRK  R +RRDSSG GHEHAVVFVNGEQYYGAKAS+N+WAPRVTD YEFSLSQIW+ISGSFG+DL
Subjt:  FQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGNDL

Query:  NTIEAGWQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLF
        NTIEAGWQVSPELYGD+ PRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISP SSYNG+QFDIGLM+WKDPKHGHWWLE G+GLLVGYWPAFLF
Subjt:  NTIEAGWQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLF

Query:  SHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP
        SHLRSH SMVQFGGE+VNSRSSG HT TQMGSGHFA+EGF KA+YFRNLQVVDWDNNLLPL NLH+LADH  CYDIRQG NNVWGTYFYYGGPGRN +CP
Subjt:  SHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTCTTCTTGTTTTGTTGTTTTTCTTCTGGTTTTTACTTCCCTCACCTCCGTTTTCTCAACTTCCATGCCACCCAAGAACCAAACTTTCCACCCGGCCAAAGA
GCTGCAGAAACTAAAGCACATCAGAGCTTATTTACGCAAAATCAACAAGCCTTCAACCAAGACAATTCGGAGCTCAGATGGTGATGTCATAGACTGTGTGATTTCCCATC
TCCAGCCTGCTTTTGACCATCCTGAACTCAAAGGACACACCCCATTGGAGCCGCCGGAGAGGCCAAGAGGGAACAACTCGACGGAAGGTGTGGCAGAGAGCTTCCAATTA
TGGTCAGATTCTGGCGAATTCTGCCCGGAGGGAACTATTCCGATAAGAAGAACCAGAGAGACCGACATTCTTAGAGCAAGCTCTGTTCGCAGATTTGGAAGAAAACCCAT
TAGACGTGTGAGGAGAGATTCATCAGGCAATGGCCACGAGCATGCCGTGGTGTTTGTAAATGGAGAACAATATTATGGAGCAAAGGCGAGTTTAAACATATGGGCACCAC
GTGTAACCGACCAATACGAATTCAGCTTATCTCAAATATGGGTCATTTCAGGCTCCTTTGGCAATGATTTAAACACCATTGAAGCTGGATGGCAGGTTAGCCCTGAGCTG
TATGGCGACAGCAATCCTAGGTTCTTCACGTATTGGACGACTGATGCTTATCAAGCCACTGGCTGTTACAATCTACTTTGCTCTGGCTTTGTTCAAACTAACAACAAGAT
CGCCATTGGAGCAGCAATCTCCCCCATCTCCTCTTATAATGGCAAACAATTCGATATTGGTTTAATGGTTTGGAAGGACCCGAAGCACGGGCACTGGTGGTTGGAATACG
GGTCAGGTCTGCTAGTCGGATACTGGCCGGCGTTTCTGTTCAGCCATTTAAGGAGCCATGGGAGCATGGTGCAGTTTGGAGGGGAGATAGTGAACAGCAGATCATCAGGG
TTCCACACAGCCACTCAAATGGGGAGTGGCCATTTTGCAGAAGAAGGCTTTGGAAAAGCTTCATATTTCAGGAACCTCCAAGTGGTTGATTGGGACAATAATTTGCTTCC
TCTAACAAATCTTCATCTCTTGGCTGACCATTCTGATTGCTATGATATAAGACAAGGCAACAATAATGTTTGGGGCACTTATTTTTACTATGGAGGTCCTGGGAGGAATG
TAAAATGCCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTTCTTCTTGTTTTGTTGTTTTTCTTCTGGTTTTTACTTCCCTCACCTCCGTTTTCTCAACTTCCATGCCACCCAAGAACCAAACTTTCCACCCGGCCAAAGA
GCTGCAGAAACTAAAGCACATCAGAGCTTATTTACGCAAAATCAACAAGCCTTCAACCAAGACAATTCGGAGCTCAGATGGTGATGTCATAGACTGTGTGATTTCCCATC
TCCAGCCTGCTTTTGACCATCCTGAACTCAAAGGACACACCCCATTGGAGCCGCCGGAGAGGCCAAGAGGGAACAACTCGACGGAAGGTGTGGCAGAGAGCTTCCAATTA
TGGTCAGATTCTGGCGAATTCTGCCCGGAGGGAACTATTCCGATAAGAAGAACCAGAGAGACCGACATTCTTAGAGCAAGCTCTGTTCGCAGATTTGGAAGAAAACCCAT
TAGACGTGTGAGGAGAGATTCATCAGGCAATGGCCACGAGCATGCCGTGGTGTTTGTAAATGGAGAACAATATTATGGAGCAAAGGCGAGTTTAAACATATGGGCACCAC
GTGTAACCGACCAATACGAATTCAGCTTATCTCAAATATGGGTCATTTCAGGCTCCTTTGGCAATGATTTAAACACCATTGAAGCTGGATGGCAGGTTAGCCCTGAGCTG
TATGGCGACAGCAATCCTAGGTTCTTCACGTATTGGACGACTGATGCTTATCAAGCCACTGGCTGTTACAATCTACTTTGCTCTGGCTTTGTTCAAACTAACAACAAGAT
CGCCATTGGAGCAGCAATCTCCCCCATCTCCTCTTATAATGGCAAACAATTCGATATTGGTTTAATGGTTTGGAAGGACCCGAAGCACGGGCACTGGTGGTTGGAATACG
GGTCAGGTCTGCTAGTCGGATACTGGCCGGCGTTTCTGTTCAGCCATTTAAGGAGCCATGGGAGCATGGTGCAGTTTGGAGGGGAGATAGTGAACAGCAGATCATCAGGG
TTCCACACAGCCACTCAAATGGGGAGTGGCCATTTTGCAGAAGAAGGCTTTGGAAAAGCTTCATATTTCAGGAACCTCCAAGTGGTTGATTGGGACAATAATTTGCTTCC
TCTAACAAATCTTCATCTCTTGGCTGACCATTCTGATTGCTATGATATAAGACAAGGCAACAATAATGTTTGGGGCACTTATTTTTACTATGGAGGTCCTGGGAGGAATG
TAAAATGCCCATGA
Protein sequenceShow/hide protein sequence
MASSSCFVVFLLVFTSLTSVFSTSMPPKNQTFHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNSTEGVAESFQL
WSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPEL
YGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSG
FHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP