; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS020007 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS020007
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationscaffold22:1136753..1139146
RNA-Seq ExpressionMS020007
SyntenyMS020007
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137033.1 uncharacterized protein LOC111008593 [Momordica charantia]4.7e-24194.43Show/hide
Query:  MASSSCFVVFLLVFTSLTSVFSTSMPPKNQTFHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNS
        MASSSCFVVFLLVFTSLTSVFSTSMPPKNQTFHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNS
Subjt:  MASSSCFVVFLLVFTSLTSVFSTSMPPKNQTFHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNS

Query:  TEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVIS
        TEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVIS
Subjt:  TEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVIS

Query:  GSFGNDLNTIEAGWQEHIEEFRHLKFLSFNLHFCFMFVQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDI
        GSFGNDLNTIEAGW                        QVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDI
Subjt:  GSFGNDLNTIEAGWQEHIEEFRHLKFLSFNLHFCFMFVQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDI

Query:  GLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLAD
        GLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLAD
Subjt:  GLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLAD

Query:  HSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP
        HSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP
Subjt:  HSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP

XP_022923706.1 uncharacterized protein LOC111431334 [Cucurbita moschata]5.6e-21884.6Show/hide
Query:  ASSSCFVVFLLVFTSLTSVFSTSM----PPKNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPR
        +SSSCFVV LLVFTS +SVF TS+    PPKN+T FHP+KEL +LKHIRAYLRKINKP+TKTI+SSDGDVIDCV+SHLQPAFDHP LKGHTPL+PPERPR
Subjt:  ASSSCFVVFLLVFTSLTSVFSTSM----PPKNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPR

Query:  GNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI
        GNNS E VAESFQLWS SG+FCPEGTIPIRRT E DI RASS RRFGRKPIR +RRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ EFSLSQI
Subjt:  GNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI

Query:  WVISGSFGNDLNTIEAGWQEHIEEFRHLKFLSFNLHFCFMFVQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGK
        WVISGSFGNDLNTIEAGW                        QVSPELYGD+NPRFFTYWTTDAYQATGCYNLLCSGFVQTNN+IAIGAAISPISSY GK
Subjt:  WVISGSFGNDLNTIEAGWQEHIEEFRHLKFLSFNLHFCFMFVQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGK

Query:  QFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH
        QFD+G+MVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSR SGFHTAT+MGSGHF EEGFGKASYFRNLQVVDWDNNLLPLTNLH
Subjt:  QFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH

Query:  LLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP
        +LADHSDCYDIRQG N+ WGTYFYYGGPGRNVKCP
Subjt:  LLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP

XP_023000690.1 uncharacterized protein LOC111495054 [Cucurbita maxima]1.3e-21784.6Show/hide
Query:  ASSSCFVVFLLVFTSLTSVFSTSM----PPKNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPR
        +SSSCFVV LLVFTS +SVF TS+    PPKNQT FHP+KEL +LKHIRAYLRKINKP TKTI+SSDGDVIDCV+SHLQPAFDHP LKGHTPL PPERPR
Subjt:  ASSSCFVVFLLVFTSLTSVFSTSM----PPKNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPR

Query:  GNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI
        GNNS E VAE+FQLWS SG+FCPEGTIPIRRT E DI RASS RRFGRKPIR +RRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ EFSLSQI
Subjt:  GNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI

Query:  WVISGSFGNDLNTIEAGWQEHIEEFRHLKFLSFNLHFCFMFVQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGK
        WVISGSFGNDLNTIEAGW                        QVSPELYGD+NPRFFTYWTTDAYQATGCYNLLCSGFVQTNN+IAIGAAISP+SSYNGK
Subjt:  WVISGSFGNDLNTIEAGWQEHIEEFRHLKFLSFNLHFCFMFVQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGK

Query:  QFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH
        QFD+G+MVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSR SGFHTAT+MGSGHF EEGF KASYFRNLQVVDWDNNLLPLTNLH
Subjt:  QFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH

Query:  LLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP
        +LADHSDCYDIRQG N+VWGTYFYYGGPGRNVKCP
Subjt:  LLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP

XP_023519342.1 uncharacterized protein LOC111782769 [Cucurbita pepo subsp. pepo]3.6e-21783.83Show/hide
Query:  SNQMASSSCFVVFLLVFTSLTSVFSTSM----PPKNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPP
        S+  +SSSCFVV LLVFTS +SVF TS+    PPKNQT FHP+KEL +LKHIRAYLRKINK  TKTI+SSDGDVIDCV+SHLQPAFDHP LKGHTPL+PP
Subjt:  SNQMASSSCFVVFLLVFTSLTSVFSTSM----PPKNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPP

Query:  ERPRGNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFS
        ERPRGNNS E VAESFQLWS SG+FCPEGTIPIRRT E DI RASS RRFGRKPIR +RRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ EFS
Subjt:  ERPRGNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFS

Query:  LSQIWVISGSFGNDLNTIEAGWQEHIEEFRHLKFLSFNLHFCFMFVQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISS
        LSQIWVISGSFGNDLNTIEAGW                        QVSPELYGD+NPRFFTYWTTDAYQATGCYNLLCSGFVQTNN+IAIGAAISPISS
Subjt:  LSQIWVISGSFGNDLNTIEAGWQEHIEEFRHLKFLSFNLHFCFMFVQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISS

Query:  YNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPL
        Y GKQFD+G+MVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNS+ SGFHTAT+MGSGHF EEGFGKASYFRNLQVVDWDNNLLPL
Subjt:  YNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPL

Query:  TNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP
        TNLH+LADHSDCYDIRQG N+ WGTYFYYGGPGRNVKCP
Subjt:  TNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP

XP_038895687.1 uncharacterized protein LOC120083860 isoform X1 [Benincasa hispida]4.9e-22286.59Show/hide
Query:  SNQMASSSCFVVFLLVF-TSLTSVFSTS----MPPKNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEP
        S+   S SCFVVFLLVF TS +SVFS+S    +PPKNQT FHPAKEL+KLKHIR YLRKINKP  KTIRSSDGDVIDCV+SHLQPAFDHPELKGHTPLEP
Subjt:  SNQMASSSCFVVFLLVF-TSLTSVFSTS----MPPKNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEP

Query:  PERPRGNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF
        PERPRGNNS E VAE+FQLWS SG+FCPEGTIPIRRT E DI RASS RRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF
Subjt:  PERPRGNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF

Query:  SLSQIWVISGSFGNDLNTIEAGWQEHIEEFRHLKFLSFNLHFCFMFVQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPIS
        SLSQIWVISGSFGNDLNTIEAGW                        QVSPELYGD+NPRFFTYWTTDAYQATGCYNLLCSGFVQTNN+IAIGAAISPIS
Subjt:  SLSQIWVISGSFGNDLNTIEAGWQEHIEEFRHLKFLSFNLHFCFMFVQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPIS

Query:  SYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLP
        SY+GKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSH SMVQFGGEIVNSRSSGFHT TQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLP
Subjt:  SYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLP

Query:  LTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP
        LTNLHLLADHSDCYDIRQ NNNVWGTYFYYGGPGRNVKCP
Subjt:  LTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP

TrEMBL top hitse value%identityAlignment
A0A6J1C958 uncharacterized protein LOC1110085932.3e-24194.43Show/hide
Query:  MASSSCFVVFLLVFTSLTSVFSTSMPPKNQTFHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNS
        MASSSCFVVFLLVFTSLTSVFSTSMPPKNQTFHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNS
Subjt:  MASSSCFVVFLLVFTSLTSVFSTSMPPKNQTFHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNS

Query:  TEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVIS
        TEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVIS
Subjt:  TEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVIS

Query:  GSFGNDLNTIEAGWQEHIEEFRHLKFLSFNLHFCFMFVQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDI
        GSFGNDLNTIEAGW                        QVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDI
Subjt:  GSFGNDLNTIEAGWQEHIEEFRHLKFLSFNLHFCFMFVQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDI

Query:  GLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLAD
        GLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLAD
Subjt:  GLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLAD

Query:  HSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP
        HSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP
Subjt:  HSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP

A0A6J1E7H5 uncharacterized protein LOC1114313342.7e-21884.6Show/hide
Query:  ASSSCFVVFLLVFTSLTSVFSTSM----PPKNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPR
        +SSSCFVV LLVFTS +SVF TS+    PPKN+T FHP+KEL +LKHIRAYLRKINKP+TKTI+SSDGDVIDCV+SHLQPAFDHP LKGHTPL+PPERPR
Subjt:  ASSSCFVVFLLVFTSLTSVFSTSM----PPKNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPR

Query:  GNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI
        GNNS E VAESFQLWS SG+FCPEGTIPIRRT E DI RASS RRFGRKPIR +RRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ EFSLSQI
Subjt:  GNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI

Query:  WVISGSFGNDLNTIEAGWQEHIEEFRHLKFLSFNLHFCFMFVQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGK
        WVISGSFGNDLNTIEAGW                        QVSPELYGD+NPRFFTYWTTDAYQATGCYNLLCSGFVQTNN+IAIGAAISPISSY GK
Subjt:  WVISGSFGNDLNTIEAGWQEHIEEFRHLKFLSFNLHFCFMFVQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGK

Query:  QFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH
        QFD+G+MVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSR SGFHTAT+MGSGHF EEGFGKASYFRNLQVVDWDNNLLPLTNLH
Subjt:  QFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH

Query:  LLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP
        +LADHSDCYDIRQG N+ WGTYFYYGGPGRNVKCP
Subjt:  LLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP

A0A6J1GRA4 uncharacterized protein LOC1114568101.9e-21683.37Show/hide
Query:  SNQMASSSCFVVFLLVFTSLTSVFSTS----MPPKNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPP
        S+  +S SCFVV LLVFTSL SVFSTS    MP KNQT FHP KEL+KLKHIRAYLRKINKP  KTI+SSDGDVIDCV+SHLQPAFDHPELKGH+PLEPP
Subjt:  SNQMASSSCFVVFLLVFTSLTSVFSTS----MPPKNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPP

Query:  ERPRGNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFS
        ERPR N S E VA++ QLWS SGEFCPEGTIPIRRT E DI RA+S+RRFGRKPIR VRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFS
Subjt:  ERPRGNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFS

Query:  LSQIWVISGSFGNDLNTIEAGWQEHIEEFRHLKFLSFNLHFCFMFVQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISS
        LSQIW+ISGSFGNDLNTIEAGW                        QVSPELYGD+NPRFFTYWTTDAYQATGCYNLLCSGFVQTN++IAIGAAISP+SS
Subjt:  LSQIWVISGSFGNDLNTIEAGWQEHIEEFRHLKFLSFNLHFCFMFVQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISS

Query:  YNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPL
        Y GKQFDIGLMVWKDPKHGHWWLEYGSG LVGYWPAFLFSHLRSH SMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPL
Subjt:  YNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPL

Query:  TNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP
        TNLH+LADHSDCYDIRQG+NNVWGTYFYYGGPGR V+CP
Subjt:  TNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP

A0A6J1JZN8 uncharacterized protein LOC1114902772.3e-21783.6Show/hide
Query:  SNQMASSSCFVVFLLVFTSLTSVFSTS----MPPKNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPP
        S+  +S SCFVVFLLVFTSL SVFSTS    MP KNQT FHP KEL+KLK+IRAYLRKINKP  KTI+SSDGDVIDCV+SHLQPAFDHPELKGH+PLEPP
Subjt:  SNQMASSSCFVVFLLVFTSLTSVFSTS----MPPKNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPP

Query:  ERPRGNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFS
        ERPR N S E VA++ QLWS SGEFCPEGTIPIRRT E DI RA+S+RRFGRKPIR VRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFS
Subjt:  ERPRGNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFS

Query:  LSQIWVISGSFGNDLNTIEAGWQEHIEEFRHLKFLSFNLHFCFMFVQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISS
        LSQIW+ISGSFGNDLNTIEAGW                        QVSPELYGD+NPRFFTYWTTDAYQATGCYNLLCSGFVQTNN+IAIGAAISP+SS
Subjt:  LSQIWVISGSFGNDLNTIEAGWQEHIEEFRHLKFLSFNLHFCFMFVQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISS

Query:  YNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPL
        Y GKQFDIGLMVWKDPKHGHWWLEYGSG+LVGYWPAFLFSHLRSH SMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPL
Subjt:  YNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPL

Query:  TNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP
        TNLH+LADHSDCYDIRQG+NNVWGTYFYYGGPGR V+CP
Subjt:  TNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP

A0A6J1KJ26 uncharacterized protein LOC1114950546.1e-21884.6Show/hide
Query:  ASSSCFVVFLLVFTSLTSVFSTSM----PPKNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPR
        +SSSCFVV LLVFTS +SVF TS+    PPKNQT FHP+KEL +LKHIRAYLRKINKP TKTI+SSDGDVIDCV+SHLQPAFDHP LKGHTPL PPERPR
Subjt:  ASSSCFVVFLLVFTSLTSVFSTSM----PPKNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPR

Query:  GNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI
        GNNS E VAE+FQLWS SG+FCPEGTIPIRRT E DI RASS RRFGRKPIR +RRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ EFSLSQI
Subjt:  GNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI

Query:  WVISGSFGNDLNTIEAGWQEHIEEFRHLKFLSFNLHFCFMFVQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGK
        WVISGSFGNDLNTIEAGW                        QVSPELYGD+NPRFFTYWTTDAYQATGCYNLLCSGFVQTNN+IAIGAAISP+SSYNGK
Subjt:  WVISGSFGNDLNTIEAGWQEHIEEFRHLKFLSFNLHFCFMFVQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGK

Query:  QFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH
        QFD+G+MVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSR SGFHTAT+MGSGHF EEGF KASYFRNLQVVDWDNNLLPLTNLH
Subjt:  QFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH

Query:  LLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP
        +LADHSDCYDIRQG N+VWGTYFYYGGPGRNVKCP
Subjt:  LLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10750.1 Protein of Unknown Function (DUF239)2.5e-17667.21Show/hide
Query:  ASSSCFVVFLLVFTSLTSVFSTSMPPKNQTFHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNST
        +S+  F+  LL+ +S +SV S ++ P+NQT  P  EL KLK I  +LRKINKPS KTI S DGD+IDCV+ H QPAFDHP L+G  PL+PPERPRG+N  
Subjt:  ASSSCFVVFLLVFTSLTSVFSTSMPPKNQTFHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERPRGNNST

Query:  EGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISG
            +SFQLW   GE CPEGT+PIRRT+E DILRA+SV  FG+K +R  RRD+S NGHEHAV +V+GE+YYGAKAS+N+WAP+V +QYEFSLSQIW+ISG
Subjt:  EGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISG

Query:  SFGNDLNTIEAGWQEHIEEFRHLKFLSFNLHFCFMFVQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIG
        SFGNDLNTIEAGW                        QVSPELYGD+ PRFFTYWT DAYQATGCYNLLCSGFVQTN++IAIGAAISP SSY G QFDI 
Subjt:  SFGNDLNTIEAGWQEHIEEFRHLKFLSFNLHFCFMFVQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIG

Query:  LMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADH
        L++WKDPKHG+WWLE+GSG+LVGYWP+FLF+HL+ H SMVQ+GGEIVNS   G HT+TQMGSGHFAEEGF K+SYFRN+QVVDWDNNL+P  NL +LADH
Subjt:  LMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADH

Query:  SDCYDIRQGNNNVWGTYFYYGGPGRNVKCP
         +CYDI+ G+N  WG+YFYYGGPG+N KCP
Subjt:  SDCYDIRQGNNNVWGTYFYYGGPGRNVKCP

AT1G23340.1 Protein of Unknown Function (DUF239)6.3e-17567.66Show/hide
Query:  ASSSC-FVVFLLVFTSLTSVFSTSMPPKNQT-----FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERP
        +SSSC F  F+L    L S+FS+   P N T       P +E+QK+K IR  L+KINKP+ KTI SSDGD IDCV SH QPAFDHP L+G  P++PPE P
Subjt:  ASSSC-FVVFLLVFTSLTSVFSTSMPPKNQT-----FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERP

Query:  RGNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQ
         G +      E+FQLWS  GE CPEGTIPIRRT E D+LRA+SVRRFGRK IRRVRRDSS NGHEHAV +V+G QYYGAKAS+N+W PRV  QYEFSLSQ
Subjt:  RGNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQ

Query:  IWVISGSFGNDLNTIEAGWQEHIEEFRHLKFLSFNLHFCFMFVQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNG
        IW+I+GSF  DLNTIEAGW                        Q+SPELYGD+NPRFFTYWT+DAYQATGCYNLLCSGFVQTNN+IAIGAAISP+SSY G
Subjt:  IWVISGSFGNDLNTIEAGWQEHIEEFRHLKFLSFNLHFCFMFVQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNG

Query:  KQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNL
         QFDI L++WKDPKHGHWWL++GSG LVGYWP  LF+HLR HG+MVQFGGEIVN+R  G HT+TQMGSGHFA EGFGKASYFRNLQ+VDWDN L+P++NL
Subjt:  KQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNL

Query:  HLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP
         +LADH +CYDIR G N VWG +FYYGGPG+N KCP
Subjt:  HLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP

AT1G23340.2 Protein of Unknown Function (DUF239)6.3e-17567.66Show/hide
Query:  ASSSC-FVVFLLVFTSLTSVFSTSMPPKNQT-----FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERP
        +SSSC F  F+L    L S+FS+   P N T       P +E+QK+K IR  L+KINKP+ KTI SSDGD IDCV SH QPAFDHP L+G  P++PPE P
Subjt:  ASSSC-FVVFLLVFTSLTSVFSTSMPPKNQT-----FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLEPPERP

Query:  RGNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQ
         G +      E+FQLWS  GE CPEGTIPIRRT E D+LRA+SVRRFGRK IRRVRRDSS NGHEHAV +V+G QYYGAKAS+N+W PRV  QYEFSLSQ
Subjt:  RGNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQ

Query:  IWVISGSFGNDLNTIEAGWQEHIEEFRHLKFLSFNLHFCFMFVQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNG
        IW+I+GSF  DLNTIEAGW                        Q+SPELYGD+NPRFFTYWT+DAYQATGCYNLLCSGFVQTNN+IAIGAAISP+SSY G
Subjt:  IWVISGSFGNDLNTIEAGWQEHIEEFRHLKFLSFNLHFCFMFVQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNG

Query:  KQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNL
         QFDI L++WKDPKHGHWWL++GSG LVGYWP  LF+HLR HG+MVQFGGEIVN+R  G HT+TQMGSGHFA EGFGKASYFRNLQ+VDWDN L+P++NL
Subjt:  KQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNL

Query:  HLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP
         +LADH +CYDIR G N VWG +FYYGGPG+N KCP
Subjt:  HLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP

AT1G70550.1 Protein of Unknown Function (DUF239)5.3e-17466Show/hide
Query:  HSNQMAS-------SSCFVVFLLVFTSLTSVFSTSMPPKN-----QTFHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELK
        HS+Q  S       SS F+  +L+   ++S FS++    N     QT  P +ELQKL  IR  L KINKP+ KTI+SSDGD IDCV +H QPAFDHP L+
Subjt:  HSNQMAS-------SSCFVVFLLVFTSLTSVFSTSMPPKN-----QTFHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELK

Query:  GHTPLEPPERPRGNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPR
        G  PL+PPE P+G +  +G  E+ QLWS SGE CPEGTIPIRRT E D+LRASSV+RFGRK IRRV+RDS+ NGHEHAV +V G QYYGAKAS+N+W+PR
Subjt:  GHTPLEPPERPRGNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPR

Query:  VTDQYEFSLSQIWVISGSFGNDLNTIEAGWQEHIEEFRHLKFLSFNLHFCFMFVQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIG
        VT QYEFSLSQIWVI+GSF +DLNTIEAGW                        Q+SPELYGD+ PRFFTYWT+DAY+ TGCYNLLCSGFVQTN +IAIG
Subjt:  VTDQYEFSLSQIWVISGSFGNDLNTIEAGWQEHIEEFRHLKFLSFNLHFCFMFVQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIG

Query:  AAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVD
        AAISP SSY G QFDI L++WKDPKHGHWWL++GSG LVGYWPAFLF+HL+ HGSMVQFGGEIVN+R  G HT TQMGSGHFA EGFGKASYFRNLQ+VD
Subjt:  AAISPISSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVD

Query:  WDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP
        WDN L+P +NL +LADH +CYDIR G N VWG YFYYGGPG+N +CP
Subjt:  WDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP

AT5G50150.1 Protein of Unknown Function (DUF239)1.8e-18570.75Show/hide
Query:  TTSPKHSNQMASSSCFVVFLLVFTSLTSVFSTSMPPKNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLE
        ++S   +    +S    + LL+   L  +  +++  KNQT F P +E+QKL+ + AYL KINKPS KTI S DGDVI+CV SHLQPAFDHP+L+G  PL+
Subjt:  TTSPKHSNQMASSSCFVVFLLVFTSLTSVFSTSMPPKNQT-FHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPLE

Query:  PPERPRGNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYE
         P RP   N T       QLWS SGE CP G+IPIR+T + D+LRA+SVRRFGRK  R +RRDSSG GHEHAVVFVNGEQYYGAKAS+N+WAPRVTD YE
Subjt:  PPERPRGNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYE

Query:  FSLSQIWVISGSFGNDLNTIEAGWQEHIEEFRHLKFLSFNLHFCFMFVQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPI
        FSLSQIW+ISGSFG+DLNTIEAGW                        QVSPELYGD+ PRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISP 
Subjt:  FSLSQIWVISGSFGNDLNTIEAGWQEHIEEFRHLKFLSFNLHFCFMFVQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPI

Query:  SSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLL
        SSYNG+QFDIGLM+WKDPKHGHWWLE G+GLLVGYWPAFLFSHLRSH SMVQFGGE+VNSRSSG HT TQMGSGHFA+EGF KA+YFRNLQVVDWDNNLL
Subjt:  SSYNGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLL

Query:  PLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP
        PL NLH+LADH  CYDIRQG NNVWGTYFYYGGPGRN +CP
Subjt:  PLTNLHLLADHSDCYDIRQGNNNVWGTYFYYGGPGRNVKCP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CTTACAAATCACCATATCATTACAGTAAAACAACAAACGACTTCGCCGAAACATTCCAACCAAATGGCTTCTTCTTCTTGTTTTGTTGTTTTTCTTCTGGTTTTTACTTC
CCTCACCTCCGTTTTCTCAACTTCCATGCCACCCAAGAACCAAACTTTCCACCCGGCCAAAGAGCTGCAGAAACTAAAGCACATCAGAGCTTATTTACGCAAAATCAACA
AGCCTTCAACCAAGACAATTCGGAGCTCAGATGGTGATGTCATAGACTGTGTGATTTCCCATCTCCAGCCTGCTTTTGACCATCCTGAACTCAAAGGACACACCCCATTG
GAGCCGCCGGAGAGGCCAAGAGGGAACAACTCGACGGAAGGTGTGGCAGAGAGCTTCCAATTATGGTCAGATTCTGGCGAATTCTGCCCGGAGGGAACTATTCCGATAAG
AAGAACCAGAGAGACCGACATTCTTAGAGCAAGCTCTGTTCGCAGATTTGGAAGAAAACCCATTAGACGTGTGAGGAGAGATTCATCAGGCAATGGCCACGAGCATGCCG
TGGTGTTTGTAAATGGAGAACAATATTATGGAGCAAAGGCGAGTTTAAACATATGGGCACCACGTGTAACCGACCAATACGAATTCAGCTTATCTCAAATATGGGTCATT
TCAGGCTCCTTTGGCAATGATTTAAACACCATTGAAGCTGGATGGCAGGAACATATCGAGGAATTTAGACACTTGAAATTCTTGTCCTTCAATCTTCATTTTTGTTTCAT
GTTTGTTCAGGTTAGCCCTGAGCTGTATGGCGACAGCAATCCTAGGTTCTTCACGTATTGGACGACTGATGCTTATCAAGCCACTGGCTGTTACAATCTACTTTGCTCTG
GCTTTGTTCAAACTAACAACAAGATCGCCATTGGAGCAGCAATCTCCCCCATCTCCTCTTATAATGGCAAACAATTCGATATTGGTTTAATGGTTTGGAAGGACCCGAAG
CACGGGCACTGGTGGTTGGAATACGGGTCGGGTCTGCTAGTCGGGTACTGGCCGGCGTTTCTGTTCAGCCATTTAAGGAGCCATGGGAGCATGGTGCAGTTTGGAGGGGA
GATAGTGAACAGCAGATCATCAGGGTTCCACACAGCCACTCAAATGGGGAGTGGCCATTTTGCAGAAGAAGGCTTTGGAAAAGCTTCATATTTCAGGAACCTCCAAGTGG
TTGATTGGGACAATAATTTGCTTCCTCTAACAAATCTTCATCTCTTGGCTGACCATTCTGATTGCTATGATATAAGACAAGGCAACAATAATGTTTGGGGCACTTATTTT
TACTATGGAGGTCCTGGGAGGAATGTAAAATGCCCA
mRNA sequenceShow/hide mRNA sequence
CTTACAAATCACCATATCATTACAGTAAAACAACAAACGACTTCGCCGAAACATTCCAACCAAATGGCTTCTTCTTCTTGTTTTGTTGTTTTTCTTCTGGTTTTTACTTC
CCTCACCTCCGTTTTCTCAACTTCCATGCCACCCAAGAACCAAACTTTCCACCCGGCCAAAGAGCTGCAGAAACTAAAGCACATCAGAGCTTATTTACGCAAAATCAACA
AGCCTTCAACCAAGACAATTCGGAGCTCAGATGGTGATGTCATAGACTGTGTGATTTCCCATCTCCAGCCTGCTTTTGACCATCCTGAACTCAAAGGACACACCCCATTG
GAGCCGCCGGAGAGGCCAAGAGGGAACAACTCGACGGAAGGTGTGGCAGAGAGCTTCCAATTATGGTCAGATTCTGGCGAATTCTGCCCGGAGGGAACTATTCCGATAAG
AAGAACCAGAGAGACCGACATTCTTAGAGCAAGCTCTGTTCGCAGATTTGGAAGAAAACCCATTAGACGTGTGAGGAGAGATTCATCAGGCAATGGCCACGAGCATGCCG
TGGTGTTTGTAAATGGAGAACAATATTATGGAGCAAAGGCGAGTTTAAACATATGGGCACCACGTGTAACCGACCAATACGAATTCAGCTTATCTCAAATATGGGTCATT
TCAGGCTCCTTTGGCAATGATTTAAACACCATTGAAGCTGGATGGCAGGAACATATCGAGGAATTTAGACACTTGAAATTCTTGTCCTTCAATCTTCATTTTTGTTTCAT
GTTTGTTCAGGTTAGCCCTGAGCTGTATGGCGACAGCAATCCTAGGTTCTTCACGTATTGGACGACTGATGCTTATCAAGCCACTGGCTGTTACAATCTACTTTGCTCTG
GCTTTGTTCAAACTAACAACAAGATCGCCATTGGAGCAGCAATCTCCCCCATCTCCTCTTATAATGGCAAACAATTCGATATTGGTTTAATGGTTTGGAAGGACCCGAAG
CACGGGCACTGGTGGTTGGAATACGGGTCGGGTCTGCTAGTCGGGTACTGGCCGGCGTTTCTGTTCAGCCATTTAAGGAGCCATGGGAGCATGGTGCAGTTTGGAGGGGA
GATAGTGAACAGCAGATCATCAGGGTTCCACACAGCCACTCAAATGGGGAGTGGCCATTTTGCAGAAGAAGGCTTTGGAAAAGCTTCATATTTCAGGAACCTCCAAGTGG
TTGATTGGGACAATAATTTGCTTCCTCTAACAAATCTTCATCTCTTGGCTGACCATTCTGATTGCTATGATATAAGACAAGGCAACAATAATGTTTGGGGCACTTATTTT
TACTATGGAGGTCCTGGGAGGAATGTAAAATGCCCA
Protein sequenceShow/hide protein sequence
LTNHHIITVKQQTTSPKHSNQMASSSCFVVFLLVFTSLTSVFSTSMPPKNQTFHPAKELQKLKHIRAYLRKINKPSTKTIRSSDGDVIDCVISHLQPAFDHPELKGHTPL
EPPERPRGNNSTEGVAESFQLWSDSGEFCPEGTIPIRRTRETDILRASSVRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVI
SGSFGNDLNTIEAGWQEHIEEFRHLKFLSFNLHFCFMFVQVSPELYGDSNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPISSYNGKQFDIGLMVWKDPK
HGHWWLEYGSGLLVGYWPAFLFSHLRSHGSMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGNNNVWGTYF
YYGGPGRNVKCP