; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC11g0655 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC11g0655
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionPeptidase_S9 domain-containing protein
Genome locationMC11:5248137..5257303
RNA-Seq ExpressionMC11g0655
SyntenyMC11g0655
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008236 - serine-type peptidase activity (molecular function)
InterPro domainsIPR001375 - Peptidase S9, prolyl oligopeptidase, catalytic domain
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008466042.1 PREDICTED: putative esterase YitV [Cucumis melo]2.38e-22676.89Show/hide
Query:  MAPLISHVRLRPSLTCLCLCRAPTLPWNPQKSDPNKLSFQVAARGSCQREASQMEEALVDADKFRSEFLRVLRSRRSGEAPLSVKPAMPVHHPLIQEANP
        MA LISHVRL PSL CL  C APT PWN Q     K S++VAARG   +E +QMEEA+VDADKFR+EFLRVLR+RRSGE PL+VK   PV +PLIQEA+P
Subjt:  MAPLISHVRLRPSLTCLCLCRAPTLPWNPQKSDPNKLSFQVAARGSCQREASQMEEALVDADKFRSEFLRVLRSRRSGEAPLSVKPAMPVHHPLIQEANP

Query:  PTFSK---------------------------EGEQGQLPILIISMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKDKT
        PTFSK                           EGEQGQLPILI+SMK+SRQQ+RP IVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAK KT
Subjt:  PTFSK---------------------------EGEQGQLPILIISMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKDKT

Query:  TYRDALISSWKRGDTMPFIFDTAWDLIKLADYLTEREDVDPSRIGITGESLGGMHAWFAAAADTRYAVVVPIIGVQCFRWAIDHDKWQARVESIKPVFEE
        TYRDALIS+WK+GDTMPFIFDT WDLIKLADYLT RED+DP RIGITGESLGGMHAWFAAAADTRY+VVVPIIGVQ F WA+D+DKWQARV+SIKPVFEE
Subjt:  TYRDALISSWKRGDTMPFIFDTAWDLIKLADYLTEREDVDPSRIGITGESLGGMHAWFAAAADTRYAVVVPIIGVQCFRWAIDHDKWQARVESIKPVFEE

Query:  ARIELGLSEINKELVEKVWNRIAPGLSSQFHSIYSVPAIAPRPLLLLNGADDPRCPIGGLDAPVSRTQTAYRKLGCPDNFKFIAQPGIGHEMTPEMVKEA
        ARI+LG++EINKE+V+KVWNRIAPGL SQF SIYSVPAIAPRPLLLLNGADDPRCP+ GLDAPVSR QTAY+K GCP+NFKFIAQ GIGHEMT EMVKEA
Subjt:  ARIELGLSEINKELVEKVWNRIAPGLSSQFHSIYSVPAIAPRPLLLLNGADDPRCPIGGLDAPVSRTQTAYRKLGCPDNFKFIAQPGIGHEMTPEMVKEA

Query:  SDWFDRFLIQS
        SDWFD+FL +S
Subjt:  SDWFDRFLIQS

XP_011652622.1 uncharacterized protein LOC101220970 isoform X1 [Cucumis sativus]7.76e-22176.21Show/hide
Query:  MAPLISHVRLRPSLTCLCLCRAPT-LPWNPQKSDPNKLSFQVAARGSCQREASQMEEALVDADKFRSEFLRVLRSRRSGEAPLSVKPAMPVHHPLIQEAN
        MA LISHV L P+ +  CL  APT  PWN Q     K  ++VAARG   +E +QMEEA+VDADKFR EFLRVLRSRRSGE PL+VK   PV +PLIQEAN
Subjt:  MAPLISHVRLRPSLTCLCLCRAPT-LPWNPQKSDPNKLSFQVAARGSCQREASQMEEALVDADKFRSEFLRVLRSRRSGEAPLSVKPAMPVHHPLIQEAN

Query:  PPTFSK---------------------------EGEQGQLPILIISMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKDK
        PPTFSK                           EGEQGQLPILI+SMK+SRQQ+RP IVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAK K
Subjt:  PPTFSK---------------------------EGEQGQLPILIISMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKDK

Query:  TTYRDALISSWKRGDTMPFIFDTAWDLIKLADYLTEREDVDPSRIGITGESLGGMHAWFAAAADTRYAVVVPIIGVQCFRWAIDHDKWQARVESIKPVFE
        TTYRDALIS+WK+GDTMPFIFDT WDLIKLADYLT RED+DP RIGITGESLGGMHAWFAAAADTRY+VVVPIIGVQ F WA+D+DKWQARVESIKPVFE
Subjt:  TTYRDALISSWKRGDTMPFIFDTAWDLIKLADYLTEREDVDPSRIGITGESLGGMHAWFAAAADTRYAVVVPIIGVQCFRWAIDHDKWQARVESIKPVFE

Query:  EARIELGLSEINKELVEKVWNRIAPGLSSQFHSIYSVPAIAPRPLLLLNGADDPRCPIGGLDAPVSRTQTAYRKLGCPDNFKFIAQPGIGHEMTPEMVKE
        EARIELG++EINKE+V+KVWNRIAPGL SQF SIYSVPAIAPRPLLLLNGADDPRCP+ GLDAPVSR QTAY+K GCP+NFKFI Q GIGHEMT EMVKE
Subjt:  EARIELGLSEINKELVEKVWNRIAPGLSSQFHSIYSVPAIAPRPLLLLNGADDPRCPIGGLDAPVSRTQTAYRKLGCPDNFKFIAQPGIGHEMTPEMVKE

Query:  ASDWFDRFLIQS
        ASDWFD+FL +S
Subjt:  ASDWFDRFLIQS

XP_022153900.1 uncharacterized protein LOC111021276 [Momordica charantia]2.15e-28293.46Show/hide
Query:  MAPLISHVRLRPSLTCLCLCRAPTLPWNPQKSDPNKLSFQVAARGSCQREASQMEEALVDADKFRSEFLRVLRSRRSGEAPLSVKPAMPVHHPLIQEANP
        MAPLISHVRLRPSLTCLCLCRAPTLPWNPQKSDPNKLSFQVAARGSCQREASQMEEALVDADKFRSEFLRVLRSRRSGEAPLSVKPAMPVHHPLIQEANP
Subjt:  MAPLISHVRLRPSLTCLCLCRAPTLPWNPQKSDPNKLSFQVAARGSCQREASQMEEALVDADKFRSEFLRVLRSRRSGEAPLSVKPAMPVHHPLIQEANP

Query:  PTFSK---------------------------EGEQGQLPILIISMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKDKT
        PTFSK                           EGEQGQLPILIISMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKDKT
Subjt:  PTFSK---------------------------EGEQGQLPILIISMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKDKT

Query:  TYRDALISSWKRGDTMPFIFDTAWDLIKLADYLTEREDVDPSRIGITGESLGGMHAWFAAAADTRYAVVVPIIGVQCFRWAIDHDKWQARVESIKPVFEE
        TYRDALISSWKRGDTMPFIFDTAWDLIKLADYLTEREDVDPSRIGITGESLGGMHAWFAAAADTRYAVVVPIIGVQCFRWAIDHDKWQARVESIKPVFEE
Subjt:  TYRDALISSWKRGDTMPFIFDTAWDLIKLADYLTEREDVDPSRIGITGESLGGMHAWFAAAADTRYAVVVPIIGVQCFRWAIDHDKWQARVESIKPVFEE

Query:  ARIELGLSEINKELVEKVWNRIAPGLSSQFHSIYSVPAIAPRPLLLLNGADDPRCPIGGLDAPVSRTQTAYRKLGCPDNFKFIAQPGIGHEMTPEMVKEA
        ARIELGLSEINKELVEKVWNRIAPGLSSQFHSIYSVPAIAPRPLLLLNGADDPRCPIGGLDAPVSRTQTAYRKLGCPDNFKFIAQPGIGHEMTPEMVKEA
Subjt:  ARIELGLSEINKELVEKVWNRIAPGLSSQFHSIYSVPAIAPRPLLLLNGADDPRCPIGGLDAPVSRTQTAYRKLGCPDNFKFIAQPGIGHEMTPEMVKEA

Query:  SDWFDRFLIQSSG
        SDWFDRFLIQSSG
Subjt:  SDWFDRFLIQSSG

XP_022936568.1 uncharacterized protein LOC111443135 isoform X2 [Cucurbita moschata]3.67e-22079.05Show/hide
Query:  MAPLISHVRLRPSLTCLCLCRAPTLPWNPQKSDPNKLSFQVAARGSCQREASQMEEALVDADKFRSEFLRVLRSRRSGEAPLSVKPAMPVHHPLIQEANP
        MA LI    LRPSLT LCL  A T PWN Q S+  K  ++VAA G+C     QM EA+VDADKFR+EFLRVLRSRRS E PL+VK  MP     IQE NP
Subjt:  MAPLISHVRLRPSLTCLCLCRAPTLPWNPQKSDPNKLSFQVAARGSCQREASQMEEALVDADKFRSEFLRVLRSRRSGEAPLSVKPAMPVHHPLIQEANP

Query:  PTFSK--------------------EGEQGQLPILIISMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKDKTTYRDALI
        P FSK                    EGEQG+LPILIISMKDSRQQ+RPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAK+KTTY DALI
Subjt:  PTFSK--------------------EGEQGQLPILIISMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKDKTTYRDALI

Query:  SSWKRGDTMPFIFDTAWDLIKLADYLTEREDVDPSRIGITGESLGGMHAWFAAAADTRYAVVVPIIGVQCFRWAIDHDKWQARVESIKPVFEEARIELGL
        SSWKRGDTMPFIFDT WDLIKLADYLT+RED+DP RIGITGESLGGMHAWFAAAADTRY+VVVPIIGVQCFRWAID+DKWQARVESIKPVFEEARIELG+
Subjt:  SSWKRGDTMPFIFDTAWDLIKLADYLTEREDVDPSRIGITGESLGGMHAWFAAAADTRYAVVVPIIGVQCFRWAIDHDKWQARVESIKPVFEEARIELGL

Query:  SEINKELVEKVWNRIAPGLSSQFHSIYSVPAIAPRPLLLLNGADDPRCPIGGLDAPVSRTQTAYRKLGCPDNFKFIAQPGIGHEMTPEMVKEASDWFDRF
        +EI+KE+V+KVWNRIAPGL SQF SIYSVPAIAPRPLLLLNGADDPRCPI GLDAPVSRTQ AY+K GCP+NFKFIAQPGIGHEMTPEMVKEAS WFDRF
Subjt:  SEINKELVEKVWNRIAPGLSSQFHSIYSVPAIAPRPLLLLNGADDPRCPIGGLDAPVSRTQTAYRKLGCPDNFKFIAQPGIGHEMTPEMVKEASDWFDRF

Query:  L
        L
Subjt:  L

XP_038898085.1 putative esterase YitV [Benincasa hispida]1.66e-23379.32Show/hide
Query:  MAPLISHVRLRPSLTCLCLCRAPTLPWNPQKSDPNKLSFQVAARGSCQREASQMEEALVDADKFRSEFLRVLRSRRSGEAPLSVKPAMPVHHPLIQEANP
        MA L SHV LRPSLTCL  C   T+PWN + S   K S++VAARGS Q+EA+QMEEA+VDADKFR+EFL VLRSRRSGE PL+VK A PV +PLIQEANP
Subjt:  MAPLISHVRLRPSLTCLCLCRAPTLPWNPQKSDPNKLSFQVAARGSCQREASQMEEALVDADKFRSEFLRVLRSRRSGEAPLSVKPAMPVHHPLIQEANP

Query:  PTFSK---------------------------EGEQGQLPILIISMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKDKT
        PTFS+                           EGEQGQLP+LIISMKDSRQQ+RPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAK+KT
Subjt:  PTFSK---------------------------EGEQGQLPILIISMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKDKT

Query:  TYRDALISSWKRGDTMPFIFDTAWDLIKLADYLTEREDVDPSRIGITGESLGGMHAWFAAAADTRYAVVVPIIGVQCFRWAIDHDKWQARVESIKPVFEE
        TYRDALISSWKRGDTMPFIFDT WDLIKLADYLT+RED+DP RIGITGESLGGMHAWFAAAADTRY+VVVPIIGVQCFRWA+D+DKWQARVESIKPVFEE
Subjt:  TYRDALISSWKRGDTMPFIFDTAWDLIKLADYLTEREDVDPSRIGITGESLGGMHAWFAAAADTRYAVVVPIIGVQCFRWAIDHDKWQARVESIKPVFEE

Query:  ARIELGLSEINKELVEKVWNRIAPGLSSQFHSIYSVPAIAPRPLLLLNGADDPRCPIGGLDAPVSRTQTAYRKLGCPDNFKFIAQPGIGHEMTPEMVKEA
        ARIELG++EINKE+V+KVWNRIAPGL SQF SIYSVPAIAPRPLLLLNGA+DPRCPI GLDAPVSRTQTAY+K GCP+NFKFI QP IGH+MT EMVKEA
Subjt:  ARIELGLSEINKELVEKVWNRIAPGLSSQFHSIYSVPAIAPRPLLLLNGADDPRCPIGGLDAPVSRTQTAYRKLGCPDNFKFIAQPGIGHEMTPEMVKEA

Query:  SDWFDRFLIQS
        SDWFDRFL Q+
Subjt:  SDWFDRFLIQS

TrEMBL top hitse value%identityAlignment
A0A0A0LJX7 Peptidase_S9 domain-containing protein3.76e-22176.21Show/hide
Query:  MAPLISHVRLRPSLTCLCLCRAPT-LPWNPQKSDPNKLSFQVAARGSCQREASQMEEALVDADKFRSEFLRVLRSRRSGEAPLSVKPAMPVHHPLIQEAN
        MA LISHV L P+ +  CL  APT  PWN Q     K  ++VAARG   +E +QMEEA+VDADKFR EFLRVLRSRRSGE PL+VK   PV +PLIQEAN
Subjt:  MAPLISHVRLRPSLTCLCLCRAPT-LPWNPQKSDPNKLSFQVAARGSCQREASQMEEALVDADKFRSEFLRVLRSRRSGEAPLSVKPAMPVHHPLIQEAN

Query:  PPTFSK---------------------------EGEQGQLPILIISMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKDK
        PPTFSK                           EGEQGQLPILI+SMK+SRQQ+RP IVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAK K
Subjt:  PPTFSK---------------------------EGEQGQLPILIISMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKDK

Query:  TTYRDALISSWKRGDTMPFIFDTAWDLIKLADYLTEREDVDPSRIGITGESLGGMHAWFAAAADTRYAVVVPIIGVQCFRWAIDHDKWQARVESIKPVFE
        TTYRDALIS+WK+GDTMPFIFDT WDLIKLADYLT RED+DP RIGITGESLGGMHAWFAAAADTRY+VVVPIIGVQ F WA+D+DKWQARVESIKPVFE
Subjt:  TTYRDALISSWKRGDTMPFIFDTAWDLIKLADYLTEREDVDPSRIGITGESLGGMHAWFAAAADTRYAVVVPIIGVQCFRWAIDHDKWQARVESIKPVFE

Query:  EARIELGLSEINKELVEKVWNRIAPGLSSQFHSIYSVPAIAPRPLLLLNGADDPRCPIGGLDAPVSRTQTAYRKLGCPDNFKFIAQPGIGHEMTPEMVKE
        EARIELG++EINKE+V+KVWNRIAPGL SQF SIYSVPAIAPRPLLLLNGADDPRCP+ GLDAPVSR QTAY+K GCP+NFKFI Q GIGHEMT EMVKE
Subjt:  EARIELGLSEINKELVEKVWNRIAPGLSSQFHSIYSVPAIAPRPLLLLNGADDPRCPIGGLDAPVSRTQTAYRKLGCPDNFKFIAQPGIGHEMTPEMVKE

Query:  ASDWFDRFLIQS
        ASDWFD+FL +S
Subjt:  ASDWFDRFLIQS

A0A1S3CRR2 putative esterase YitV1.15e-22676.89Show/hide
Query:  MAPLISHVRLRPSLTCLCLCRAPTLPWNPQKSDPNKLSFQVAARGSCQREASQMEEALVDADKFRSEFLRVLRSRRSGEAPLSVKPAMPVHHPLIQEANP
        MA LISHVRL PSL CL  C APT PWN Q     K S++VAARG   +E +QMEEA+VDADKFR+EFLRVLR+RRSGE PL+VK   PV +PLIQEA+P
Subjt:  MAPLISHVRLRPSLTCLCLCRAPTLPWNPQKSDPNKLSFQVAARGSCQREASQMEEALVDADKFRSEFLRVLRSRRSGEAPLSVKPAMPVHHPLIQEANP

Query:  PTFSK---------------------------EGEQGQLPILIISMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKDKT
        PTFSK                           EGEQGQLPILI+SMK+SRQQ+RP IVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAK KT
Subjt:  PTFSK---------------------------EGEQGQLPILIISMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKDKT

Query:  TYRDALISSWKRGDTMPFIFDTAWDLIKLADYLTEREDVDPSRIGITGESLGGMHAWFAAAADTRYAVVVPIIGVQCFRWAIDHDKWQARVESIKPVFEE
        TYRDALIS+WK+GDTMPFIFDT WDLIKLADYLT RED+DP RIGITGESLGGMHAWFAAAADTRY+VVVPIIGVQ F WA+D+DKWQARV+SIKPVFEE
Subjt:  TYRDALISSWKRGDTMPFIFDTAWDLIKLADYLTEREDVDPSRIGITGESLGGMHAWFAAAADTRYAVVVPIIGVQCFRWAIDHDKWQARVESIKPVFEE

Query:  ARIELGLSEINKELVEKVWNRIAPGLSSQFHSIYSVPAIAPRPLLLLNGADDPRCPIGGLDAPVSRTQTAYRKLGCPDNFKFIAQPGIGHEMTPEMVKEA
        ARI+LG++EINKE+V+KVWNRIAPGL SQF SIYSVPAIAPRPLLLLNGADDPRCP+ GLDAPVSR QTAY+K GCP+NFKFIAQ GIGHEMT EMVKEA
Subjt:  ARIELGLSEINKELVEKVWNRIAPGLSSQFHSIYSVPAIAPRPLLLLNGADDPRCPIGGLDAPVSRTQTAYRKLGCPDNFKFIAQPGIGHEMTPEMVKEA

Query:  SDWFDRFLIQS
        SDWFD+FL +S
Subjt:  SDWFDRFLIQS

A0A5A7TAI4 Putative esterase YitV1.15e-22676.89Show/hide
Query:  MAPLISHVRLRPSLTCLCLCRAPTLPWNPQKSDPNKLSFQVAARGSCQREASQMEEALVDADKFRSEFLRVLRSRRSGEAPLSVKPAMPVHHPLIQEANP
        MA LISHVRL PSL CL  C APT PWN Q     K S++VAARG   +E +QMEEA+VDADKFR+EFLRVLR+RRSGE PL+VK   PV +PLIQEA+P
Subjt:  MAPLISHVRLRPSLTCLCLCRAPTLPWNPQKSDPNKLSFQVAARGSCQREASQMEEALVDADKFRSEFLRVLRSRRSGEAPLSVKPAMPVHHPLIQEANP

Query:  PTFSK---------------------------EGEQGQLPILIISMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKDKT
        PTFSK                           EGEQGQLPILI+SMK+SRQQ+RP IVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAK KT
Subjt:  PTFSK---------------------------EGEQGQLPILIISMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKDKT

Query:  TYRDALISSWKRGDTMPFIFDTAWDLIKLADYLTEREDVDPSRIGITGESLGGMHAWFAAAADTRYAVVVPIIGVQCFRWAIDHDKWQARVESIKPVFEE
        TYRDALIS+WK+GDTMPFIFDT WDLIKLADYLT RED+DP RIGITGESLGGMHAWFAAAADTRY+VVVPIIGVQ F WA+D+DKWQARV+SIKPVFEE
Subjt:  TYRDALISSWKRGDTMPFIFDTAWDLIKLADYLTEREDVDPSRIGITGESLGGMHAWFAAAADTRYAVVVPIIGVQCFRWAIDHDKWQARVESIKPVFEE

Query:  ARIELGLSEINKELVEKVWNRIAPGLSSQFHSIYSVPAIAPRPLLLLNGADDPRCPIGGLDAPVSRTQTAYRKLGCPDNFKFIAQPGIGHEMTPEMVKEA
        ARI+LG++EINKE+V+KVWNRIAPGL SQF SIYSVPAIAPRPLLLLNGADDPRCP+ GLDAPVSR QTAY+K GCP+NFKFIAQ GIGHEMT EMVKEA
Subjt:  ARIELGLSEINKELVEKVWNRIAPGLSSQFHSIYSVPAIAPRPLLLLNGADDPRCPIGGLDAPVSRTQTAYRKLGCPDNFKFIAQPGIGHEMTPEMVKEA

Query:  SDWFDRFLIQS
        SDWFD+FL +S
Subjt:  SDWFDRFLIQS

A0A6J1DI56 uncharacterized protein LOC1110212761.04e-28293.46Show/hide
Query:  MAPLISHVRLRPSLTCLCLCRAPTLPWNPQKSDPNKLSFQVAARGSCQREASQMEEALVDADKFRSEFLRVLRSRRSGEAPLSVKPAMPVHHPLIQEANP
        MAPLISHVRLRPSLTCLCLCRAPTLPWNPQKSDPNKLSFQVAARGSCQREASQMEEALVDADKFRSEFLRVLRSRRSGEAPLSVKPAMPVHHPLIQEANP
Subjt:  MAPLISHVRLRPSLTCLCLCRAPTLPWNPQKSDPNKLSFQVAARGSCQREASQMEEALVDADKFRSEFLRVLRSRRSGEAPLSVKPAMPVHHPLIQEANP

Query:  PTFSK---------------------------EGEQGQLPILIISMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKDKT
        PTFSK                           EGEQGQLPILIISMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKDKT
Subjt:  PTFSK---------------------------EGEQGQLPILIISMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKDKT

Query:  TYRDALISSWKRGDTMPFIFDTAWDLIKLADYLTEREDVDPSRIGITGESLGGMHAWFAAAADTRYAVVVPIIGVQCFRWAIDHDKWQARVESIKPVFEE
        TYRDALISSWKRGDTMPFIFDTAWDLIKLADYLTEREDVDPSRIGITGESLGGMHAWFAAAADTRYAVVVPIIGVQCFRWAIDHDKWQARVESIKPVFEE
Subjt:  TYRDALISSWKRGDTMPFIFDTAWDLIKLADYLTEREDVDPSRIGITGESLGGMHAWFAAAADTRYAVVVPIIGVQCFRWAIDHDKWQARVESIKPVFEE

Query:  ARIELGLSEINKELVEKVWNRIAPGLSSQFHSIYSVPAIAPRPLLLLNGADDPRCPIGGLDAPVSRTQTAYRKLGCPDNFKFIAQPGIGHEMTPEMVKEA
        ARIELGLSEINKELVEKVWNRIAPGLSSQFHSIYSVPAIAPRPLLLLNGADDPRCPIGGLDAPVSRTQTAYRKLGCPDNFKFIAQPGIGHEMTPEMVKEA
Subjt:  ARIELGLSEINKELVEKVWNRIAPGLSSQFHSIYSVPAIAPRPLLLLNGADDPRCPIGGLDAPVSRTQTAYRKLGCPDNFKFIAQPGIGHEMTPEMVKEA

Query:  SDWFDRFLIQSSG
        SDWFDRFLIQSSG
Subjt:  SDWFDRFLIQSSG

A0A6J1FDL4 uncharacterized protein LOC111443135 isoform X21.78e-22079.05Show/hide
Query:  MAPLISHVRLRPSLTCLCLCRAPTLPWNPQKSDPNKLSFQVAARGSCQREASQMEEALVDADKFRSEFLRVLRSRRSGEAPLSVKPAMPVHHPLIQEANP
        MA LI    LRPSLT LCL  A T PWN Q S+  K  ++VAA G+C     QM EA+VDADKFR+EFLRVLRSRRS E PL+VK  MP     IQE NP
Subjt:  MAPLISHVRLRPSLTCLCLCRAPTLPWNPQKSDPNKLSFQVAARGSCQREASQMEEALVDADKFRSEFLRVLRSRRSGEAPLSVKPAMPVHHPLIQEANP

Query:  PTFSK--------------------EGEQGQLPILIISMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKDKTTYRDALI
        P FSK                    EGEQG+LPILIISMKDSRQQ+RPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAK+KTTY DALI
Subjt:  PTFSK--------------------EGEQGQLPILIISMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKDKTTYRDALI

Query:  SSWKRGDTMPFIFDTAWDLIKLADYLTEREDVDPSRIGITGESLGGMHAWFAAAADTRYAVVVPIIGVQCFRWAIDHDKWQARVESIKPVFEEARIELGL
        SSWKRGDTMPFIFDT WDLIKLADYLT+RED+DP RIGITGESLGGMHAWFAAAADTRY+VVVPIIGVQCFRWAID+DKWQARVESIKPVFEEARIELG+
Subjt:  SSWKRGDTMPFIFDTAWDLIKLADYLTEREDVDPSRIGITGESLGGMHAWFAAAADTRYAVVVPIIGVQCFRWAIDHDKWQARVESIKPVFEEARIELGL

Query:  SEINKELVEKVWNRIAPGLSSQFHSIYSVPAIAPRPLLLLNGADDPRCPIGGLDAPVSRTQTAYRKLGCPDNFKFIAQPGIGHEMTPEMVKEASDWFDRF
        +EI+KE+V+KVWNRIAPGL SQF SIYSVPAIAPRPLLLLNGADDPRCPI GLDAPVSRTQ AY+K GCP+NFKFIAQPGIGHEMTPEMVKEAS WFDRF
Subjt:  SEINKELVEKVWNRIAPGLSSQFHSIYSVPAIAPRPLLLLNGADDPRCPIGGLDAPVSRTQTAYRKLGCPDNFKFIAQPGIGHEMTPEMVKEASDWFDRF

Query:  L
        L
Subjt:  L

SwissProt top hitse value%identityAlignment
A0A1D8EJG8 4-O-methyl-glucuronoyl methylesterase 15.0e-0825.6Show/hide
Query:  AWDLIKLADYL--TEREDVDPSRIGITGESLGGMHAWFAAAADTRYAVVVP----IIGVQCFRWAIDHDKWQARVESIKPVFEEARIELGLSEINKELVE
        AW + ++ D L  T    +DP+R+G+TG S  G  A  A A + R A+ +P      G  C+R +     WQ   +  + V   A+I          + E
Subjt:  AWDLIKLADYL--TEREDVDPSRIGITGESLGGMHAWFAAAADTRYAVVVP----IIGVQCFRWAIDHDKWQARVESIKPVFEEARIELGLSEINKELVE

Query:  KVWNRIAPGLSSQFHSIYSVP--------AIAPRPLLLLNGADDPRCPIGGLDAPVSRTQTAYRKLGCPDNFKFIAQPGIGH--EMTPEMVKEASDWFDR
         VW  + P  ++  +++ ++P         IAPR L ++  +D            ++  +T +  LG  DNF F    G  H    + +   E + + ++
Subjt:  KVWNRIAPGLSSQFHSIYSVP--------AIAPRPLLLLNGADDPRCPIGGLDAPVSRTQTAYRKLGCPDNFKFIAQPGIGH--EMTPEMVKEASDWFDR

Query:  FLIQSSG
        FL+QS G
Subjt:  FLIQSSG

B2ABS0 4-O-methyl-glucuronoyl methylesterase6.1e-0623.56Show/hide
Query:  AWDLIKLADYL---TEREDVDPSRIGITGESLGGMHAWFAAAADTRYAVVVP----IIGVQCFRWAIDHDKWQARVESIKPVFEEARIELGLSEINKELV
        AW + ++ D L     +  +DP+R+G+TG S  G  A  A A + R A+ +P      G  C+R A     WQ                  + +  + + 
Subjt:  AWDLIKLADYL---TEREDVDPSRIGITGESLGGMHAWFAAAADTRYAVVVP----IIGVQCFRWAIDHDKWQARVESIKPVFEEARIELGLSEINKELV

Query:  EKVWNRIAPGLSSQFHSIYSVP--------AIAPRPLLLLNGADDPRCPIGGLDAPVSRTQTAYRKLGCPDNFKFIAQPGIGHEMTP--EMVKEASDWFD
        E VW   +P  +S  +++  +P         IAPR L ++   D            +   +  +  LG  DNF +    G  H   P  +   E + + +
Subjt:  EKVWNRIAPGLSSQFHSIYSVP--------AIAPRPLLLLNGADDPRCPIGGLDAPVSRTQTAYRKLGCPDNFKFIAQPGIGHEMTP--EMVKEASDWFD

Query:  RFLIQSSG
        +FL++ SG
Subjt:  RFLIQSSG

O34973 Putative hydrolase YtaP1.6e-1126.29Show/hide
Query:  SRGYVAIAIDSRYHGER--AKDKTTYRDALISSWKRGDTMPFIFDTAWDLIKLADYLTEREDVDPSRIGITGESLGGMHAWFAAAADTRYAVVVPIIGVQ
        S GY  +AID    G+R    +   +++ L++       M       +D +   DY+  R DV P RIG  G S+GG+ AW+ AA D R  V V +    
Subjt:  SRGYVAIAIDSRYHGER--AKDKTTYRDALISSWKRGDTMPFIFDTAWDLIKLADYLTEREDVDPSRIGITGESLGGMHAWFAAAADTRYAVVVPIIGVQ

Query:  CFRWAIDHDKWQARVESIKPVFEEARIELGLSEINKELVEKVWNRIAPGLSSQFHSIYSVPAIAPRPLLLLNGADDPRCPIGGLDAPVSRTQTAYRKLGC
             +DH         IK                + L    +    P L+  F +      IAPRP L L G  D   P  G+D         Y   G 
Subjt:  CFRWAIDHDKWQARVESIKPVFEEARIELGLSEINKELVEKVWNRIAPGLSSQFHSIYSVPAIAPRPLLLLNGADDPRCPIGGLDAPVSRTQTAYRKLGC

Query:  PDNFKFIAQPGIGHEMTPEMVKEASDWFDRFL
         D ++ + +   GH  T  +  EA  +  ++L
Subjt:  PDNFKFIAQPGIGHEMTPEMVKEASDWFDRFL

P29368 Uncharacterized 31.7 kDa protein in traX-finO intergenic region8.7e-0528.57Show/hide
Query:  RRPAIVFLHSTNKCKEWLRP-LLEAYASRGYVAIAIDSRYHGERAKDKTTYRDALISSWKRGDTMPFIFDTAWDLIKLADYLTEREDVDPSRIGITGESL
        + P I+  H     +  L P    A+   G+  I  D R  GE             S  +RG  +P +     D+I + ++  ++E +D  RIG+ G SL
Subjt:  RRPAIVFLHSTNKCKEWLRP-LLEAYASRGYVAIAIDSRYHGERAKDKTTYRDALISSWKRGDTMPFIFDTAWDLIKLADYLTEREDVDPSRIGITGESL

Query:  GGMHAWFAAAADTRYAVVV
        GG H + A A D R   +V
Subjt:  GGMHAWFAAAADTRYAVVV

Q99390 Uncharacterized 31.7 kDa protein in traX-finO intergenic region2.3e-0529.41Show/hide
Query:  RRPAIVFLHSTNKCKEWLRP-LLEAYASRGYVAIAIDSRYHGERAKDKTTYRDALISSWKRGDTMPFIFDTAWDLIKLADYLTEREDVDPSRIGITGESL
        + P I+  H     +  L P    A+   G+  I  D R  GE             S  +RG  +P +     D+I + ++  ++E +D  RIG+ G SL
Subjt:  RRPAIVFLHSTNKCKEWLRP-LLEAYASRGYVAIAIDSRYHGERAKDKTTYRDALISSWKRGDTMPFIFDTAWDLIKLADYLTEREDVDPSRIGITGESL

Query:  GGMHAWFAAAADTRYAVVV
        GG H + AAA D R   +V
Subjt:  GGMHAWFAAAADTRYAVVV

Arabidopsis top hitse value%identityAlignment
AT5G25770.1 alpha/beta-Hydrolases superfamily protein2.2e-12860.85Show/hide
Query:  MEEALVDADKFRSEFLRVLRSRRSGEAPLSVKPAMPVHHPLIQEANPPT-------------------------FSKEGEQGQLPILIISMKDSRQQRRP
        ME  +     FR +FLR+L SRRS + PL    + P+ +PL Q   P T                          +++ EQG+LP+LI+S+K+  +++RP
Subjt:  MEEALVDADKFRSEFLRVLRSRRSGEAPLSVKPAMPVHHPLIQEANPPT-------------------------FSKEGEQGQLPILIISMKDSRQQRRP

Query:  AIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKDKTTYRDALISSWKRGDTMPFIFDTAWDLIKLADYLTEREDVDPSRIGITGESLGGMH
        AIVF+H TN  KEWLRP LEAYASRGYVAI +DSRYHGERA  KT YRDALISSW+ G+TMPFIFDT WDLIKLA+YLT+R+D+DP +IGITG SLGGMH
Subjt:  AIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKDKTTYRDALISSWKRGDTMPFIFDTAWDLIKLADYLTEREDVDPSRIGITGESLGGMH

Query:  AWFAAAADTRYAVVVPIIGVQCFRWAIDHDKWQARVESIKPVFEEARIELGLSEINKELVEKVWNRIAPGLSSQFHSIYSVPAIAPRPLLLLNGADDPRC
        AWFAAAADTRY+VVVP+IGVQ FRWAI++D+W+ARV SIKP+FEEARI+LG + I+KELVEKVWNRIAPGL+S+F S YS+P IAPRPL +LNGA+DPRC
Subjt:  AWFAAAADTRYAVVVPIIGVQCFRWAIDHDKWQARVESIKPVFEEARIELGLSEINKELVEKVWNRIAPGLSSQFHSIYSVPAIAPRPLLLLNGADDPRC

Query:  PIGGLDAPVSRTQTAYRKLGCPDNFKFIAQPGIGHEMTPEMVKEASDWFDRFLIQ
        P+GGL+  + R + AY++   P NFKF A+ G+GHE T  M+KE+SDWFD+FL Q
Subjt:  PIGGLDAPVSRTQTAYRKLGCPDNFKFIAQPGIGHEMTPEMVKEASDWFDRFLIQ

AT5G25770.2 alpha/beta-Hydrolases superfamily protein2.2e-12860.85Show/hide
Query:  MEEALVDADKFRSEFLRVLRSRRSGEAPLSVKPAMPVHHPLIQEANPPT-------------------------FSKEGEQGQLPILIISMKDSRQQRRP
        ME  +     FR +FLR+L SRRS + PL    + P+ +PL Q   P T                          +++ EQG+LP+LI+S+K+  +++RP
Subjt:  MEEALVDADKFRSEFLRVLRSRRSGEAPLSVKPAMPVHHPLIQEANPPT-------------------------FSKEGEQGQLPILIISMKDSRQQRRP

Query:  AIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKDKTTYRDALISSWKRGDTMPFIFDTAWDLIKLADYLTEREDVDPSRIGITGESLGGMH
        AIVF+H TN  KEWLRP LEAYASRGYVAI +DSRYHGERA  KT YRDALISSW+ G+TMPFIFDT WDLIKLA+YLT+R+D+DP +IGITG SLGGMH
Subjt:  AIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKDKTTYRDALISSWKRGDTMPFIFDTAWDLIKLADYLTEREDVDPSRIGITGESLGGMH

Query:  AWFAAAADTRYAVVVPIIGVQCFRWAIDHDKWQARVESIKPVFEEARIELGLSEINKELVEKVWNRIAPGLSSQFHSIYSVPAIAPRPLLLLNGADDPRC
        AWFAAAADTRY+VVVP+IGVQ FRWAI++D+W+ARV SIKP+FEEARI+LG + I+KELVEKVWNRIAPGL+S+F S YS+P IAPRPL +LNGA+DPRC
Subjt:  AWFAAAADTRYAVVVPIIGVQCFRWAIDHDKWQARVESIKPVFEEARIELGLSEINKELVEKVWNRIAPGLSSQFHSIYSVPAIAPRPLLLLNGADDPRC

Query:  PIGGLDAPVSRTQTAYRKLGCPDNFKFIAQPGIGHEMTPEMVKEASDWFDRFLIQ
        P+GGL+  + R + AY++   P NFKF A+ G+GHE T  M+KE+SDWFD+FL Q
Subjt:  PIGGLDAPVSRTQTAYRKLGCPDNFKFIAQPGIGHEMTPEMVKEASDWFDRFLIQ

AT5G25770.3 alpha/beta-Hydrolases superfamily protein2.2e-12860.85Show/hide
Query:  MEEALVDADKFRSEFLRVLRSRRSGEAPLSVKPAMPVHHPLIQEANPPT-------------------------FSKEGEQGQLPILIISMKDSRQQRRP
        ME  +     FR +FLR+L SRRS + PL    + P+ +PL Q   P T                          +++ EQG+LP+LI+S+K+  +++RP
Subjt:  MEEALVDADKFRSEFLRVLRSRRSGEAPLSVKPAMPVHHPLIQEANPPT-------------------------FSKEGEQGQLPILIISMKDSRQQRRP

Query:  AIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKDKTTYRDALISSWKRGDTMPFIFDTAWDLIKLADYLTEREDVDPSRIGITGESLGGMH
        AIVF+H TN  KEWLRP LEAYASRGYVAI +DSRYHGERA  KT YRDALISSW+ G+TMPFIFDT WDLIKLA+YLT+R+D+DP +IGITG SLGGMH
Subjt:  AIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKDKTTYRDALISSWKRGDTMPFIFDTAWDLIKLADYLTEREDVDPSRIGITGESLGGMH

Query:  AWFAAAADTRYAVVVPIIGVQCFRWAIDHDKWQARVESIKPVFEEARIELGLSEINKELVEKVWNRIAPGLSSQFHSIYSVPAIAPRPLLLLNGADDPRC
        AWFAAAADTRY+VVVP+IGVQ FRWAI++D+W+ARV SIKP+FEEARI+LG + I+KELVEKVWNRIAPGL+S+F S YS+P IAPRPL +LNGA+DPRC
Subjt:  AWFAAAADTRYAVVVPIIGVQCFRWAIDHDKWQARVESIKPVFEEARIELGLSEINKELVEKVWNRIAPGLSSQFHSIYSVPAIAPRPLLLLNGADDPRC

Query:  PIGGLDAPVSRTQTAYRKLGCPDNFKFIAQPGIGHEMTPEMVKEASDWFDRFLIQ
        P+GGL+  + R + AY++   P NFKF A+ G+GHE T  M+KE+SDWFD+FL Q
Subjt:  PIGGLDAPVSRTQTAYRKLGCPDNFKFIAQPGIGHEMTPEMVKEASDWFDRFLIQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCACCTCTCATATCTCACGTGCGACTTCGCCCTTCCCTCACGTGTCTTTGCCTTTGCCGCGCACCAACTTTGCCATGGAACCCGCAAAAATCCGATCCCAATAAACT
CTCGTTCCAGGTTGCAGCCAGGGGAAGTTGTCAACGCGAAGCCAGTCAAATGGAAGAAGCTCTTGTTGACGCCGACAAGTTCCGGAGCGAATTCCTCCGGGTTCTGCGTA
GTCGACGATCTGGAGAAGCTCCGCTAAGTGTGAAGCCCGCAATGCCTGTCCACCACCCTCTGATTCAGGAGGCCAACCCGCCAACCTTCAGTAAGGAAGGAGAGCAAGGC
CAATTGCCCATATTGATTATTAGTATGAAGGATAGCAGACAGCAAAGAAGACCTGCAATTGTTTTTCTGCACAGTACAAACAAGTGCAAAGAGTGGTTGAGACCGTTGCT
TGAGGCTTATGCATCGAGGGGATATGTAGCCATTGCCATTGATTCTCGTTACCATGGTGAAAGGGCCAAGGACAAAACCACTTACCGTGATGCTCTTATATCTTCATGGA
AAAGAGGTGACACCATGCCGTTCATATTTGACACGGCATGGGACTTGATAAAGTTGGCGGATTATCTGACGGAAAGGGAGGATGTTGACCCATCTAGAATAGGGATTACT
GGTGAATCACTTGGAGGAATGCATGCATGGTTTGCTGCTGCTGCTGACACCCGCTATGCTGTGGTTGTCCCCATAATTGGTGTACAGTGTTTTCGATGGGCCATTGATCA
TGATAAGTGGCAGGCCCGAGTTGAGAGTATAAAACCCGTTTTTGAAGAAGCACGAATTGAATTAGGCCTGAGTGAGATCAACAAAGAATTGGTGGAGAAGGTCTGGAATA
GGATTGCTCCTGGTTTGAGTTCCCAATTTCACTCGATTTATTCAGTTCCAGCTATTGCACCACGCCCTTTGTTGCTATTAAATGGTGCAGATGACCCTCGTTGTCCAATT
GGAGGTTTGGATGCTCCCGTATCAAGAACACAGACAGCTTATCGGAAGCTTGGTTGCCCAGACAATTTTAAGTTCATAGCACAACCGGGGATAGGCCACGAAATGACACC
AGAGATGGTAAAAGAAGCTAGCGATTGGTTTGACAGGTTTTTAATCCAGAGTAGTGGCTAG
mRNA sequenceShow/hide mRNA sequence
CGAATCGCGCCGATGCTTGCTTGACCCGATTCGCGCCAAAATGGCTCACCGGATCACCGCTTGATGGACGAAACCTAAAACCCATTCCCATCAGACGGTACCTGCCCAGT
GATCACATCGATCTTTAATTTCACCCGTCCGATCATTAATTCCCACATCGAAATTCAATTTCAAACGCACCGACAGAGAAAACCCATTCTCATTAGACGGTACCTTACGA
GTTCTGGATTCCGGCGGCAGTAGCTCTACTAACTTCGCAAGTTTCTCTCCGCTGGAATTCAGATGTAAAGATTGATCGCGGGACATTGATCCGTTTATGTTTTTTGCTTT
CTGATATTCTTTCATCAATCTGATTAGCGTAGCAAACTGGAGCGGTTTCGAATTTTGCCCCAACCTTGGCAGAAGGAGTAGATCTCGTGGCTGATTGCATAACGACGAAT
TAGGGTAGTTCTGTTCTAACAGAGCGGCCAGCTGCACTCGAGGACGGATTCTGAAGCAAAATATCACATAATCCGTATCAATTTTGCGATTACGTTTAGAGACACGGGAA
CGACATTCGCATTTTCGGCCACACCACGAAGTCAGGATATTTCAGTCGCTTTTGTTGGTTTTTGTGGAAAGTGCTCGGGGAGTGCTTCTTCATTCCCAGAAATGTGTCCC
AGACATGACAAGAGGCAGCTCTCGTACTACCTTAGAACGTAATATGCATTGTCGGACCAAAGATTATTATTGTGTGTCAGCAATGCATGCAGTTCCGTGAATCACAAGCC
TCGAGGTATATTTCACAGTGTTCTTTTACTTGTGGATAATATCCGTAGATCATGGTGGCATGCATGTTACATTCAAAAGGGCAGCTAATAATGCAGCCAATTAAGTTCCT
TCTTTTTGGCATGCTACCCAAAAATCGTAATCGAAAGCCAAGACGGTCGTTGTCGACGTTGGGCGAGTGCCTGTTCTGTTGAGTGCAAAATAAAGCAGCCATATTTTATG
GATCAGAAAAAGGAATCGTTCACTGATTAGAAAAAAGAAAAAGAAAAAGAAAAAGAAAAAGCAGCAACTCTGCGTCAGGGTCCAGGGTGTGGACCGCCTTGGCTCGCCTC
TGCGCTCTGCTTCAATGCTCAGCGGCCCCAAATGGCACCTCTCATATCTCACGTGCGACTTCGCCCTTCCCTCACGTGTCTTTGCCTTTGCCGCGCACCAACTTTGCCAT
GGAACCCGCAAAAATCCGATCCCAATAAACTCTCGTTCCAGGTTGCAGCCAGGGGAAGTTGTCAACGCGAAGCCAGTCAAATGGAAGAAGCTCTTGTTGACGCCGACAAG
TTCCGGAGCGAATTCCTCCGGGTTCTGCGTAGTCGACGATCTGGAGAAGCTCCGCTAAGTGTGAAGCCCGCAATGCCTGTCCACCACCCTCTGATTCAGGAGGCCAACCC
GCCAACCTTCAGTAAGGAAGGAGAGCAAGGCCAATTGCCCATATTGATTATTAGTATGAAGGATAGCAGACAGCAAAGAAGACCTGCAATTGTTTTTCTGCACAGTACAA
ACAAGTGCAAAGAGTGGTTGAGACCGTTGCTTGAGGCTTATGCATCGAGGGGATATGTAGCCATTGCCATTGATTCTCGTTACCATGGTGAAAGGGCCAAGGACAAAACC
ACTTACCGTGATGCTCTTATATCTTCATGGAAAAGAGGTGACACCATGCCGTTCATATTTGACACGGCATGGGACTTGATAAAGTTGGCGGATTATCTGACGGAAAGGGA
GGATGTTGACCCATCTAGAATAGGGATTACTGGTGAATCACTTGGAGGAATGCATGCATGGTTTGCTGCTGCTGCTGACACCCGCTATGCTGTGGTTGTCCCCATAATTG
GTGTACAGTGTTTTCGATGGGCCATTGATCATGATAAGTGGCAGGCCCGAGTTGAGAGTATAAAACCCGTTTTTGAAGAAGCACGAATTGAATTAGGCCTGAGTGAGATC
AACAAAGAATTGGTGGAGAAGGTCTGGAATAGGATTGCTCCTGGTTTGAGTTCCCAATTTCACTCGATTTATTCAGTTCCAGCTATTGCACCACGCCCTTTGTTGCTATT
AAATGGTGCAGATGACCCTCGTTGTCCAATTGGAGGTTTGGATGCTCCCGTATCAAGAACACAGACAGCTTATCGGAAGCTTGGTTGCCCAGACAATTTTAAGTTCATAG
CACAACCGGGGATAGGCCACGAAATGACACCAGAGATGGTAAAAGAAGCTAGCGATTGGTTTGACAGGTTTTTAATCCAGAGTAGTGGCTAGTGGCTTTTCTTGGCCCTG
CGGATCTTGTGGAAATCTATGATGCATTTGAGAAGTTAAGTGCAAGTGAAGAAATAAAGTTGAATCTTATTCCTGTACGAGAAAAACAATTCCCATCAGTTCTAATGGAA
AGTCATTGCTTGCTTGAAACATCTGTCATTTTATGTCTTTCATGCTTGAGCTGGATCAAAACTCGATAAAGACAAGACAGTTTGAGTTGTAGTAAAGCTACGTAAGTAGA
TTTCAATGAATTTCGTGAGGAAAAAAAAATGTAAACACCCATACGGCCTTTATTACAAAATTAAATCAACACAATATTCTGAATTCCAATGACAAAATTCCTACAAGGCC
TAAAAACCTAAATCAAAGAAAGCCTAATGGTGTAGTATATACAGTGGACGTCACCCAGCCCCAGATATGCAGCGTAGGAATTGGACCAAGGAGCAAAATAGGCGGAAAAT
ATAGAAGAGAGAGCCAAAAAGATTTAGAGGAAGTGAAATCAATTTTCACCTGCTACCAGCAGTTGCAGGAGTAGGTTTCATGGACTTGAACTTTGCCAGGTCAACGAGAT
CACCAAAAAGATTATCCTCAGGCCCAGAAGGCCTGCTGGGAGGCACATTAGACGAGGTAGAAACTCGGTAAGATGAATTTCTTATGCCATTACCATCTCTAATGGATAGC
CCATACATCCGCTGCTCCAGGTATTGTGTCCTCTGTGGCTGGACGTGGCCATGGCCAGAGAATCGATTGCCAAACATTTGCAGAGGATATAAGGCTGCCATCTGCCCCGT
CTGCCGAGAAAGCATGCCCCCAGAAGATCCTCCATAATCGAGCTGATTGCTGAAACTAACCTGGCCATTAAGTGCTGCAATCTGGCTGCCCACAATTGGCTGGATATACA
CACCTACAACTTGGTCATTCAAAACTGGCTGAGGTCCGTGGGGATATGTACCACTCTGTACATGCGTGAAAACGACCTGAGTCACCTTCATTGGGTGAGGATACTCTTCA
CCTGTTACCGGGCCATTATCATCCACAGGCTCCGCCTCCCAAGGTGGTGGCGGAAATGACTCGCTGTTTTGAGAACCTGATCAACC
Protein sequenceShow/hide protein sequence
MAPLISHVRLRPSLTCLCLCRAPTLPWNPQKSDPNKLSFQVAARGSCQREASQMEEALVDADKFRSEFLRVLRSRRSGEAPLSVKPAMPVHHPLIQEANPPTFSKEGEQG
QLPILIISMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKDKTTYRDALISSWKRGDTMPFIFDTAWDLIKLADYLTEREDVDPSRIGIT
GESLGGMHAWFAAAADTRYAVVVPIIGVQCFRWAIDHDKWQARVESIKPVFEEARIELGLSEINKELVEKVWNRIAPGLSSQFHSIYSVPAIAPRPLLLLNGADDPRCPI
GGLDAPVSRTQTAYRKLGCPDNFKFIAQPGIGHEMTPEMVKEASDWFDRFLIQSSG