; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g20370 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g20370
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr3:13744753..13750292
RNA-Seq ExpressionMoc03g20370
SyntenyMoc03g20370
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]4.5e-27692.99Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGES FTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD E ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPI EILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIE

Query:  DSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
        +SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKP TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  DSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAARREVCVIREHGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANVLSLPTYLALGWTRSQLRRSPTPLVGFSGE
        QSG KRKELARAARREVC+IRE  PTCPITFDG DLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SAN+LSLPTYLALGWTRSQL++SPTPLVGFSGE
Subjt:  QSGHKRKELARAARREVCVIREHGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANVLSLPTYLALGWTRSQLRRSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQNQTRITQMAEFV
        SVIPEG IDLPVTLGQ+QT++TQMAEFV
Subjt:  SVIPEGCIDLPVTLGQNQTRITQMAEFV

XP_022150613.1 uncharacterized protein LOC111018708, partial [Momordica charantia]3.6e-22595.26Show/hide
Query:  KDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
        KDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFEGLMDF AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
Subjt:  KDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
        FSSR Y KKT THLATIRQKEG TLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+AP TFAEVLQKAKKVIDGQELLRTKTGRP+RKI
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI

Query:  GRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
        GRGRSGKD ERADPKSKDKGSFSSGRAEYRRAE+GPT+SRPYERFTPTTIPI EILTNIE+SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
Subjt:  GRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC

Query:  WELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCVIREHGPTCPITFDGVDLEEVH
        WELKRQIEDLIQDGYFKKFVGKP TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVC+IRE GPTCPITFDG D EEVH
Subjt:  WELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCVIREHGPTCPITFDGVDLEEVH

Query:  LPHNDALVIAPLIDHVVVRRVL
        LPHNDA VIAPLIDHVVVRRVL
Subjt:  LPHNDALVIAPLIDHVVVRRVL

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]3.6e-27376.45Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPA   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDGDLGES FTSDVLE        APTVK YDG+KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILT
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSGKDE+AD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPI EILT
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILT

Query:  NIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP
        NIE+SGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKP TSSAEKKEERK SRTP RR DRPAVINTIFGGP
Subjt:  NIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP

Query:  SGGQSGHKRKELARAARREVCVIREHGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANVLSLPTYLALGWTRSQLRRSPTPLVGF
        SGGQSGHKRKELARAARREVC+IRE  PTCPITFD  DLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SAN++SL TYLALGWTRSQL++S TPLVGF
Subjt:  SGGQSGHKRKELARAARREVCVIREHGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANVLSLPTYLALGWTRSQLRRSPTPLVGF

Query:  SGESVIPEGCIDLPVTLGQNQTRITQMAEFVVVDGRSAYNAIFGRPIIHSFWAIPSTVHQVLKYPTPSGVGTVRGEQTASRECYAAALKGPSVCALETL-
        S ESVIPEGCIDLPVTLG +QT++TQMAEFVV+DGRSAYNAIFGRPIIHSF AIPST+HQVLKY TP+GVG VRGEQ ASRECYA+ALKG SVCALETL 
Subjt:  SGESVIPEGCIDLPVTLGQNQTRITQMAEFVVVDGRSAYNAIFGRPIIHSFWAIPSTVHQVLKYPTPSGVGTVRGEQTASRECYAAALKGPSVCALETL-

Query:  -RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILEPDLMEIGAPEP
         RDGTLEF+A+LPR+EFAAPTEELELVPLL  +      +E +L     +  +D+      D+   G PEP
Subjt:  -RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILEPDLMEIGAPEP

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]2.9e-22289.91Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTP
        MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRT       KIG+GRSGKD E  DPKSKDKGSFS+GRAEYRRAENGPTRSRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTP

Query:  TTIPIFEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRP
        TTIPI EILTNIE+SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKP TSSAEKKEERKRSRTPPRRTDRP
Subjt:  TTIPIFEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRP

Query:  AVINTIFGGPSGGQSGHKRKELARAARREVCVIREHGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANVLSLPTYLALGWTRSQL
        AVINTIFGGPSGGQSGHKRK+LARAARREVC+IRE  PTCPITFD  DL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASAN+LSLPTYLALGWTRSQL
Subjt:  AVINTIFGGPSGGQSGHKRKELARAARREVCVIREHGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANVLSLPTYLALGWTRSQL

Query:  RRSPTPLVGFSGESVIPEGCIDLPVTLGQNQTRITQMAEFVVVDGRSAYNAIFGRPIIHSFWAIPSTVHQVLKYPTPSGVGTVRGEQTASRECYAAALKG
        ++SPTPLVGFSGESV+PEGCIDLPVTLGQ+QTR+TQMAEFVVVDGRSAYNAIFGRPIIHSF AIPST+HQVLKY TP+GVGTVRGEQTASRECYA+ LKG
Subjt:  RRSPTPLVGFSGESVIPEGCIDLPVTLGQNQTRITQMAEFVVVDGRSAYNAIFGRPIIHSFWAIPSTVHQVLKYPTPSGVGTVRGEQTASRECYAAALKG

Query:  PSVCALETL--RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQL
         SVCALETL  RDGTLEFEADLP +EFAAP EELELVPLLS EKQ+
Subjt:  PSVCALETL--RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]1.4e-26965.14Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAVAAEGQGHDGLAAEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPTPTSEDFDALQREMEAMR
        MVQPANSTNT DRR LAA+  HQREVGA   EGQGH+ L  EPL RSARIT P LPPAHP+ SK                                    
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAVAAEGQGHDGLAAEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPTPTSEDFDALQREMEAMR

Query:  FMEAMYNDMVLAAGAGSRSENRATRIDACEQRGSHLGPAEEERPEDNGSEGYTHQKGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--AGII
                                                                                               AESS+NP   G+I
Subjt:  FMEAMYNDMVLAAGAGSRSENRATRIDACEQRGSHLGPAEEERPEDNGSEGYTHQKGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--AGII

Query:  TREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSA
        TREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLGE SF+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIALTGSA
Subjt:  TREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSA

Query:  RLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEV
        RLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPATFAEV
Subjt:  RLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEV

Query:  LQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEDSGMEKLLKRPEKL
        LQK KKVIDGQELLRTKTGRPE+ I +GR+GKD+ +AD KS+DKG S SS R +YRR+ +   +SRPYE +TPTTIPIFEILTNIE++GMEKLLKRPEKL
Subjt:  LQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEDSGMEKLLKRPEKL

Query:  RGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAAR
        RG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKP ++S EKKEERKR RTPPRR DRPAVIN             K+KELAR AR
Subjt:  RGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAAR

Query:  REVCVIREHGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANVLSLPTYLALGWTRSQLRRSPTPLVGFSGESVIPEGCIDLPVTL
        REVC+IRE  PT  I F+  DLE VHLPHNDALVIAPLID V+VRR+LVDGGASAN+LSL TYLALGWTRSQL++SPTPLVGFSGES+  EGCIDLPV++
Subjt:  REVCVIREHGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANVLSLPTYLALGWTRSQLRRSPTPLVGFSGESVIPEGCIDLPVTL

Query:  GQNQTRITQMAEFVVVDGRSAYNAIFGRPIIHSFWAIPSTVHQVLKYPTPSGVGTVRGEQTASRECYAAALKGPSVCALE--TLRD
         Q+ T++TQMAEFVV+DGRSAYNAIFGRPIIHSF A+PST+HQVLKY T +GVGTVRGE   SRECYA+  K  SVCALE  T+RD
Subjt:  GQNQTRITQMAEFVVVDGRSAYNAIFGRPIIHSFWAIPSTVHQVLKYPTPSGVGTVRGEQTASRECYAAALKGPSVCALE--TLRD

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088132.2e-27692.99Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGES FTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD E ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPI EILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIE

Query:  DSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
        +SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKP TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  DSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAARREVCVIREHGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANVLSLPTYLALGWTRSQLRRSPTPLVGFSGE
        QSG KRKELARAARREVC+IRE  PTCPITFDG DLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SAN+LSLPTYLALGWTRSQL++SPTPLVGFSGE
Subjt:  QSGHKRKELARAARREVCVIREHGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANVLSLPTYLALGWTRSQLRRSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQNQTRITQMAEFV
        SVIPEG IDLPVTLGQ+QT++TQMAEFV
Subjt:  SVIPEGCIDLPVTLGQNQTRITQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188231.7e-27376.45Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPA   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDGDLGES FTSDVLE        APTVK YDG+KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILT
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSGKDE+AD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPI EILT
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILT

Query:  NIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP
        NIE+SGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKP TSSAEKKEERK SRTP RR DRPAVINTIFGGP
Subjt:  NIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP

Query:  SGGQSGHKRKELARAARREVCVIREHGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANVLSLPTYLALGWTRSQLRRSPTPLVGF
        SGGQSGHKRKELARAARREVC+IRE  PTCPITFD  DLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SAN++SL TYLALGWTRSQL++S TPLVGF
Subjt:  SGGQSGHKRKELARAARREVCVIREHGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANVLSLPTYLALGWTRSQLRRSPTPLVGF

Query:  SGESVIPEGCIDLPVTLGQNQTRITQMAEFVVVDGRSAYNAIFGRPIIHSFWAIPSTVHQVLKYPTPSGVGTVRGEQTASRECYAAALKGPSVCALETL-
        S ESVIPEGCIDLPVTLG +QT++TQMAEFVV+DGRSAYNAIFGRPIIHSF AIPST+HQVLKY TP+GVG VRGEQ ASRECYA+ALKG SVCALETL 
Subjt:  SGESVIPEGCIDLPVTLGQNQTRITQMAEFVVVDGRSAYNAIFGRPIIHSFWAIPSTVHQVLKYPTPSGVGTVRGEQTASRECYAAALKGPSVCALETL-

Query:  -RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILEPDLMEIGAPEP
         RDGTLEF+A+LPR+EFAAPTEELELVPLL  +      +E +L     +  +D+      D+   G PEP
Subjt:  -RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILEPDLMEIGAPEP

A0A6J1D9W7 uncharacterized protein LOC1110187081.8e-22595.26Show/hide
Query:  KDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
        KDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFEGLMDF AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
Subjt:  KDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
        FSSR Y KKT THLATIRQKEG TLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+AP TFAEVLQKAKKVIDGQELLRTKTGRP+RKI
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI

Query:  GRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
        GRGRSGKD ERADPKSKDKGSFSSGRAEYRRAE+GPT+SRPYERFTPTTIPI EILTNIE+SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
Subjt:  GRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC

Query:  WELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCVIREHGPTCPITFDGVDLEEVH
        WELKRQIEDLIQDGYFKKFVGKP TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVC+IRE GPTCPITFDG D EEVH
Subjt:  WELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCVIREHGPTCPITFDGVDLEEVH

Query:  LPHNDALVIAPLIDHVVVRRVL
        LPHNDA VIAPLIDHVVVRRVL
Subjt:  LPHNDALVIAPLIDHVVVRRVL

A0A6J1DD03 uncharacterized protein LOC1110198991.4e-22289.91Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTP
        MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRT       KIG+GRSGKD E  DPKSKDKGSFS+GRAEYRRAENGPTRSRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTP

Query:  TTIPIFEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRP
        TTIPI EILTNIE+SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKP TSSAEKKEERKRSRTPPRRTDRP
Subjt:  TTIPIFEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRP

Query:  AVINTIFGGPSGGQSGHKRKELARAARREVCVIREHGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANVLSLPTYLALGWTRSQL
        AVINTIFGGPSGGQSGHKRK+LARAARREVC+IRE  PTCPITFD  DL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASAN+LSLPTYLALGWTRSQL
Subjt:  AVINTIFGGPSGGQSGHKRKELARAARREVCVIREHGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANVLSLPTYLALGWTRSQL

Query:  RRSPTPLVGFSGESVIPEGCIDLPVTLGQNQTRITQMAEFVVVDGRSAYNAIFGRPIIHSFWAIPSTVHQVLKYPTPSGVGTVRGEQTASRECYAAALKG
        ++SPTPLVGFSGESV+PEGCIDLPVTLGQ+QTR+TQMAEFVVVDGRSAYNAIFGRPIIHSF AIPST+HQVLKY TP+GVGTVRGEQTASRECYA+ LKG
Subjt:  RRSPTPLVGFSGESVIPEGCIDLPVTLGQNQTRITQMAEFVVVDGRSAYNAIFGRPIIHSFWAIPSTVHQVLKYPTPSGVGTVRGEQTASRECYAAALKG

Query:  PSVCALETL--RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQL
         SVCALETL  RDGTLEFEADLP +EFAAP EELELVPLLS EKQ+
Subjt:  PSVCALETL--RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQL

A0A6J1DHB3 uncharacterized protein LOC1110204796.8e-27065.14Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAVAAEGQGHDGLAAEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPTPTSEDFDALQREMEAMR
        MVQPANSTNT DRR LAA+  HQREVGA   EGQGH+ L  EPL RSARIT P LPPAHP+ SK                                    
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAVAAEGQGHDGLAAEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPTPTSEDFDALQREMEAMR

Query:  FMEAMYNDMVLAAGAGSRSENRATRIDACEQRGSHLGPAEEERPEDNGSEGYTHQKGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--AGII
                                                                                               AESS+NP   G+I
Subjt:  FMEAMYNDMVLAAGAGSRSENRATRIDACEQRGSHLGPAEEERPEDNGSEGYTHQKGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--AGII

Query:  TREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSA
        TREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLGE SF+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIALTGSA
Subjt:  TREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSA

Query:  RLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEV
        RLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPATFAEV
Subjt:  RLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEV

Query:  LQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEDSGMEKLLKRPEKL
        LQK KKVIDGQELLRTKTGRPE+ I +GR+GKD+ +AD KS+DKG S SS R +YRR+ +   +SRPYE +TPTTIPIFEILTNIE++GMEKLLKRPEKL
Subjt:  LQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEDSGMEKLLKRPEKL

Query:  RGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAAR
        RG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKP ++S EKKEERKR RTPPRR DRPAVIN             K+KELAR AR
Subjt:  RGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAAR

Query:  REVCVIREHGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANVLSLPTYLALGWTRSQLRRSPTPLVGFSGESVIPEGCIDLPVTL
        REVC+IRE  PT  I F+  DLE VHLPHNDALVIAPLID V+VRR+LVDGGASAN+LSL TYLALGWTRSQL++SPTPLVGFSGES+  EGCIDLPV++
Subjt:  REVCVIREHGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANVLSLPTYLALGWTRSQLRRSPTPLVGFSGESVIPEGCIDLPVTL

Query:  GQNQTRITQMAEFVVVDGRSAYNAIFGRPIIHSFWAIPSTVHQVLKYPTPSGVGTVRGEQTASRECYAAALKGPSVCALE--TLRD
         Q+ T++TQMAEFVV+DGRSAYNAIFGRPIIHSF A+PST+HQVLKY T +GVGTVRGE   SRECYA+  K  SVCALE  T+RD
Subjt:  GQNQTRITQMAEFVVVDGRSAYNAIFGRPIIHSFWAIPSTVHQVLKYPTPSGVGTVRGEQTASRECYAAALKGPSVCALE--TLRD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCGAATTCGACCAATACGACGGATCGAAGGACTCTGGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGTGGCGGCAGAAGGGCAAGGTCACGA
CGGCCTGGCAGCGGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGCCCTGCCGCCTGCACACCCGAGGACGTCCAAGGCCACCCGTGGTCGAGGTGGGACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCGACTCCAACAAGCGAGGATTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCTTCATGGAGGCAATGTATAACGACATGGTG
CTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGGGCGACGCGCATAGACGCATGCGAGCAAAGGGGTTCCCACCTCGGCCCGGCCGAGGAGGAACGTCCCGAAGACAA
CGGGAGTGAGGGGTACACTCACCAGAAGGGAGACCTCCGCGAGCATCTCAACCGAAAGAGAGGCTCGTCTCTCCGAAAGGGGCAATCGCCATCCCGCTCCCACAGGAGCT
CCAACCAGCAGGCTGAATCCTCTCACAACCCCGCAGGGATAATCACAAGGGAGGAATTCGACCAGCTAAGGGGGGAGCTCGATGCTCAAGTGGAGGCCCTAAAGGCCAAA
TGTGAGCAGAAGGACGATTCACTGAACGATGGCGACTTGGGAGAATCGTCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAA
GCCTTATGACGGGACGAAGGACCCCAAGGACTATGTTGAAGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGC
TTACTGGCAGCGCGCGTTTGTGGTACCGGAGATTGCCAGCCAGATCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTACGAC
AAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGGGAGACGCTGCGGGAGTATGTTACCAGATTCCAGGAGGAGCAGTTGAAGGTCGCACACTGCTCCGA
TGACTCGGCCATGTGCTACTTCCTCACCGGTCTAGCCGATGAGGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCCAGAAGGCGAAGA
AAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAAGAC
AAAGGATCCTTCTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCAACCAGGAGCCGACCTTACGAGCGCTTCACCCCAACCACGATTCCAATTTTCGAGAT
CCTAACGAACATCGAGGATTCTGGAATGGAAAAACTACTCAAGCGTCCGGAGAAACTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTACTGCCGCTTCCATCGGG
AGCACGGCCACAACACGTCGGACTGCTGGGAATTGAAGCGCCAAATTGAAGATCTAATTCAAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCGGGACTAGCTCAGCA
GAGAAAAAGGAGGAGCGAAAGCGTTCTAGAACGCCACCTCGGCGCACCGACCGACCTGCGGTCATCAACACCATTTTTGGAGGACCAAGCGGGGGTCAATCCGGGCATAA
AAGAAAGGAGTTAGCCCGTGCAGCCAGACGCGAGGTGTGCGTCATCAGGGAGCACGGGCCGACCTGCCCAATCACCTTCGACGGTGTAGACTTGGAGGAGGTACATCTGC
CCCACAATGATGCCCTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTTAGGAGAGTACTGGTAGACGGGGGCGCATCCGCTAACGTCCTGTCCTTACCGACCTACCTC
GCCTTGGGCTGGACGAGGTCGCAATTGAGGAGAAGCCCGACGCCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGCTGCATCGACTTGCCGGTCACGCTGGG
GCAGAACCAAACCCGGATCACTCAAATGGCCGAGTTCGTGGTAGTTGATGGTAGGTCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCCTTTTGGGCCATTC
CTTCAACAGTTCATCAAGTTTTGAAGTATCCCACCCCCAGCGGCGTGGGCACGGTCCGAGGAGAGCAAACCGCTTCGAGGGAGTGTTATGCCGCCGCACTCAAAGGCCCA
TCAGTTTGCGCCCTCGAAACTCTCAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACCGAGGAGCTCGAGCTTGTTCCTCTGCT
TAGCCCCGAGAAGCAGTTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGAACCAGATCTGATGGAGATCG
GCGCTCCAGAACCCTCATGGATGGACCCGATCGCAGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGACAGGCAGCTCGGTTCGTG
ATCCGAGATGGGGCATTGTACCGACGTGGTTTTTCCCTGCCTCTGTTGAAATGCCTAACCCCTGAAGAGGGCCTAGTAGAGCATTACGAGCCTTCGACAAATGAGGAAGA
GCTGCTCCTCAACCTCGACTTGTTGGAAGAAAGACGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAGGGTAGGATGGCCAGACATTACAATGCCCGCGTTCGACCTC
GGGCCTTTCAGGTCGGCCATCTGGTCTTAAGGAGGGTCCAAACGCATGTGGGTGCCCTTGATCCGGCCTGGGAAGGCCCATTTGAGATCAAGGGCATAGTCCGACCTGGG
ACGTACGTATTGGCCGATCCGAAAGGAGATGTCCTCGCGCACCCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCGAATTCGACCAATACGACGGATCGAAGGACTCTGGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGTGGCGGCAGAAGGGCAAGGTCACGA
CGGCCTGGCAGCGGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGCCCTGCCGCCTGCACACCCGAGGACGTCCAAGGCCACCCGTGGTCGAGGTGGGACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCGACTCCAACAAGCGAGGATTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCTTCATGGAGGCAATGTATAACGACATGGTG
CTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGGGCGACGCGCATAGACGCATGCGAGCAAAGGGGTTCCCACCTCGGCCCGGCCGAGGAGGAACGTCCCGAAGACAA
CGGGAGTGAGGGGTACACTCACCAGAAGGGAGACCTCCGCGAGCATCTCAACCGAAAGAGAGGCTCGTCTCTCCGAAAGGGGCAATCGCCATCCCGCTCCCACAGGAGCT
CCAACCAGCAGGCTGAATCCTCTCACAACCCCGCAGGGATAATCACAAGGGAGGAATTCGACCAGCTAAGGGGGGAGCTCGATGCTCAAGTGGAGGCCCTAAAGGCCAAA
TGTGAGCAGAAGGACGATTCACTGAACGATGGCGACTTGGGAGAATCGTCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAA
GCCTTATGACGGGACGAAGGACCCCAAGGACTATGTTGAAGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGC
TTACTGGCAGCGCGCGTTTGTGGTACCGGAGATTGCCAGCCAGATCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTACGAC
AAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGGGAGACGCTGCGGGAGTATGTTACCAGATTCCAGGAGGAGCAGTTGAAGGTCGCACACTGCTCCGA
TGACTCGGCCATGTGCTACTTCCTCACCGGTCTAGCCGATGAGGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCCAGAAGGCGAAGA
AAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAAGAC
AAAGGATCCTTCTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCAACCAGGAGCCGACCTTACGAGCGCTTCACCCCAACCACGATTCCAATTTTCGAGAT
CCTAACGAACATCGAGGATTCTGGAATGGAAAAACTACTCAAGCGTCCGGAGAAACTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTACTGCCGCTTCCATCGGG
AGCACGGCCACAACACGTCGGACTGCTGGGAATTGAAGCGCCAAATTGAAGATCTAATTCAAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCGGGACTAGCTCAGCA
GAGAAAAAGGAGGAGCGAAAGCGTTCTAGAACGCCACCTCGGCGCACCGACCGACCTGCGGTCATCAACACCATTTTTGGAGGACCAAGCGGGGGTCAATCCGGGCATAA
AAGAAAGGAGTTAGCCCGTGCAGCCAGACGCGAGGTGTGCGTCATCAGGGAGCACGGGCCGACCTGCCCAATCACCTTCGACGGTGTAGACTTGGAGGAGGTACATCTGC
CCCACAATGATGCCCTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTTAGGAGAGTACTGGTAGACGGGGGCGCATCCGCTAACGTCCTGTCCTTACCGACCTACCTC
GCCTTGGGCTGGACGAGGTCGCAATTGAGGAGAAGCCCGACGCCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGCTGCATCGACTTGCCGGTCACGCTGGG
GCAGAACCAAACCCGGATCACTCAAATGGCCGAGTTCGTGGTAGTTGATGGTAGGTCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCCTTTTGGGCCATTC
CTTCAACAGTTCATCAAGTTTTGAAGTATCCCACCCCCAGCGGCGTGGGCACGGTCCGAGGAGAGCAAACCGCTTCGAGGGAGTGTTATGCCGCCGCACTCAAAGGCCCA
TCAGTTTGCGCCCTCGAAACTCTCAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACCGAGGAGCTCGAGCTTGTTCCTCTGCT
TAGCCCCGAGAAGCAGTTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGAACCAGATCTGATGGAGATCG
GCGCTCCAGAACCCTCATGGATGGACCCGATCGCAGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGACAGGCAGCTCGGTTCGTG
ATCCGAGATGGGGCATTGTACCGACGTGGTTTTTCCCTGCCTCTGTTGAAATGCCTAACCCCTGAAGAGGGCCTAGTAGAGCATTACGAGCCTTCGACAAATGAGGAAGA
GCTGCTCCTCAACCTCGACTTGTTGGAAGAAAGACGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAGGGTAGGATGGCCAGACATTACAATGCCCGCGTTCGACCTC
GGGCCTTTCAGGTCGGCCATCTGGTCTTAAGGAGGGTCCAAACGCATGTGGGTGCCCTTGATCCGGCCTGGGAAGGCCCATTTGAGATCAAGGGCATAGTCCGACCTGGG
ACGTACGTATTGGCCGATCCGAAAGGAGATGTCCTCGCGCACCCGTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTTDRRTLAASDAHQREVGAVAAEGQGHDGLAAEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPTPTSEDFDALQREMEAMRFMEAMYNDMV
LAAGAGSRSENRATRIDACEQRGSHLGPAEEERPEDNGSEGYTHQKGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQLRGELDAQVEALKAK
CEQKDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYD
KKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKD
KGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSA
EKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCVIREHGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANVLSLPTYL
ALGWTRSQLRRSPTPLVGFSGESVIPEGCIDLPVTLGQNQTRITQMAEFVVVDGRSAYNAIFGRPIIHSFWAIPSTVHQVLKYPTPSGVGTVRGEQTASRECYAAALKGP
SVCALETLRDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILEPDLMEIGAPEPSWMDPIADFIRGNSPQDPKERRKLARQAARFV
IRDGALYRRGFSLPLLKCLTPEEGLVEHYEPSTNEEELLLNLDLLEERRAMAQLRLAEYQGRMARHYNARVRPRAFQVGHLVLRRVQTHVGALDPAWEGPFEIKGIVRPG
TYVLADPKGDVLAHP