; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g20260 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g20260
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:14586858..14592442
RNA-Seq ExpressionMoc07g20260
SyntenyMoc07g20260
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]1.7e-24886.36Show/hide
Query:  QAESSHN---PTGIITREEFDQLRGELDAQVEALKAKCEQNDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPNDYVEVFEGLMDFQAASD
        +AESS N   P G+ITREEFDQLRG+LDAQVEALKAKCEQ +  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDP DYVEVFE LMDFQAASD
Subjt:  QAESSHN---PTGIITREEFDQLRGELDAQVEALKAKCEQNDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPNDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTRTTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD E ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFT TTIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTRTTIPISEILTNIE

Query:  DSGMEKLLKRPEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTNRPA------------
        +SGMEKLLKRPEKLRGA ERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKP TSSAEKKEERKRSRTPPRRT+RPA            
Subjt:  DSGMEKLLKRPEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTNRPA------------

Query:  ----------------------HGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLRRSPTPLVGFSGE
                                PTCPITFDG DLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQL++SPTPLVGFSGE
Subjt:  ----------------------HGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLRRSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQNQTRITQMAEFV
        SVIPEG IDLPVTLGQ+QT++TQMAEFV
Subjt:  SVIPEGCIDLPVTLGQNQTRITQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]8.0e-24671.39Show/hide
Query:  SSNQQAESSHNPT---GIITREEFDQLRGELDAQVEALKAKCEQNDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPNDYVEVFEGLMDFQ
        SSNQQAESSHNP    G+ITREEFDQLRG+L+AQVEALKAKCEQ +  LNDGDLGESPFTSDVLE        APTVK YDG+KDP DYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPT---GIITREEFDQLRGELDAQVEALKAKCEQNDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPNDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTRTTIPISEILT
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSGKDE+AD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFT TTIPISEILT
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTRTTIPISEILT

Query:  NIEDSGMEKLLKRPEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTNRPA---------
        NIE+SGMEKLLKRPEKLRGA ERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKP TSSAEKKEERK SRTP RR +RPA         
Subjt:  NIEDSGMEKLLKRPEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTNRPA---------

Query:  -------------------------HGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLRRSPTPLVGF
                                   PTCPITFD  DLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQL++S TPLVGF
Subjt:  -------------------------HGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLRRSPTPLVGF

Query:  SGESVIPEGCIDLPVTLGQNQTRITQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPSGVGTVRGEQTASRECYAAALKGPSVCALET--
        S ESVIPEGCIDLPVTLG +QT++TQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKY TP+GVG VRGEQ ASRECYA+ALKG SVCALET  
Subjt:  SGESVIPEGCIDLPVTLGQNQTRITQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPSGVGTVRGEQTASRECYAAALKGPSVCALET--

Query:  RRDGTLEFKADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILEPDLMEIGAPEP
         RDGTLEFKA+LPR+EFAAPTEELELVPLL  +      +E +L     +  +D+      D+   G PEP
Subjt:  RRDGTLEFKADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILEPDLMEIGAPEP

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]4.1e-25062.78Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQRKVGAAAAEGQGHDGLAAEPPRRLARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPTPTSEDFDALQREMEAMR
        MVQPANSTNT DRR LAA+  HQR+VGA   EGQGH+ L  EP  R ARIT P LPPAHP+ SK                                    
Subjt:  MVQPANSTNTTDRRTLAASDAHQRKVGAAAAEGQGHDGLAAEPPRRLARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPTPTSEDFDALQREMEAMR

Query:  TQMRSMEAMYNDMVLAAGAGSRSENRATRIDAREQRGSHLGPAEEEHPEDNGSEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--
                                                                                                   AESS+NP  
Subjt:  TQMRSMEAMYNDMVLAAGAGSRSENRATRIDAREQRGSHLGPAEEEHPEDNGSEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--

Query:  TGIITREEFDQLRGELDAQVEALKAKCEQNDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPNDYVEVFEGLMDFQAASDAIKCRAFQIAL
         G+ITREEFDQL+ + DAQVEALKA+CE+ + S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG+KDP DYVEVFE LMDFQAA+DAIKC AFQIAL
Subjt:  TGIITREEFDQLRGELDAQVEALKAKCEQNDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPNDYVEVFEGLMDFQAASDAIKCRAFQIAL

Query:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT
        TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPAT
Subjt:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT

Query:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTRTTIPISEILTNIEDSGMEKLLKR
        FAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD+ +AD KS+DKG S SS R +YRR+ +   +SRPYE +T TTIPI EILTNIE++GMEKLLKR
Subjt:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTRTTIPISEILTNIEDSGMEKLLKR

Query:  PEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTNRPA---------------------H
        PEKLRG  E+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKP ++S EKKEERKR RTPPRR +RPA                      
Subjt:  PEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTNRPA---------------------H

Query:  GPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLRRSPTPLVGFSGESVIPEGCIDLPVTLGQNQTRITQ
         PT  I F+  DLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQL++SPTPLVGFSGES+  EGCIDLPV++ Q+ T++TQ
Subjt:  GPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLRRSPTPLVGFSGESVIPEGCIDLPVTLGQNQTRITQ

Query:  MAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPSGVGTVRGEQTASRECYAAALKGPSVCALE
        MAEFVV+DGRSAYNAIFGRPIIHSFRA+PSTLHQVLKY T +GVGTVRGE   SRECYA+  K  SVCALE
Subjt:  MAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPSGVGTVRGEQTASRECYAAALKGPSVCALE

XP_022156542.1 uncharacterized protein LOC111023421 [Momordica charantia]2.5e-19997.07Show/hide
Query:  GIITREEFDQLRGELDAQVEALKAKCEQNDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPNDYVEVFEGLMDFQAASDAIKCRAFQIALT
        GIITREEFDQLRGELDAQVEALKAKCEQ DDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDP DYVEVFEGLMDFQAASDAIKCRAFQIALT
Subjt:  GIITREEFDQLRGELDAQVEALKAKCEQNDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPNDYVEVFEGLMDFQAASDAIKCRAFQIALT

Query:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF
        GSARLWYRRLP RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF
Subjt:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF

Query:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTRTTIPISEILTNIEDSGMEKLLKRPE
        AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFT TTIPI EILTNIE+SGMEKLLKRPE
Subjt:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTRTTIPISEILTNIEDSGMEKLLKRPE

Query:  KLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTNRPA
        KLRGA ERRSKDKYCRFHREHGHNTSD WELKRQIEDLIQDGYFKKFVGKP TSSAEKKEERKRSRTPPRRT+RPA
Subjt:  KLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTNRPA

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]3.4e-19669.3Show/hide
Query:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAA+DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTATHLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDER-ADPKSKDKGSFSS-GRAEYRRAENGPTRSRPYERFTRTTIP
        T LADE LTVKLGEEAP TF EVLQKAKKVIDGQELLRTKTGRPE++I + +  +++R AD KS+DKGS SS  R EYRR E+GP+RSRPYER+T +TIP
Subjt:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDER-ADPKSKDKGSFSS-GRAEYRRAENGPTRSRPYERFTRTTIP

Query:  ISEILTNIEDSGMEKLLKRPEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTNRPA---
        ISEILTNIE+SGMEKLLKRPEKLRG LE+R+K+KYCRFHR+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKP ++S EKKEERKRSRTPPRR +RPA   
Subjt:  ISEILTNIEDSGMEKLLKRPEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTNRPA---

Query:  -------------------------------HGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLRRSP
                                       H PTC ITF   DLE VHLPHNDALVIA LIDH +VRRVL+DG                          
Subjt:  -------------------------------HGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLRRSP

Query:  TPLVGFSGESVIPEGCIDLPVTLGQNQTRITQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPSGVGTVRGEQTASRECYAAALKGPSVC
                      GCIDLPVT+GQ+ T++TQMAEFVV+DGRSAYNAIFGRPIIHSFRA+PSTLHQVLKY TP+ VG VRGEQ  SRECYA+ALKG +VC
Subjt:  TPLVGFSGESVIPEGCIDLPVTLGQNQTRITQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPSGVGTVRGEQTASRECYAAALKGPSVC

Query:  ALE--TRRDGTLEFKADLP---RKEFAAPTEELELVPLLSPEKQ
        ALE  T R    E +ADLP   +++F  PTEELELVPLLSPE+Q
Subjt:  ALE--TRRDGTLEFKADLP---RKEFAAPTEELELVPLLSPEKQ

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088138.3e-24986.36Show/hide
Query:  QAESSHN---PTGIITREEFDQLRGELDAQVEALKAKCEQNDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPNDYVEVFEGLMDFQAASD
        +AESS N   P G+ITREEFDQLRG+LDAQVEALKAKCEQ +  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDP DYVEVFE LMDFQAASD
Subjt:  QAESSHN---PTGIITREEFDQLRGELDAQVEALKAKCEQNDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPNDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTRTTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD E ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFT TTIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTRTTIPISEILTNIE

Query:  DSGMEKLLKRPEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTNRPA------------
        +SGMEKLLKRPEKLRGA ERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKP TSSAEKKEERKRSRTPPRRT+RPA            
Subjt:  DSGMEKLLKRPEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTNRPA------------

Query:  ----------------------HGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLRRSPTPLVGFSGE
                                PTCPITFDG DLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQL++SPTPLVGFSGE
Subjt:  ----------------------HGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLRRSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQNQTRITQMAEFV
        SVIPEG IDLPVTLGQ+QT++TQMAEFV
Subjt:  SVIPEGCIDLPVTLGQNQTRITQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188233.9e-24671.39Show/hide
Query:  SSNQQAESSHNPT---GIITREEFDQLRGELDAQVEALKAKCEQNDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPNDYVEVFEGLMDFQ
        SSNQQAESSHNP    G+ITREEFDQLRG+L+AQVEALKAKCEQ +  LNDGDLGESPFTSDVLE        APTVK YDG+KDP DYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPT---GIITREEFDQLRGELDAQVEALKAKCEQNDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPNDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTRTTIPISEILT
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSGKDE+AD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFT TTIPISEILT
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTRTTIPISEILT

Query:  NIEDSGMEKLLKRPEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTNRPA---------
        NIE+SGMEKLLKRPEKLRGA ERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKP TSSAEKKEERK SRTP RR +RPA         
Subjt:  NIEDSGMEKLLKRPEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTNRPA---------

Query:  -------------------------HGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLRRSPTPLVGF
                                   PTCPITFD  DLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQL++S TPLVGF
Subjt:  -------------------------HGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLRRSPTPLVGF

Query:  SGESVIPEGCIDLPVTLGQNQTRITQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPSGVGTVRGEQTASRECYAAALKGPSVCALET--
        S ESVIPEGCIDLPVTLG +QT++TQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKY TP+GVG VRGEQ ASRECYA+ALKG SVCALET  
Subjt:  SGESVIPEGCIDLPVTLGQNQTRITQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPSGVGTVRGEQTASRECYAAALKGPSVCALET--

Query:  RRDGTLEFKADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILEPDLMEIGAPEP
         RDGTLEFKA+LPR+EFAAPTEELELVPLL  +      +E +L     +  +D+      D+   G PEP
Subjt:  RRDGTLEFKADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILEPDLMEIGAPEP

A0A6J1DHB3 uncharacterized protein LOC1110204792.0e-25062.78Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQRKVGAAAAEGQGHDGLAAEPPRRLARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPTPTSEDFDALQREMEAMR
        MVQPANSTNT DRR LAA+  HQR+VGA   EGQGH+ L  EP  R ARIT P LPPAHP+ SK                                    
Subjt:  MVQPANSTNTTDRRTLAASDAHQRKVGAAAAEGQGHDGLAAEPPRRLARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPTPTSEDFDALQREMEAMR

Query:  TQMRSMEAMYNDMVLAAGAGSRSENRATRIDAREQRGSHLGPAEEEHPEDNGSEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--
                                                                                                   AESS+NP  
Subjt:  TQMRSMEAMYNDMVLAAGAGSRSENRATRIDAREQRGSHLGPAEEEHPEDNGSEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--

Query:  TGIITREEFDQLRGELDAQVEALKAKCEQNDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPNDYVEVFEGLMDFQAASDAIKCRAFQIAL
         G+ITREEFDQL+ + DAQVEALKA+CE+ + S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG+KDP DYVEVFE LMDFQAA+DAIKC AFQIAL
Subjt:  TGIITREEFDQLRGELDAQVEALKAKCEQNDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPNDYVEVFEGLMDFQAASDAIKCRAFQIAL

Query:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT
        TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPAT
Subjt:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT

Query:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTRTTIPISEILTNIEDSGMEKLLKR
        FAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD+ +AD KS+DKG S SS R +YRR+ +   +SRPYE +T TTIPI EILTNIE++GMEKLLKR
Subjt:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTRTTIPISEILTNIEDSGMEKLLKR

Query:  PEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTNRPA---------------------H
        PEKLRG  E+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKP ++S EKKEERKR RTPPRR +RPA                      
Subjt:  PEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTNRPA---------------------H

Query:  GPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLRRSPTPLVGFSGESVIPEGCIDLPVTLGQNQTRITQ
         PT  I F+  DLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQL++SPTPLVGFSGES+  EGCIDLPV++ Q+ T++TQ
Subjt:  GPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLRRSPTPLVGFSGESVIPEGCIDLPVTLGQNQTRITQ

Query:  MAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPSGVGTVRGEQTASRECYAAALKGPSVCALE
        MAEFVV+DGRSAYNAIFGRPIIHSFRA+PSTLHQVLKY T +GVGTVRGE   SRECYA+  K  SVCALE
Subjt:  MAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPSGVGTVRGEQTASRECYAAALKGPSVCALE

A0A6J1DS95 uncharacterized protein LOC1110234211.2e-19997.07Show/hide
Query:  GIITREEFDQLRGELDAQVEALKAKCEQNDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPNDYVEVFEGLMDFQAASDAIKCRAFQIALT
        GIITREEFDQLRGELDAQVEALKAKCEQ DDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDP DYVEVFEGLMDFQAASDAIKCRAFQIALT
Subjt:  GIITREEFDQLRGELDAQVEALKAKCEQNDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPNDYVEVFEGLMDFQAASDAIKCRAFQIALT

Query:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF
        GSARLWYRRLP RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF
Subjt:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF

Query:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTRTTIPISEILTNIEDSGMEKLLKRPE
        AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFT TTIPI EILTNIE+SGMEKLLKRPE
Subjt:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTRTTIPISEILTNIEDSGMEKLLKRPE

Query:  KLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTNRPA
        KLRGA ERRSKDKYCRFHREHGHNTSD WELKRQIEDLIQDGYFKKFVGKP TSSAEKKEERKRSRTPPRRT+RPA
Subjt:  KLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTNRPA

A0A6J1DZB9 uncharacterized protein LOC1110249041.6e-19669.3Show/hide
Query:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAA+DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTATHLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDER-ADPKSKDKGSFSS-GRAEYRRAENGPTRSRPYERFTRTTIP
        T LADE LTVKLGEEAP TF EVLQKAKKVIDGQELLRTKTGRPE++I + +  +++R AD KS+DKGS SS  R EYRR E+GP+RSRPYER+T +TIP
Subjt:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDER-ADPKSKDKGSFSS-GRAEYRRAENGPTRSRPYERFTRTTIP

Query:  ISEILTNIEDSGMEKLLKRPEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTNRPA---
        ISEILTNIE+SGMEKLLKRPEKLRG LE+R+K+KYCRFHR+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKP ++S EKKEERKRSRTPPRR +RPA   
Subjt:  ISEILTNIEDSGMEKLLKRPEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTNRPA---

Query:  -------------------------------HGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLRRSP
                                       H PTC ITF   DLE VHLPHNDALVIA LIDH +VRRVL+DG                          
Subjt:  -------------------------------HGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLRRSP

Query:  TPLVGFSGESVIPEGCIDLPVTLGQNQTRITQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPSGVGTVRGEQTASRECYAAALKGPSVC
                      GCIDLPVT+GQ+ T++TQMAEFVV+DGRSAYNAIFGRPIIHSFRA+PSTLHQVLKY TP+ VG VRGEQ  SRECYA+ALKG +VC
Subjt:  TPLVGFSGESVIPEGCIDLPVTLGQNQTRITQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPSGVGTVRGEQTASRECYAAALKGPSVC

Query:  ALE--TRRDGTLEFKADLP---RKEFAAPTEELELVPLLSPEKQ
        ALE  T R    E +ADLP   +++F  PTEELELVPLLSPE+Q
Subjt:  ALE--TRRDGTLEFKADLP---RKEFAAPTEELELVPLLSPEKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCGAATTCGACCAATACGACGGATCGAAGGACTCTGGCTGCCAGTGATGCCCACCAGAGGAAGGTCGGAGCAGCGGCGGCAGAAGGGCAAGGTCACGA
CGGCCTGGCAGCGGAACCCCCCCGCAGGTTGGCACGAATCACCGCGCCTGCCCTGCCGCCTGCACACCCGAGGACGTCCAAGGCCACCCGTGGTCGAGGTGGGACCTCTA
AGAAGGGCGCCCGAGGTCCAGCCCCGACTCCAACAAGCGAGGATTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGCAATGTAT
AACGACATGGTGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGGGCGACGCGCATAGACGCACGCGAGCAAAGGGGTTCCCACCTCGGCCCGGCCGAGGAGGAACA
TCCCGAAGACAACGGGAGTGAGGGGTACACTCGCCAGAGGGGAGACCTCCGCGAGCATCTCAACCGAAAGAGAGGCTCGTCTCTCCGAAAGGGGCAGTCGCCATCCCGCT
CCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCCACAGGGATAATCACAAGGGAGGAATTCGACCAGCTGAGGGGGGAGCTCGATGCTCAAGTGGAGGCC
TTAAAGGCCAAATGTGAGCAGAACGACGATTCACTGAACGATGGCGACTTGGGAGAATCACCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGC
TCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAATGACTATGTTGAAGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCT
TTCAGATCGCGCTTACTGGCAGCGCGCGCTTGTGGTACCGGAGATTGCCAGCCAGATCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCT
CGACACTACGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGGGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTCGC
ACACTGCTCCGATGACTCGGCCATGTGCTACTTCCTCACCGGTCTAGCCGATGAGGCCCTCACGGTGAAACTCGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCC
AGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCC
AAGTCCAAAGACAAAGGATCCTTCTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCAACCAGGAGCCGACCTTACGAGCGCTTCACCCGAACCACGATTCC
AATTTCCGAGATCCTAACGAACATCGAGGATTCTGGAATGGAAAAACTACTCAAGCGTCCGGAGAAACTTCGGGGAGCCCTGGAGAGGCGCAGCAAGGACAAGTACTGCC
GCTTCCATCGAGAGCACGGCCACAACACGTCGGACTGCTGGGAATTGAAGCGCCAAATTGAAGATCTAATTCAAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCGGG
ACTAGCTCAGCAGAGAAAAAGGAGGAGCGAAAGCGTTCTAGAACGCCACCTCGGCGCACCAACCGACCTGCGCACGGGCCGACCTGCCCAATCACCTTCGACGGTGTAGA
CTTGGAGGAGGTACACCTGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTTAGGAGAGTACTGGTAGACGGGGGAGCATCCGCTAACATCC
TGTCCTTACCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAGGAGAAGCCCGACGCCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGCTGCATC
GACTTACCGGTCACGCTGGGGCAGAACCAAACCCGGATCACTCAAATGGCCGAGTTCGTGGTAGTTGATGGTAGGTCGGCCTATAACGCCATCTTTGGGAGACCCATCAT
CCACTCCTTTCGGGCCATTCCTTCAACACTTCATCAAGTTTTGAAGTATCCCACCCCCAGCGGCGTGGGCACGGTCCGAGGAGAGCAAACCGCTTCGAGGGAGTGTTATG
CCGCCGCACTCAAAGGCCCATCGGTTTGCGCCCTCGAAACTCGCAGGGATGGGACGCTCGAGTTCAAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACCGAGGAG
CTCGAGCTTGTTCCTCTGCTTAGCCCCGAGAAGCAGTTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGA
ACCAGATCTGATGGAGATCGGCGCTCCAGAACCCTCATGGATGGACCCGATCGCAGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAA
GACAGGCAGCTCGGTTCGTGATCCGAGATGGGGCATTGTACCGACGTGGCTTTTCCCTGCCTCTGTTGAAATGCCTAACCCCTGAAGAGGGCCTACGAGCAATGGCCCAG
CTACGCCTGGCGGAATATCAGGGCAAGATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGGCCATCTGGTCTTAAGGAGGGTCCAAACGCATGT
GGGTGCCCTTGATCCGGCCTGGGAAGGCCCATTTGAGATCAAGGGCATAGTCCGACCTAGGACGTACGTATTGGCCGATCCGAAAGGAGATGTCCTCGCGCACCCGTGGA
ACGCGGAACACCTGAAGCGTTATTATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCGAATTCGACCAATACGACGGATCGAAGGACTCTGGCTGCCAGTGATGCCCACCAGAGGAAGGTCGGAGCAGCGGCGGCAGAAGGGCAAGGTCACGA
CGGCCTGGCAGCGGAACCCCCCCGCAGGTTGGCACGAATCACCGCGCCTGCCCTGCCGCCTGCACACCCGAGGACGTCCAAGGCCACCCGTGGTCGAGGTGGGACCTCTA
AGAAGGGCGCCCGAGGTCCAGCCCCGACTCCAACAAGCGAGGATTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGCAATGTAT
AACGACATGGTGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGGGCGACGCGCATAGACGCACGCGAGCAAAGGGGTTCCCACCTCGGCCCGGCCGAGGAGGAACA
TCCCGAAGACAACGGGAGTGAGGGGTACACTCGCCAGAGGGGAGACCTCCGCGAGCATCTCAACCGAAAGAGAGGCTCGTCTCTCCGAAAGGGGCAGTCGCCATCCCGCT
CCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCCACAGGGATAATCACAAGGGAGGAATTCGACCAGCTGAGGGGGGAGCTCGATGCTCAAGTGGAGGCC
TTAAAGGCCAAATGTGAGCAGAACGACGATTCACTGAACGATGGCGACTTGGGAGAATCACCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGC
TCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAATGACTATGTTGAAGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCT
TTCAGATCGCGCTTACTGGCAGCGCGCGCTTGTGGTACCGGAGATTGCCAGCCAGATCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCT
CGACACTACGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGGGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTCGC
ACACTGCTCCGATGACTCGGCCATGTGCTACTTCCTCACCGGTCTAGCCGATGAGGCCCTCACGGTGAAACTCGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCC
AGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCC
AAGTCCAAAGACAAAGGATCCTTCTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCAACCAGGAGCCGACCTTACGAGCGCTTCACCCGAACCACGATTCC
AATTTCCGAGATCCTAACGAACATCGAGGATTCTGGAATGGAAAAACTACTCAAGCGTCCGGAGAAACTTCGGGGAGCCCTGGAGAGGCGCAGCAAGGACAAGTACTGCC
GCTTCCATCGAGAGCACGGCCACAACACGTCGGACTGCTGGGAATTGAAGCGCCAAATTGAAGATCTAATTCAAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCGGG
ACTAGCTCAGCAGAGAAAAAGGAGGAGCGAAAGCGTTCTAGAACGCCACCTCGGCGCACCAACCGACCTGCGCACGGGCCGACCTGCCCAATCACCTTCGACGGTGTAGA
CTTGGAGGAGGTACACCTGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTTAGGAGAGTACTGGTAGACGGGGGAGCATCCGCTAACATCC
TGTCCTTACCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAGGAGAAGCCCGACGCCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGCTGCATC
GACTTACCGGTCACGCTGGGGCAGAACCAAACCCGGATCACTCAAATGGCCGAGTTCGTGGTAGTTGATGGTAGGTCGGCCTATAACGCCATCTTTGGGAGACCCATCAT
CCACTCCTTTCGGGCCATTCCTTCAACACTTCATCAAGTTTTGAAGTATCCCACCCCCAGCGGCGTGGGCACGGTCCGAGGAGAGCAAACCGCTTCGAGGGAGTGTTATG
CCGCCGCACTCAAAGGCCCATCGGTTTGCGCCCTCGAAACTCGCAGGGATGGGACGCTCGAGTTCAAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACCGAGGAG
CTCGAGCTTGTTCCTCTGCTTAGCCCCGAGAAGCAGTTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGA
ACCAGATCTGATGGAGATCGGCGCTCCAGAACCCTCATGGATGGACCCGATCGCAGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAA
GACAGGCAGCTCGGTTCGTGATCCGAGATGGGGCATTGTACCGACGTGGCTTTTCCCTGCCTCTGTTGAAATGCCTAACCCCTGAAGAGGGCCTACGAGCAATGGCCCAG
CTACGCCTGGCGGAATATCAGGGCAAGATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGGCCATCTGGTCTTAAGGAGGGTCCAAACGCATGT
GGGTGCCCTTGATCCGGCCTGGGAAGGCCCATTTGAGATCAAGGGCATAGTCCGACCTAGGACGTACGTATTGGCCGATCCGAAAGGAGATGTCCTCGCGCACCCGTGGA
ACGCGGAACACCTGAAGCGTTATTATCCTTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTTDRRTLAASDAHQRKVGAAAAEGQGHDGLAAEPPRRLARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPTPTSEDFDALQREMEAMRTQMRSMEAMY
NDMVLAAGAGSRSENRATRIDAREQRGSHLGPAEEEHPEDNGSEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPTGIITREEFDQLRGELDAQVEA
LKAKCEQNDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPNDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSS
RHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADP
KSKDKGSFSSGRAEYRRAENGPTRSRPYERFTRTTIPISEILTNIEDSGMEKLLKRPEKLRGALERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPG
TSSAEKKEERKRSRTPPRRTNRPAHGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLRRSPTPLVGFSGESVIPEGCI
DLPVTLGQNQTRITQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPSGVGTVRGEQTASRECYAAALKGPSVCALETRRDGTLEFKADLPRKEFAAPTEE
LELVPLLSPEKQLASAYETDLARSVPVEILDNPSILEPDLMEIGAPEPSWMDPIADFIRGNSPQDPKERRKLARQAARFVIRDGALYRRGFSLPLLKCLTPEEGLRAMAQ
LRLAEYQGKMARHYNARVRPRAFQVGHLVLRRVQTHVGALDPAWEGPFEIKGIVRPRTYVLADPKGDVLAHPWNAEHLKRYYP