; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g04540 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g04540
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:3919617..3922057
RNA-Seq ExpressionMoc07g04540
SyntenyMoc07g04540
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]5.6e-23983.14Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGKLDAQVEALKAKCEPKDDSLNDGDLGESPFTSNVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCE K+  LNDGDLGESPFTS+VLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHN---PAGIITREEFDQLRGKLDAQVEALKAKCEPKDDSLNDGDLGESPFTSNVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFLAQFSSRHYDKKTATHLATIKQKEGETLPEYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLR+EFLA FSSRHYDKKTATHLATI+QKEGETL EYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFLAQFSSRHYDKKTATHLATIKQKEGETLPEYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADLKSKEKGSFSSGRADPGEAQQG-------------------------
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD+E AD KSK+KGSFSSGRA+   A+ G                         
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADLKSKEKGSFSSGRADPGEAQQG-------------------------

Query:  ------------------------KYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
                                KYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  ------------------------KYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSSTPLVGFSGE
        QSG KRKELARAARREVCIIREQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKKS TPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSSTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDRTRVTQMAEFV
        SVIPEG IDLPVTLGQD+T+VTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDRTRVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]3.2e-24274.01Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGKLDAQVEALKAKCEPKDDSLNDGDLGESPFTSNVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPA   G+ITREEFDQLRGKL+AQVEALKAKCE K+  LNDGDLGESPFTS+VLE        APTVK YDG+KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGKLDAQVEALKAKCEPKDDSLNDGDLGESPFTSNVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFLAQFSSRHYDKKTATHLATIKQKEGETLPEYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFLAQFSSRHYDKKTATHLATIKQKEGETLPEYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADLKSKEKGSFSSGRAD----------------------------
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSGKD E+ADLKSK+KGSFSSGRA+                            
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADLKSKEKGSFSSGRAD----------------------------

Query:  ---------------------PGEAQQGKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGG
                             P    + KYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEERK SRTP RR DRPAVINTIFGG
Subjt:  ---------------------PGEAQQGKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGG

Query:  PSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSSTPLVG
        PSGGQSGHKRKELARAARREVCIIREQ PTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS+TPLVG
Subjt:  PSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSSTPLVG

Query:  FSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAILGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETL
        FS ESVIPEGCIDLPVTLG D+T+VTQMAEFVVIDGRSAYNAI GRPIIHSFRAIPSTLHQVLKY TPNGVG VRGEQ ASRECYASALKGSSVCALETL
Subjt:  FSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAILGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETL

Query:  AGRDGTLEFEADLPRKEFAAPTEELELVPLL
          RDGTLEF+A+LPR+EFAAPTEELELVPLL
Subjt:  AGRDGTLEFEADLPRKEFAAPTEELELVPLL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]3.5e-24162.24Show/hide
Query:  MVQPTNSTNTTDRRTLAASDAHQREVGAATVEGQGHDGLATGPLHRSARITA----PAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMAMRT
        MVQP NSTNT DRR LAA+  HQREVGA  VEGQGH+ L T PL RSARIT     PAHP+ SK                                    
Subjt:  MVQPTNSTNTTDRRTLAASDAHQREVGAATVEGQGHDGLATGPLHRSARITA----PAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMAMRT

Query:  QMRSMEEMYNEMMLAAGAGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSCSHRSSNQQAESSHNP--AGIITREEFDQLRGKLDAQVEA
                                                                             AESS+NP   G+ITREEFDQL+ K DAQVEA
Subjt:  QMRSMEEMYNEMMLAAGAGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSCSHRSSNQQAESSHNP--AGIITREEFDQLRGKLDAQVEA

Query:  LKAKCEPKDDSLNDGDLGESPFTSNVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQL
        LKA+CE K+ S +DGDLGE  F+S++LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIALTGSARLWYRRLPAR ISTYSQL
Subjt:  LKAKCEPKDDSLNDGDLGESPFTSNVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQL

Query:  RKEFLAQFSSRHYDKKTATHLATIKQKEGETLPEYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKT
        RKEF++QFSSRHYD+KT THLATI+QKEGETL EYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPATFAEVLQK KKVIDGQELLRTKT
Subjt:  RKEFLAQFSSRHYDKKTATHLATIKQKEGETLPEYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKT

Query:  GRPERKIGRGRSGKDVERADLKSKEKGSFSSG--------------------------------------------------RADPGEAQQGKYCRFHRE
        GRPE+ I +GR+GKD  +AD KS++KG  SS                                                   R DP +    KYCRFHR+
Subjt:  GRPERKIGRGRSGKDVERADLKSKEKGSFSSG--------------------------------------------------RADPGEAQQGKYCRFHRE

Query:  HGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFD
        HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN             K+KELAR ARREVCIIREQ PT  I F+
Subjt:  HGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFD

Query:  GADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSSTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDG
         ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLKKS TPLVGFSGES+  EGCIDLPV++ QD T+VTQMAEFVVIDG
Subjt:  GADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSSTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDG

Query:  RSAYNAILGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRD
        RSAYNAI GRPIIHSFRA+PSTLHQVLKY T NGVGTVRGE   SRECYAS  K SSVCALE    RD
Subjt:  RSAYNAILGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRD

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]7.8e-20966.51Show/hide
Query:  PAEEERPEDNESEGYTRQRGDLREHL-NRKRGSSLRKGQSPSCSHR--SSNQQAESSHN---PAGIITREEFDQLRGKLDAQVEALKAKCEPKDDSLNDG
        P   E   ++E   Y+ +  DLR+HL ++K+ +S     S S S    +SN +A+S +    P  +I REEFD ++ + D QVEALKA+CE K+   +D 
Subjt:  PAEEERPEDNESEGYTRQRGDLREHL-NRKRGSSLRKGQSPSCSHR--SSNQQAESSHN---PAGIITREEFDQLRGKLDAQVEALKAKCEPKDDSLNDG

Query:  DLGESPFTSNVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFLAQFSSRHYDK
        DLGESPFTS+++EAPIPPKFK PT+KPYDG+KDPKDYVEVFEGLMDFQAA+DAIKC AFQIALTGSARLW RRLPARSISTYSQLRKEF+ QFS RHYD+
Subjt:  DLGESPFTSNVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFLAQFSSRHYDK

Query:  KTATHLATIKQKEGETLPEYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD
        KTATHLATI+QKE                                   DE LTVKLGEEAPATFAEVLQ AKKVIDGQELLRTKT RPE++I + R  + 
Subjt:  KTATHLATIKQKEGETLPEYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD

Query:  VERADLKSKEKGSFSSG------RADPGEAQQGKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN
          + D KSK+KGS SSG      R++ G ++   Y R           CWELKRQIEDLIQD YFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVIN
Subjt:  VERADLKSKEKGSFSSG------RADPGEAQQGKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN

Query:  TIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSS
        TIFGGPSGGQ  +KRKELA  ARR+V IIREQ PTC ITF   DLE VHLPHNDALVIAPLIDHV+VRRVLVDGGASANILSLPTYLAL  TRSQLKKS 
Subjt:  TIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSS

Query:  TPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAILGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVC
        TPLVGFS ESV PEGCIDLPVT+GQD T+VTQMAEFVVIDGR AYNAI  RPIIHSF+A+PS LHQVLKY TPNGVGTVRGEQ  SRECYASALK SSVC
Subjt:  TPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAILGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVC

Query:  ALETLAGRDGTLEFEADLPRK
        ALE    +D       DLPR+
Subjt:  ALETLAGRDGTLEFEADLPRK

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]3.7e-19067.4Show/hide
Query:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFLAQFSSRHYDKKTATHLATIKQKEGETLPEYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAA+DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEF++QFSS HYD+KTATHLATI+QKE ETL EYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFLAQFSSRHYDKKTATHLATIKQKEGETLPEYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADLKSKEKGSFSSG---------------------------
        T LADE LTVKLGEEAP TF EVLQKAKKVIDGQELLRTKTGRPE++I + +  ++  +AD KS++KGS SS                            
Subjt:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADLKSKEKGSFSSG---------------------------

Query:  -----------------------RADPGEAQQGKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN
                               R D  +  + KYCRFHR+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVIN
Subjt:  -----------------------RADPGEAQQGKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN

Query:  TIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSS
        TIFGGP+GGQSG+KRKELAR ARREVCIIRE  PTC ITF  ADLE VHLPHNDALVIA LIDH +VRRVL+DG                          
Subjt:  TIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSS

Query:  TPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAILGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVC
                      GCIDLPVT+GQD T+VTQMAEFVVIDGRSAYNAI GRPIIHSFRA+PSTLHQVLKY TPN VG VRGEQ  SRECYASALKGS+VC
Subjt:  TPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAILGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVC

Query:  ALETLAGRDGTLEFEADLP---RKEFAAPTEELELVPLLSPEK
        ALE    R    E EADLP   +++F  PTEELELVPLLSPE+
Subjt:  ALETLAGRDGTLEFEADLP---RKEFAAPTEELELVPLLSPEK

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088132.7e-23983.14Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGKLDAQVEALKAKCEPKDDSLNDGDLGESPFTSNVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCE K+  LNDGDLGESPFTS+VLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHN---PAGIITREEFDQLRGKLDAQVEALKAKCEPKDDSLNDGDLGESPFTSNVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFLAQFSSRHYDKKTATHLATIKQKEGETLPEYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLR+EFLA FSSRHYDKKTATHLATI+QKEGETL EYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFLAQFSSRHYDKKTATHLATIKQKEGETLPEYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADLKSKEKGSFSSGRADPGEAQQG-------------------------
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD+E AD KSK+KGSFSSGRA+   A+ G                         
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADLKSKEKGSFSSGRADPGEAQQG-------------------------

Query:  ------------------------KYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
                                KYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  ------------------------KYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSSTPLVGFSGE
        QSG KRKELARAARREVCIIREQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKKS TPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSSTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDRTRVTQMAEFV
        SVIPEG IDLPVTLGQD+T+VTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDRTRVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188231.5e-24274.01Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGKLDAQVEALKAKCEPKDDSLNDGDLGESPFTSNVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPA   G+ITREEFDQLRGKL+AQVEALKAKCE K+  LNDGDLGESPFTS+VLE        APTVK YDG+KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGKLDAQVEALKAKCEPKDDSLNDGDLGESPFTSNVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFLAQFSSRHYDKKTATHLATIKQKEGETLPEYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFLAQFSSRHYDKKTATHLATIKQKEGETLPEYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADLKSKEKGSFSSGRAD----------------------------
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSGKD E+ADLKSK+KGSFSSGRA+                            
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADLKSKEKGSFSSGRAD----------------------------

Query:  ---------------------PGEAQQGKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGG
                             P    + KYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEERK SRTP RR DRPAVINTIFGG
Subjt:  ---------------------PGEAQQGKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGG

Query:  PSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSSTPLVG
        PSGGQSGHKRKELARAARREVCIIREQ PTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS+TPLVG
Subjt:  PSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSSTPLVG

Query:  FSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAILGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETL
        FS ESVIPEGCIDLPVTLG D+T+VTQMAEFVVIDGRSAYNAI GRPIIHSFRAIPSTLHQVLKY TPNGVG VRGEQ ASRECYASALKGSSVCALETL
Subjt:  FSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAILGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETL

Query:  AGRDGTLEFEADLPRKEFAAPTEELELVPLL
          RDGTLEF+A+LPR+EFAAPTEELELVPLL
Subjt:  AGRDGTLEFEADLPRKEFAAPTEELELVPLL

A0A6J1DHB3 uncharacterized protein LOC1110204791.7e-24162.24Show/hide
Query:  MVQPTNSTNTTDRRTLAASDAHQREVGAATVEGQGHDGLATGPLHRSARITA----PAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMAMRT
        MVQP NSTNT DRR LAA+  HQREVGA  VEGQGH+ L T PL RSARIT     PAHP+ SK                                    
Subjt:  MVQPTNSTNTTDRRTLAASDAHQREVGAATVEGQGHDGLATGPLHRSARITA----PAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMAMRT

Query:  QMRSMEEMYNEMMLAAGAGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSCSHRSSNQQAESSHNP--AGIITREEFDQLRGKLDAQVEA
                                                                             AESS+NP   G+ITREEFDQL+ K DAQVEA
Subjt:  QMRSMEEMYNEMMLAAGAGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSCSHRSSNQQAESSHNP--AGIITREEFDQLRGKLDAQVEA

Query:  LKAKCEPKDDSLNDGDLGESPFTSNVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQL
        LKA+CE K+ S +DGDLGE  F+S++LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIALTGSARLWYRRLPAR ISTYSQL
Subjt:  LKAKCEPKDDSLNDGDLGESPFTSNVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQL

Query:  RKEFLAQFSSRHYDKKTATHLATIKQKEGETLPEYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKT
        RKEF++QFSSRHYD+KT THLATI+QKEGETL EYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPATFAEVLQK KKVIDGQELLRTKT
Subjt:  RKEFLAQFSSRHYDKKTATHLATIKQKEGETLPEYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKT

Query:  GRPERKIGRGRSGKDVERADLKSKEKGSFSSG--------------------------------------------------RADPGEAQQGKYCRFHRE
        GRPE+ I +GR+GKD  +AD KS++KG  SS                                                   R DP +    KYCRFHR+
Subjt:  GRPERKIGRGRSGKDVERADLKSKEKGSFSSG--------------------------------------------------RADPGEAQQGKYCRFHRE

Query:  HGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFD
        HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN             K+KELAR ARREVCIIREQ PT  I F+
Subjt:  HGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFD

Query:  GADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSSTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDG
         ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLKKS TPLVGFSGES+  EGCIDLPV++ QD T+VTQMAEFVVIDG
Subjt:  GADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSSTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDG

Query:  RSAYNAILGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRD
        RSAYNAI GRPIIHSFRA+PSTLHQVLKY T NGVGTVRGE   SRECYAS  K SSVCALE    RD
Subjt:  RSAYNAILGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRD

A0A6J1DPC9 uncharacterized protein LOC1110222803.8e-20966.51Show/hide
Query:  PAEEERPEDNESEGYTRQRGDLREHL-NRKRGSSLRKGQSPSCSHR--SSNQQAESSHN---PAGIITREEFDQLRGKLDAQVEALKAKCEPKDDSLNDG
        P   E   ++E   Y+ +  DLR+HL ++K+ +S     S S S    +SN +A+S +    P  +I REEFD ++ + D QVEALKA+CE K+   +D 
Subjt:  PAEEERPEDNESEGYTRQRGDLREHL-NRKRGSSLRKGQSPSCSHR--SSNQQAESSHN---PAGIITREEFDQLRGKLDAQVEALKAKCEPKDDSLNDG

Query:  DLGESPFTSNVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFLAQFSSRHYDK
        DLGESPFTS+++EAPIPPKFK PT+KPYDG+KDPKDYVEVFEGLMDFQAA+DAIKC AFQIALTGSARLW RRLPARSISTYSQLRKEF+ QFS RHYD+
Subjt:  DLGESPFTSNVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFLAQFSSRHYDK

Query:  KTATHLATIKQKEGETLPEYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD
        KTATHLATI+QKE                                   DE LTVKLGEEAPATFAEVLQ AKKVIDGQELLRTKT RPE++I + R  + 
Subjt:  KTATHLATIKQKEGETLPEYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD

Query:  VERADLKSKEKGSFSSG------RADPGEAQQGKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN
          + D KSK+KGS SSG      R++ G ++   Y R           CWELKRQIEDLIQD YFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVIN
Subjt:  VERADLKSKEKGSFSSG------RADPGEAQQGKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN

Query:  TIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSS
        TIFGGPSGGQ  +KRKELA  ARR+V IIREQ PTC ITF   DLE VHLPHNDALVIAPLIDHV+VRRVLVDGGASANILSLPTYLAL  TRSQLKKS 
Subjt:  TIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSS

Query:  TPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAILGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVC
        TPLVGFS ESV PEGCIDLPVT+GQD T+VTQMAEFVVIDGR AYNAI  RPIIHSF+A+PS LHQVLKY TPNGVGTVRGEQ  SRECYASALK SSVC
Subjt:  TPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAILGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVC

Query:  ALETLAGRDGTLEFEADLPRK
        ALE    +D       DLPR+
Subjt:  ALETLAGRDGTLEFEADLPRK

A0A6J1DZB9 uncharacterized protein LOC1110249041.8e-19067.4Show/hide
Query:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFLAQFSSRHYDKKTATHLATIKQKEGETLPEYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAA+DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEF++QFSS HYD+KTATHLATI+QKE ETL EYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFLAQFSSRHYDKKTATHLATIKQKEGETLPEYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADLKSKEKGSFSSG---------------------------
        T LADE LTVKLGEEAP TF EVLQKAKKVIDGQELLRTKTGRPE++I + +  ++  +AD KS++KGS SS                            
Subjt:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADLKSKEKGSFSSG---------------------------

Query:  -----------------------RADPGEAQQGKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN
                               R D  +  + KYCRFHR+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVIN
Subjt:  -----------------------RADPGEAQQGKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN

Query:  TIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSS
        TIFGGP+GGQSG+KRKELAR ARREVCIIRE  PTC ITF  ADLE VHLPHNDALVIA LIDH +VRRVL+DG                          
Subjt:  TIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSS

Query:  TPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAILGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVC
                      GCIDLPVT+GQD T+VTQMAEFVVIDGRSAYNAI GRPIIHSFRA+PSTLHQVLKY TPN VG VRGEQ  SRECYASALKGS+VC
Subjt:  TPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAILGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVC

Query:  ALETLAGRDGTLEFEADLP---RKEFAAPTEELELVPLLSPEK
        ALE    R    E EADLP   +++F  PTEELELVPLLSPE+
Subjt:  ALETLAGRDGTLEFEADLP---RKEFAAPTEELELVPLLSPEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCCACAAACTCGACCAATACAACGGACCGAAGGACCCTAGCAGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAACGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCCACAGGACCTCTCCACAGGTCGGCGCGGATCACCGCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCC
GGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGACGCACTCCAGAGAGAGATGGCAATGCGCACACAAATGCGCTCCATGGAGGAAATGTATAACGAAATGATGCTA
GCTGCAGGCGCAGGCCCAGCCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTC
GTCTCTCCGAAAAGGGCAGTCACCATCCTGCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGC
TGAGGGGCAAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCCGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCGAACGTT
TTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCA
AGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTACCAGCCAGGTCGATCTCGACCTACTCTCAGC
TGAGAAAGGAGTTCCTCGCCCAATTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAAGCAGAAGGAGGGTGAGACGCTGCCGGAATATGTC
ACCAGATTCCAGGAGGAGCAGTTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGA
GGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAGGCAAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGTCGACCGGAACGAAAGATCGGCCGGG
GCAGAAGTGGAAAAGATGTAGAAAGGGCAGATCTCAAGTCCAAAGAGAAGGGATCCTTTTCCAGCGGCCGAGCTGACCCCGGAGAGGCGCAGCAAGGAAAGTATTGCCGC
TTCCATCGGGAGCACGGCCATAACACGTCAGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGAC
CAGCTCGGCAGAGAAAAAGGAAGAGCGAAAGCGTTCAAGGACGCCGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTTGGAGGGCCAAGCGGGGGTCAGT
CCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAG
GTCCACCTGCCCCACAATGATGCACTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGAAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACC
GACCTACCTCGCCTTGGGATGGACGAGGTCGCAATTGAAGAAAAGCTCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGG
TCACGCTGGGGCAGGACCGAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCCTTGGGAGACCCATCATCCACTCATTT
CGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATCCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACT
CAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCGGTAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCG
AGCTTGTTCCTCTGCTTAGTCCCGAGAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCCACAAACTCGACCAATACAACGGACCGAAGGACCCTAGCAGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAACGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCCACAGGACCTCTCCACAGGTCGGCGCGGATCACCGCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCC
GGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGACGCACTCCAGAGAGAGATGGCAATGCGCACACAAATGCGCTCCATGGAGGAAATGTATAACGAAATGATGCTA
GCTGCAGGCGCAGGCCCAGCCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTC
GTCTCTCCGAAAAGGGCAGTCACCATCCTGCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGC
TGAGGGGCAAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCCGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCGAACGTT
TTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCA
AGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTACCAGCCAGGTCGATCTCGACCTACTCTCAGC
TGAGAAAGGAGTTCCTCGCCCAATTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAAGCAGAAGGAGGGTGAGACGCTGCCGGAATATGTC
ACCAGATTCCAGGAGGAGCAGTTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGA
GGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAGGCAAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGTCGACCGGAACGAAAGATCGGCCGGG
GCAGAAGTGGAAAAGATGTAGAAAGGGCAGATCTCAAGTCCAAAGAGAAGGGATCCTTTTCCAGCGGCCGAGCTGACCCCGGAGAGGCGCAGCAAGGAAAGTATTGCCGC
TTCCATCGGGAGCACGGCCATAACACGTCAGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGAC
CAGCTCGGCAGAGAAAAAGGAAGAGCGAAAGCGTTCAAGGACGCCGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTTGGAGGGCCAAGCGGGGGTCAGT
CCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAG
GTCCACCTGCCCCACAATGATGCACTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGAAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACC
GACCTACCTCGCCTTGGGATGGACGAGGTCGCAATTGAAGAAAAGCTCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGG
TCACGCTGGGGCAGGACCGAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCCTTGGGAGACCCATCATCCACTCATTT
CGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATCCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACT
CAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCGGTAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCG
AGCTTGTTCCTCTGCTTAGTCCCGAGAAGTAG
Protein sequenceShow/hide protein sequence
MVQPTNSTNTTDRRTLAASDAHQREVGAATVEGQGHDGLATGPLHRSARITAPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMAMRTQMRSMEEMYNEMML
AAGAGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSCSHRSSNQQAESSHNPAGIITREEFDQLRGKLDAQVEALKAKCEPKDDSLNDGDLGESPFTSNV
LEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFLAQFSSRHYDKKTATHLATIKQKEGETLPEYV
TRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADLKSKEKGSFSSGRADPGEAQQGKYCR
FHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEE
VHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSSTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAILGRPIIHSF
RAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRDGTLEFEADLPRKEFAAPTEELELVPLLSPEK