; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g09720 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g09720
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr9:8158135..8163714
RNA-Seq ExpressionMoc09g09720
SyntenyMoc09g09720
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]1.3e-26794.84Show/hide
Query:  GELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDETKDLKDYVEVFEGLMDFQAASDAIKCRTFQIALTGSARLWYRRLPA
        G+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYD +KD KDYVEVFE LMDFQAASDAIKCR F+IALTGSARLWYRRLPA
Subjt:  GELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDETKDLKDYVEVFEGLMDFQAASDAIKCRTFQIALTGSARLWYRRLPA

Query:  GSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAPASFAEVLQKAKKVID
         SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAH SDDSAMCYFLTGLADEALTVKLGEEAPA+FAEVLQKAKKVID
Subjt:  GSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAPASFAEVLQKAKKVID

Query:  GQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKD
        GQELLRTKTGRPERKIGRGRSGKD E ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE+SGMEKLLKRPEKLRGAPERRSKD
Subjt:  GQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKD

Query:  KYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQG
        KYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSG KRKELARAARREVCIIREQ 
Subjt:  KYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQG

Query:  PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQM
        PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLK+SPTPLVGFSGESVIPEG IDLPVTLGQDQT+VTQM
Subjt:  PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQM

Query:  AEFV
        AEFV
Subjt:  AEFV

XP_022150613.1 uncharacterized protein LOC111018708, partial [Momordica charantia]4.7e-22294.79Show/hide
Query:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDETKDLKDYVEVFEGLMDFQAASDAIKCRTFQIALTGSARLWYRRLPAGSISTYSQLRREFLAQ
        KDDSLNDGDLGES FTSDVLEAPIPPKFKAPTVKPYD +KD KDYVEVFEGLMDF AASDAIKCR FQIALTGSARLWYRRLPA SISTYSQLRREFLAQ
Subjt:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDETKDLKDYVEVFEGLMDFQAASDAIKCRTFQIALTGSARLWYRRLPAGSISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAPASFAEVLQKAKKVIDGQELLRTKTGRPERKI
        FSSR Y KKT THLATIRQKEG TLREYVTRFQEEQLKVAH SDDSAMCYFLTGLADEALTVKLGE+AP +FAEVLQKAKKVIDGQELLRTKTGRP+RKI
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAPASFAEVLQKAKKVIDGQELLRTKTGRPERKI

Query:  GRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
        GRGRSGKD ERADPKSKDKGSFSSGRAEYRRAE+GPT+SRPYERFTPTTIPISEILTNIE+SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
Subjt:  GRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC

Query:  WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVH
        WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGAD EEVH
Subjt:  WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVH

Query:  LPHNDALVIAPLIDHVVVRRVL
        LPHNDA VIAPLIDHVVVRRVL
Subjt:  LPHNDALVIAPLIDHVVVRRVL

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]1.7e-26481.56Show/hide
Query:  GELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDETKDLKDYVEVFEGLMDFQAASDAIKCRTFQIALTGSARLWYRRLPA
        G+L+AQVEALKAKCEQK+  LNDGDLGESPFTSDVLE        APTVK YD +KD KDYVEVFEGLMDFQAASDAIKCR FQIALTGSARLW      
Subjt:  GELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDETKDLKDYVEVFEGLMDFQAASDAIKCRTFQIALTGSARLWYRRLPA

Query:  GSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAPASFAEVLQKAKKVID
                                                       FQE+QLKVA  SDDSAMCYFLTGLADEALTVKLG+EAPA+FAEVLQKAKKVID
Subjt:  GSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAPASFAEVLQKAKKVID

Query:  GQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDK
        GQELLRTKTGRPER I RGRSGKDE+AD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPISEILTNIE+SGMEKLLKRPEKLRGAPERR+KDK
Subjt:  GQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDK

Query:  YCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGP
        YCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEERK SRTP RR DRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQ P
Subjt:  YCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGP

Query:  TCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMA
        TCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLK+S TPLVGFS ESVIPEGCIDLPVTLG DQT+VTQMA
Subjt:  TCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMA

Query:  EFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCAIETL--RDGTLEFEADLPRKEFAAPTEELELVP
        EFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVG VRGEQ ASRECYA+ALKGSSVCA+ETL  RDGTLEF+A+LPR+EFAAPTEELELVP
Subjt:  EFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCAIETL--RDGTLEFEADLPRKEFAAPTEELELVP

Query:  LL
        LL
Subjt:  LL

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]7.4e-22892.6Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPASFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTP
        MCYFLTGLADEALTVKL EEAPA+FAEVLQKAKKVIDGQELLRT       KIG+GRSGKD E  DPKSKDKGSFS+GRAEYRRAENGPTRSRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGEEAPASFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTP

Query:  TTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRP
        TTIPISEILTNIE+SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRP
Subjt:  TTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRP

Query:  AVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
        AVINTIFGGPSGGQSGHKRK+LARAARREVCIIREQ PTCPITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Subjt:  AVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL

Query:  KRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKG
        K+SPTPLVGFSGESV+PEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYA+ LKG
Subjt:  KRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKG

Query:  SSVCAIETL--RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQV
        +SVCA+ETL  RDGTLEFEADLP +EFAAP EELELVPLLS EKQV
Subjt:  SSVCAIETL--RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQV

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]3.8e-26466.62Show/hide
Query:  MAQPANSTNTTDRRTLAASNAHQREVGAAAVEGQGHDGLPTEPFRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR
        M QPANSTNT DRR LAA++ HQREVGA  VEGQGH+ L TEP  RSARIT P LPPAHP+ SKA                                   
Subjt:  MAQPANSTNTTDRRTLAASNAHQREVGAAAVEGQGHDGLPTEPFRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR

Query:  TQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSDNHKGGVRPAEGELDAQVEALKAKCEQ
              E+ YN +                                           TR+  D                     + + DAQVEALKA+CE+
Subjt:  TQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSDNHKGGVRPAEGELDAQVEALKAKCEQ

Query:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDETKDLKDYVEVFEGLMDFQAASDAIKCRTFQIALTGSARLWYRRLPAGSISTYSQLRREFLAQ
        K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYD +KD KDYVEVFE LMDFQAA+DAIKC  FQIALTGSARLWYRRLPA  ISTYSQLR+EF++Q
Subjt:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDETKDLKDYVEVFEGLMDFQAASDAIKCRTFQIALTGSARLWYRRLPAGSISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAPASFAEVLQKAKKVIDGQELLRTKTGRPERKI
        FSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAH SDDSAMCYFLTGLADE LTVKL EEAPA+FAEVLQK KKVIDGQELLRTKTGRPE+ I
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAPASFAEVLQKAKKVIDGQELLRTKTGRPERKI

Query:  GRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD
         +GR+GKD+ +AD KS+DKG S SS R +YRR+ +   +SRPYE +TPTTIPI EILTNIE++GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNTS+
Subjt:  GRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD

Query:  CWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEV
         WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN             K+KELAR ARREVCIIREQ PT  I F+ ADLE V
Subjt:  CWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEV

Query:  HLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAI
        HLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLK+SPTPLVGFSGES+  EGCIDLPV++ QD T+VTQMAEFVV+DGRSAYNAI
Subjt:  HLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAI

Query:  FGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCAIE--TLRD
        FGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SRECYA+  K SSVCA+E  T+RD
Subjt:  FGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCAIE--TLRD

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088136.1e-26894.84Show/hide
Query:  GELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDETKDLKDYVEVFEGLMDFQAASDAIKCRTFQIALTGSARLWYRRLPA
        G+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYD +KD KDYVEVFE LMDFQAASDAIKCR F+IALTGSARLWYRRLPA
Subjt:  GELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDETKDLKDYVEVFEGLMDFQAASDAIKCRTFQIALTGSARLWYRRLPA

Query:  GSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAPASFAEVLQKAKKVID
         SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAH SDDSAMCYFLTGLADEALTVKLGEEAPA+FAEVLQKAKKVID
Subjt:  GSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAPASFAEVLQKAKKVID

Query:  GQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKD
        GQELLRTKTGRPERKIGRGRSGKD E ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE+SGMEKLLKRPEKLRGAPERRSKD
Subjt:  GQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKD

Query:  KYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQG
        KYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSG KRKELARAARREVCIIREQ 
Subjt:  KYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQG

Query:  PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQM
        PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLK+SPTPLVGFSGESVIPEG IDLPVTLGQDQT+VTQM
Subjt:  PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQM

Query:  AEFV
        AEFV
Subjt:  AEFV

A0A6J1D9E1 uncharacterized protein LOC1110188238.2e-26581.56Show/hide
Query:  GELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDETKDLKDYVEVFEGLMDFQAASDAIKCRTFQIALTGSARLWYRRLPA
        G+L+AQVEALKAKCEQK+  LNDGDLGESPFTSDVLE        APTVK YD +KD KDYVEVFEGLMDFQAASDAIKCR FQIALTGSARLW      
Subjt:  GELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDETKDLKDYVEVFEGLMDFQAASDAIKCRTFQIALTGSARLWYRRLPA

Query:  GSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAPASFAEVLQKAKKVID
                                                       FQE+QLKVA  SDDSAMCYFLTGLADEALTVKLG+EAPA+FAEVLQKAKKVID
Subjt:  GSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAPASFAEVLQKAKKVID

Query:  GQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDK
        GQELLRTKTGRPER I RGRSGKDE+AD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPISEILTNIE+SGMEKLLKRPEKLRGAPERR+KDK
Subjt:  GQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDK

Query:  YCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGP
        YCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEERK SRTP RR DRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQ P
Subjt:  YCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGP

Query:  TCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMA
        TCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLK+S TPLVGFS ESVIPEGCIDLPVTLG DQT+VTQMA
Subjt:  TCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMA

Query:  EFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCAIETL--RDGTLEFEADLPRKEFAAPTEELELVP
        EFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVG VRGEQ ASRECYA+ALKGSSVCA+ETL  RDGTLEF+A+LPR+EFAAPTEELELVP
Subjt:  EFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCAIETL--RDGTLEFEADLPRKEFAAPTEELELVP

Query:  LL
        LL
Subjt:  LL

A0A6J1D9W7 uncharacterized protein LOC1110187082.3e-22294.79Show/hide
Query:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDETKDLKDYVEVFEGLMDFQAASDAIKCRTFQIALTGSARLWYRRLPAGSISTYSQLRREFLAQ
        KDDSLNDGDLGES FTSDVLEAPIPPKFKAPTVKPYD +KD KDYVEVFEGLMDF AASDAIKCR FQIALTGSARLWYRRLPA SISTYSQLRREFLAQ
Subjt:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDETKDLKDYVEVFEGLMDFQAASDAIKCRTFQIALTGSARLWYRRLPAGSISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAPASFAEVLQKAKKVIDGQELLRTKTGRPERKI
        FSSR Y KKT THLATIRQKEG TLREYVTRFQEEQLKVAH SDDSAMCYFLTGLADEALTVKLGE+AP +FAEVLQKAKKVIDGQELLRTKTGRP+RKI
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAPASFAEVLQKAKKVIDGQELLRTKTGRPERKI

Query:  GRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
        GRGRSGKD ERADPKSKDKGSFSSGRAEYRRAE+GPT+SRPYERFTPTTIPISEILTNIE+SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
Subjt:  GRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC

Query:  WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVH
        WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGAD EEVH
Subjt:  WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVH

Query:  LPHNDALVIAPLIDHVVVRRVL
        LPHNDA VIAPLIDHVVVRRVL
Subjt:  LPHNDALVIAPLIDHVVVRRVL

A0A6J1DD03 uncharacterized protein LOC1110198993.6e-22892.6Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPASFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTP
        MCYFLTGLADEALTVKL EEAPA+FAEVLQKAKKVIDGQELLRT       KIG+GRSGKD E  DPKSKDKGSFS+GRAEYRRAENGPTRSRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGEEAPASFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTP

Query:  TTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRP
        TTIPISEILTNIE+SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRP
Subjt:  TTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRP

Query:  AVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
        AVINTIFGGPSGGQSGHKRK+LARAARREVCIIREQ PTCPITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Subjt:  AVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL

Query:  KRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKG
        K+SPTPLVGFSGESV+PEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYA+ LKG
Subjt:  KRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKG

Query:  SSVCAIETL--RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQV
        +SVCA+ETL  RDGTLEFEADLP +EFAAP EELELVPLLS EKQV
Subjt:  SSVCAIETL--RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQV

A0A6J1DHB3 uncharacterized protein LOC1110204791.8e-26466.62Show/hide
Query:  MAQPANSTNTTDRRTLAASNAHQREVGAAAVEGQGHDGLPTEPFRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR
        M QPANSTNT DRR LAA++ HQREVGA  VEGQGH+ L TEP  RSARIT P LPPAHP+ SKA                                   
Subjt:  MAQPANSTNTTDRRTLAASNAHQREVGAAAVEGQGHDGLPTEPFRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR

Query:  TQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSDNHKGGVRPAEGELDAQVEALKAKCEQ
              E+ YN +                                           TR+  D                     + + DAQVEALKA+CE+
Subjt:  TQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSDNHKGGVRPAEGELDAQVEALKAKCEQ

Query:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDETKDLKDYVEVFEGLMDFQAASDAIKCRTFQIALTGSARLWYRRLPAGSISTYSQLRREFLAQ
        K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYD +KD KDYVEVFE LMDFQAA+DAIKC  FQIALTGSARLWYRRLPA  ISTYSQLR+EF++Q
Subjt:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDETKDLKDYVEVFEGLMDFQAASDAIKCRTFQIALTGSARLWYRRLPAGSISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAPASFAEVLQKAKKVIDGQELLRTKTGRPERKI
        FSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAH SDDSAMCYFLTGLADE LTVKL EEAPA+FAEVLQK KKVIDGQELLRTKTGRPE+ I
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAPASFAEVLQKAKKVIDGQELLRTKTGRPERKI

Query:  GRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD
         +GR+GKD+ +AD KS+DKG S SS R +YRR+ +   +SRPYE +TPTTIPI EILTNIE++GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNTS+
Subjt:  GRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD

Query:  CWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEV
         WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN             K+KELAR ARREVCIIREQ PT  I F+ ADLE V
Subjt:  CWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEV

Query:  HLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAI
        HLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLK+SPTPLVGFSGES+  EGCIDLPV++ QD T+VTQMAEFVV+DGRSAYNAI
Subjt:  HLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAI

Query:  FGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCAIE--TLRD
        FGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SRECYA+  K SSVCA+E  T+RD
Subjt:  FGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCAIE--TLRD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCAACCAGCGAATTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCAATGCCCACCAGAGGGAGGTCGGAGCAGCGGCGGTAGAAGGGCAAGGTCACGA
CGGCCTACCAACGGAACCCTTCCGCAGGTCGGCACGGATCACCGCGCCCGCCCTACCGCCCGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGCAATGTAT
AACGAAATGGTGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCATGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAAGAACG
TCCCGAAGACAACGAGAGTGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGGATAATCACAAGGGAGGAGTTCGACCAGCTG
AGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTG
GAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGAGACGAAGGACCTCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGC
GGCATCAGACGCAATCAAATGCCGCACCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCGGGTCGATCTCGACCTACTCTCAGCTGA
GAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGCGAGACGCTGCGGGAGTATGTCACC
AGATTCCAGGAGGAGCAGTTGAAGGTTGCACACTACTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGA
GGCCCCGGCCAGCTTCGCCGAGGTGCTCCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCA
GAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTCTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCT
TACGAGCGCTTCACCCCAACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGATTCTGGAATGGAAAAACTACTCAAGCGTCCGGAGAAACTTCGGGGAGCCCC
GGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAATTGAAGCGCCAAATTGAGGATCTAATTCAAGACGGCT
ACTTCAAGAAGTTTGTGGGAAAGCCCAGGACCAGCTCAGCAGAGAAAAAGGAAGAGCGAAAGCGTTCAAGGACGCCACCTCGGCGCACCGACCGACCTGCGGTCATCAAT
ACCATTTTTGGAGGGCCAAGCGGGGGTCAATCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCC
AATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACG
GGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCCGACGCCGCTGGTTGGGTTCTCTGGAGAATCG
GTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCAAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAGTTGACGGTAGATCGGCCTATAACGC
CATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATTCCACTCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGA
CCGCTTCGAGGGAGTGCTATGCCGCCGCACTCAAAGGCTCATCGGTCTGCGCAATCGAAACTCTCAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAG
TTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGGTCGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGAGCCAGATCTGAT
GGAGATCGGCGCTCCAGAATCCTCATGGATGGACCCGATCACAGACTTCATTAGGGGCAATTCACCACAAGATCCCAAGGAGCGCAAAAAGTTGGCAAGGCGGGCAGCTC
GAGTAGAGCATTACGAGCCTACGACGAATGAGGATGGGTTGCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGCGGATAATCAGGGC
AGAATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGGACATCTGGTCTTAAGGAGGGTCCAAACGCATGTGGGTGCCCTTGATCCGGCCTGGGA
GGGCCCGTTTGAGATCAAGGGCATAGTCCGACCTGGGACGTACATATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATT
ATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCAACCAGCGAATTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCAATGCCCACCAGAGGGAGGTCGGAGCAGCGGCGGTAGAAGGGCAAGGTCACGA
CGGCCTACCAACGGAACCCTTCCGCAGGTCGGCACGGATCACCGCGCCCGCCCTACCGCCCGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGCAATGTAT
AACGAAATGGTGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCATGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAAGAACG
TCCCGAAGACAACGAGAGTGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGGATAATCACAAGGGAGGAGTTCGACCAGCTG
AGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTG
GAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGAGACGAAGGACCTCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGC
GGCATCAGACGCAATCAAATGCCGCACCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCGGGTCGATCTCGACCTACTCTCAGCTGA
GAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGCGAGACGCTGCGGGAGTATGTCACC
AGATTCCAGGAGGAGCAGTTGAAGGTTGCACACTACTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGA
GGCCCCGGCCAGCTTCGCCGAGGTGCTCCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCA
GAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTCTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCT
TACGAGCGCTTCACCCCAACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGATTCTGGAATGGAAAAACTACTCAAGCGTCCGGAGAAACTTCGGGGAGCCCC
GGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAATTGAAGCGCCAAATTGAGGATCTAATTCAAGACGGCT
ACTTCAAGAAGTTTGTGGGAAAGCCCAGGACCAGCTCAGCAGAGAAAAAGGAAGAGCGAAAGCGTTCAAGGACGCCACCTCGGCGCACCGACCGACCTGCGGTCATCAAT
ACCATTTTTGGAGGGCCAAGCGGGGGTCAATCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCC
AATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACG
GGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCCGACGCCGCTGGTTGGGTTCTCTGGAGAATCG
GTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCAAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAGTTGACGGTAGATCGGCCTATAACGC
CATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATTCCACTCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGA
CCGCTTCGAGGGAGTGCTATGCCGCCGCACTCAAAGGCTCATCGGTCTGCGCAATCGAAACTCTCAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAG
TTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGGTCGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGAGCCAGATCTGAT
GGAGATCGGCGCTCCAGAATCCTCATGGATGGACCCGATCACAGACTTCATTAGGGGCAATTCACCACAAGATCCCAAGGAGCGCAAAAAGTTGGCAAGGCGGGCAGCTC
GAGTAGAGCATTACGAGCCTACGACGAATGAGGATGGGTTGCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGCGGATAATCAGGGC
AGAATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGGACATCTGGTCTTAAGGAGGGTCCAAACGCATGTGGGTGCCCTTGATCCGGCCTGGGA
GGGCCCGTTTGAGATCAAGGGCATAGTCCGACCTGGGACGTACATATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATT
ATCCTTGA
Protein sequenceShow/hide protein sequence
MAQPANSTNTTDRRTLAASNAHQREVGAAAVEGQGHDGLPTEPFRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMRTQMRSMEAMY
NEMVLAAGAGSRSENRVTRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSDNHKGGVRPAEGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVL
EAPIPPKFKAPTVKPYDETKDLKDYVEVFEGLMDFQAASDAIKCRTFQIALTGSARLWYRRLPAGSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVT
RFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAPASFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRP
YERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN
TIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGES
VIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCAIETLRDGTLEFEADLPRKE
FAAPTEELELVPLLSPEKQVVPVEILDNPSILEPDLMEIGAPESSWMDPITDFIRGNSPQDPKERKKLARRAARVEHYEPTTNEDGLLLNLDLLEERRAMAQLRLADNQG
RMARHYNARVRPRAFQVGHLVLRRVQTHVGALDPAWEGPFEIKGIVRPGTYILADLKGDVLAHPWNAEHLKRYYP