; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g20250 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g20250
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr4:14743834..14748058
RNA-Seq ExpressionMoc04g20250
SyntenyMoc04g20250
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]1.1e-26490.34Show/hide
Query:  QVESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDSDLGESPFTSDVLEALIPPKFKAPTVKPYDESKDPKDYVEVFEGLMDFQAASD
        + ESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLND DLGESPFTSDVLEA IPPKFKAPTVKPYD SKDPKDYVEVFE LMDFQAASD
Subjt:  QVESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDSDLGESPFTSDVLEALIPPKFKAPTVKPYDESKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARTISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA +ISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARTISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  T-------------------------ELLRTKTGQPERKIGWGRSGKDIEKADPKSKDKGSFSSGRAEYRGAENGPTRIQPYERFTPTTIPISEILTNIE
        T                         ELLRTKTG+PERKIG GRSGKDIE ADPKSKDKGSFSSGRAEYR AENGPTR +PYERFTPTTIPISEILTNIE
Subjt:  T-------------------------ELLRTKTGQPERKIGWGRSGKDIEKADPKSKDKGSFSSGRAEYRGAENGPTRIQPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPKRRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEQRKRSRTPPRRTDRPAVINTIFGGPSGG
        ESGMEKLLKRPEKLRGAP+RRSKDKYCRFHREH HNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKE+RKRSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  ESGMEKLLKRPEKLRGAPKRRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEQRKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
        QSG KRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVD G SANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV
        SVIPEG IDLPVTLGQDQTQVTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV

XP_022150613.1 uncharacterized protein LOC111018708, partial [Momordica charantia]4.6e-20287.2Show/hide
Query:  KEGPLNDSDLGESPFTSDVLEALIPPKFKAPTVKPYDESKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARTISTYSQLRREFLAQ
        K+  LND DLGES FTSDVLEA IPPKFKAPTVKPYD SKDPKDYVEVFEGLMDF AASDAIKCRAFQIALTGSARLWYRRLPAR+ISTYSQLRREFLAQ
Subjt:  KEGPLNDSDLGESPFTSDVLEALIPPKFKAPTVKPYDESKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARTISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALT-------------------------ELLRTKTGQPERKI
        FSSR Y KKT THLATIRQKEG TLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALT                         ELLRTKTG+P+RKI
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALT-------------------------ELLRTKTGQPERKI

Query:  GWGRSGKDIEKADPKSKDKGSFSSGRAEYRGAENGPTRIQPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPKRRSKDKYCRFHREHDHNTSDC
        G GRSGKD+E+ADPKSKDKGSFSSGRAEYR AE+GPT+ +PYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAP+RRSKDKYCRFHREH HNTSDC
Subjt:  GWGRSGKDIEKADPKSKDKGSFSSGRAEYRGAENGPTRIQPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPKRRSKDKYCRFHREHDHNTSDC

Query:  WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEQRKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVH
        WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKE+RKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQ PTCPITFDGAD EEVH
Subjt:  WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEQRKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVH

Query:  LPHNDALVIAPLIDHVVVRRVL
        LPHNDA VIAPLIDHVVVRRVL
Subjt:  LPHNDALVIAPLIDHVVVRRVL

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]5.0e-26577.42Show/hide
Query:  SSTQQVESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDSDLGESPFTSDVLEALIPPKFKAPTVKPYDESKDPKDYVEVFEGLMDFQ
        SS QQ ESS NPATP GVITREEFDQLRG+L+AQVEALKAKCEQKEGPLND DLGESPFTSDVLE        APTVK YD SKDPKDYVEVFEGLMDFQ
Subjt:  SSTQQVESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDSDLGESPFTSDVLEALIPPKFKAPTVKPYDESKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARTISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARTISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALT-------------------------ELLRTKTGQPERKIGWGRSGKDIEKADPKSKDKGSFSSGRAEYRGAENGPTRIQPYERFTPTTIPISEIL
        DEALT                         ELLRTKTG+PER I  GRSGKD EKAD KSKDKGSFSSGRAE+R A NGPTR +PYERFTPTTIPISEIL
Subjt:  DEALT-------------------------ELLRTKTGQPERKIGWGRSGKDIEKADPKSKDKGSFSSGRAEYRGAENGPTRIQPYERFTPTTIPISEIL

Query:  TNIEESGMEKLLKRPEKLRGAPKRRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEQRKRSRTPPRRTDRPAVINTIFGG
        TNIEESGMEKLLKRPEKLRGAP+RR+KDKYCRFHREHDHNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKE+RK SRTP RR DRPAVINTIFGG
Subjt:  TNIEESGMEKLLKRPEKLRGAPKRRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEQRKRSRTPPRRTDRPAVINTIFGG

Query:  PSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQLKKSPTPLVG
        PSGGQSGHKRKELARAARREVCIIREQRPTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVG
Subjt:  PSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQLKKSPTPLVG

Query:  FSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETL
        FS ESVIPEGCIDLPVTLG DQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPNGVG VRGEQ ASRECYASALKGSSVCALETL
Subjt:  FSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETL

Query:  ASRDGTLEFKVNLPRWEFAAPTEELELVPLLSPKKQKPLRVGSKNLANEQD
         SRDGTLEFK NLPR EFAAPTEELELVPLL  K        ++N+ +EQ+
Subjt:  ASRDGTLEFKVNLPRWEFAAPTEELELVPLLSPKKQKPLRVGSKNLANEQD

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]3.9e-20985.27Show/hide
Query:  MCYFLTGLADEALT-------------------------ELLRTKTGQPERKIGWGRSGKDIEKADPKSKDKGSFSSGRAEYRGAENGPTRIQPYERFTP
        MCYFLTGLADEALT                         ELLRTK GQ       GRSGKD+E  DPKSKDKGSFS+GRAEYR AENGPTR +PYERFTP
Subjt:  MCYFLTGLADEALT-------------------------ELLRTKTGQPERKIGWGRSGKDIEKADPKSKDKGSFSSGRAEYRGAENGPTRIQPYERFTP

Query:  TTIPISEILTNIEESGMEKLLKRPEKLRGAPKRRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEQRKRSRTPPRRTDRP
        TTIPISEILTNIEESGMEKLLKRPEKLRGAP+RRSKDKYCRFHREH HNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAEKKE+RKRSRTPPRRTDRP
Subjt:  TTIPISEILTNIEESGMEKLLKRPEKLRGAPKRRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEQRKRSRTPPRRTDRP

Query:  AVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQL
        AVINTIFGGPSGGQSGHKRK+LARAARREVCIIREQRPTCPITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVD GASANILSLPTYLALGWTRSQL
Subjt:  AVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQL

Query:  KKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKG
        KKSPTPLVGFSGESV+PEGCIDLPVTLGQDQT+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPNGVGTVRGEQTASRECYAS LKG
Subjt:  KKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKG

Query:  SSVCALETLASRDGTLEFKVNLPRWEFAAPTEELELVPLLSPKKQKPL
        +SVCALETL SRDGTLEF+ +LP  EFAAP EELELVPLLS +KQ  L
Subjt:  SSVCALETLASRDGTLEFKVNLPRWEFAAPTEELELVPLLSPKKQKPL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]1.0e-25763.89Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHTPRTSKATRGRGGTSKKGARGPAPAPTSENLDALQREMEAM
        MVQPANSTNTADRR LAA+  HQREVGA VVEGQGH+ L TEPL RSARIT PVLPPAH P+ SKA                                  
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHTPRTSKATRGRGGTSKKGARGPAPAPTSENLDALQREMEAM

Query:  RTKMRSMEEMYNGMILAAGAGSRSENRVTRVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSTQQVESSRNPA
                                                                                                     ESS NP 
Subjt:  RTKMRSMEEMYNGMILAAGAGSRSENRVTRVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSTQQVESSRNPA

Query:  TPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDSDLGESPFTSDVLEALIPPKFKAPTVKPYDESKDPKDYVEVFEGLMDFQAASDAIKCRAFQI
        TP GVITREEFDQL+ + DAQVEALKA+CE+KE   +D DLGE  F+SD+LEALIPPKFK PT+KPYD SKDPKDYVEVFE LMDFQAA+DAIKC AFQI
Subjt:  TPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDSDLGESPFTSDVLEALIPPKFKAPTVKPYDESKDPKDYVEVFEGLMDFQAASDAIKCRAFQI

Query:  ALTGSARLWYRRLPARTISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALT--------
        ALTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LT        
Subjt:  ALTGSARLWYRRLPARTISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALT--------

Query:  -----------------ELLRTKTGQPERKIGWGRSGKDIEKADPKSKDKG-SFSSGRAEYRGAENGPTRIQPYERFTPTTIPISEILTNIEESGMEKLL
                         ELLRTKTG+PE+ I  GR+GKD  KAD KS+DKG S SS R +YR + +   + +PYE +TPTTIPI EILTNIEE+GMEKLL
Subjt:  -----------------ELLRTKTGQPERKIGWGRSGKDIEKADPKSKDKG-SFSSGRAEYRGAENGPTRIQPYERFTPTTIPISEILTNIEESGMEKLL

Query:  KRPEKLRGAPKRRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEQRKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKE
        KRPEKLRG P++R+ DKYCRFHR+H HNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKE+RKR RTPPRR DRPAVIN             K+KE
Subjt:  KRPEKLRGAPKRRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEQRKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKE

Query:  LARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCI
        LAR ARREVCIIREQRPT  I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVD GASANILSL TYLALGWTRSQLKKSPTPLVGFSGES+  EGCI
Subjt:  LARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCI

Query:  DLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLASRD
        DLPV++ QD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SRECYAS  K SSVCALE    RD
Subjt:  DLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLASRD

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088135.3e-26590.34Show/hide
Query:  QVESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDSDLGESPFTSDVLEALIPPKFKAPTVKPYDESKDPKDYVEVFEGLMDFQAASD
        + ESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLND DLGESPFTSDVLEA IPPKFKAPTVKPYD SKDPKDYVEVFE LMDFQAASD
Subjt:  QVESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDSDLGESPFTSDVLEALIPPKFKAPTVKPYDESKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARTISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA +ISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARTISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  T-------------------------ELLRTKTGQPERKIGWGRSGKDIEKADPKSKDKGSFSSGRAEYRGAENGPTRIQPYERFTPTTIPISEILTNIE
        T                         ELLRTKTG+PERKIG GRSGKDIE ADPKSKDKGSFSSGRAEYR AENGPTR +PYERFTPTTIPISEILTNIE
Subjt:  T-------------------------ELLRTKTGQPERKIGWGRSGKDIEKADPKSKDKGSFSSGRAEYRGAENGPTRIQPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPKRRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEQRKRSRTPPRRTDRPAVINTIFGGPSGG
        ESGMEKLLKRPEKLRGAP+RRSKDKYCRFHREH HNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKE+RKRSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  ESGMEKLLKRPEKLRGAPKRRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEQRKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
        QSG KRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVD G SANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV
        SVIPEG IDLPVTLGQDQTQVTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188232.4e-26577.42Show/hide
Query:  SSTQQVESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDSDLGESPFTSDVLEALIPPKFKAPTVKPYDESKDPKDYVEVFEGLMDFQ
        SS QQ ESS NPATP GVITREEFDQLRG+L+AQVEALKAKCEQKEGPLND DLGESPFTSDVLE        APTVK YD SKDPKDYVEVFEGLMDFQ
Subjt:  SSTQQVESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDSDLGESPFTSDVLEALIPPKFKAPTVKPYDESKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARTISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARTISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALT-------------------------ELLRTKTGQPERKIGWGRSGKDIEKADPKSKDKGSFSSGRAEYRGAENGPTRIQPYERFTPTTIPISEIL
        DEALT                         ELLRTKTG+PER I  GRSGKD EKAD KSKDKGSFSSGRAE+R A NGPTR +PYERFTPTTIPISEIL
Subjt:  DEALT-------------------------ELLRTKTGQPERKIGWGRSGKDIEKADPKSKDKGSFSSGRAEYRGAENGPTRIQPYERFTPTTIPISEIL

Query:  TNIEESGMEKLLKRPEKLRGAPKRRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEQRKRSRTPPRRTDRPAVINTIFGG
        TNIEESGMEKLLKRPEKLRGAP+RR+KDKYCRFHREHDHNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKE+RK SRTP RR DRPAVINTIFGG
Subjt:  TNIEESGMEKLLKRPEKLRGAPKRRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEQRKRSRTPPRRTDRPAVINTIFGG

Query:  PSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQLKKSPTPLVG
        PSGGQSGHKRKELARAARREVCIIREQRPTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVG
Subjt:  PSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQLKKSPTPLVG

Query:  FSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETL
        FS ESVIPEGCIDLPVTLG DQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPNGVG VRGEQ ASRECYASALKGSSVCALETL
Subjt:  FSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETL

Query:  ASRDGTLEFKVNLPRWEFAAPTEELELVPLLSPKKQKPLRVGSKNLANEQD
         SRDGTLEFK NLPR EFAAPTEELELVPLL  K        ++N+ +EQ+
Subjt:  ASRDGTLEFKVNLPRWEFAAPTEELELVPLLSPKKQKPLRVGSKNLANEQD

A0A6J1D9W7 uncharacterized protein LOC1110187082.2e-20287.2Show/hide
Query:  KEGPLNDSDLGESPFTSDVLEALIPPKFKAPTVKPYDESKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARTISTYSQLRREFLAQ
        K+  LND DLGES FTSDVLEA IPPKFKAPTVKPYD SKDPKDYVEVFEGLMDF AASDAIKCRAFQIALTGSARLWYRRLPAR+ISTYSQLRREFLAQ
Subjt:  KEGPLNDSDLGESPFTSDVLEALIPPKFKAPTVKPYDESKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARTISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALT-------------------------ELLRTKTGQPERKI
        FSSR Y KKT THLATIRQKEG TLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALT                         ELLRTKTG+P+RKI
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALT-------------------------ELLRTKTGQPERKI

Query:  GWGRSGKDIEKADPKSKDKGSFSSGRAEYRGAENGPTRIQPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPKRRSKDKYCRFHREHDHNTSDC
        G GRSGKD+E+ADPKSKDKGSFSSGRAEYR AE+GPT+ +PYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAP+RRSKDKYCRFHREH HNTSDC
Subjt:  GWGRSGKDIEKADPKSKDKGSFSSGRAEYRGAENGPTRIQPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPKRRSKDKYCRFHREHDHNTSDC

Query:  WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEQRKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVH
        WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKE+RKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQ PTCPITFDGAD EEVH
Subjt:  WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEQRKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVH

Query:  LPHNDALVIAPLIDHVVVRRVL
        LPHNDA VIAPLIDHVVVRRVL
Subjt:  LPHNDALVIAPLIDHVVVRRVL

A0A6J1DD03 uncharacterized protein LOC1110198991.9e-20985.27Show/hide
Query:  MCYFLTGLADEALT-------------------------ELLRTKTGQPERKIGWGRSGKDIEKADPKSKDKGSFSSGRAEYRGAENGPTRIQPYERFTP
        MCYFLTGLADEALT                         ELLRTK GQ       GRSGKD+E  DPKSKDKGSFS+GRAEYR AENGPTR +PYERFTP
Subjt:  MCYFLTGLADEALT-------------------------ELLRTKTGQPERKIGWGRSGKDIEKADPKSKDKGSFSSGRAEYRGAENGPTRIQPYERFTP

Query:  TTIPISEILTNIEESGMEKLLKRPEKLRGAPKRRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEQRKRSRTPPRRTDRP
        TTIPISEILTNIEESGMEKLLKRPEKLRGAP+RRSKDKYCRFHREH HNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAEKKE+RKRSRTPPRRTDRP
Subjt:  TTIPISEILTNIEESGMEKLLKRPEKLRGAPKRRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEQRKRSRTPPRRTDRP

Query:  AVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQL
        AVINTIFGGPSGGQSGHKRK+LARAARREVCIIREQRPTCPITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVD GASANILSLPTYLALGWTRSQL
Subjt:  AVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQL

Query:  KKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKG
        KKSPTPLVGFSGESV+PEGCIDLPVTLGQDQT+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPNGVGTVRGEQTASRECYAS LKG
Subjt:  KKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKG

Query:  SSVCALETLASRDGTLEFKVNLPRWEFAAPTEELELVPLLSPKKQKPL
        +SVCALETL SRDGTLEF+ +LP  EFAAP EELELVPLLS +KQ  L
Subjt:  SSVCALETLASRDGTLEFKVNLPRWEFAAPTEELELVPLLSPKKQKPL

A0A6J1DHB3 uncharacterized protein LOC1110204794.8e-25863.89Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHTPRTSKATRGRGGTSKKGARGPAPAPTSENLDALQREMEAM
        MVQPANSTNTADRR LAA+  HQREVGA VVEGQGH+ L TEPL RSARIT PVLPPAH P+ SKA                                  
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHTPRTSKATRGRGGTSKKGARGPAPAPTSENLDALQREMEAM

Query:  RTKMRSMEEMYNGMILAAGAGSRSENRVTRVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSTQQVESSRNPA
                                                                                                     ESS NP 
Subjt:  RTKMRSMEEMYNGMILAAGAGSRSENRVTRVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSTQQVESSRNPA

Query:  TPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDSDLGESPFTSDVLEALIPPKFKAPTVKPYDESKDPKDYVEVFEGLMDFQAASDAIKCRAFQI
        TP GVITREEFDQL+ + DAQVEALKA+CE+KE   +D DLGE  F+SD+LEALIPPKFK PT+KPYD SKDPKDYVEVFE LMDFQAA+DAIKC AFQI
Subjt:  TPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDSDLGESPFTSDVLEALIPPKFKAPTVKPYDESKDPKDYVEVFEGLMDFQAASDAIKCRAFQI

Query:  ALTGSARLWYRRLPARTISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALT--------
        ALTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LT        
Subjt:  ALTGSARLWYRRLPARTISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALT--------

Query:  -----------------ELLRTKTGQPERKIGWGRSGKDIEKADPKSKDKG-SFSSGRAEYRGAENGPTRIQPYERFTPTTIPISEILTNIEESGMEKLL
                         ELLRTKTG+PE+ I  GR+GKD  KAD KS+DKG S SS R +YR + +   + +PYE +TPTTIPI EILTNIEE+GMEKLL
Subjt:  -----------------ELLRTKTGQPERKIGWGRSGKDIEKADPKSKDKG-SFSSGRAEYRGAENGPTRIQPYERFTPTTIPISEILTNIEESGMEKLL

Query:  KRPEKLRGAPKRRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEQRKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKE
        KRPEKLRG P++R+ DKYCRFHR+H HNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKE+RKR RTPPRR DRPAVIN             K+KE
Subjt:  KRPEKLRGAPKRRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEQRKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKE

Query:  LARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCI
        LAR ARREVCIIREQRPT  I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVD GASANILSL TYLALGWTRSQLKKSPTPLVGFSGES+  EGCI
Subjt:  LARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCI

Query:  DLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLASRD
        DLPV++ QD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SRECYAS  K SSVCALE    RD
Subjt:  DLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLASRD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCCGCAAACTCGACCAATACGGCAGATCGAAGGACCCTAGCTGCTAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACGGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTTCTACCACCTGCGCACACCCCAAGAACATCCAAGGCCACCCGTGGCCGAGGTGGAACCT
CTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCCCCGACAAGTGAGAACTTGGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACGAAAATGCGGTCCATGGAGGAAATG
TATAACGGAATGATATTAGCTGCAGGCGCGGGGTCCCGATCTGAGAATCGAGTGACGCGCGTTGGCATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGA
ACATCCTGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTCCGTGAACACCTCAACAGAAAGAGAGGCTCATCCCTCCGAAAAGGACAGTCACCATCCC
GCTCACACAGGAGCTCCACCCAGCAGGTTGAATCCTCTCGCAACCCAGCAACTCCTGCAGGAGTGATTACAAGGGAGGAGTTCGACCAGCTGAGGGGTCAGCTCGACGCT
CAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTCCACTGAACGATAGCGACTTGGGAGAATCGCCCTTCACCTCAGACGTTTTGGAAGCACTGATCCCTCC
GAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGAGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTCGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCA
AATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGGCTGCCAGCCAGGACGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCC
CAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCA
ATTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAAGCCCTCACGGAGCTCCTCCGAACCAAAACCGGCCAACCAGAAC
GAAAGATCGGTTGGGGCAGAAGTGGAAAAGATATAGAAAAGGCAGATCCCAAGTCCAAGGACAAAGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAGGGGCGGAGAAC
GGACCTACCAGGATCCAACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAGCGTCC
TGAGAAGCTTCGGGGAGCCCCGAAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGACCATAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTG
AGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACCAGCTCGGCCGAGAAAAAGGAACAGCGGAAGCGTTCGAGGACGCCGCCCCGACGCACT
GACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAG
GGAGCAGAGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGG
TCAGGAGGGTGCTGGTAGACAGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTTGCCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTG
GTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTTGGGCAGGACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGA
CGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCCTTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGTGTGG
GCACAGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGCTATGCCTCCGCACTCAAAGGCTCATCAGTCTGCGCCCTTGAAACTCTCGCCAGTAGGGATGGGACGCTCGAG
TTTAAGGTCAACCTGCCGAGGTGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTACTTAGTCCCAAGAAGCAAAAACCCTTGAGAGTTGGATCTAAGAA
CCTTGCTAACGAACAAGACCCATAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCCGCAAACTCGACCAATACGGCAGATCGAAGGACCCTAGCTGCTAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACGGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTTCTACCACCTGCGCACACCCCAAGAACATCCAAGGCCACCCGTGGCCGAGGTGGAACCT
CTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCCCCGACAAGTGAGAACTTGGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACGAAAATGCGGTCCATGGAGGAAATG
TATAACGGAATGATATTAGCTGCAGGCGCGGGGTCCCGATCTGAGAATCGAGTGACGCGCGTTGGCATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGA
ACATCCTGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTCCGTGAACACCTCAACAGAAAGAGAGGCTCATCCCTCCGAAAAGGACAGTCACCATCCC
GCTCACACAGGAGCTCCACCCAGCAGGTTGAATCCTCTCGCAACCCAGCAACTCCTGCAGGAGTGATTACAAGGGAGGAGTTCGACCAGCTGAGGGGTCAGCTCGACGCT
CAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTCCACTGAACGATAGCGACTTGGGAGAATCGCCCTTCACCTCAGACGTTTTGGAAGCACTGATCCCTCC
GAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGAGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTCGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCA
AATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGGCTGCCAGCCAGGACGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCC
CAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCA
ATTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAAGCCCTCACGGAGCTCCTCCGAACCAAAACCGGCCAACCAGAAC
GAAAGATCGGTTGGGGCAGAAGTGGAAAAGATATAGAAAAGGCAGATCCCAAGTCCAAGGACAAAGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAGGGGCGGAGAAC
GGACCTACCAGGATCCAACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAGCGTCC
TGAGAAGCTTCGGGGAGCCCCGAAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGACCATAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTG
AGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACCAGCTCGGCCGAGAAAAAGGAACAGCGGAAGCGTTCGAGGACGCCGCCCCGACGCACT
GACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAG
GGAGCAGAGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGG
TCAGGAGGGTGCTGGTAGACAGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTTGCCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTG
GTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTTGGGCAGGACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGA
CGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCCTTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGTGTGG
GCACAGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGCTATGCCTCCGCACTCAAAGGCTCATCAGTCTGCGCCCTTGAAACTCTCGCCAGTAGGGATGGGACGCTCGAG
TTTAAGGTCAACCTGCCGAGGTGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTACTTAGTCCCAAGAAGCAAAAACCCTTGAGAGTTGGATCTAAGAA
CCTTGCTAACGAACAAGACCCATAG
Protein sequenceShow/hide protein sequence
MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHTPRTSKATRGRGGTSKKGARGPAPAPTSENLDALQREMEAMRTKMRSMEEM
YNGMILAAGAGSRSENRVTRVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSTQQVESSRNPATPAGVITREEFDQLRGQLDA
QVEALKAKCEQKEGPLNDSDLGESPFTSDVLEALIPPKFKAPTVKPYDESKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARTISTYSQLRREFLA
QFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTELLRTKTGQPERKIGWGRSGKDIEKADPKSKDKGSFSSGRAEYRGAEN
GPTRIQPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPKRRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEQRKRSRTPPRRT
DRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQLKKSPTPL
VGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLASRDGTLE
FKVNLPRWEFAAPTEELELVPLLSPKKQKPLRVGSKNLANEQDP