; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g01200 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g01200
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr3:856002..858361
RNA-Seq ExpressionMoc03g01200
SyntenyMoc03g01200
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]1.9e-25086.93Show/hide
Query:  QAESSHNPATPVGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLKAPIPPKLKAPTVKPYDGSKNPKDYVEVFEGLMDFQAASD
        +AESS NPATP GVITR EFDQLRG+LDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL+APIPPK KAPTVKPYDGSK+PKDYVEVFE LMDFQAASD
Subjt:  QAESSHNPATPVGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLKAPIPPKLKAPTVKPYDGSKNPKDYVEVFEGLMDFQAASD

Query:  AIKCRSFQIALAGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCR+F+IAL GSARLWYRRLPA SISTY+QLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRSFQIALAGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPTTFAEVLQKAKKVIDGQEFLRTKTSRPERRIGRDRSEKD-EKADPKSKNKGSFSSGRAEFRRA----------------------------
        TVKLGEEAP TFAEVLQKAKKVIDGQE LRTKT RPER+IGR RS KD E ADPKSK+KGSFSSGRAE+RRA                            
Subjt:  TVKLGEEAPTTFAEVLQKAKKVIDGQEFLRTKTSRPERRIGRDRSEKD-EKADPKSKNKGSFSSGRAEFRRA----------------------------

Query:  -SGMEKLLKRPEKHRGALERRSKDKYVRFYREHGHNTSDCWELKRQIEDLIQDCYFKKFVGKPRTSSAEKKEERQRSRTPPRRTDRPAVINTIFGGPSGG
         SGMEKLLKRPEK RGA ERRSKDKY RF+REHGHNTSD WELKRQIE+LIQD YFKKFVGKPRTSSAEKKEER+RSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  -SGMEKLLKRPEKHRGALERRSKDKYVRFYREHGHNTSDCWELKRQIEDLIQDCYFKKFVGKPRTSSAEKKEERQRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGYKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
        QSG KRKELARAARREVCIIREQRPTCPITFD  DLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
Subjt:  QSGYKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE

Query:  SIIPEGCIDLPVTLGQDQTQVTQMAEFV
        S+IPEG IDLPVTLGQDQTQVTQMAEFV
Subjt:  SIIPEGCIDLPVTLGQDQTQVTQMAEFV

XP_022150613.1 uncharacterized protein LOC111018708, partial [Momordica charantia]3.2e-18984.12Show/hide
Query:  KEGPLNDGDLGESPFTSDVLKAPIPPKLKAPTVKPYDGSKNPKDYVEVFEGLMDFQAASDAIKCRSFQIALAGSARLWYRRLPARSISTYAQLRREFLAQ
        K+  LNDGDLGES FTSDVL+APIPPK KAPTVKPYDGSK+PKDYVEVFEGLMDF AASDAIKCR+FQIAL GSARLWYRRLPARSISTY+QLRREFLAQ
Subjt:  KEGPLNDGDLGESPFTSDVLKAPIPPKLKAPTVKPYDGSKNPKDYVEVFEGLMDFQAASDAIKCRSFQIALAGSARLWYRRLPARSISTYAQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQEFLRTKTSRPERRI
        FSSR Y KKT THLATIRQKEG TLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+APTTFAEVLQKAKKVIDGQE LRTKT RP+R+I
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQEFLRTKTSRPERRI

Query:  GRDRSEKD-EKADPKSKNKGSFSSGRAEFRRA-----------------------------SGMEKLLKRPEKHRGALERRSKDKYVRFYREHGHNTSDC
        GR RS KD E+ADPKSK+KGSFSSGRAE+RRA                             SGMEKLLKRPEK RGA ERRSKDKY RF+REHGHNTSDC
Subjt:  GRDRSEKD-EKADPKSKNKGSFSSGRAEFRRA-----------------------------SGMEKLLKRPEKHRGALERRSKDKYVRFYREHGHNTSDC

Query:  WELKRQIEDLIQDCYFKKFVGKPRTSSAEKKEERQRSRTPPRRTDRPAVINTIFGGPSGGQSGYKRKELARAARREVCIIREQRPTCPITFDSTDLEEVH
        WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEER+RSRTPPRRTDRPAVINTIFGGPSGGQSG+KRKELARAARREVCIIREQ PTCPITFD  D EEVH
Subjt:  WELKRQIEDLIQDCYFKKFVGKPRTSSAEKKEERQRSRTPPRRTDRPAVINTIFGGPSGGQSGYKRKELARAARREVCIIREQRPTCPITFDSTDLEEVH

Query:  LPHNDALVIAPLIDHVVVRRVL
        LPHNDA VIAPLIDHVVVRRVL
Subjt:  LPHNDALVIAPLIDHVVVRRVL

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]6.1e-24177.26Show/hide
Query:  SSNQQAESSHNPATPVGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLKAPIPPKLKAPTVKPYDGSKNPKDYVEVFEGLMDFQ
        SSNQQAESSHNPATP GVITR EFDQLRGKL+AQVEALKAKCEQKEGPLNDGDLGESPFTSDV        L+APTVK YDGSK+PKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPATPVGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLKAPIPPKLKAPTVKPYDGSKNPKDYVEVFEGLMDFQ

Query:  AASDAIKCRSFQIALAGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCR+FQIAL GSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRSFQIALAGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPTTFAEVLQKAKKVIDGQEFLRTKTSRPERRIGRDRSEKDEKADPKSKNKGSFSSGRAEFRRA-------------------------
        DEALTVKLG+EAP TFAEVLQKAKKVIDGQE LRTKT RPER I R RS KDEKAD KSK+KGSFSSGRAEFRRA                         
Subjt:  DEALTVKLGEEAPTTFAEVLQKAKKVIDGQEFLRTKTSRPERRIGRDRSEKDEKADPKSKNKGSFSSGRAEFRRA-------------------------

Query:  ----SGMEKLLKRPEKHRGALERRSKDKYVRFYREHGHNTSDCWELKRQIEDLIQDCYFKKFVGKPRTSSAEKKEERQRSRTPPRRTDRPAVINTIFGGP
            SGMEKLLKRPEK RGA ERR+KDKY RF+REH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEER+ SRTP RR DRPAVINTIFGGP
Subjt:  ----SGMEKLLKRPEKHRGALERRSKDKYVRFYREHGHNTSDCWELKRQIEDLIQDCYFKKFVGKPRTSSAEKKEERQRSRTPPRRTDRPAVINTIFGGP

Query:  SGGQSGYKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGF
        SGGQSG+KRKELARAARREVCIIREQRPTCPITFDS DLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVGF
Subjt:  SGGQSGYKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGF

Query:  SGESIIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALET
        S ES+IPEGCIDLPVTLG DQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPNGVG VRGEQ ASRECYASALKGSSVCALET
Subjt:  SGESIIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALET

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]9.3e-25063.18Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGASVVEGQGHDGLATKPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR
        MVQPANSTNTADRR LAA+  HQREVGA VVEGQGH+ L T+PL RSARIT PVLPPAHP+ SK                                    
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGASVVEGQGHDGLATKPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR

Query:  TQMRSMEEMYNEMILAAGAGSRSENRMTRIDIREQRGPHLGPVEEEHPEDNESEGHTRRRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAT
                                                                                                   AESS+NP T
Subjt:  TQMRSMEEMYNEMILAAGAGSRSENRMTRIDIREQRGPHLGPVEEEHPEDNESEGHTRRRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAT

Query:  PVGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLKAPIPPKLKAPTVKPYDGSKNPKDYVEVFEGLMDFQAASDAIKCRSFQIA
        P GVITR EFDQL+ K DAQVEALKA+CE+KE   +DGDLGE  F+SD+L+A IPPK K PT+KPYDGSK+PKDYVEVFE LMDFQAA+DAIKC +FQIA
Subjt:  PVGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLKAPIPPKLKAPTVKPYDGSKNPKDYVEVFEGLMDFQAASDAIKCRSFQIA

Query:  LAGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPT
        L GSARLWYRRLPAR ISTY+QLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAP 
Subjt:  LAGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPT

Query:  TFAEVLQKAKKVIDGQEFLRTKTSRPERRIGRDRSEKDE-KADPKSKNKG-SFSSGRAEFRRA-----------------------------SGMEKLLK
        TFAEVLQK KKVIDGQE LRTKT RPE+ I + R+ KD+ KAD KS++KG S SS R ++RR+                             +GMEKLLK
Subjt:  TFAEVLQKAKKVIDGQEFLRTKTSRPERRIGRDRSEKDE-KADPKSKNKG-SFSSGRAEFRRA-----------------------------SGMEKLLK

Query:  RPEKHRGALERRSKDKYVRFYREHGHNTSDCWELKRQIEDLIQDCYFKKFVGKPRTSSAEKKEERQRSRTPPRRTDRPAVINTIFGGPSGGQSGYKRKEL
        RPEK RG  E+R+ DKY RF+R+HGHNTS+ WELKRQIEDLIQD YFKKFVGKPR++S EKKEER+R RTPPRR DRPAVIN             K+KEL
Subjt:  RPEKHRGALERRSKDKYVRFYREHGHNTSDCWELKRQIEDLIQDCYFKKFVGKPRTSSAEKKEERQRSRTPPRRTDRPAVINTIFGGPSGGQSGYKRKEL

Query:  ARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESIIPEGCID
        AR ARREVCIIREQRPT  I F+  DLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLKKSPTPLVGFSGESI  EGCID
Subjt:  ARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESIIPEGCID

Query:  LPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALE
        LPV++ QD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SRECYAS  K SSVCALE
Subjt:  LPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALE

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]4.7e-20966.45Show/hide
Query:  EQRGPHLGPVEEEHPEDNESEGHTRRRGDLREHLNRKRGSSLRKGQ---SPSRSHRSSNQQAESSHNPATPVGVITRAEFDQLRGKLDAQVEALKAKCEQ
        E+  P + P   E   ++E   ++ R  DLR+HL  K+  +  + +   S SR   +SN +A+S + P  P  VI R EFD ++ + D QVEALKA+CE+
Subjt:  EQRGPHLGPVEEEHPEDNESEGHTRRRGDLREHLNRKRGSSLRKGQ---SPSRSHRSSNQQAESSHNPATPVGVITRAEFDQLRGKLDAQVEALKAKCEQ

Query:  KEGPLNDGDLGESPFTSDVLKAPIPPKLKAPTVKPYDGSKNPKDYVEVFEGLMDFQAASDAIKCRSFQIALAGSARLWYRRLPARSISTYAQLRREFLAQ
        KE P +D DLGESPFTSD+++APIPPK K PT+KPYDGSK+PKDYVEVFEGLMDFQAA+DAIKC +FQIAL GSARLW RRLPARSISTY+QLR+EF+ Q
Subjt:  KEGPLNDGDLGESPFTSDVLKAPIPPKLKAPTVKPYDGSKNPKDYVEVFEGLMDFQAASDAIKCRSFQIALAGSARLWYRRLPARSISTYAQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQEFLRTKTSRPERRI
        FS RHYD+KTATHLATIRQKE                                   DE LTVKLGEEAP TFAEVLQ AKKVIDGQE LRTKT RPE++I
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQEFLRTKTSRPERRI

Query:  GRDR-SEKDEKADPKSKNKGSFSSG-RAEFRRASGMEKLLKRPEKHRGALERRSKDKYVRFYREHGHNTSDCWELKRQIEDLIQDCYFKKFVGKPRTSSA
         + R S+K  K D KSK+KGS SSG R E+RR+         P + R          Y R           CWELKRQIEDLIQD YFKKFVGKPR++S 
Subjt:  GRDR-SEKDEKADPKSKNKGSFSSG-RAEFRRASGMEKLLKRPEKHRGALERRSKDKYVRFYREHGHNTSDCWELKRQIEDLIQDCYFKKFVGKPRTSSA

Query:  EKKEERQRSRTPPRRTDRPAVINTIFGGPSGGQSGYKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGAS
        EKKEER+RSRTPPRR DRPAVINTIFGGPSGGQ   KRKELA  ARR+V IIREQ+PTC ITF  TDLE VHLPHNDALVIAPLIDHV+VRRVLVDGGAS
Subjt:  EKKEERQRSRTPPRRTDRPAVINTIFGGPSGGQSGYKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGAS

Query:  ANILSLPTYLALGWTRSQLKKSPTPLVGFSGESIIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVG
        ANILSLPTYLAL  TRSQLKKSPTPLVGFS ES+ PEGCIDLPVT+GQD TQVTQMAEFVVIDGR AYNAIF RPIIHSF+A+PS LHQVLKYSTPNGVG
Subjt:  ANILSLPTYLALGWTRSQLKKSPTPLVGFSGESIIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVG

Query:  TVRGEQTASRECYASALKGSSVCALE
        TVRGEQ  SRECYASALK SSVCALE
Subjt:  TVRGEQTASRECYASALKGSSVCALE

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088139.1e-25186.93Show/hide
Query:  QAESSHNPATPVGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLKAPIPPKLKAPTVKPYDGSKNPKDYVEVFEGLMDFQAASD
        +AESS NPATP GVITR EFDQLRG+LDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL+APIPPK KAPTVKPYDGSK+PKDYVEVFE LMDFQAASD
Subjt:  QAESSHNPATPVGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLKAPIPPKLKAPTVKPYDGSKNPKDYVEVFEGLMDFQAASD

Query:  AIKCRSFQIALAGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCR+F+IAL GSARLWYRRLPA SISTY+QLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRSFQIALAGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPTTFAEVLQKAKKVIDGQEFLRTKTSRPERRIGRDRSEKD-EKADPKSKNKGSFSSGRAEFRRA----------------------------
        TVKLGEEAP TFAEVLQKAKKVIDGQE LRTKT RPER+IGR RS KD E ADPKSK+KGSFSSGRAE+RRA                            
Subjt:  TVKLGEEAPTTFAEVLQKAKKVIDGQEFLRTKTSRPERRIGRDRSEKD-EKADPKSKNKGSFSSGRAEFRRA----------------------------

Query:  -SGMEKLLKRPEKHRGALERRSKDKYVRFYREHGHNTSDCWELKRQIEDLIQDCYFKKFVGKPRTSSAEKKEERQRSRTPPRRTDRPAVINTIFGGPSGG
         SGMEKLLKRPEK RGA ERRSKDKY RF+REHGHNTSD WELKRQIE+LIQD YFKKFVGKPRTSSAEKKEER+RSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  -SGMEKLLKRPEKHRGALERRSKDKYVRFYREHGHNTSDCWELKRQIEDLIQDCYFKKFVGKPRTSSAEKKEERQRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGYKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
        QSG KRKELARAARREVCIIREQRPTCPITFD  DLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
Subjt:  QSGYKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE

Query:  SIIPEGCIDLPVTLGQDQTQVTQMAEFV
        S+IPEG IDLPVTLGQDQTQVTQMAEFV
Subjt:  SIIPEGCIDLPVTLGQDQTQVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188232.9e-24177.26Show/hide
Query:  SSNQQAESSHNPATPVGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLKAPIPPKLKAPTVKPYDGSKNPKDYVEVFEGLMDFQ
        SSNQQAESSHNPATP GVITR EFDQLRGKL+AQVEALKAKCEQKEGPLNDGDLGESPFTSDV        L+APTVK YDGSK+PKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPATPVGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLKAPIPPKLKAPTVKPYDGSKNPKDYVEVFEGLMDFQ

Query:  AASDAIKCRSFQIALAGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCR+FQIAL GSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRSFQIALAGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPTTFAEVLQKAKKVIDGQEFLRTKTSRPERRIGRDRSEKDEKADPKSKNKGSFSSGRAEFRRA-------------------------
        DEALTVKLG+EAP TFAEVLQKAKKVIDGQE LRTKT RPER I R RS KDEKAD KSK+KGSFSSGRAEFRRA                         
Subjt:  DEALTVKLGEEAPTTFAEVLQKAKKVIDGQEFLRTKTSRPERRIGRDRSEKDEKADPKSKNKGSFSSGRAEFRRA-------------------------

Query:  ----SGMEKLLKRPEKHRGALERRSKDKYVRFYREHGHNTSDCWELKRQIEDLIQDCYFKKFVGKPRTSSAEKKEERQRSRTPPRRTDRPAVINTIFGGP
            SGMEKLLKRPEK RGA ERR+KDKY RF+REH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEER+ SRTP RR DRPAVINTIFGGP
Subjt:  ----SGMEKLLKRPEKHRGALERRSKDKYVRFYREHGHNTSDCWELKRQIEDLIQDCYFKKFVGKPRTSSAEKKEERQRSRTPPRRTDRPAVINTIFGGP

Query:  SGGQSGYKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGF
        SGGQSG+KRKELARAARREVCIIREQRPTCPITFDS DLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVGF
Subjt:  SGGQSGYKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGF

Query:  SGESIIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALET
        S ES+IPEGCIDLPVTLG DQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPNGVG VRGEQ ASRECYASALKGSSVCALET
Subjt:  SGESIIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALET

A0A6J1D9W7 uncharacterized protein LOC1110187081.5e-18984.12Show/hide
Query:  KEGPLNDGDLGESPFTSDVLKAPIPPKLKAPTVKPYDGSKNPKDYVEVFEGLMDFQAASDAIKCRSFQIALAGSARLWYRRLPARSISTYAQLRREFLAQ
        K+  LNDGDLGES FTSDVL+APIPPK KAPTVKPYDGSK+PKDYVEVFEGLMDF AASDAIKCR+FQIAL GSARLWYRRLPARSISTY+QLRREFLAQ
Subjt:  KEGPLNDGDLGESPFTSDVLKAPIPPKLKAPTVKPYDGSKNPKDYVEVFEGLMDFQAASDAIKCRSFQIALAGSARLWYRRLPARSISTYAQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQEFLRTKTSRPERRI
        FSSR Y KKT THLATIRQKEG TLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+APTTFAEVLQKAKKVIDGQE LRTKT RP+R+I
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQEFLRTKTSRPERRI

Query:  GRDRSEKD-EKADPKSKNKGSFSSGRAEFRRA-----------------------------SGMEKLLKRPEKHRGALERRSKDKYVRFYREHGHNTSDC
        GR RS KD E+ADPKSK+KGSFSSGRAE+RRA                             SGMEKLLKRPEK RGA ERRSKDKY RF+REHGHNTSDC
Subjt:  GRDRSEKD-EKADPKSKNKGSFSSGRAEFRRA-----------------------------SGMEKLLKRPEKHRGALERRSKDKYVRFYREHGHNTSDC

Query:  WELKRQIEDLIQDCYFKKFVGKPRTSSAEKKEERQRSRTPPRRTDRPAVINTIFGGPSGGQSGYKRKELARAARREVCIIREQRPTCPITFDSTDLEEVH
        WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEER+RSRTPPRRTDRPAVINTIFGGPSGGQSG+KRKELARAARREVCIIREQ PTCPITFD  D EEVH
Subjt:  WELKRQIEDLIQDCYFKKFVGKPRTSSAEKKEERQRSRTPPRRTDRPAVINTIFGGPSGGQSGYKRKELARAARREVCIIREQRPTCPITFDSTDLEEVH

Query:  LPHNDALVIAPLIDHVVVRRVL
        LPHNDA VIAPLIDHVVVRRVL
Subjt:  LPHNDALVIAPLIDHVVVRRVL

A0A6J1DHB3 uncharacterized protein LOC1110204794.5e-25063.18Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGASVVEGQGHDGLATKPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR
        MVQPANSTNTADRR LAA+  HQREVGA VVEGQGH+ L T+PL RSARIT PVLPPAHP+ SK                                    
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGASVVEGQGHDGLATKPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR

Query:  TQMRSMEEMYNEMILAAGAGSRSENRMTRIDIREQRGPHLGPVEEEHPEDNESEGHTRRRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAT
                                                                                                   AESS+NP T
Subjt:  TQMRSMEEMYNEMILAAGAGSRSENRMTRIDIREQRGPHLGPVEEEHPEDNESEGHTRRRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAT

Query:  PVGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLKAPIPPKLKAPTVKPYDGSKNPKDYVEVFEGLMDFQAASDAIKCRSFQIA
        P GVITR EFDQL+ K DAQVEALKA+CE+KE   +DGDLGE  F+SD+L+A IPPK K PT+KPYDGSK+PKDYVEVFE LMDFQAA+DAIKC +FQIA
Subjt:  PVGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLKAPIPPKLKAPTVKPYDGSKNPKDYVEVFEGLMDFQAASDAIKCRSFQIA

Query:  LAGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPT
        L GSARLWYRRLPAR ISTY+QLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAP 
Subjt:  LAGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPT

Query:  TFAEVLQKAKKVIDGQEFLRTKTSRPERRIGRDRSEKDE-KADPKSKNKG-SFSSGRAEFRRA-----------------------------SGMEKLLK
        TFAEVLQK KKVIDGQE LRTKT RPE+ I + R+ KD+ KAD KS++KG S SS R ++RR+                             +GMEKLLK
Subjt:  TFAEVLQKAKKVIDGQEFLRTKTSRPERRIGRDRSEKDE-KADPKSKNKG-SFSSGRAEFRRA-----------------------------SGMEKLLK

Query:  RPEKHRGALERRSKDKYVRFYREHGHNTSDCWELKRQIEDLIQDCYFKKFVGKPRTSSAEKKEERQRSRTPPRRTDRPAVINTIFGGPSGGQSGYKRKEL
        RPEK RG  E+R+ DKY RF+R+HGHNTS+ WELKRQIEDLIQD YFKKFVGKPR++S EKKEER+R RTPPRR DRPAVIN             K+KEL
Subjt:  RPEKHRGALERRSKDKYVRFYREHGHNTSDCWELKRQIEDLIQDCYFKKFVGKPRTSSAEKKEERQRSRTPPRRTDRPAVINTIFGGPSGGQSGYKRKEL

Query:  ARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESIIPEGCID
        AR ARREVCIIREQRPT  I F+  DLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLKKSPTPLVGFSGESI  EGCID
Subjt:  ARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESIIPEGCID

Query:  LPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALE
        LPV++ QD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SRECYAS  K SSVCALE
Subjt:  LPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALE

A0A6J1DPC9 uncharacterized protein LOC1110222802.3e-20966.45Show/hide
Query:  EQRGPHLGPVEEEHPEDNESEGHTRRRGDLREHLNRKRGSSLRKGQ---SPSRSHRSSNQQAESSHNPATPVGVITRAEFDQLRGKLDAQVEALKAKCEQ
        E+  P + P   E   ++E   ++ R  DLR+HL  K+  +  + +   S SR   +SN +A+S + P  P  VI R EFD ++ + D QVEALKA+CE+
Subjt:  EQRGPHLGPVEEEHPEDNESEGHTRRRGDLREHLNRKRGSSLRKGQ---SPSRSHRSSNQQAESSHNPATPVGVITRAEFDQLRGKLDAQVEALKAKCEQ

Query:  KEGPLNDGDLGESPFTSDVLKAPIPPKLKAPTVKPYDGSKNPKDYVEVFEGLMDFQAASDAIKCRSFQIALAGSARLWYRRLPARSISTYAQLRREFLAQ
        KE P +D DLGESPFTSD+++APIPPK K PT+KPYDGSK+PKDYVEVFEGLMDFQAA+DAIKC +FQIAL GSARLW RRLPARSISTY+QLR+EF+ Q
Subjt:  KEGPLNDGDLGESPFTSDVLKAPIPPKLKAPTVKPYDGSKNPKDYVEVFEGLMDFQAASDAIKCRSFQIALAGSARLWYRRLPARSISTYAQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQEFLRTKTSRPERRI
        FS RHYD+KTATHLATIRQKE                                   DE LTVKLGEEAP TFAEVLQ AKKVIDGQE LRTKT RPE++I
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQEFLRTKTSRPERRI

Query:  GRDR-SEKDEKADPKSKNKGSFSSG-RAEFRRASGMEKLLKRPEKHRGALERRSKDKYVRFYREHGHNTSDCWELKRQIEDLIQDCYFKKFVGKPRTSSA
         + R S+K  K D KSK+KGS SSG R E+RR+         P + R          Y R           CWELKRQIEDLIQD YFKKFVGKPR++S 
Subjt:  GRDR-SEKDEKADPKSKNKGSFSSG-RAEFRRASGMEKLLKRPEKHRGALERRSKDKYVRFYREHGHNTSDCWELKRQIEDLIQDCYFKKFVGKPRTSSA

Query:  EKKEERQRSRTPPRRTDRPAVINTIFGGPSGGQSGYKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGAS
        EKKEER+RSRTPPRR DRPAVINTIFGGPSGGQ   KRKELA  ARR+V IIREQ+PTC ITF  TDLE VHLPHNDALVIAPLIDHV+VRRVLVDGGAS
Subjt:  EKKEERQRSRTPPRRTDRPAVINTIFGGPSGGQSGYKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGAS

Query:  ANILSLPTYLALGWTRSQLKKSPTPLVGFSGESIIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVG
        ANILSLPTYLAL  TRSQLKKSPTPLVGFS ES+ PEGCIDLPVT+GQD TQVTQMAEFVVIDGR AYNAIF RPIIHSF+A+PS LHQVLKYSTPNGVG
Subjt:  ANILSLPTYLALGWTRSQLKKSPTPLVGFSGESIIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVG

Query:  TVRGEQTASRECYASALKGSSVCALE
        TVRGEQ  SRECYASALK SSVCALE
Subjt:  TVRGEQTASRECYASALKGSSVCALE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACACTAGCTGCCAGCGATGCCCACCAAAGGGAGGTCGGAGCATCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAAAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATGTAT
AACGAGATGATATTGGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAATGACGCGCATTGACATACGCGAGCAAAGGGGTCCCCACCTCGGCCCAGTCGAGGAGGAACA
TCCCGAAGACAACGAGAGCGAGGGACACACTCGCCGGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCCGCT
CACACCGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCAGCAACTCCCGTAGGGGTGATCACAAGGGCGGAGTTCGACCAGCTGAGGGGCAAGCTCGACGCTCAG
GTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTCCATTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGAAAGCACCGATCCCTCCGAA
GTTAAAAGCTCCTACCGTGAAACCTTATGATGGGTCGAAGAACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAAT
GTCGCTCCTTTCAGATCGCGCTTGCCGGCAGCGCGCGTTTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACGCTCAGCTGAGAAGGGAGTTCCTTGCCCAG
TTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAAAAGGAAGGTGAGACGCTGCGGGAATATGTCACCAGATTCCAGGAGGAACAATT
GAAGGTCGCACACTGCTCTGATGACTCGGCCATGTGCTACTTTCTCACCGGCCTAGCCGACGAAGCTCTCACGGTGAAGCTTGGAGAGGAGGCCCCGACCACCTTCGCCG
AAGTGCTGCAGAAGGCGAAGAAAGTCATCGATGGGCAGGAGTTCCTCCGAACCAAAACCAGCCGACCAGAACGAAGGATCGGCCGGGATAGAAGCGAAAAAGATGAAAAG
GCGGATCCCAAGTCCAAGAACAAGGGATCTTTCTCCAGTGGCCGAGCTGAGTTTCGAAGGGCGTCCGGAATGGAAAAACTACTCAAACGTCCTGAGAAGCATCGGGGAGC
CCTGGAGAGGCGCAGCAAGGACAAGTATGTCCGCTTCTATCGGGAGCACGGCCACAACACGTCGGATTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATT
GCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACCAGCTCGGCAGAGAAAAAGGAAGAGCGGCAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACCTGCGGTCATC
AACACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGATATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTG
CCCAATCACCTTCGACAGTACAGACTTAGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAG
ACGGAGGCGCGTCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAA
TCGATCATCCCGGAGGGTTGCATCGACTTGCCGGTCACGCTTGGGCAAGACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAA
CGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCCTTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAAC
AGACCGCTTCGAGGGAGTGCTACGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTTGAAACTCGCCAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACACTAGCTGCCAGCGATGCCCACCAAAGGGAGGTCGGAGCATCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAAAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATGTAT
AACGAGATGATATTGGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAATGACGCGCATTGACATACGCGAGCAAAGGGGTCCCCACCTCGGCCCAGTCGAGGAGGAACA
TCCCGAAGACAACGAGAGCGAGGGACACACTCGCCGGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCCGCT
CACACCGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCAGCAACTCCCGTAGGGGTGATCACAAGGGCGGAGTTCGACCAGCTGAGGGGCAAGCTCGACGCTCAG
GTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTCCATTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGAAAGCACCGATCCCTCCGAA
GTTAAAAGCTCCTACCGTGAAACCTTATGATGGGTCGAAGAACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAAT
GTCGCTCCTTTCAGATCGCGCTTGCCGGCAGCGCGCGTTTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACGCTCAGCTGAGAAGGGAGTTCCTTGCCCAG
TTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAAAAGGAAGGTGAGACGCTGCGGGAATATGTCACCAGATTCCAGGAGGAACAATT
GAAGGTCGCACACTGCTCTGATGACTCGGCCATGTGCTACTTTCTCACCGGCCTAGCCGACGAAGCTCTCACGGTGAAGCTTGGAGAGGAGGCCCCGACCACCTTCGCCG
AAGTGCTGCAGAAGGCGAAGAAAGTCATCGATGGGCAGGAGTTCCTCCGAACCAAAACCAGCCGACCAGAACGAAGGATCGGCCGGGATAGAAGCGAAAAAGATGAAAAG
GCGGATCCCAAGTCCAAGAACAAGGGATCTTTCTCCAGTGGCCGAGCTGAGTTTCGAAGGGCGTCCGGAATGGAAAAACTACTCAAACGTCCTGAGAAGCATCGGGGAGC
CCTGGAGAGGCGCAGCAAGGACAAGTATGTCCGCTTCTATCGGGAGCACGGCCACAACACGTCGGATTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATT
GCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACCAGCTCGGCAGAGAAAAAGGAAGAGCGGCAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACCTGCGGTCATC
AACACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGATATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTG
CCCAATCACCTTCGACAGTACAGACTTAGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAG
ACGGAGGCGCGTCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAA
TCGATCATCCCGGAGGGTTGCATCGACTTGCCGGTCACGCTTGGGCAAGACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAA
CGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCCTTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAAC
AGACCGCTTCGAGGGAGTGCTACGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTTGAAACTCGCCAGTAG
Protein sequenceShow/hide protein sequence
MVQPANSTNTADRRTLAASDAHQREVGASVVEGQGHDGLATKPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMRTQMRSMEEMY
NEMILAAGAGSRSENRMTRIDIREQRGPHLGPVEEEHPEDNESEGHTRRRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPATPVGVITRAEFDQLRGKLDAQ
VEALKAKCEQKEGPLNDGDLGESPFTSDVLKAPIPPKLKAPTVKPYDGSKNPKDYVEVFEGLMDFQAASDAIKCRSFQIALAGSARLWYRRLPARSISTYAQLRREFLAQ
FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQEFLRTKTSRPERRIGRDRSEKDEK
ADPKSKNKGSFSSGRAEFRRASGMEKLLKRPEKHRGALERRSKDKYVRFYREHGHNTSDCWELKRQIEDLIQDCYFKKFVGKPRTSSAEKKEERQRSRTPPRRTDRPAVI
NTIFGGPSGGQSGYKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
SIIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETRQ