; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc11g17700 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc11g17700
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr11:13583051..13588624
RNA-Seq ExpressionMoc11g17700
SyntenyMoc11g17700
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]1.9e-27192.23Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESLFTSDVLEAPIPSKFKAPTMKPYDGTKDPKDYVEVFEGLMDFQAASD
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGES FTSDVLEAPIP KFKAPT+KPYDG+KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESLFTSDVLEAPIPSKFKAPTMKPYDGTKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGSSGKD-ERADPKSKDKGSFSSGRPEYRRVESGPTRSRPYERFTPTMIPISEILMNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRG SGKD E ADPKSKDKGSFSSGR EYRR E+GPTRSRPYERFTPT IPISEIL NIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGSSGKD-ERADPKSKDKGSFSSGRPEYRRVESGPTRSRPYERFTPTMIPISEILMNIE

Query:  DSGMEKLLKRPEKLRGAPERRNKDKYCRFHREHGHNMSDCWELKRQIEDLIQDGYFKMFVGKPRTSSAEKKEERKRSRTPPRRADRPAVINTIFGGPSGG
        +SGMEKLLKRPEKLRGAPERR+KDKYCRFHREHGHN SD WELKRQIE+LIQDGYFK FVGKPRTSSAEKKEERKRSRTPPRR DRPAVINTIFGGPSGG
Subjt:  DSGMEKLLKRPEKLRGAPERRNKDKYCRFHREHGHNMSDCWELKRQIEDLIQDGYFKMFVGKPRTSSAEKKEERKRSRTPPRRADRPAVINTIFGGPSGG

Query:  QSGHKRKELARAVRREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVKRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE
        QSG KRKELARA RREVCIIREQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLK+SPTPLVGFSGE
Subjt:  QSGHKRKELARAVRREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVKRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE

Query:  SVILEGCIDLPVTLGQDQTQVTQMAEFV
        SVI EG IDLPVTLGQDQTQVTQMAEFV
Subjt:  SVILEGCIDLPVTLGQDQTQVTQMAEFV

XP_022150613.1 uncharacterized protein LOC111018708, partial [Momordica charantia]2.6e-22093.36Show/hide
Query:  KDDSLNDGDLGESLFTSDVLEAPIPSKFKAPTMKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
        KDDSLNDGDLGES FTSDVLEAPIP KFKAPT+KPYDG+KDPKDYVEVFEGLMDF AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
Subjt:  KDDSLNDGDLGESLFTSDVLEAPIPSKFKAPTMKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
        FSSR Y KKT THLATIRQKEG TLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+AP TFAEVLQKAKKVIDGQELLRTKTGRP+RKI
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI

Query:  GRGSSGKD-ERADPKSKDKGSFSSGRPEYRRVESGPTRSRPYERFTPTMIPISEILMNIEDSGMEKLLKRPEKLRGAPERRNKDKYCRFHREHGHNMSDC
        GRG SGKD ERADPKSKDKGSFSSGR EYRR ESGPT+SRPYERFTPT IPISEIL NIE+SGMEKLLKRPEKLRGAPERR+KDKYCRFHREHGHN SDC
Subjt:  GRGSSGKD-ERADPKSKDKGSFSSGRPEYRRVESGPTRSRPYERFTPTMIPISEILMNIEDSGMEKLLKRPEKLRGAPERRNKDKYCRFHREHGHNMSDC

Query:  WELKRQIEDLIQDGYFKMFVGKPRTSSAEKKEERKRSRTPPRRADRPAVINTIFGGPSGGQSGHKRKELARAVRREVCIIREQGPTCPITFDGADLEEVH
        WELKRQIEDLIQDGYFK FVGKPRTSSAEKKEERKRSRTPPRR DRPAVINTIFGGPSGGQSGHKRKELARA RREVCIIREQGPTCPITFDGAD EEVH
Subjt:  WELKRQIEDLIQDGYFKMFVGKPRTSSAEKKEERKRSRTPPRRADRPAVINTIFGGPSGGQSGHKRKELARAVRREVCIIREQGPTCPITFDGADLEEVH

Query:  LPHNDALVIAPLIDHVVVKRVL
        LPHNDA VIAPLIDHVVV+RVL
Subjt:  LPHNDALVIAPLIDHVVVKRVL

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]1.0e-26980Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESLFTSDVLEAPIPSKFKAPTMKPYDGTKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPA   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDGDLGES FTSDVLE        APT+K YDG+KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESLFTSDVLEAPIPSKFKAPTMKPYDGTKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGSSGKDERADPKSKDKGSFSSGRPEYRRVESGPTRSRPYERFTPTMIPISEILM
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RG SGKDE+AD KSKDKGSFSSGR E+RR  +GPTRSRPYERFTPT IPISEIL 
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGSSGKDERADPKSKDKGSFSSGRPEYRRVESGPTRSRPYERFTPTMIPISEILM

Query:  NIEDSGMEKLLKRPEKLRGAPERRNKDKYCRFHREHGHNMSDCWELKRQIEDLIQDGYFKMFVGKPRTSSAEKKEERKRSRTPPRRADRPAVINTIFGGP
        NIE+SGMEKLLKRPEKLRGAPERRNKDKYCRFHREH HN SD WELKRQIEDLIQD YFK FVGKPRTSSAEKKEERK SRTP RR DRPAVINTIFGGP
Subjt:  NIEDSGMEKLLKRPEKLRGAPERRNKDKYCRFHREHGHNMSDCWELKRQIEDLIQDGYFKMFVGKPRTSSAEKKEERKRSRTPPRRADRPAVINTIFGGP

Query:  SGGQSGHKRKELARAVRREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVKRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGF
        SGGQSGHKRKELARA RREVCIIREQ PTCPITFD ADLEEVHLPHNDALVIAPLIDHVVV+RVLVD G SANI+SL TYLALGWTRSQLK+S TPLVGF
Subjt:  SGGQSGHKRKELARAVRREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVKRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGF

Query:  SGESVILEGCIDLPVTLGQDQTQVTQMAEFVVVDGRSTYNAIFGRPIIHSFRAIPATLHQVLKSSTPNGVGTVRGEQTASRECYAAALKGPSVCALETL-
        S ESVI EGCIDLPVTLG DQTQVTQMAEFVV+DGRS YNAIFGRPIIHSFRAIP+TLHQVLK STPNGVG VRGEQ ASRECYA+ALKG SVCALETL 
Subjt:  SGESVILEGCIDLPVTLGQDQTQVTQMAEFVVVDGRSTYNAIFGRPIIHSFRAIPATLHQVLKSSTPNGVGTVRGEQTASRECYAAALKGPSVCALETL-

Query:  -RDGTLEFEADLPRKEFAASTEELELVPLL
         RDGTLEF+A+LPR+EFAA TEELELVPLL
Subjt:  -RDGTLEFEADLPRKEFAASTEELELVPLL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]3.0e-26965.57Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLVTEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPPTDPTSENLDALKREMEAMC
        MVQPANSTNT DRR LAA+  HQREVGA  VEGQGH+ L TEPL RSARIT P LPPAHP+ SK                                    
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLVTEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPPTDPTSENLDALKREMEAMC

Query:  TQMRSMEEMYNEMMLAAGAGSRSENRATRGDVPEQRGSHLGPTEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--
                                                                                                   AESS+NP  
Subjt:  TQMRSMEEMYNEMMLAAGAGSRSENRATRGDVPEQRGSHLGPTEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--

Query:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESLFTSDVLEAPIPSKFKAPTMKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL
         G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLGE  F+SD+LEA IP KFK PTMKPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIAL
Subjt:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESLFTSDVLEAPIPSKFKAPTMKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL

Query:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT
        TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPAT
Subjt:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT

Query:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGSSGKDE-RADPKSKDKG-SFSSGRPEYRRVESGPTRSRPYERFTPTMIPISEILMNIEDSGMEKLLKR
        FAEVLQK KKVIDGQELLRTKTGRPE+ I +G +GKD+ +AD KS+DKG S SS R +YRR  S   +SRPYE +TPT IPI EIL NIE++GMEKLLKR
Subjt:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGSSGKDE-RADPKSKDKG-SFSSGRPEYRRVESGPTRSRPYERFTPTMIPISEILMNIEDSGMEKLLKR

Query:  PEKLRGAPERRNKDKYCRFHREHGHNMSDCWELKRQIEDLIQDGYFKMFVGKPRTSSAEKKEERKRSRTPPRRADRPAVINTIFGGPSGGQSGHKRKELA
        PEKLRG PE+RN DKYCRFHR+HGHN S+ WELKRQIEDLIQDGYFK FVGKPR++S EKKEERKR RTPPRR DRPAVIN             K+KELA
Subjt:  PEKLRGAPERRNKDKYCRFHREHGHNMSDCWELKRQIEDLIQDGYFKMFVGKPRTSSAEKKEERKRSRTPPRRADRPAVINTIFGGPSGGQSGHKRKELA

Query:  RAVRREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVKRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVILEGCIDL
        R  RREVCIIREQ PT  I F+ ADLE VHLPHNDALVIAPLID V+V+R+LVDGGASANILSL TYLALGWTRSQLK+SPTPLVGFSGES+ LEGCIDL
Subjt:  RAVRREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVKRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVILEGCIDL

Query:  PVTLGQDQTQVTQMAEFVVVDGRSTYNAIFGRPIIHSFRAIPATLHQVLKSSTPNGVGTVRGEQTASRECYAAALKGPSVCALE--TLRD
        PV++ QD TQVTQMAEFVV+DGRS YNAIFGRPIIHSFRA+P+TLHQVLK ST NGVGTVRGE   SRECYA+  K  SVCALE  T+RD
Subjt:  PVTLGQDQTQVTQMAEFVVVDGRSTYNAIFGRPIIHSFRAIPATLHQVLKSSTPNGVGTVRGEQTASRECYAAALKGPSVCALE--TLRD

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]1.2e-21773.35Show/hide
Query:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAA+DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTATHLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGSSGKDER-ADPKSKDKGSFSS-GRPEYRRVESGPTRSRPYERFTPTMIP
        T LADE LTVKLGEEAP TF EVLQKAKKVIDGQELLRTKTGRPE++I +    +++R AD KS+DKGS SS  R EYRR+ESGP+RSRPYER+T + IP
Subjt:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGSSGKDER-ADPKSKDKGSFSS-GRPEYRRVESGPTRSRPYERFTPTMIP

Query:  ISEILMNIEDSGMEKLLKRPEKLRGAPERRNKDKYCRFHREHGHNMSDCWELKRQIEDLIQDGYFKMFVGKPRTSSAEKKEERKRSRTPPRRADRPAVIN
        ISEIL NIE+SGMEKLLKRPEKLRG  E+RNK+KYCRFHR+HGHN + CWELKRQIEDLIQDGYFK FVGKPR++S EKKEERKRSRTPPRR DRPAVIN
Subjt:  ISEILMNIEDSGMEKLLKRPEKLRGAPERRNKDKYCRFHREHGHNMSDCWELKRQIEDLIQDGYFKMFVGKPRTSSAEKKEERKRSRTPPRRADRPAVIN

Query:  TIFGGPSGGQSGHKRKELARAVRREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVKRVLVDGGASANILSLPTYLALGWTRSQLKRSP
        TIFGGP+GGQSG+KRKELAR  RREVCIIRE  PTC ITF  ADLE VHLPHNDALVIA LIDH +V+RVL+DG                          
Subjt:  TIFGGPSGGQSGHKRKELARAVRREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVKRVLVDGGASANILSLPTYLALGWTRSQLKRSP

Query:  TPLVGFSGESVILEGCIDLPVTLGQDQTQVTQMAEFVVVDGRSTYNAIFGRPIIHSFRAIPATLHQVLKSSTPNGVGTVRGEQTASRECYAAALKGPSVC
                      GCIDLPVT+GQD TQVTQMAEFVV+DGRS YNAIFGRPIIHSFRA+P+TLHQVLK STPN VG VRGEQ  SRECYA+ALKG +VC
Subjt:  TPLVGFSGESVILEGCIDLPVTLGQDQTQVTQMAEFVVVDGRSTYNAIFGRPIIHSFRAIPATLHQVLKSSTPNGVGTVRGEQTASRECYAAALKGPSVC

Query:  ALE--TLRDGTLEFEADLP---RKEFAASTEELELVPLLSPEKQPD------LMEIGAP
        ALE  T R    E EADLP   +++F   TEELELVPLLSPE+Q +      ++E+ AP
Subjt:  ALE--TLRDGTLEFEADLP---RKEFAASTEELELVPLLSPEKQPD------LMEIGAP

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088139.1e-27292.23Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESLFTSDVLEAPIPSKFKAPTMKPYDGTKDPKDYVEVFEGLMDFQAASD
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGES FTSDVLEAPIP KFKAPT+KPYDG+KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESLFTSDVLEAPIPSKFKAPTMKPYDGTKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGSSGKD-ERADPKSKDKGSFSSGRPEYRRVESGPTRSRPYERFTPTMIPISEILMNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRG SGKD E ADPKSKDKGSFSSGR EYRR E+GPTRSRPYERFTPT IPISEIL NIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGSSGKD-ERADPKSKDKGSFSSGRPEYRRVESGPTRSRPYERFTPTMIPISEILMNIE

Query:  DSGMEKLLKRPEKLRGAPERRNKDKYCRFHREHGHNMSDCWELKRQIEDLIQDGYFKMFVGKPRTSSAEKKEERKRSRTPPRRADRPAVINTIFGGPSGG
        +SGMEKLLKRPEKLRGAPERR+KDKYCRFHREHGHN SD WELKRQIE+LIQDGYFK FVGKPRTSSAEKKEERKRSRTPPRR DRPAVINTIFGGPSGG
Subjt:  DSGMEKLLKRPEKLRGAPERRNKDKYCRFHREHGHNMSDCWELKRQIEDLIQDGYFKMFVGKPRTSSAEKKEERKRSRTPPRRADRPAVINTIFGGPSGG

Query:  QSGHKRKELARAVRREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVKRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE
        QSG KRKELARA RREVCIIREQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLK+SPTPLVGFSGE
Subjt:  QSGHKRKELARAVRREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVKRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE

Query:  SVILEGCIDLPVTLGQDQTQVTQMAEFV
        SVI EG IDLPVTLGQDQTQVTQMAEFV
Subjt:  SVILEGCIDLPVTLGQDQTQVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188235.0e-27080Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESLFTSDVLEAPIPSKFKAPTMKPYDGTKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPA   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDGDLGES FTSDVLE        APT+K YDG+KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESLFTSDVLEAPIPSKFKAPTMKPYDGTKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGSSGKDERADPKSKDKGSFSSGRPEYRRVESGPTRSRPYERFTPTMIPISEILM
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RG SGKDE+AD KSKDKGSFSSGR E+RR  +GPTRSRPYERFTPT IPISEIL 
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGSSGKDERADPKSKDKGSFSSGRPEYRRVESGPTRSRPYERFTPTMIPISEILM

Query:  NIEDSGMEKLLKRPEKLRGAPERRNKDKYCRFHREHGHNMSDCWELKRQIEDLIQDGYFKMFVGKPRTSSAEKKEERKRSRTPPRRADRPAVINTIFGGP
        NIE+SGMEKLLKRPEKLRGAPERRNKDKYCRFHREH HN SD WELKRQIEDLIQD YFK FVGKPRTSSAEKKEERK SRTP RR DRPAVINTIFGGP
Subjt:  NIEDSGMEKLLKRPEKLRGAPERRNKDKYCRFHREHGHNMSDCWELKRQIEDLIQDGYFKMFVGKPRTSSAEKKEERKRSRTPPRRADRPAVINTIFGGP

Query:  SGGQSGHKRKELARAVRREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVKRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGF
        SGGQSGHKRKELARA RREVCIIREQ PTCPITFD ADLEEVHLPHNDALVIAPLIDHVVV+RVLVD G SANI+SL TYLALGWTRSQLK+S TPLVGF
Subjt:  SGGQSGHKRKELARAVRREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVKRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGF

Query:  SGESVILEGCIDLPVTLGQDQTQVTQMAEFVVVDGRSTYNAIFGRPIIHSFRAIPATLHQVLKSSTPNGVGTVRGEQTASRECYAAALKGPSVCALETL-
        S ESVI EGCIDLPVTLG DQTQVTQMAEFVV+DGRS YNAIFGRPIIHSFRAIP+TLHQVLK STPNGVG VRGEQ ASRECYA+ALKG SVCALETL 
Subjt:  SGESVILEGCIDLPVTLGQDQTQVTQMAEFVVVDGRSTYNAIFGRPIIHSFRAIPATLHQVLKSSTPNGVGTVRGEQTASRECYAAALKGPSVCALETL-

Query:  -RDGTLEFEADLPRKEFAASTEELELVPLL
         RDGTLEF+A+LPR+EFAA TEELELVPLL
Subjt:  -RDGTLEFEADLPRKEFAASTEELELVPLL

A0A6J1D9W7 uncharacterized protein LOC1110187081.3e-22093.36Show/hide
Query:  KDDSLNDGDLGESLFTSDVLEAPIPSKFKAPTMKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
        KDDSLNDGDLGES FTSDVLEAPIP KFKAPT+KPYDG+KDPKDYVEVFEGLMDF AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
Subjt:  KDDSLNDGDLGESLFTSDVLEAPIPSKFKAPTMKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
        FSSR Y KKT THLATIRQKEG TLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+AP TFAEVLQKAKKVIDGQELLRTKTGRP+RKI
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI

Query:  GRGSSGKD-ERADPKSKDKGSFSSGRPEYRRVESGPTRSRPYERFTPTMIPISEILMNIEDSGMEKLLKRPEKLRGAPERRNKDKYCRFHREHGHNMSDC
        GRG SGKD ERADPKSKDKGSFSSGR EYRR ESGPT+SRPYERFTPT IPISEIL NIE+SGMEKLLKRPEKLRGAPERR+KDKYCRFHREHGHN SDC
Subjt:  GRGSSGKD-ERADPKSKDKGSFSSGRPEYRRVESGPTRSRPYERFTPTMIPISEILMNIEDSGMEKLLKRPEKLRGAPERRNKDKYCRFHREHGHNMSDC

Query:  WELKRQIEDLIQDGYFKMFVGKPRTSSAEKKEERKRSRTPPRRADRPAVINTIFGGPSGGQSGHKRKELARAVRREVCIIREQGPTCPITFDGADLEEVH
        WELKRQIEDLIQDGYFK FVGKPRTSSAEKKEERKRSRTPPRR DRPAVINTIFGGPSGGQSGHKRKELARA RREVCIIREQGPTCPITFDGAD EEVH
Subjt:  WELKRQIEDLIQDGYFKMFVGKPRTSSAEKKEERKRSRTPPRRADRPAVINTIFGGPSGGQSGHKRKELARAVRREVCIIREQGPTCPITFDGADLEEVH

Query:  LPHNDALVIAPLIDHVVVKRVL
        LPHNDA VIAPLIDHVVV+RVL
Subjt:  LPHNDALVIAPLIDHVVVKRVL

A0A6J1DHB3 uncharacterized protein LOC1110204791.5e-26965.57Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLVTEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPPTDPTSENLDALKREMEAMC
        MVQPANSTNT DRR LAA+  HQREVGA  VEGQGH+ L TEPL RSARIT P LPPAHP+ SK                                    
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLVTEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPPTDPTSENLDALKREMEAMC

Query:  TQMRSMEEMYNEMMLAAGAGSRSENRATRGDVPEQRGSHLGPTEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--
                                                                                                   AESS+NP  
Subjt:  TQMRSMEEMYNEMMLAAGAGSRSENRATRGDVPEQRGSHLGPTEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--

Query:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESLFTSDVLEAPIPSKFKAPTMKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL
         G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLGE  F+SD+LEA IP KFK PTMKPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIAL
Subjt:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESLFTSDVLEAPIPSKFKAPTMKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL

Query:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT
        TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPAT
Subjt:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT

Query:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGSSGKDE-RADPKSKDKG-SFSSGRPEYRRVESGPTRSRPYERFTPTMIPISEILMNIEDSGMEKLLKR
        FAEVLQK KKVIDGQELLRTKTGRPE+ I +G +GKD+ +AD KS+DKG S SS R +YRR  S   +SRPYE +TPT IPI EIL NIE++GMEKLLKR
Subjt:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGSSGKDE-RADPKSKDKG-SFSSGRPEYRRVESGPTRSRPYERFTPTMIPISEILMNIEDSGMEKLLKR

Query:  PEKLRGAPERRNKDKYCRFHREHGHNMSDCWELKRQIEDLIQDGYFKMFVGKPRTSSAEKKEERKRSRTPPRRADRPAVINTIFGGPSGGQSGHKRKELA
        PEKLRG PE+RN DKYCRFHR+HGHN S+ WELKRQIEDLIQDGYFK FVGKPR++S EKKEERKR RTPPRR DRPAVIN             K+KELA
Subjt:  PEKLRGAPERRNKDKYCRFHREHGHNMSDCWELKRQIEDLIQDGYFKMFVGKPRTSSAEKKEERKRSRTPPRRADRPAVINTIFGGPSGGQSGHKRKELA

Query:  RAVRREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVKRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVILEGCIDL
        R  RREVCIIREQ PT  I F+ ADLE VHLPHNDALVIAPLID V+V+R+LVDGGASANILSL TYLALGWTRSQLK+SPTPLVGFSGES+ LEGCIDL
Subjt:  RAVRREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVKRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVILEGCIDL

Query:  PVTLGQDQTQVTQMAEFVVVDGRSTYNAIFGRPIIHSFRAIPATLHQVLKSSTPNGVGTVRGEQTASRECYAAALKGPSVCALE--TLRD
        PV++ QD TQVTQMAEFVV+DGRS YNAIFGRPIIHSFRA+P+TLHQVLK ST NGVGTVRGE   SRECYA+  K  SVCALE  T+RD
Subjt:  PVTLGQDQTQVTQMAEFVVVDGRSTYNAIFGRPIIHSFRAIPATLHQVLKSSTPNGVGTVRGEQTASRECYAAALKGPSVCALE--TLRD

A0A6J1DZB9 uncharacterized protein LOC1110249045.8e-21873.35Show/hide
Query:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAA+DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTATHLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGSSGKDER-ADPKSKDKGSFSS-GRPEYRRVESGPTRSRPYERFTPTMIP
        T LADE LTVKLGEEAP TF EVLQKAKKVIDGQELLRTKTGRPE++I +    +++R AD KS+DKGS SS  R EYRR+ESGP+RSRPYER+T + IP
Subjt:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGSSGKDER-ADPKSKDKGSFSS-GRPEYRRVESGPTRSRPYERFTPTMIP

Query:  ISEILMNIEDSGMEKLLKRPEKLRGAPERRNKDKYCRFHREHGHNMSDCWELKRQIEDLIQDGYFKMFVGKPRTSSAEKKEERKRSRTPPRRADRPAVIN
        ISEIL NIE+SGMEKLLKRPEKLRG  E+RNK+KYCRFHR+HGHN + CWELKRQIEDLIQDGYFK FVGKPR++S EKKEERKRSRTPPRR DRPAVIN
Subjt:  ISEILMNIEDSGMEKLLKRPEKLRGAPERRNKDKYCRFHREHGHNMSDCWELKRQIEDLIQDGYFKMFVGKPRTSSAEKKEERKRSRTPPRRADRPAVIN

Query:  TIFGGPSGGQSGHKRKELARAVRREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVKRVLVDGGASANILSLPTYLALGWTRSQLKRSP
        TIFGGP+GGQSG+KRKELAR  RREVCIIRE  PTC ITF  ADLE VHLPHNDALVIA LIDH +V+RVL+DG                          
Subjt:  TIFGGPSGGQSGHKRKELARAVRREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVKRVLVDGGASANILSLPTYLALGWTRSQLKRSP

Query:  TPLVGFSGESVILEGCIDLPVTLGQDQTQVTQMAEFVVVDGRSTYNAIFGRPIIHSFRAIPATLHQVLKSSTPNGVGTVRGEQTASRECYAAALKGPSVC
                      GCIDLPVT+GQD TQVTQMAEFVV+DGRS YNAIFGRPIIHSFRA+P+TLHQVLK STPN VG VRGEQ  SRECYA+ALKG +VC
Subjt:  TPLVGFSGESVILEGCIDLPVTLGQDQTQVTQMAEFVVVDGRSTYNAIFGRPIIHSFRAIPATLHQVLKSSTPNGVGTVRGEQTASRECYAAALKGPSVC

Query:  ALE--TLRDGTLEFEADLP---RKEFAASTEELELVPLLSPEKQPD------LMEIGAP
        ALE  T R    E EADLP   +++F   TEELELVPLLSPE+Q +      ++E+ AP
Subjt:  ALE--TLRDGTLEFEADLP---RKEFAASTEELELVPLLSPEKQPD------LMEIGAP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCGAACTCGACCAATACGACAGACCGAAGGACTCTGGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAGGGTCACGA
CGGCCTAGTAACGGAACCCCTCCGCAGGTCAGCACGGATCACCGCACCTGCCCTACCGCCTGCACACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCGGGGTCCACCTACGGATCCAACAAGCGAGAACCTGGATGCGCTCAAGAGAGAGATGGAGGCAATGTGCACACAAATGCGCTCCATGGAGGAAATGTAT
AACGAAATGATGCTGGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGCGACGCGCGGGGACGTACCCGAGCAAAGGGGTTCCCACCTCGGCCCGACCGAGGAGGAACG
TCCCGAAGACAACGAGAGCGAGGGGTATACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCT
CACACAGGAGTTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCC
TTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGACGGCGACTTGGGAGAATCTCTTTTCACCTCGGATGTTTTGGAAGCACCAATCCCTTCGAAGTTCAAAGC
TCCTACCATGAAGCCTTACGACGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCT
TTCAGATTGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTTGCCCAGTTCTCTTCT
CGGCACTACGACAAAAAGACAGCGACCCATCTTGCCACCATCAGGCAGAAGGAGGGTGAGACGCTGCGGGAATATGTCACCAGATTCCAGGAGGAACAGTTGAAGGTTGC
ACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAGCTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTACTCC
AGAAGGCGAAGAAAGTCATCGACGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGTCGGGGCAGCAGCGGAAAAGATGAAAGGGCAGATCCC
AAGTCCAAGGACAAGGGATCCTTCTCCAGTGGCCGACCTGAGTATCGAAGGGTGGAGAGCGGACCTACCAGGAGCCGACCTTACGAGCGCTTCACCCCAACCATGATTCC
AATTTCCGAGATCCTAATGAACATCGAGGATTCTGGAATGGAAAAGCTACTCAAGCGTCCGGAAAAACTTCGGGGAGCCCCGGAAAGGCGCAACAAGGACAAGTATTGCC
GCTTCCATCGGGAGCACGGCCACAATATGTCGGACTGCTGGGAATTGAAGCGCCAAATTGAGGATCTAATTCAAGACGGCTACTTCAAGATGTTCGTGGGAAAGCCCAGG
ACCAGCTCAGCAGAGAAAAAGGAAGAGCGAAAGCGTTCGAGGACGCCACCCAGGCGCGCCGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGAGGGCA
ATCCGGACATAAGAGGAAGGAGTTAGCCCGTGCAGTCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGATTTGGAGG
AGGTCCACCTGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATCGATCATGTGGTGGTCAAGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTA
CCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCCGACGCCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCTAGAGGGTTGCATCGACTTGCC
GGTCACGCTGGGGCAGGACCAAACTCAGGTCACTCAAATGGCCGAGTTCGTAGTAGTTGACGGCAGATCGACCTATAACGCCATCTTTGGGAGACCCATCATCCACTCAT
TTCGGGCCATTCCTGCAACACTTCATCAAGTTTTGAAGTCTTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGCTATGCCGCCGCA
CTCAAAGGCCCGTCGGTCTGCGCCCTCGAAACTCTCAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAATTTGCCGCATCCACTGAAGAGCTCGAACT
TGTTCCTCTGCTTAGTCCCGAGAAGCAGCCAGATCTGATGGAGATCGGCGCTCCAGCATCCTCATGGATGGACCCGATCGCGGACTTCATTAAGGACAACTCACCACAAG
ACCCCAAGGAGCGCAAAAAGTTGGCAAGGCGGGCAGCTCGAGTAGAGCATTATGAGCCTACGACGAATGAGGATGGGCTGCTCCTCAACCTCGACTTGTTGGAAGAAAGA
AGAGCAATGGCCCAGCTTCGCCTGGCGGAATATCAGGGCAGAATGGCCATACATTACAATGCCCGTGTCCGACCTCGGGCCTTTCAGGTCGGACATCTGGTCTTAAGGAG
GGTCCAAACGCATGTGGGTGCCCTTGACCCGGCCTGGGAGGGCCCGTTTGAGATCAAAGGCATAGTCCGACCTGGGACGTACATATTGGCCGATCTGAAAGGAGACGTCC
TCGCACACCCGTGGAACGCGGAACACCTGAAGCGTTATTATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCGAACTCGACCAATACGACAGACCGAAGGACTCTGGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAGGGTCACGA
CGGCCTAGTAACGGAACCCCTCCGCAGGTCAGCACGGATCACCGCACCTGCCCTACCGCCTGCACACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCGGGGTCCACCTACGGATCCAACAAGCGAGAACCTGGATGCGCTCAAGAGAGAGATGGAGGCAATGTGCACACAAATGCGCTCCATGGAGGAAATGTAT
AACGAAATGATGCTGGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGCGACGCGCGGGGACGTACCCGAGCAAAGGGGTTCCCACCTCGGCCCGACCGAGGAGGAACG
TCCCGAAGACAACGAGAGCGAGGGGTATACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCT
CACACAGGAGTTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCC
TTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGACGGCGACTTGGGAGAATCTCTTTTCACCTCGGATGTTTTGGAAGCACCAATCCCTTCGAAGTTCAAAGC
TCCTACCATGAAGCCTTACGACGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCT
TTCAGATTGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTTGCCCAGTTCTCTTCT
CGGCACTACGACAAAAAGACAGCGACCCATCTTGCCACCATCAGGCAGAAGGAGGGTGAGACGCTGCGGGAATATGTCACCAGATTCCAGGAGGAACAGTTGAAGGTTGC
ACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAGCTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTACTCC
AGAAGGCGAAGAAAGTCATCGACGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGTCGGGGCAGCAGCGGAAAAGATGAAAGGGCAGATCCC
AAGTCCAAGGACAAGGGATCCTTCTCCAGTGGCCGACCTGAGTATCGAAGGGTGGAGAGCGGACCTACCAGGAGCCGACCTTACGAGCGCTTCACCCCAACCATGATTCC
AATTTCCGAGATCCTAATGAACATCGAGGATTCTGGAATGGAAAAGCTACTCAAGCGTCCGGAAAAACTTCGGGGAGCCCCGGAAAGGCGCAACAAGGACAAGTATTGCC
GCTTCCATCGGGAGCACGGCCACAATATGTCGGACTGCTGGGAATTGAAGCGCCAAATTGAGGATCTAATTCAAGACGGCTACTTCAAGATGTTCGTGGGAAAGCCCAGG
ACCAGCTCAGCAGAGAAAAAGGAAGAGCGAAAGCGTTCGAGGACGCCACCCAGGCGCGCCGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGAGGGCA
ATCCGGACATAAGAGGAAGGAGTTAGCCCGTGCAGTCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGATTTGGAGG
AGGTCCACCTGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATCGATCATGTGGTGGTCAAGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTA
CCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCCGACGCCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCTAGAGGGTTGCATCGACTTGCC
GGTCACGCTGGGGCAGGACCAAACTCAGGTCACTCAAATGGCCGAGTTCGTAGTAGTTGACGGCAGATCGACCTATAACGCCATCTTTGGGAGACCCATCATCCACTCAT
TTCGGGCCATTCCTGCAACACTTCATCAAGTTTTGAAGTCTTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGCTATGCCGCCGCA
CTCAAAGGCCCGTCGGTCTGCGCCCTCGAAACTCTCAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAATTTGCCGCATCCACTGAAGAGCTCGAACT
TGTTCCTCTGCTTAGTCCCGAGAAGCAGCCAGATCTGATGGAGATCGGCGCTCCAGCATCCTCATGGATGGACCCGATCGCGGACTTCATTAAGGACAACTCACCACAAG
ACCCCAAGGAGCGCAAAAAGTTGGCAAGGCGGGCAGCTCGAGTAGAGCATTATGAGCCTACGACGAATGAGGATGGGCTGCTCCTCAACCTCGACTTGTTGGAAGAAAGA
AGAGCAATGGCCCAGCTTCGCCTGGCGGAATATCAGGGCAGAATGGCCATACATTACAATGCCCGTGTCCGACCTCGGGCCTTTCAGGTCGGACATCTGGTCTTAAGGAG
GGTCCAAACGCATGTGGGTGCCCTTGACCCGGCCTGGGAGGGCCCGTTTGAGATCAAAGGCATAGTCCGACCTGGGACGTACATATTGGCCGATCTGAAAGGAGACGTCC
TCGCACACCCGTGGAACGCGGAACACCTGAAGCGTTATTATCCTTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLVTEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPPTDPTSENLDALKREMEAMCTQMRSMEEMY
NEMMLAAGAGSRSENRATRGDVPEQRGSHLGPTEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQLRGELDAQVEA
LKAKCEQKDDSLNDGDLGESLFTSDVLEAPIPSKFKAPTMKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSS
RHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGSSGKDERADP
KSKDKGSFSSGRPEYRRVESGPTRSRPYERFTPTMIPISEILMNIEDSGMEKLLKRPEKLRGAPERRNKDKYCRFHREHGHNMSDCWELKRQIEDLIQDGYFKMFVGKPR
TSSAEKKEERKRSRTPPRRADRPAVINTIFGGPSGGQSGHKRKELARAVRREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVKRVLVDGGASANILSL
PTYLALGWTRSQLKRSPTPLVGFSGESVILEGCIDLPVTLGQDQTQVTQMAEFVVVDGRSTYNAIFGRPIIHSFRAIPATLHQVLKSSTPNGVGTVRGEQTASRECYAAA
LKGPSVCALETLRDGTLEFEADLPRKEFAASTEELELVPLLSPEKQPDLMEIGAPASSWMDPIADFIKDNSPQDPKERKKLARRAARVEHYEPTTNEDGLLLNLDLLEER
RAMAQLRLAEYQGRMAIHYNARVRPRAFQVGHLVLRRVQTHVGALDPAWEGPFEIKGIVRPGTYILADLKGDVLAHPWNAEHLKRYYP