; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g00120 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g00120
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr8:77112..83769
RNA-Seq ExpressionMoc08g00120
SyntenyMoc08g00120
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]1.3e-25086.96Show/hide
Query:  QAESSHNPATPAGVITRAEFDQLKGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        +AESS NPATPAGVITR EFDQL+G+LDAQVEALKAKCEQKEG LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHNPATPAGVITRAEFDQLKGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIGLTGNARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKE----------------------------------ADEAL
        AIKCRAF+I LTG+ARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKE                                  ADEAL
Subjt:  AIKCRAFQIGLTGNARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKE----------------------------------ADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGWGRSGKDIEKSDPKSKDKGSFSRGRAEYRRAENRPTRSRPPYERFTPTTIPISEILTNI
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIG GRSGKDIE +DPKSKDKGSFS GRAEYRRAEN PTRSR PYERFTPTTIPISEILTNI
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGWGRSGKDIEKSDPKSKDKGSFSRGRAEYRRAENRPTRSRPPYERFTPTTIPISEILTNI

Query:  EESGMEKLLKRPEKLRGAQERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAENKEERKRSRTPPQRTDRPAVINTIFGGPSG
        EESGMEKLLKRPEKLRGA ERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAE KEERKRSRTPP+RTDRPAVINTIFGGPSG
Subjt:  EESGMEKLLKRPEKLRGAQERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAENKEERKRSRTPPQRTDRPAVINTIFGGPSG

Query:  GQSGHKRKELARAAKREVCIIREQRPTCPITFDSTDLEKVHLPHNDALVIAPLIDHVVVRRVLIDGGAFANILSLPTYLALGWTRSQLTKSPTPLVGFSG
        GQSG KRKELARAA+REVCIIREQRPTCPITFD  DLE+VHLPHNDALVIAPLIDHVVV RVL+DGG  ANILSLPTYLALGWTRSQL KSPTPLVGFSG
Subjt:  GQSGHKRKELARAAKREVCIIREQRPTCPITFDSTDLEKVHLPHNDALVIAPLIDHVVVRRVLIDGGAFANILSLPTYLALGWTRSQLTKSPTPLVGFSG

Query:  ESAIPEGCIDLPVTLGQDQTQVTQMAEFV
        ES IPEG IDLPVTLGQDQTQVTQMAEFV
Subjt:  ESAIPEGCIDLPVTLGQDQTQVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]3.9e-26679.91Show/hide
Query:  SSNQQAESSHNPATPAGVITRAEFDQLKGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPATP GVITR EFDQL+GKL+AQVEALKAKCEQKEG LNDGDLGESPFTSDVLE        APTVK YDGSKDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPATPAGVITRAEFDQLKGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIGLTGNARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLR
        AASDAIKCRAFQI LTG+ARLW++           QL+   +AQ S    D      L  +    ADEALTVKLG+EAPATFAEVLQKAKKVIDGQELLR
Subjt:  AASDAIKCRAFQIGLTGNARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLR

Query:  TKTGRPERKIGWGRSGKDIEKSDPKSKDKGSFSRGRAEYRRAENRPTRSRPPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAQERRSKDKYCRF
        TKTGRPER I  GRSGKD EK+D KSKDKGSFS GRAE+RRA N PTRSR PYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGA ERR+KDKYCRF
Subjt:  TKTGRPERKIGWGRSGKDIEKSDPKSKDKGSFSRGRAEYRRAENRPTRSRPPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAQERRSKDKYCRF

Query:  HREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAENKEERKRSRTPPQRTDRPAVINTIFGGPSGGQSGHKRKELARAAKREVCIIREQRPTCPI
        HREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAE KEERK SRTP +R DRPAVINTIFGGPSGGQSGHKRKELARAA+REVCIIREQRPTCPI
Subjt:  HREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAENKEERKRSRTPPQRTDRPAVINTIFGGPSGGQSGHKRKELARAAKREVCIIREQRPTCPI

Query:  TFDSTDLEKVHLPHNDALVIAPLIDHVVVRRVLIDGGAFANILSLPTYLALGWTRSQLTKSPTPLVGFSGESAIPEGCIDLPVTLGQDQTQVTQMAEFVV
        TFDS DLE+VHLPHNDALVIAPLIDHVVVRRVL+D G  ANI+SL TYLALGWTRSQL KS TPLVGFS ES IPEGCIDLPVTLG DQTQVTQMAEFVV
Subjt:  TFDSTDLEKVHLPHNDALVIAPLIDHVVVRRVLIDGGAFANILSLPTYLALGWTRSQLTKSPTPLVGFSGESAIPEGCIDLPVTLGQDQTQVTQMAEFVV

Query:  IDGRSAYNAIFGRPIIHSFRAIPSTFHQILKYSTPNGVGTVRGEQTASKECYASALKGSSVCALETLAGRDGTLEFEADLPRREFAAPTEELELVPLLSP
        IDGRSAYNAIFGRPIIHSFRAIPST HQ+LKYSTPNGVG VRGEQ AS+ECYASALKGSSVCALETL  RDGTLEF+A+LPRREFAAPTEELELVPLL  
Subjt:  IDGRSAYNAIFGRPIIHSFRAIPSTFHQILKYSTPNGVGTVRGEQTASKECYASALKGSSVCALETLAGRDGTLEFEADLPRREFAAPTEELELVPLLSP

Query:  EKQTDLARSVPVEILDNPSISESDLMKIGAPE
        +   ++     ++   + +  + D+   G PE
Subjt:  EKQTDLARSVPVEILDNPSISESDLMKIGAPE

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]2.0e-23373.09Show/hide
Query:  QAESSHNPATPAGVITRAEFDQLKGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        +AESS+NP TP GVITR EFDQLK K DAQVEALKA+CE+KE S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+D
Subjt:  QAESSHNPATPAGVITRAEFDQLKGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIGLTGNARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKE----------------------------------ADEAL
        AIKC AFQI LTG+ARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKE                                  ADE L
Subjt:  AIKCRAFQIGLTGNARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKE----------------------------------ADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGWGRSGKDIEKSDPKSKDKG-SFSRGRAEYRRAENRPTRSRPPYERFTPTTIPISEILTN
        TVKL EEAPATFAEVLQK KKVIDGQELLRTKTGRPE+ I  GR+GKD  K+D KS+DKG S S  R +YRR+ +   +SR PYE +TPTTIPI EILTN
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGWGRSGKDIEKSDPKSKDKG-SFSRGRAEYRRAENRPTRSRPPYERFTPTTIPISEILTN

Query:  IEESGMEKLLKRPEKLRGAQERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAENKEERKRSRTPPQRTDRPAVINTIFGGPS
        IEE+GMEKLLKRPEKLRG  E+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S E KEERKR RTPP+R DRPAVIN       
Subjt:  IEESGMEKLLKRPEKLRGAQERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAENKEERKRSRTPPQRTDRPAVINTIFGGPS

Query:  GGQSGHKRKELARAAKREVCIIREQRPTCPITFDSTDLEKVHLPHNDALVIAPLIDHVVVRRVLIDGGAFANILSLPTYLALGWTRSQLTKSPTPLVGFS
              K+KELAR A+REVCIIREQRPT  I F+  DLE VHLPHNDALVIAPLID V+VRR+L+DGGA ANILSL TYLALGWTRSQL KSPTPLVGFS
Subjt:  GGQSGHKRKELARAAKREVCIIREQRPTCPITFDSTDLEKVHLPHNDALVIAPLIDHVVVRRVLIDGGAFANILSLPTYLALGWTRSQLTKSPTPLVGFS

Query:  GESAIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTFHQILKYSTPNGVGTVRGEQTASKECYASALKGSSVCALETLAG
        GES   EGCIDLPV++ QD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PST HQ+LKYST NGVGTVRGE   S+ECYAS  K SSVCALE    
Subjt:  GESAIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTFHQILKYSTPNGVGTVRGEQTASKECYASALKGSSVCALETLAG

Query:  RD
        RD
Subjt:  RD

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]2.5e-0239.77Show/hide
Query:  GQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPPS----ENFDALQ----REMEAMRTQMRSMEEMYNE
        GQGH+ L TEPL RSARIT PVLPPAHP+ SKA                P  P     E FD L+     ++EA++ +    E  +++
Subjt:  GQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPPS----ENFDALQ----REMEAMRTQMRSMEEMYNE

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]6.7e-21889.8Show/hide
Query:  ADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGWGRSGKDIEKSDPKSKDKGSFSRGRAEYRRAENRPTRSRPPYERFTPTTIPISE
        ADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRT       KIG GRSGKD+E +DPKSKDKGSFS GRAEYRRAEN PTRSR PYERFTPTTIPISE
Subjt:  ADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGWGRSGKDIEKSDPKSKDKGSFSRGRAEYRRAENRPTRSRPPYERFTPTTIPISE

Query:  ILTNIEESGMEKLLKRPEKLRGAQERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAENKEERKRSRTPPQRTDRPAVINTIF
        ILTNIEESGMEKLLKRPEKLRGA ERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAE KEERKRSRTPP+RTDRPAVINTIF
Subjt:  ILTNIEESGMEKLLKRPEKLRGAQERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAENKEERKRSRTPPQRTDRPAVINTIF

Query:  GGPSGGQSGHKRKELARAAKREVCIIREQRPTCPITFDSTDLEKVHLPHNDALVIAPLIDHVVVRRVLIDGGAFANILSLPTYLALGWTRSQLTKSPTPL
        GGPSGGQSGHKRK+LARAA+REVCIIREQRPTCPITFD  DL +VHLPHNDALVIAPLIDHVVVRRVL+DGGA ANILSLPTYLALGWTRSQL KSPTPL
Subjt:  GGPSGGQSGHKRKELARAAKREVCIIREQRPTCPITFDSTDLEKVHLPHNDALVIAPLIDHVVVRRVLIDGGAFANILSLPTYLALGWTRSQLTKSPTPL

Query:  VGFSGESAIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTFHQILKYSTPNGVGTVRGEQTASKECYASALKGSSVCALE
        VGFSGES +PEGCIDLPVTLGQDQT+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPST HQ+LKYSTPNGVGTVRGEQTAS+ECYAS LKG+SVCALE
Subjt:  VGFSGESAIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTFHQILKYSTPNGVGTVRGEQTASKECYASALKGSSVCALE

Query:  TLAGRDGTLEFEADLPRREFAAPTEELELVPLLSPEKQTDL
        TL  RDGTLEFEADLP REFAAP EELELVPLLS EKQ  L
Subjt:  TLAGRDGTLEFEADLPRREFAAPTEELELVPLLSPEKQTDL

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]5.9e-21465.89Show/hide
Query:  SPVEEGHPE----------DNESEGHTRQRGDLREHL-NRKRGSSF--QKGQSPSRSHKSSNQQAESSHNPATPAGVITRAEFDQLKGKLDAQVEALKAK
        +P E+G P           ++E   ++ +  DLR+HL ++K+ +S+  +   S SR   +SN +A+S + P  P  VI R EFD +K + D QVEALKA+
Subjt:  SPVEEGHPE----------DNESEGHTRQRGDLREHL-NRKRGSSF--QKGQSPSRSHKSSNQQAESSHNPATPAGVITRAEFDQLKGKLDAQVEALKAK

Query:  CEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIGLTGNARLWYRRLPARSISTYSQLRREF
        CE+KE   +D DLGESPFTSD++EAPIPPKFK PT+KPYDGSKDPKDYVEVFEGLMDFQAA+DAIKC AFQI LTG+ARLW RRLPARSISTYSQLR+EF
Subjt:  CEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIGLTGNARLWYRRLPARSISTYSQLRREF

Query:  LAQFSSRHYDKKTATHLATIRQKEADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGWGRSGKDIEKSDPKSKDKGSFSRG-RAEYR
        + QFS RHYD+KTATHLATIRQKE DE LTVKLGEEAPATFAEVLQ AKKVIDGQELLRTKT RPE++I   R  +   K D KSKDKGS S G R EYR
Subjt:  LAQFSSRHYDKKTATHLATIRQKEADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGWGRSGKDIEKSDPKSKDKGSFSRG-RAEYR

Query:  RAENRPTRSRPPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAQERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAE
        R+E+ P+RSR PYER                                                       CWELKRQIEDLIQD YFKKFVGKPR++S E
Subjt:  RAENRPTRSRPPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAQERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAE

Query:  NKEERKRSRTPPQRTDRPAVINTIFGGPSGGQSGHKRKELARAAKREVCIIREQRPTCPITFDSTDLEKVHLPHNDALVIAPLIDHVVVRRVLIDGGAFA
         KEERKRSRTPP+R DRPAVINTIFGGPSGGQ  +KRKELA  A+R+V IIREQ+PTC ITF  TDLE VHLPHNDALVIAPLIDHV+VRRVL+DGGA A
Subjt:  NKEERKRSRTPPQRTDRPAVINTIFGGPSGGQSGHKRKELARAAKREVCIIREQRPTCPITFDSTDLEKVHLPHNDALVIAPLIDHVVVRRVLIDGGAFA

Query:  NILSLPTYLALGWTRSQLTKSPTPLVGFSGESAIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTFHQILKYSTPNGVGT
        NILSLPTYLAL  TRSQL KSPTPLVGFS ES  PEGCIDLPVT+GQD TQVTQMAEFVVIDGR AYNAIF RPIIHSF+A+PS  HQ+LKYSTPNGVGT
Subjt:  NILSLPTYLALGWTRSQLTKSPTPLVGFSGESAIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTFHQILKYSTPNGVGT

Query:  VRGEQTASKECYASALKGSSVCALETLAGRDGTLEFEADLPR
        VRGEQ  S+ECYASALK SSVCALE    +D       DLPR
Subjt:  VRGEQTASKECYASALKGSSVCALETLAGRDGTLEFEADLPR

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088136.5e-25186.96Show/hide
Query:  QAESSHNPATPAGVITRAEFDQLKGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        +AESS NPATPAGVITR EFDQL+G+LDAQVEALKAKCEQKEG LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHNPATPAGVITRAEFDQLKGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIGLTGNARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKE----------------------------------ADEAL
        AIKCRAF+I LTG+ARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKE                                  ADEAL
Subjt:  AIKCRAFQIGLTGNARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKE----------------------------------ADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGWGRSGKDIEKSDPKSKDKGSFSRGRAEYRRAENRPTRSRPPYERFTPTTIPISEILTNI
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIG GRSGKDIE +DPKSKDKGSFS GRAEYRRAEN PTRSR PYERFTPTTIPISEILTNI
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGWGRSGKDIEKSDPKSKDKGSFSRGRAEYRRAENRPTRSRPPYERFTPTTIPISEILTNI

Query:  EESGMEKLLKRPEKLRGAQERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAENKEERKRSRTPPQRTDRPAVINTIFGGPSG
        EESGMEKLLKRPEKLRGA ERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAE KEERKRSRTPP+RTDRPAVINTIFGGPSG
Subjt:  EESGMEKLLKRPEKLRGAQERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAENKEERKRSRTPPQRTDRPAVINTIFGGPSG

Query:  GQSGHKRKELARAAKREVCIIREQRPTCPITFDSTDLEKVHLPHNDALVIAPLIDHVVVRRVLIDGGAFANILSLPTYLALGWTRSQLTKSPTPLVGFSG
        GQSG KRKELARAA+REVCIIREQRPTCPITFD  DLE+VHLPHNDALVIAPLIDHVVV RVL+DGG  ANILSLPTYLALGWTRSQL KSPTPLVGFSG
Subjt:  GQSGHKRKELARAAKREVCIIREQRPTCPITFDSTDLEKVHLPHNDALVIAPLIDHVVVRRVLIDGGAFANILSLPTYLALGWTRSQLTKSPTPLVGFSG

Query:  ESAIPEGCIDLPVTLGQDQTQVTQMAEFV
        ES IPEG IDLPVTLGQDQTQVTQMAEFV
Subjt:  ESAIPEGCIDLPVTLGQDQTQVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188231.9e-26679.91Show/hide
Query:  SSNQQAESSHNPATPAGVITRAEFDQLKGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPATP GVITR EFDQL+GKL+AQVEALKAKCEQKEG LNDGDLGESPFTSDVLE        APTVK YDGSKDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPATPAGVITRAEFDQLKGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIGLTGNARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLR
        AASDAIKCRAFQI LTG+ARLW++           QL+   +AQ S    D      L  +    ADEALTVKLG+EAPATFAEVLQKAKKVIDGQELLR
Subjt:  AASDAIKCRAFQIGLTGNARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLR

Query:  TKTGRPERKIGWGRSGKDIEKSDPKSKDKGSFSRGRAEYRRAENRPTRSRPPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAQERRSKDKYCRF
        TKTGRPER I  GRSGKD EK+D KSKDKGSFS GRAE+RRA N PTRSR PYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGA ERR+KDKYCRF
Subjt:  TKTGRPERKIGWGRSGKDIEKSDPKSKDKGSFSRGRAEYRRAENRPTRSRPPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAQERRSKDKYCRF

Query:  HREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAENKEERKRSRTPPQRTDRPAVINTIFGGPSGGQSGHKRKELARAAKREVCIIREQRPTCPI
        HREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAE KEERK SRTP +R DRPAVINTIFGGPSGGQSGHKRKELARAA+REVCIIREQRPTCPI
Subjt:  HREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAENKEERKRSRTPPQRTDRPAVINTIFGGPSGGQSGHKRKELARAAKREVCIIREQRPTCPI

Query:  TFDSTDLEKVHLPHNDALVIAPLIDHVVVRRVLIDGGAFANILSLPTYLALGWTRSQLTKSPTPLVGFSGESAIPEGCIDLPVTLGQDQTQVTQMAEFVV
        TFDS DLE+VHLPHNDALVIAPLIDHVVVRRVL+D G  ANI+SL TYLALGWTRSQL KS TPLVGFS ES IPEGCIDLPVTLG DQTQVTQMAEFVV
Subjt:  TFDSTDLEKVHLPHNDALVIAPLIDHVVVRRVLIDGGAFANILSLPTYLALGWTRSQLTKSPTPLVGFSGESAIPEGCIDLPVTLGQDQTQVTQMAEFVV

Query:  IDGRSAYNAIFGRPIIHSFRAIPSTFHQILKYSTPNGVGTVRGEQTASKECYASALKGSSVCALETLAGRDGTLEFEADLPRREFAAPTEELELVPLLSP
        IDGRSAYNAIFGRPIIHSFRAIPST HQ+LKYSTPNGVG VRGEQ AS+ECYASALKGSSVCALETL  RDGTLEF+A+LPRREFAAPTEELELVPLL  
Subjt:  IDGRSAYNAIFGRPIIHSFRAIPSTFHQILKYSTPNGVGTVRGEQTASKECYASALKGSSVCALETLAGRDGTLEFEADLPRREFAAPTEELELVPLLSP

Query:  EKQTDLARSVPVEILDNPSISESDLMKIGAPE
        +   ++     ++   + +  + D+   G PE
Subjt:  EKQTDLARSVPVEILDNPSISESDLMKIGAPE

A0A6J1DHB3 uncharacterized protein LOC1110204799.4e-23473.09Show/hide
Query:  QAESSHNPATPAGVITRAEFDQLKGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        +AESS+NP TP GVITR EFDQLK K DAQVEALKA+CE+KE S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+D
Subjt:  QAESSHNPATPAGVITRAEFDQLKGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIGLTGNARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKE----------------------------------ADEAL
        AIKC AFQI LTG+ARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKE                                  ADE L
Subjt:  AIKCRAFQIGLTGNARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKE----------------------------------ADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGWGRSGKDIEKSDPKSKDKG-SFSRGRAEYRRAENRPTRSRPPYERFTPTTIPISEILTN
        TVKL EEAPATFAEVLQK KKVIDGQELLRTKTGRPE+ I  GR+GKD  K+D KS+DKG S S  R +YRR+ +   +SR PYE +TPTTIPI EILTN
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGWGRSGKDIEKSDPKSKDKG-SFSRGRAEYRRAENRPTRSRPPYERFTPTTIPISEILTN

Query:  IEESGMEKLLKRPEKLRGAQERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAENKEERKRSRTPPQRTDRPAVINTIFGGPS
        IEE+GMEKLLKRPEKLRG  E+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S E KEERKR RTPP+R DRPAVIN       
Subjt:  IEESGMEKLLKRPEKLRGAQERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAENKEERKRSRTPPQRTDRPAVINTIFGGPS

Query:  GGQSGHKRKELARAAKREVCIIREQRPTCPITFDSTDLEKVHLPHNDALVIAPLIDHVVVRRVLIDGGAFANILSLPTYLALGWTRSQLTKSPTPLVGFS
              K+KELAR A+REVCIIREQRPT  I F+  DLE VHLPHNDALVIAPLID V+VRR+L+DGGA ANILSL TYLALGWTRSQL KSPTPLVGFS
Subjt:  GGQSGHKRKELARAAKREVCIIREQRPTCPITFDSTDLEKVHLPHNDALVIAPLIDHVVVRRVLIDGGAFANILSLPTYLALGWTRSQLTKSPTPLVGFS

Query:  GESAIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTFHQILKYSTPNGVGTVRGEQTASKECYASALKGSSVCALETLAG
        GES   EGCIDLPV++ QD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PST HQ+LKYST NGVGTVRGE   S+ECYAS  K SSVCALE    
Subjt:  GESAIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTFHQILKYSTPNGVGTVRGEQTASKECYASALKGSSVCALETLAG

Query:  RD
        RD
Subjt:  RD

A0A6J1DHB3 uncharacterized protein LOC1110204791.2e-0239.77Show/hide
Query:  GQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPPS----ENFDALQ----REMEAMRTQMRSMEEMYNE
        GQGH+ L TEPL RSARIT PVLPPAHP+ SKA                P  P     E FD L+     ++EA++ +    E  +++
Subjt:  GQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPPS----ENFDALQ----REMEAMRTQMRSMEEMYNE

A0A6J1DHB3 uncharacterized protein LOC1110204793.3e-21889.8Show/hide
Query:  ADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGWGRSGKDIEKSDPKSKDKGSFSRGRAEYRRAENRPTRSRPPYERFTPTTIPISE
        ADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRT       KIG GRSGKD+E +DPKSKDKGSFS GRAEYRRAEN PTRSR PYERFTPTTIPISE
Subjt:  ADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGWGRSGKDIEKSDPKSKDKGSFSRGRAEYRRAENRPTRSRPPYERFTPTTIPISE

Query:  ILTNIEESGMEKLLKRPEKLRGAQERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAENKEERKRSRTPPQRTDRPAVINTIF
        ILTNIEESGMEKLLKRPEKLRGA ERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAE KEERKRSRTPP+RTDRPAVINTIF
Subjt:  ILTNIEESGMEKLLKRPEKLRGAQERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAENKEERKRSRTPPQRTDRPAVINTIF

Query:  GGPSGGQSGHKRKELARAAKREVCIIREQRPTCPITFDSTDLEKVHLPHNDALVIAPLIDHVVVRRVLIDGGAFANILSLPTYLALGWTRSQLTKSPTPL
        GGPSGGQSGHKRK+LARAA+REVCIIREQRPTCPITFD  DL +VHLPHNDALVIAPLIDHVVVRRVL+DGGA ANILSLPTYLALGWTRSQL KSPTPL
Subjt:  GGPSGGQSGHKRKELARAAKREVCIIREQRPTCPITFDSTDLEKVHLPHNDALVIAPLIDHVVVRRVLIDGGAFANILSLPTYLALGWTRSQLTKSPTPL

Query:  VGFSGESAIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTFHQILKYSTPNGVGTVRGEQTASKECYASALKGSSVCALE
        VGFSGES +PEGCIDLPVTLGQDQT+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPST HQ+LKYSTPNGVGTVRGEQTAS+ECYAS LKG+SVCALE
Subjt:  VGFSGESAIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTFHQILKYSTPNGVGTVRGEQTASKECYASALKGSSVCALE

Query:  TLAGRDGTLEFEADLPRREFAAPTEELELVPLLSPEKQTDL
        TL  RDGTLEFEADLP REFAAP EELELVPLLS EKQ  L
Subjt:  TLAGRDGTLEFEADLPRREFAAPTEELELVPLLSPEKQTDL

A0A6J1DPC9 uncharacterized protein LOC1110222802.9e-21465.89Show/hide
Query:  SPVEEGHPE----------DNESEGHTRQRGDLREHL-NRKRGSSF--QKGQSPSRSHKSSNQQAESSHNPATPAGVITRAEFDQLKGKLDAQVEALKAK
        +P E+G P           ++E   ++ +  DLR+HL ++K+ +S+  +   S SR   +SN +A+S + P  P  VI R EFD +K + D QVEALKA+
Subjt:  SPVEEGHPE----------DNESEGHTRQRGDLREHL-NRKRGSSF--QKGQSPSRSHKSSNQQAESSHNPATPAGVITRAEFDQLKGKLDAQVEALKAK

Query:  CEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIGLTGNARLWYRRLPARSISTYSQLRREF
        CE+KE   +D DLGESPFTSD++EAPIPPKFK PT+KPYDGSKDPKDYVEVFEGLMDFQAA+DAIKC AFQI LTG+ARLW RRLPARSISTYSQLR+EF
Subjt:  CEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIGLTGNARLWYRRLPARSISTYSQLRREF

Query:  LAQFSSRHYDKKTATHLATIRQKEADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGWGRSGKDIEKSDPKSKDKGSFSRG-RAEYR
        + QFS RHYD+KTATHLATIRQKE DE LTVKLGEEAPATFAEVLQ AKKVIDGQELLRTKT RPE++I   R  +   K D KSKDKGS S G R EYR
Subjt:  LAQFSSRHYDKKTATHLATIRQKEADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGWGRSGKDIEKSDPKSKDKGSFSRG-RAEYR

Query:  RAENRPTRSRPPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAQERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAE
        R+E+ P+RSR PYER                                                       CWELKRQIEDLIQD YFKKFVGKPR++S E
Subjt:  RAENRPTRSRPPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAQERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAE

Query:  NKEERKRSRTPPQRTDRPAVINTIFGGPSGGQSGHKRKELARAAKREVCIIREQRPTCPITFDSTDLEKVHLPHNDALVIAPLIDHVVVRRVLIDGGAFA
         KEERKRSRTPP+R DRPAVINTIFGGPSGGQ  +KRKELA  A+R+V IIREQ+PTC ITF  TDLE VHLPHNDALVIAPLIDHV+VRRVL+DGGA A
Subjt:  NKEERKRSRTPPQRTDRPAVINTIFGGPSGGQSGHKRKELARAAKREVCIIREQRPTCPITFDSTDLEKVHLPHNDALVIAPLIDHVVVRRVLIDGGAFA

Query:  NILSLPTYLALGWTRSQLTKSPTPLVGFSGESAIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTFHQILKYSTPNGVGT
        NILSLPTYLAL  TRSQL KSPTPLVGFS ES  PEGCIDLPVT+GQD TQVTQMAEFVVIDGR AYNAIF RPIIHSF+A+PS  HQ+LKYSTPNGVGT
Subjt:  NILSLPTYLALGWTRSQLTKSPTPLVGFSGESAIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTFHQILKYSTPNGVGT

Query:  VRGEQTASKECYASALKGSSVCALETLAGRDGTLEFEADLPR
        VRGEQ  S+ECYASALK SSVCALE    +D       DLPR
Subjt:  VRGEQTASKECYASALKGSSVCALETLAGRDGTLEFEADLPR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAAGTATGAGGGCCGAGGTGAACCTGGCCAAGGTCCGACCTACCGGGAAGCTCGAGGGAGGTCGGAGCAGCAGTGGTAGGGGGCAAGGTCACGACGGCCTA
GCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAG
AAGGGCGCCCGGGGTCCAGCCCCGGCTCCACCAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATG
TATAACGAGATAATACTAGCTGCAGGCGTAGGGTCCCGATCTGAAAATCGAATGACGCGCATTGACATACGCCAACAAAGGGGTTCCCACCTCAGCCCGGTCGAG
GAGGGACATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTTCGTGAACACCTCAACAGAAAGAGAGGCTCGTCTTTCCAAAAAGGACAG
TCACCATCCCGCTCACACAAGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCAGCAACTCCCGCAGGGGTGATCACAAGGGCAGAGTTCGACCAGCTGAAG
GGCAAGCTCGACGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTTCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCGGACGTT
TTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTTATGGAT
TTCCAAGCGGCCTCAGACGCAATCAAATGCCGCGCCTTTCAGATCGGGCTTACTGGCAACGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACC
TACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGCCGACGAG
GCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACC
GGCCGACCAGAACGAAAGATCGGCTGGGGCAGAAGTGGAAAAGATATAGAAAAGTCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGAGGCCGAGCTGAG
TATCGAAGGGCGGAGAACAGACCTACTAGGAGCCGACCTCCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCTGAGATCCTAACGAACATCGAGGAGTCT
GGAATGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGGAGCCCAGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACC
TCGGACTGTTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACCAGCTCGGCAGAAAACAAGGAA
GAGCGGAAGCGTTCGAGGACGCCGCCCCAGCGTACTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAG
GAGTTAGCTCGTGCAGCCAAGCGCGAGGTGTGCATCATCAGGGAGCAAAGGCCGACCTGCCCAATCACCTTCGACAGTACAGACTTAGAGAAGGTCCACCTGCCC
CACAATGACGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGATAGACGGAGGCGCATTTGCTAACATCCTGTCCTTACCGACCTAC
CTCGCCCTGGGTTGGACGAGGTCGCAATTGACGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGCCATCCCAGAGGGTTGCATCGACTTGCCGGTC
ACACTTGGGCAGGACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCA
TTTCGGGCCATTCCCTCGACATTTCATCAAATTCTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAAGGAGTGTTATGCC
TCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCGGTAGGGATGGGACGCTCGAGTTCGAGGCCGACCTACCGAGGAGGGAGTTTGCCGCACCC
ACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAAACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCAGAGTCA
GATCTGATGAAGATCGGCGCTCCAGAGTCCTCATGGATGGACCCGATTGCGGACTTCATTAGGGGCAACCCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCA
AGACGAGCAGCTCGGTTCGTGGTCCGAGGTGGAGCATTGTACCGACGCGGCCTTTCCCTGCCTCTATTGAGATGCATAACCCCTGAAGAGGGCCTGCGCGACCTC
AAAAGTAAGGGGTGCGAGGTGACATTGGCTGCAGTTCAAAGAAAAAGCAAAGAAATGAGAGATGTTGCCAACAACAAAGTAAAAACAAATGGGAGCTTCTTTATT
GAACGGAGAGCAGAGCCAAGGCTTATAGACCTTGCAGCTCTGCCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTAAGTATGAGGGCCGAGGTGAACCTGGCCAAGGTCCGACCTACCGGGAAGCTCGAGGGAGGTCGGAGCAGCAGTGGTAGGGGGCAAGGTCACGACGGCCTA
GCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAG
AAGGGCGCCCGGGGTCCAGCCCCGGCTCCACCAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATG
TATAACGAGATAATACTAGCTGCAGGCGTAGGGTCCCGATCTGAAAATCGAATGACGCGCATTGACATACGCCAACAAAGGGGTTCCCACCTCAGCCCGGTCGAG
GAGGGACATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTTCGTGAACACCTCAACAGAAAGAGAGGCTCGTCTTTCCAAAAAGGACAG
TCACCATCCCGCTCACACAAGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCAGCAACTCCCGCAGGGGTGATCACAAGGGCAGAGTTCGACCAGCTGAAG
GGCAAGCTCGACGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTTCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCGGACGTT
TTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTTATGGAT
TTCCAAGCGGCCTCAGACGCAATCAAATGCCGCGCCTTTCAGATCGGGCTTACTGGCAACGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACC
TACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGCCGACGAG
GCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACC
GGCCGACCAGAACGAAAGATCGGCTGGGGCAGAAGTGGAAAAGATATAGAAAAGTCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGAGGCCGAGCTGAG
TATCGAAGGGCGGAGAACAGACCTACTAGGAGCCGACCTCCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCTGAGATCCTAACGAACATCGAGGAGTCT
GGAATGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGGAGCCCAGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACC
TCGGACTGTTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACCAGCTCGGCAGAAAACAAGGAA
GAGCGGAAGCGTTCGAGGACGCCGCCCCAGCGTACTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAG
GAGTTAGCTCGTGCAGCCAAGCGCGAGGTGTGCATCATCAGGGAGCAAAGGCCGACCTGCCCAATCACCTTCGACAGTACAGACTTAGAGAAGGTCCACCTGCCC
CACAATGACGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGATAGACGGAGGCGCATTTGCTAACATCCTGTCCTTACCGACCTAC
CTCGCCCTGGGTTGGACGAGGTCGCAATTGACGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGCCATCCCAGAGGGTTGCATCGACTTGCCGGTC
ACACTTGGGCAGGACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCA
TTTCGGGCCATTCCCTCGACATTTCATCAAATTCTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAAGGAGTGTTATGCC
TCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCGGTAGGGATGGGACGCTCGAGTTCGAGGCCGACCTACCGAGGAGGGAGTTTGCCGCACCC
ACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAAACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCAGAGTCA
GATCTGATGAAGATCGGCGCTCCAGAGTCCTCATGGATGGACCCGATTGCGGACTTCATTAGGGGCAACCCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCA
AGACGAGCAGCTCGGTTCGTGGTCCGAGGTGGAGCATTGTACCGACGCGGCCTTTCCCTGCCTCTATTGAGATGCATAACCCCTGAAGAGGGCCTGCGCGACCTC
AAAAGTAAGGGGTGCGAGGTGACATTGGCTGCAGTTCAAAGAAAAAGCAAAGAAATGAGAGATGTTGCCAACAACAAAGTAAAAACAAATGGGAGCTTCTTTATT
GAACGGAGAGCAGAGCCAAGGCTTATAGACCTTGCAGCTCTGCCCTGA
Protein sequenceShow/hide protein sequence
MLSMRAEVNLAKVRPTGKLEGGRSSSGRGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPPSENFDALQREMEAMRTQMRSMEEM
YNEIILAAGVGSRSENRMTRIDIRQQRGSHLSPVEEGHPEDNESEGHTRQRGDLREHLNRKRGSSFQKGQSPSRSHKSSNQQAESSHNPATPAGVITRAEFDQLK
GKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIGLTGNARLWYRRLPARSIST
YSQLRREFLAQFSSRHYDKKTATHLATIRQKEADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGWGRSGKDIEKSDPKSKDKGSFSRGRAE
YRRAENRPTRSRPPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAQERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAENKE
ERKRSRTPPQRTDRPAVINTIFGGPSGGQSGHKRKELARAAKREVCIIREQRPTCPITFDSTDLEKVHLPHNDALVIAPLIDHVVVRRVLIDGGAFANILSLPTY
LALGWTRSQLTKSPTPLVGFSGESAIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTFHQILKYSTPNGVGTVRGEQTASKECYA
SALKGSSVCALETLAGRDGTLEFEADLPRREFAAPTEELELVPLLSPEKQTDLARSVPVEILDNPSISESDLMKIGAPESSWMDPIADFIRGNPPQDPKERRKLA
RRAARFVVRGGALYRRGLSLPLLRCITPEEGLRDLKSKGCEVTLAAVQRKSKEMRDVANNKVKTNGSFFIERRAEPRLIDLAALP