; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g33040 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g33040
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr1:23223987..23228064
RNA-Seq ExpressionMoc01g33040
SyntenyMoc01g33040
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]1.4e-25788.07Show/hide
Query:  QAESSRNPVTPAGMIIREEFDQLRGQLDDQVEALKTKCEQREGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        +AESSRNP TPAG+I REEFDQLRGQLD QVEALK KCEQ+EGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASD
Subjt:  QAESSRNPVTPAGMIIREEFDQLRGQLDDQVEALKTKCEQREGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDNSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTAT+LATIRQKEGETLREYVTRFQEEQLKVAHCSD+SAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDNSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAE-------------------------------------TDPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAE                                      DPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAE-------------------------------------TDPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRPAVINTIFGGPSGG
        ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPP+RTDRPAVINTIFGGPSGG
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIASLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
        QSG KRKELARAARREVCIIREQRPTC ITFDGADLEEVHLPHNDALVIA LIDHVVV RVL+DGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIASLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE

Query:  SVIREGCIDLPVTLGQDQTRVTQMAEFV
        SVI EG IDLPVTLGQDQT+VTQMAEFV
Subjt:  SVIREGCIDLPVTLGQDQTRVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]9.9e-25676.83Show/hide
Query:  SSNQQAESSRNPVTPAGMIIREEFDQLRGQLDDQVEALKTKCEQREGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ
        SSNQQAESS NP TP G+I REEFDQLRG+L+ QVEALK KCEQ+EGPLNDGDLGESPFTSDVLE        APTVK YDGSKDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSRNPVTPAGMIIREEFDQLRGQLDDQVEALKTKCEQREGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDNSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SD+SAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDNSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAE------------------------------------TDPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILT
        DEALTVKLG+EAPATFAE                                     D KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPISEILT
Subjt:  DEALTVKLGEEAPATFAE------------------------------------TDPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILT

Query:  NIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRPAVINTIFGGP
        NIEESGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEERK SRTP +R DRPAVINTIFGGP
Subjt:  NIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRPAVINTIFGGP

Query:  SGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIASLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPLVGF
        SGGQSGHKRKELARAARREVCIIREQRPTC ITFD ADLEEVHLPHNDALVIA LIDHVVVRRVL+D G SANI+SL TYLALGWTRSQLKKS TPLVGF
Subjt:  SGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIASLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPLVGF

Query:  SGESVIREGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRDIPSTLHQVLKYSTLNGVGTVRGEQTASRECYASALKGSSVCALETLT
        S ESVI EGCIDLPVTLG DQT+VTQMAEFVVIDGRSAYNAIFGRPIIHSFR IPSTLHQVLKYST NGVG VRGEQ ASRECYASALKGSSVCALETL 
Subjt:  SGESVIREGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRDIPSTLHQVLKYSTLNGVGTVRGEQTASRECYASALKGSSVCALETLT

Query:  SRDGMLEFEADLPRREFAAPTEELELVPLL
        SRDG LEF+A+LPRREFAAPTEELELVPLL
Subjt:  SRDGMLEFEADLPRREFAAPTEELELVPLL

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]1.1e-21187.76Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFAE------------------------------TDPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISE
        MCYFLTGLADEALTVKL EEAPATFAE                              TDPKSKDKGSFS+GRAEYRRAENGPTRSRPYERFTPTTIPISE
Subjt:  MCYFLTGLADEALTVKLGEEAPATFAE------------------------------TDPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISE

Query:  ILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRPAVINTIF
        ILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPP+RTDRPAVINTIF
Subjt:  ILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRPAVINTIF

Query:  GGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIASLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPL
        GGPSGGQSGHKRK+LARAARREVCIIREQRPTC ITFD ADL EVHLPHNDALVIA LIDHVVVRRVL+DGGASANILSLPTYLALGWTRSQLKKSPTPL
Subjt:  GGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIASLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPL

Query:  VGFSGESVIREGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRDIPSTLHQVLKYSTLNGVGTVRGEQTASRECYASALKGSSVCALE
        VGFSGESV+ EGCIDLPVTLGQDQTRVTQMAEFVV+DGRSAYNAIFGRPIIHSFR IPSTLHQVLKYST NGVGTVRGEQTASRECYAS LKG+SVCALE
Subjt:  VGFSGESVIREGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRDIPSTLHQVLKYSTLNGVGTVRGEQTASRECYASALKGSSVCALE

Query:  TLTSRDGMLEFEADLPRREFAAPTEELELVPLLSPEKQYGL
        TLTSRDG LEFEADLP REFAAP EELELVPLLS EKQ  L
Subjt:  TLTSRDGMLEFEADLPRREFAAPTEELELVPLLSPEKQYGL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]6.0e-25362.58Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREDEVAVVEGQGHDGLATEPLRRSARTTAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPPSENFDALQREMEAMR
        MVQPANSTNTADRR LAA+  HQRE    VVEGQGH+ L TEPL RSAR T PVLPPAHP+ SK                                    
Subjt:  MVQPANSTNTADRRTLAASDAHQREDEVAVVEGQGHDGLATEPLRRSARTTAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPPSENFDALQREMEAMR

Query:  TQMRSMEAMYKEMILTAGAGSRSENRMTRIDMREQMGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKEQSPSRSHQSSNQQAESSRNPVT
                                                                                                   AESS NP+T
Subjt:  TQMRSMEAMYKEMILTAGAGSRSENRMTRIDMREQMGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKEQSPSRSHQSSNQQAESSRNPVT

Query:  PAGMIIREEFDQLRGQLDDQVEALKTKCEQREGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIA
        P G+I REEFDQL+ + D QVEALK +CE++E   +DGDLGE  F+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+DAIKC AFQIA
Subjt:  PAGMIIREEFDQLRGQLDDQVEALKTKCEQREGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIA

Query:  LTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDNSAMCYFLTGLADEALTVKLGEEAPA
        LTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT T+LATIRQKEGETLREYVTRF EEQLKVAHCSD+SAMCYFLTGLADE LTVKL EEAPA
Subjt:  LTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDNSAMCYFLTGLADEALTVKLGEEAPA

Query:  TFAE-------------------------------------TDPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLK
        TFAE                                      D KS+DKG S SS R +YRR+ +   +SRPYE +TPTTIPI EILTNIEE+GMEKLLK
Subjt:  TFAE-------------------------------------TDPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLK

Query:  RPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRPAVINTIFGGPSGGQSGHKRKEL
        RPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPP+R DRPAVIN             K+KEL
Subjt:  RPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRPAVINTIFGGPSGGQSGHKRKEL

Query:  ARAARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIASLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIREGCID
        AR ARREVCIIREQRPT SI F+ ADLE VHLPHNDALVIA LID V+VRR+L+DGGASANILSL TYLALGWTRSQLKKSPTPLVGFSGES+  EGCID
Subjt:  ARAARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIASLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIREGCID

Query:  LPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRDIPSTLHQVLKYSTLNGVGTVRGEQTASRECYASALKGSSVCALETLTSRD
        LPV++ QD T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFR +PSTLHQVLKYSTLNGVGTVRGE   SRECYAS  K SSVCALE  T RD
Subjt:  LPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRDIPSTLHQVLKYSTLNGVGTVRGEQTASRECYASALKGSSVCALETLTSRD

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]1.1e-20671.58Show/hide
Query:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDNSAMCYFL
        MDFQAA+DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTAT+LATIRQKE ETLREYVTRFQEEQLKVAHCSD+SAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDNSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAE-------------------------------------TDPKSKDKGSFSS-GRAEYRRAENGPTRSRPYERFTPTTIP
        T LADE LTVKLGEEAP TF E                                      D KS+DKGS SS  R EYRR E+GP+RSRPYER+T +TIP
Subjt:  TGLADEALTVKLGEEAPATFAE-------------------------------------TDPKSKDKGSFSS-GRAEYRRAENGPTRSRPYERFTPTTIP

Query:  ISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRPAVIN
        ISEILTNIEESGMEKLLKRPEKLRG  E+R+K+KYCRFHR+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPP+R DRPAVIN
Subjt:  ISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRPAVIN

Query:  TIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIASLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLKKSP
        TIFGGP+GGQSG+KRKELAR ARREVCIIRE +PTCSITF  ADLE VHLPHNDALVIASLIDH +VRRVLIDG                          
Subjt:  TIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIASLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLKKSP

Query:  TPLVGFSGESVIREGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRDIPSTLHQVLKYSTLNGVGTVRGEQTASRECYASALKGSSVC
                      GCIDLPVT+GQD T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFR +PSTLHQVLKYST N VG VRGEQ  SRECYASALKGS+VC
Subjt:  TPLVGFSGESVIREGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRDIPSTLHQVLKYSTLNGVGTVRGEQTASRECYASALKGSSVC

Query:  ALETLTSRDGMLEFEADLP---RREFAAPTEELELVPLLSPEKQYGLEK
        ALE  T+R  + E EADLP   +R+F  PTEELELVPLLSPE+Q   EK
Subjt:  ALETLTSRDGMLEFEADLP---RREFAAPTEELELVPLLSPEKQYGLEK

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088136.7e-25888.07Show/hide
Query:  QAESSRNPVTPAGMIIREEFDQLRGQLDDQVEALKTKCEQREGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        +AESSRNP TPAG+I REEFDQLRGQLD QVEALK KCEQ+EGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASD
Subjt:  QAESSRNPVTPAGMIIREEFDQLRGQLDDQVEALKTKCEQREGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDNSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTAT+LATIRQKEGETLREYVTRFQEEQLKVAHCSD+SAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDNSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAE-------------------------------------TDPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAE                                      DPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAE-------------------------------------TDPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRPAVINTIFGGPSGG
        ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPP+RTDRPAVINTIFGGPSGG
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIASLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
        QSG KRKELARAARREVCIIREQRPTC ITFDGADLEEVHLPHNDALVIA LIDHVVV RVL+DGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIASLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE

Query:  SVIREGCIDLPVTLGQDQTRVTQMAEFV
        SVI EG IDLPVTLGQDQT+VTQMAEFV
Subjt:  SVIREGCIDLPVTLGQDQTRVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188234.8e-25676.83Show/hide
Query:  SSNQQAESSRNPVTPAGMIIREEFDQLRGQLDDQVEALKTKCEQREGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ
        SSNQQAESS NP TP G+I REEFDQLRG+L+ QVEALK KCEQ+EGPLNDGDLGESPFTSDVLE        APTVK YDGSKDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSRNPVTPAGMIIREEFDQLRGQLDDQVEALKTKCEQREGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDNSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SD+SAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDNSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAE------------------------------------TDPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILT
        DEALTVKLG+EAPATFAE                                     D KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPISEILT
Subjt:  DEALTVKLGEEAPATFAE------------------------------------TDPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILT

Query:  NIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRPAVINTIFGGP
        NIEESGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEERK SRTP +R DRPAVINTIFGGP
Subjt:  NIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRPAVINTIFGGP

Query:  SGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIASLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPLVGF
        SGGQSGHKRKELARAARREVCIIREQRPTC ITFD ADLEEVHLPHNDALVIA LIDHVVVRRVL+D G SANI+SL TYLALGWTRSQLKKS TPLVGF
Subjt:  SGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIASLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPLVGF

Query:  SGESVIREGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRDIPSTLHQVLKYSTLNGVGTVRGEQTASRECYASALKGSSVCALETLT
        S ESVI EGCIDLPVTLG DQT+VTQMAEFVVIDGRSAYNAIFGRPIIHSFR IPSTLHQVLKYST NGVG VRGEQ ASRECYASALKGSSVCALETL 
Subjt:  SGESVIREGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRDIPSTLHQVLKYSTLNGVGTVRGEQTASRECYASALKGSSVCALETLT

Query:  SRDGMLEFEADLPRREFAAPTEELELVPLL
        SRDG LEF+A+LPRREFAAPTEELELVPLL
Subjt:  SRDGMLEFEADLPRREFAAPTEELELVPLL

A0A6J1DD03 uncharacterized protein LOC1110198995.5e-21287.76Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFAE------------------------------TDPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISE
        MCYFLTGLADEALTVKL EEAPATFAE                              TDPKSKDKGSFS+GRAEYRRAENGPTRSRPYERFTPTTIPISE
Subjt:  MCYFLTGLADEALTVKLGEEAPATFAE------------------------------TDPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISE

Query:  ILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRPAVINTIF
        ILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPP+RTDRPAVINTIF
Subjt:  ILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRPAVINTIF

Query:  GGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIASLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPL
        GGPSGGQSGHKRK+LARAARREVCIIREQRPTC ITFD ADL EVHLPHNDALVIA LIDHVVVRRVL+DGGASANILSLPTYLALGWTRSQLKKSPTPL
Subjt:  GGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIASLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPL

Query:  VGFSGESVIREGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRDIPSTLHQVLKYSTLNGVGTVRGEQTASRECYASALKGSSVCALE
        VGFSGESV+ EGCIDLPVTLGQDQTRVTQMAEFVV+DGRSAYNAIFGRPIIHSFR IPSTLHQVLKYST NGVGTVRGEQTASRECYAS LKG+SVCALE
Subjt:  VGFSGESVIREGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRDIPSTLHQVLKYSTLNGVGTVRGEQTASRECYASALKGSSVCALE

Query:  TLTSRDGMLEFEADLPRREFAAPTEELELVPLLSPEKQYGL
        TLTSRDG LEFEADLP REFAAP EELELVPLLS EKQ  L
Subjt:  TLTSRDGMLEFEADLPRREFAAPTEELELVPLLSPEKQYGL

A0A6J1DHB3 uncharacterized protein LOC1110204792.9e-25362.58Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREDEVAVVEGQGHDGLATEPLRRSARTTAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPPSENFDALQREMEAMR
        MVQPANSTNTADRR LAA+  HQRE    VVEGQGH+ L TEPL RSAR T PVLPPAHP+ SK                                    
Subjt:  MVQPANSTNTADRRTLAASDAHQREDEVAVVEGQGHDGLATEPLRRSARTTAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPPSENFDALQREMEAMR

Query:  TQMRSMEAMYKEMILTAGAGSRSENRMTRIDMREQMGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKEQSPSRSHQSSNQQAESSRNPVT
                                                                                                   AESS NP+T
Subjt:  TQMRSMEAMYKEMILTAGAGSRSENRMTRIDMREQMGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKEQSPSRSHQSSNQQAESSRNPVT

Query:  PAGMIIREEFDQLRGQLDDQVEALKTKCEQREGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIA
        P G+I REEFDQL+ + D QVEALK +CE++E   +DGDLGE  F+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+DAIKC AFQIA
Subjt:  PAGMIIREEFDQLRGQLDDQVEALKTKCEQREGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIA

Query:  LTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDNSAMCYFLTGLADEALTVKLGEEAPA
        LTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT T+LATIRQKEGETLREYVTRF EEQLKVAHCSD+SAMCYFLTGLADE LTVKL EEAPA
Subjt:  LTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDNSAMCYFLTGLADEALTVKLGEEAPA

Query:  TFAE-------------------------------------TDPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLK
        TFAE                                      D KS+DKG S SS R +YRR+ +   +SRPYE +TPTTIPI EILTNIEE+GMEKLLK
Subjt:  TFAE-------------------------------------TDPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLK

Query:  RPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRPAVINTIFGGPSGGQSGHKRKEL
        RPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPP+R DRPAVIN             K+KEL
Subjt:  RPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRPAVINTIFGGPSGGQSGHKRKEL

Query:  ARAARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIASLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIREGCID
        AR ARREVCIIREQRPT SI F+ ADLE VHLPHNDALVIA LID V+VRR+L+DGGASANILSL TYLALGWTRSQLKKSPTPLVGFSGES+  EGCID
Subjt:  ARAARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIASLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIREGCID

Query:  LPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRDIPSTLHQVLKYSTLNGVGTVRGEQTASRECYASALKGSSVCALETLTSRD
        LPV++ QD T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFR +PSTLHQVLKYSTLNGVGTVRGE   SRECYAS  K SSVCALE  T RD
Subjt:  LPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRDIPSTLHQVLKYSTLNGVGTVRGEQTASRECYASALKGSSVCALETLTSRD

A0A6J1DZB9 uncharacterized protein LOC1110249045.4e-20771.58Show/hide
Query:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDNSAMCYFL
        MDFQAA+DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTAT+LATIRQKE ETLREYVTRFQEEQLKVAHCSD+SAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDNSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAE-------------------------------------TDPKSKDKGSFSS-GRAEYRRAENGPTRSRPYERFTPTTIP
        T LADE LTVKLGEEAP TF E                                      D KS+DKGS SS  R EYRR E+GP+RSRPYER+T +TIP
Subjt:  TGLADEALTVKLGEEAPATFAE-------------------------------------TDPKSKDKGSFSS-GRAEYRRAENGPTRSRPYERFTPTTIP

Query:  ISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRPAVIN
        ISEILTNIEESGMEKLLKRPEKLRG  E+R+K+KYCRFHR+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPP+R DRPAVIN
Subjt:  ISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRPAVIN

Query:  TIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIASLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLKKSP
        TIFGGP+GGQSG+KRKELAR ARREVCIIRE +PTCSITF  ADLE VHLPHNDALVIASLIDH +VRRVLIDG                          
Subjt:  TIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIASLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLKKSP

Query:  TPLVGFSGESVIREGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRDIPSTLHQVLKYSTLNGVGTVRGEQTASRECYASALKGSSVC
                      GCIDLPVT+GQD T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFR +PSTLHQVLKYST N VG VRGEQ  SRECYASALKGS+VC
Subjt:  TPLVGFSGESVIREGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRDIPSTLHQVLKYSTLNGVGTVRGEQTASRECYASALKGSSVC

Query:  ALETLTSRDGMLEFEADLP---RREFAAPTEELELVPLLSPEKQYGLEK
        ALE  T+R  + E EADLP   +R+F  PTEELELVPLLSPE+Q   EK
Subjt:  ALETLTSRDGMLEFEADLP---RREFAAPTEELELVPLLSPEKQYGLEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTGGCCGCCAGCGATGCCCACCAGAGGGAGGACGAAGTAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAACCACCGCGCCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCACCAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGCAATGTAT
AAGGAGATGATACTAACTGCAGGCGCAGGGTCCCGATCTGAAAATCGAATGACGCGCATTGACATGCGGGAGCAAATGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACA
TCCCGAAGACAACGAGAGCGAGGGGCACACTCGCCAGAGAGGAGACCTTCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGAACAGTCACCATCCCGCT
CACACCAGAGCTCCAACCAGCAGGCTGAATCCTCTCGCAACCCAGTAACTCCTGCAGGAATGATTATAAGGGAGGAGTTCGATCAGCTGAGGGGCCAGCTCGACGATCAG
GTGGAGGCGTTAAAGACCAAATGTGAGCAAAGAGAAGGTCCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAA
GTTCAAAGCTCCTACTGTGAAACCTTATGATGGGTCGAAAGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAAT
GCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAG
TTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCTATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTTCAGGAGGAGCAGTT
GAAGGTCGCACACTGCTCCGACAACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAGGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCTGCCACCTTTGCCG
AGACAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGGAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCCG
ACCACGATTCCAATTTCCGAGATCCTAACGAACATTGAGGAGTCTGGAATGGAAAAACTACTCAAACGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGA
CAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGG
GAAAGCCCAGGACCAGCTCGGCAGAGAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCAGCGCACTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCA
AGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGTATCATCAGGGAGCAGAGGCCGACCTGCTCAATCACCTTCGACGGTGC
AGACTTGGAGGAAGTTCACCTGCCCCACAATGATGCACTTGTGATCGCTTCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGATAGACGGAGGCGCATCTGCTAACA
TCCTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCCGGAGAATCAGTCATCCGAGAGGGTTGC
ATCGACTTGCCGGTCACGCTTGGGCAGGACCAAACTCGGGTCACCCAAATGGCCGAGTTCGTGGTGATTGACGGTAGATCGGCCTATAATGCCATCTTTGGGAGACCCAT
CATCCACTCATTTCGGGACATTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCTCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCAAGGGAGTGCT
ATGCCTCTGCACTCAAAGGCTCATCGGTCTGCGCCCTTGAAACTCTCACGAGTAGGGATGGGATGCTCGAGTTCGAGGCCGACCTGCCGAGGAGGGAGTTTGCCGCACCC
ACTGAAGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAATATGGCCTGGAAAAGAACCTTGTGGTCTATTCTTCAATTCATTGGCGAGCTGTCCCAATTGGGTC
TCGAAATTCCTCATTGATGCCGCTTGGGATTGTATCACTGCGTCGGTTCGGGCCATGTACTCCTTCATCATATTCTCAAGATTTGAGTTGTTATTTTGAACTGGTGGAGT
CTGTGTTCTCTGATTATACTGCTGTTGCGGTGGTGGGATGTATTGCTGTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTGGCCGCCAGCGATGCCCACCAGAGGGAGGACGAAGTAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAACCACCGCGCCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCACCAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGCAATGTAT
AAGGAGATGATACTAACTGCAGGCGCAGGGTCCCGATCTGAAAATCGAATGACGCGCATTGACATGCGGGAGCAAATGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACA
TCCCGAAGACAACGAGAGCGAGGGGCACACTCGCCAGAGAGGAGACCTTCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGAACAGTCACCATCCCGCT
CACACCAGAGCTCCAACCAGCAGGCTGAATCCTCTCGCAACCCAGTAACTCCTGCAGGAATGATTATAAGGGAGGAGTTCGATCAGCTGAGGGGCCAGCTCGACGATCAG
GTGGAGGCGTTAAAGACCAAATGTGAGCAAAGAGAAGGTCCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAA
GTTCAAAGCTCCTACTGTGAAACCTTATGATGGGTCGAAAGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAAT
GCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAG
TTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCTATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTTCAGGAGGAGCAGTT
GAAGGTCGCACACTGCTCCGACAACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAGGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCTGCCACCTTTGCCG
AGACAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGGAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCCG
ACCACGATTCCAATTTCCGAGATCCTAACGAACATTGAGGAGTCTGGAATGGAAAAACTACTCAAACGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGA
CAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGG
GAAAGCCCAGGACCAGCTCGGCAGAGAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCAGCGCACTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCA
AGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGTATCATCAGGGAGCAGAGGCCGACCTGCTCAATCACCTTCGACGGTGC
AGACTTGGAGGAAGTTCACCTGCCCCACAATGATGCACTTGTGATCGCTTCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGATAGACGGAGGCGCATCTGCTAACA
TCCTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCCGGAGAATCAGTCATCCGAGAGGGTTGC
ATCGACTTGCCGGTCACGCTTGGGCAGGACCAAACTCGGGTCACCCAAATGGCCGAGTTCGTGGTGATTGACGGTAGATCGGCCTATAATGCCATCTTTGGGAGACCCAT
CATCCACTCATTTCGGGACATTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCTCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCAAGGGAGTGCT
ATGCCTCTGCACTCAAAGGCTCATCGGTCTGCGCCCTTGAAACTCTCACGAGTAGGGATGGGATGCTCGAGTTCGAGGCCGACCTGCCGAGGAGGGAGTTTGCCGCACCC
ACTGAAGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAATATGGCCTGGAAAAGAACCTTGTGGTCTATTCTTCAATTCATTGGCGAGCTGTCCCAATTGGGTC
TCGAAATTCCTCATTGATGCCGCTTGGGATTGTATCACTGCGTCGGTTCGGGCCATGTACTCCTTCATCATATTCTCAAGATTTGAGTTGTTATTTTGAACTGGTGGAGT
CTGTGTTCTCTGATTATACTGCTGTTGCGGTGGTGGGATGTATTGCTGTGTAG
Protein sequenceShow/hide protein sequence
MVQPANSTNTADRRTLAASDAHQREDEVAVVEGQGHDGLATEPLRRSARTTAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPPSENFDALQREMEAMRTQMRSMEAMY
KEMILTAGAGSRSENRMTRIDMREQMGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKEQSPSRSHQSSNQQAESSRNPVTPAGMIIREEFDQLRGQLDDQ
VEALKTKCEQREGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
FSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDNSAMCYFLTGLADEALTVKLGEEAPATFAETDPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTP
TTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRPAVINTIFGGP
SGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIASLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIREGC
IDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRDIPSTLHQVLKYSTLNGVGTVRGEQTASRECYASALKGSSVCALETLTSRDGMLEFEADLPRREFAAP
TEELELVPLLSPEKQYGLEKNLVVYSSIHWRAVPIGSRNSSLMPLGIVSLRRFGPCTPSSYSQDLSCYFELVESVFSDYTAVAVVGCIAV