; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g26480 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g26480
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr5:18948019..18953620
RNA-Seq ExpressionMoc05g26480
SyntenyMoc05g26480
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]1.6e-28395.45Show/hide
Query:  QAESSRNPVTPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        +AESSRNP TPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASD
Subjt:  QAESSRNPVTPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPTRSISTCSQLRREFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLP  SIST SQLRREFLA FSSRHYDKKTAT+LATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPTRSISTCSQLRREFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKGSFSSGRVEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI RGRSGKDIE ADPKSKDKGSFSSGR EYRRAENGPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKGSFSSGRVEYRRAENGPTRSRPYERFTPTTIPISEILTNIE

Query:  ESGREKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPNGG
        ESG EKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSS EKKEERKRSRTPPRRTDRPAVINTIFGGP+GG
Subjt:  ESGREKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPNGG

Query:  QSGHKRKELARAARREVCIIREQRPTCSITFDGADLNEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
        QSG KRKELARAARREVCIIREQRPTC ITFDGADL EVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQRPTCSITFDGADLNEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE

Query:  SVIPESCIDLPVTLGQDQTQVTQMAEFV
        SVIPE  IDLPVTLGQDQTQVTQMAEFV
Subjt:  SVIPESCIDLPVTLGQDQTQVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]2.7e-28380.76Show/hide
Query:  SNQQAESSRNPVTPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQA
        SNQQAESS NP TP GVITREEFDQLRG+L+AQVEALKAKCEQKEGPLNDGDLGESPFTSDVLE        APTVK YDGSKDPKDYVEVFEGLMDFQA
Subjt:  SNQQAESSRNPVTPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQA

Query:  ASDAIKCRAFQIALTGSARLWYRRLPTRSISTCSQLRREFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLAD
        ASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLAD
Subjt:  ASDAIKCRAFQIALTGSARLWYRRLPTRSISTCSQLRREFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLAD

Query:  EALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKGSFSSGRVEYRRAENGPTRSRPYERFTPTTIPISEILT
        EALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER IDRGRSGKD EKAD KSKDKGSFSSGR E+RRA NGPTRSRPYERFTPTTIPISEILT
Subjt:  EALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKGSFSSGRVEYRRAENGPTRSRPYERFTPTTIPISEILT

Query:  NIEESGREKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGP
        NIEESG EKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSS EKKEERK SRTP RR DRPAVINTIFGGP
Subjt:  NIEESGREKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGP

Query:  NGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLNEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGF
        +GGQSGHKRKELARAARREVCIIREQRPTC ITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVGF
Subjt:  NGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLNEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGF

Query:  SGESVIPESCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNVIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLA
        S ESVIPE CIDLPVTLG DQTQVTQMAEFVVIDGRSAYN IFGRPIIHSFRAIPSTLHQVLKYSTPNGVG VRGEQ ASRECYASALKGSSVCALETL 
Subjt:  SGESVIPESCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNVIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLA

Query:  GRDGTLEFKADLPRREFAAPTEELELVPLLSSEKQVTSAYETDLARSVPVEILDN
         RDGTLEFKA+LPRREFAAPTEELELVPLL  +      +E +L     +  +D+
Subjt:  GRDGTLEFKADLPRREFAAPTEELELVPLLSSEKQVTSAYETDLARSVPVEILDN

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]1.6e-22791.7Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKGSFSSGRVEYRRAENGPTRSRPYERFTP
        MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRTK G       +GRSGKD+E  DPKSKDKGSFS+GR EYRRAENGPTRSRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKGSFSSGRVEYRRAENGPTRSRPYERFTP

Query:  TTIPISEILTNIEESGREKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRP
        TTIPISEILTNIEESG EKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSS EKKEERKRSRTPPRRTDRP
Subjt:  TTIPISEILTNIEESGREKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRP

Query:  AVINTIFGGPNGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLNEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
        AVINTIFGGP+GGQSGHKRK+LARAARREVCIIREQRPTC ITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Subjt:  AVINTIFGGPNGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLNEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL

Query:  KKSPTPLVGFSGESVIPESCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNVIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKG
        KKSPTPLVGFSGESV+PE CIDLPVTLGQDQT+VTQMAEFVV+DGRSAYN IFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAS LKG
Subjt:  KKSPTPLVGFSGESVIPESCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNVIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKG

Query:  SSVCALETLAGRDGTLEFKADLPRREFAAPTEELELVPLLSSEKQV
        +SVCALETL  RDGTLEF+ADLP REFAAP EELELVPLLS EKQV
Subjt:  SSVCALETLAGRDGTLEFKADLPRREFAAPTEELELVPLLSSEKQV

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]1.2e-27867.51Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAVVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPPSENFDALQREMEAMR
        MVQPANSTNTADRR LAA+  HQREVGA VVEGQGH+ L TEPL RSARIT PVLPPAHP+ SK                                    
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAVVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPPSENFDALQREMEAMR

Query:  TQMRSMEEMYNEMILAAGAGSRSENRMTRIDIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRRSNQQAESSRNPVT
                                                                                                   AESS NP+T
Subjt:  TQMRSMEEMYNEMILAAGAGSRSENRMTRIDIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRRSNQQAESSRNPVT

Query:  PAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIA
        P GVITREEFDQL+ + DAQVEALKA+CE+KE   +DGDLGE  F+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+DAIKC AFQIA
Subjt:  PAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIA

Query:  LTGSARLWYRRLPTRSISTCSQLRREFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPA
        LTGSARLWYRRLP R IST SQLR+EF++QFSSRHYD+KT T+LATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPA
Subjt:  LTGSARLWYRRLPTRSISTCSQLRREFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPA

Query:  TFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKG-SFSSGRVEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGREKLLK
        TFAEVLQK KKVIDGQELLRTKTGRPE+ ID+GR+GKD  KAD KS+DKG S SS RV+YRR+ +   +SRPYE +TPTTIPI EILTNIEE+G EKLLK
Subjt:  TFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKG-SFSSGRVEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGREKLLK

Query:  RPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPNGGQSGHKRKEL
        RPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN             K+KEL
Subjt:  RPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPNGGQSGHKRKEL

Query:  ARAARREVCIIREQRPTCSITFDGADLNEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPESCID
        AR ARREVCIIREQRPT SI F+ ADL  VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLKKSPTPLVGFSGES+  E CID
Subjt:  ARAARREVCIIREQRPTCSITFDGADLNEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPESCID

Query:  LPVTLGQDQTQVTQMAEFVVIDGRSAYNVIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRD
        LPV++ QD TQVTQMAEFVVIDGRSAYN IFGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SRECYAS  K SSVCALE    RD
Subjt:  LPVTLGQDQTQVTQMAEFVVIDGRSAYNVIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRD

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]3.2e-22376.1Show/hide
Query:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPTRSISTCSQLRREFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAA+DAIKCRAFQIALTGSARLWYRRLP RSIST SQLR+EF++QFSS HYD+KTAT+LATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPTRSISTCSQLRREFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKGSFSS-GRVEYRRAENGPTRSRPYERFTPTTIP
        T LADE LTVKLGEEAP TF EVLQKAKKVIDGQELLRTKTGRPE++ID+ +  ++  KAD KS+DKGS SS  R EYRR E+GP+RSRPYER+T +TIP
Subjt:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKGSFSS-GRVEYRRAENGPTRSRPYERFTPTTIP

Query:  ISEILTNIEESGREKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVIN
        ISEILTNIEESG EKLLKRPEKLRG  E+R+K+KYCRFHR+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVIN
Subjt:  ISEILTNIEESGREKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVIN

Query:  TIFGGPNGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLNEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSP
        TIFGGPNGGQSG+KRKELAR ARREVCIIRE +PTCSITF  ADL  VHLPHNDALVIA LIDH +VRRVL+DGG                         
Subjt:  TIFGGPNGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLNEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSP

Query:  TPLVGFSGESVIPESCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNVIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVC
                       CIDLPVT+GQD TQVTQMAEFVVIDGRSAYN IFGRPIIHSFRA+PSTLHQVLKYSTPN VG VRGEQ  SRECYASALKGS+VC
Subjt:  TPLVGFSGESVIPESCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNVIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVC

Query:  ALETLAGRDGTLEFKADLP---RREFAAPTEELELVPLLSSEKQ
        ALE    R    E +ADLP   +R+F  PTEELELVPLLS E+Q
Subjt:  ALETLAGRDGTLEFKADLP---RREFAAPTEELELVPLLSSEKQ

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088137.8e-28495.45Show/hide
Query:  QAESSRNPVTPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        +AESSRNP TPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASD
Subjt:  QAESSRNPVTPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPTRSISTCSQLRREFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLP  SIST SQLRREFLA FSSRHYDKKTAT+LATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPTRSISTCSQLRREFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKGSFSSGRVEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI RGRSGKDIE ADPKSKDKGSFSSGR EYRRAENGPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKGSFSSGRVEYRRAENGPTRSRPYERFTPTTIPISEILTNIE

Query:  ESGREKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPNGG
        ESG EKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSS EKKEERKRSRTPPRRTDRPAVINTIFGGP+GG
Subjt:  ESGREKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPNGG

Query:  QSGHKRKELARAARREVCIIREQRPTCSITFDGADLNEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
        QSG KRKELARAARREVCIIREQRPTC ITFDGADL EVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQRPTCSITFDGADLNEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE

Query:  SVIPESCIDLPVTLGQDQTQVTQMAEFV
        SVIPE  IDLPVTLGQDQTQVTQMAEFV
Subjt:  SVIPESCIDLPVTLGQDQTQVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188231.3e-28380.76Show/hide
Query:  SNQQAESSRNPVTPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQA
        SNQQAESS NP TP GVITREEFDQLRG+L+AQVEALKAKCEQKEGPLNDGDLGESPFTSDVLE        APTVK YDGSKDPKDYVEVFEGLMDFQA
Subjt:  SNQQAESSRNPVTPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQA

Query:  ASDAIKCRAFQIALTGSARLWYRRLPTRSISTCSQLRREFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLAD
        ASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLAD
Subjt:  ASDAIKCRAFQIALTGSARLWYRRLPTRSISTCSQLRREFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLAD

Query:  EALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKGSFSSGRVEYRRAENGPTRSRPYERFTPTTIPISEILT
        EALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER IDRGRSGKD EKAD KSKDKGSFSSGR E+RRA NGPTRSRPYERFTPTTIPISEILT
Subjt:  EALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKGSFSSGRVEYRRAENGPTRSRPYERFTPTTIPISEILT

Query:  NIEESGREKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGP
        NIEESG EKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSS EKKEERK SRTP RR DRPAVINTIFGGP
Subjt:  NIEESGREKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGP

Query:  NGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLNEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGF
        +GGQSGHKRKELARAARREVCIIREQRPTC ITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVGF
Subjt:  NGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLNEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGF

Query:  SGESVIPESCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNVIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLA
        S ESVIPE CIDLPVTLG DQTQVTQMAEFVVIDGRSAYN IFGRPIIHSFRAIPSTLHQVLKYSTPNGVG VRGEQ ASRECYASALKGSSVCALETL 
Subjt:  SGESVIPESCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNVIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLA

Query:  GRDGTLEFKADLPRREFAAPTEELELVPLLSSEKQVTSAYETDLARSVPVEILDN
         RDGTLEFKA+LPRREFAAPTEELELVPLL  +      +E +L     +  +D+
Subjt:  GRDGTLEFKADLPRREFAAPTEELELVPLLSSEKQVTSAYETDLARSVPVEILDN

A0A6J1DD03 uncharacterized protein LOC1110198997.9e-22891.7Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKGSFSSGRVEYRRAENGPTRSRPYERFTP
        MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRTK G       +GRSGKD+E  DPKSKDKGSFS+GR EYRRAENGPTRSRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKGSFSSGRVEYRRAENGPTRSRPYERFTP

Query:  TTIPISEILTNIEESGREKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRP
        TTIPISEILTNIEESG EKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSS EKKEERKRSRTPPRRTDRP
Subjt:  TTIPISEILTNIEESGREKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRP

Query:  AVINTIFGGPNGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLNEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
        AVINTIFGGP+GGQSGHKRK+LARAARREVCIIREQRPTC ITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Subjt:  AVINTIFGGPNGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLNEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL

Query:  KKSPTPLVGFSGESVIPESCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNVIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKG
        KKSPTPLVGFSGESV+PE CIDLPVTLGQDQT+VTQMAEFVV+DGRSAYN IFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAS LKG
Subjt:  KKSPTPLVGFSGESVIPESCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNVIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKG

Query:  SSVCALETLAGRDGTLEFKADLPRREFAAPTEELELVPLLSSEKQV
        +SVCALETL  RDGTLEF+ADLP REFAAP EELELVPLLS EKQV
Subjt:  SSVCALETLAGRDGTLEFKADLPRREFAAPTEELELVPLLSSEKQV

A0A6J1DHB3 uncharacterized protein LOC1110204795.8e-27967.51Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAVVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPPSENFDALQREMEAMR
        MVQPANSTNTADRR LAA+  HQREVGA VVEGQGH+ L TEPL RSARIT PVLPPAHP+ SK                                    
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAVVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPPSENFDALQREMEAMR

Query:  TQMRSMEEMYNEMILAAGAGSRSENRMTRIDIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRRSNQQAESSRNPVT
                                                                                                   AESS NP+T
Subjt:  TQMRSMEEMYNEMILAAGAGSRSENRMTRIDIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRRSNQQAESSRNPVT

Query:  PAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIA
        P GVITREEFDQL+ + DAQVEALKA+CE+KE   +DGDLGE  F+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+DAIKC AFQIA
Subjt:  PAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIA

Query:  LTGSARLWYRRLPTRSISTCSQLRREFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPA
        LTGSARLWYRRLP R IST SQLR+EF++QFSSRHYD+KT T+LATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPA
Subjt:  LTGSARLWYRRLPTRSISTCSQLRREFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPA

Query:  TFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKG-SFSSGRVEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGREKLLK
        TFAEVLQK KKVIDGQELLRTKTGRPE+ ID+GR+GKD  KAD KS+DKG S SS RV+YRR+ +   +SRPYE +TPTTIPI EILTNIEE+G EKLLK
Subjt:  TFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKG-SFSSGRVEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGREKLLK

Query:  RPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPNGGQSGHKRKEL
        RPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN             K+KEL
Subjt:  RPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPNGGQSGHKRKEL

Query:  ARAARREVCIIREQRPTCSITFDGADLNEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPESCID
        AR ARREVCIIREQRPT SI F+ ADL  VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLKKSPTPLVGFSGES+  E CID
Subjt:  ARAARREVCIIREQRPTCSITFDGADLNEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPESCID

Query:  LPVTLGQDQTQVTQMAEFVVIDGRSAYNVIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRD
        LPV++ QD TQVTQMAEFVVIDGRSAYN IFGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SRECYAS  K SSVCALE    RD
Subjt:  LPVTLGQDQTQVTQMAEFVVIDGRSAYNVIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRD

A0A6J1DZB9 uncharacterized protein LOC1110249041.5e-22376.1Show/hide
Query:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPTRSISTCSQLRREFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAA+DAIKCRAFQIALTGSARLWYRRLP RSIST SQLR+EF++QFSS HYD+KTAT+LATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPTRSISTCSQLRREFLAQFSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKGSFSS-GRVEYRRAENGPTRSRPYERFTPTTIP
        T LADE LTVKLGEEAP TF EVLQKAKKVIDGQELLRTKTGRPE++ID+ +  ++  KAD KS+DKGS SS  R EYRR E+GP+RSRPYER+T +TIP
Subjt:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIEKADPKSKDKGSFSS-GRVEYRRAENGPTRSRPYERFTPTTIP

Query:  ISEILTNIEESGREKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVIN
        ISEILTNIEESG EKLLKRPEKLRG  E+R+K+KYCRFHR+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVIN
Subjt:  ISEILTNIEESGREKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVIN

Query:  TIFGGPNGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLNEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSP
        TIFGGPNGGQSG+KRKELAR ARREVCIIRE +PTCSITF  ADL  VHLPHNDALVIA LIDH +VRRVL+DGG                         
Subjt:  TIFGGPNGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLNEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSP

Query:  TPLVGFSGESVIPESCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNVIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVC
                       CIDLPVT+GQD TQVTQMAEFVVIDGRSAYN IFGRPIIHSFRA+PSTLHQVLKYSTPN VG VRGEQ  SRECYASALKGS+VC
Subjt:  TPLVGFSGESVIPESCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNVIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVC

Query:  ALETLAGRDGTLEFKADLP---RREFAAPTEELELVPLLSSEKQ
        ALE    R    E +ADLP   +R+F  PTEELELVPLLS E+Q
Subjt:  ALETLAGRDGTLEFKADLP---RREFAAPTEELELVPLLSSEKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCTAGCGATGCCCACCAGAGGGAGGTCGGAGCAGTAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCACCAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATGTAT
AACGAGATGATACTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAATGACGCGCATTGACATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACA
TCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTTCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCCGCT
CACATAGGAGATCCAACCAACAGGCTGAATCCTCTCGCAACCCAGTAACTCCTGCAGGAGTGATTACAAGGGAGGAGTTCGACCAGCTGAGGGGCCAGCTCGACGCTCAG
GTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTCCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAA
GTTCAAAGCTCCTACCGTGAAACCTTATGATGGGTCGAAAGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAAT
GCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAACTAGGTCGATCTCGACCTGCTCTCAGCTGAGAAGGGAGTTCCTCGCCCAG
TTCTCTTCTCGGCATTATGACAAGAAGACAGCGACCTATCTCGCCACCATCAGACAGAAGGAGGGCGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAGTT
GAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAGGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCG
AGGTGCTTCAGAAGGCGAAGAAAGTCATTGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCAGAACGAAAGATCGACCGGGGCAGAAGTGGAAAAGATATAGAA
AAGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGTTGAGTATCGGAGGGCAGAGAACGGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCC
GACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAAGGGAAAAACTACTCAAACGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGG
ACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTG
GGAAAGCCCAGGACCAGCTCGACAGAGAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTTGGAGGGCC
AAACGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTGCTCAATCACCTTCGACGGTG
CAGACTTGAATGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAGACGGAGGCGCGTCTGCTAAC
ATCCTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGTCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGAGTTG
CATCGACTTGCCGGTCACGCTTGGGCAGGACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTGATCGACGGTAGATCGGCCTATAACGTCATCTTTGGGAGACCCA
TCATCCACTCATTTCGGGCCATTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCAAGGGAGTGC
TATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTTGAAACTCTCGCCGGTAGGGATGGGACGCTCGAGTTCAAGGCCGACCTGCCGAGGAGGGAGTTTGCCGCACC
CACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTTCCGAGAAGCAAGTAACATCAGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCT
CGATCTCAGAACCAGATCTGATGGAGATCGGCGCTCCAGAGTCCTCATGGATGGACCCGATTGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAACGCAGA
AAGTTGGCAAGGCGAGCAGCTCGGTTCGTGGTCCGAGGTGGAGCATTGGTCCAAACCCATGTGGGTGCCCTTGATCCGACCTGGGAGGGCTCGTTTGAGGTCAAGGGCAT
AGTCCGACCTGGGACGTACGTATTGGCCGATCTGAAAGGAGATGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCTAGCGATGCCCACCAGAGGGAGGTCGGAGCAGTAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCACCAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATGTAT
AACGAGATGATACTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAATGACGCGCATTGACATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACA
TCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTTCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCCGCT
CACATAGGAGATCCAACCAACAGGCTGAATCCTCTCGCAACCCAGTAACTCCTGCAGGAGTGATTACAAGGGAGGAGTTCGACCAGCTGAGGGGCCAGCTCGACGCTCAG
GTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTCCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAA
GTTCAAAGCTCCTACCGTGAAACCTTATGATGGGTCGAAAGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAAT
GCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAACTAGGTCGATCTCGACCTGCTCTCAGCTGAGAAGGGAGTTCCTCGCCCAG
TTCTCTTCTCGGCATTATGACAAGAAGACAGCGACCTATCTCGCCACCATCAGACAGAAGGAGGGCGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAGTT
GAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAGGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCG
AGGTGCTTCAGAAGGCGAAGAAAGTCATTGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCAGAACGAAAGATCGACCGGGGCAGAAGTGGAAAAGATATAGAA
AAGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGTTGAGTATCGGAGGGCAGAGAACGGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCC
GACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAAGGGAAAAACTACTCAAACGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGG
ACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTG
GGAAAGCCCAGGACCAGCTCGACAGAGAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTTGGAGGGCC
AAACGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTGCTCAATCACCTTCGACGGTG
CAGACTTGAATGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAGACGGAGGCGCGTCTGCTAAC
ATCCTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGTCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGAGTTG
CATCGACTTGCCGGTCACGCTTGGGCAGGACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTGATCGACGGTAGATCGGCCTATAACGTCATCTTTGGGAGACCCA
TCATCCACTCATTTCGGGCCATTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCAAGGGAGTGC
TATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTTGAAACTCTCGCCGGTAGGGATGGGACGCTCGAGTTCAAGGCCGACCTGCCGAGGAGGGAGTTTGCCGCACC
CACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTTCCGAGAAGCAAGTAACATCAGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCT
CGATCTCAGAACCAGATCTGATGGAGATCGGCGCTCCAGAGTCCTCATGGATGGACCCGATTGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAACGCAGA
AAGTTGGCAAGGCGAGCAGCTCGGTTCGTGGTCCGAGGTGGAGCATTGGTCCAAACCCATGTGGGTGCCCTTGATCCGACCTGGGAGGGCTCGTTTGAGGTCAAGGGCAT
AGTCCGACCTGGGACGTACGTATTGGCCGATCTGAAAGGAGATGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTADRRTLAASDAHQREVGAVVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPPSENFDALQREMEAMRTQMRSMEEMY
NEMILAAGAGSRSENRMTRIDIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRRSNQQAESSRNPVTPAGVITREEFDQLRGQLDAQ
VEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPTRSISTCSQLRREFLAQ
FSSRHYDKKTATYLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDIE
KADPKSKDKGSFSSGRVEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGREKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFV
GKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPNGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLNEVHLPHNDALVIAPLIDHVVVRRVLVDGGASAN
ILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPESCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNVIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASREC
YASALKGSSVCALETLAGRDGTLEFKADLPRREFAAPTEELELVPLLSSEKQVTSAYETDLARSVPVEILDNPSISEPDLMEIGAPESSWMDPIADFIRGNSPQDPKERR
KLARRAARFVVRGGALVQTHVGALDPTWEGSFEVKGIVRPGTYVLADLKGDVLAHPWNAEHLKRYYP