; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g01800 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g01800
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr1:1209304..1214181
RNA-Seq ExpressionMoc01g01800
SyntenyMoc01g01800
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]1.5e-25888.64Show/hide
Query:  QAESSHN---LAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD
        +AESS N    AG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHN---LAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQ---------------------------------GSFSSGRAEYRRVENGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQ                                 GSFSSGRAEYRR ENGPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQ---------------------------------GSFSSGRAEYRRVENGPTRSRPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPNGG
        ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP+GG
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPNGG

Query:  QSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE
        QSG KRKELARAARREVCIIREQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLK+SPTPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTRVTQMAEFV
        SVIPEG IDLPVTLGQDQT+VTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTRVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]5.1e-25977.46Show/hide
Query:  SSNQQAESSHNLA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ
        SSNQQAESSHN A   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDGDLGESPFTSDVLE        APTVK YDG+KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNLA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQ--------------------------------GSFSSGRAEYRRVENGPTRSRPYERFTPTTIPISEILT
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQ                                GSFSSGRAE+RR  NGPTRSRPYERFTPTTIPISEILT
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQ--------------------------------GSFSSGRAEYRRVENGPTRSRPYERFTPTTIPISEILT

Query:  NIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP
        NIEESGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEERK SRTP RR DRPAVINTIFGGP
Subjt:  NIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP

Query:  NGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGF
        +GGQSGHKRKELARAARREVCIIREQ PTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLK+S TPLVGF
Subjt:  NGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGF

Query:  SGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVLGEQAASRECYASALKGSSVCALETLA
        S ESVIPEGCIDLPVTLG DQT+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKY TPNGVG V GEQ ASRECYASALKGSSVCALETL 
Subjt:  SGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVLGEQAASRECYASALKGSSVCALETLA

Query:  GRDGPLEFEADLPRKEFAAPTEELELVPLV
         RDG LEF+A+LPR+EFAAPTEELELVPL+
Subjt:  GRDGPLEFEADLPRKEFAAPTEELELVPLV

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]2.3e-21487.73Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQ--------------------------GSFSSGRAEYRRVENGPTRSRPYERFTPTTIPISE
        MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQ                          GSFS+GRAEYRR ENGPTRSRPYERFTPTTIPISE
Subjt:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQ--------------------------GSFSSGRAEYRRVENGPTRSRPYERFTPTTIPISE

Query:  ILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIF
        ILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIF
Subjt:  ILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIF

Query:  GGPNGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPL
        GGP+GGQSGHKRK+LARAARREVCIIREQ PTCPITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLK+SPTPL
Subjt:  GGPNGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPL

Query:  VGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVLGEQAASRECYASALKGSSVCALE
        VGFSGESV+PEGCIDLPVTLGQDQTRVTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKY TPNGVGTV GEQ ASRECYAS LKG+SVCALE
Subjt:  VGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVLGEQAASRECYASALKGSSVCALE

Query:  TLAGRDGPLEFEADLPRKEFAAPTEELELVPLVSLEKQIK
        TL  RDG LEFEADLP +EFAAP EELELVPL+S EKQ++
Subjt:  TLAGRDGPLEFEADLPRKEFAAPTEELELVPLVSLEKQIK

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]5.3e-25663.16Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLAAEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGQAPAPTSENFDALKREMEAMR
        MVQPANSTNT DRR LAA+  HQREVGA  VEGQGH+ L  EPL RSARIT P LPPAHP+ SK                                    
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLAAEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGQAPAPTSENFDALKREMEAMR

Query:  TQIRSMEEMYNEMMLAAGAGSRSENRVTLVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLHEHLNRKRGSSLRKGQSPSRSHKSSNQQAESSHN--L
                                                                                                   AESS+N   
Subjt:  TQIRSMEEMYNEMMLAAGAGSRSENRVTLVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLHEHLNRKRGSSLRKGQSPSRSHKSSNQQAESSHN--L

Query:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL
         G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIAL
Subjt:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL

Query:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT
        TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPAT
Subjt:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT

Query:  FAEVLQKAKKVIDGQ----------------------------------GSFSSGRAEYRRVENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKR
        FAEVLQK KKVIDGQ                                   S SS R +YRR  +   +SRPYE +TPTTIPI EILTNIEE+GMEKLLKR
Subjt:  FAEVLQKAKKVIDGQ----------------------------------GSFSSGRAEYRRVENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKR

Query:  PEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPNGGQSGHKRKELA
        PEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN             K+KELA
Subjt:  PEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPNGGQSGHKRKELA

Query:  RAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDL
        R ARREVCIIREQ PT  I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLK+SPTPLVGFSGES+  EGCIDL
Subjt:  RAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDL

Query:  PVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVLGEQAASRECYASALKGSSVCALETLAGRD
        PV++ QD T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKY T NGVGTV GE   SRECYAS  K SSVCALE    RD
Subjt:  PVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVLGEQAASRECYASALKGSSVCALETLAGRD

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]2.9e-20971.9Show/hide
Query:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAA+DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTATHLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQ----------------------------------GSFSSGRAEYRRVENGPTRSRPYERFTPTTIP
        T LADE LTVKLGEEAP TF EVLQKAKKVIDGQ                                   S S+ R EYRR+E+GP+RSRPYER+T +TIP
Subjt:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQ----------------------------------GSFSSGRAEYRRVENGPTRSRPYERFTPTTIP

Query:  ISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN
        ISEILTNIEESGMEKLLKRPEKLRG  E+R+K+KYCRFHR+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVIN
Subjt:  ISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN

Query:  TIFGGPNGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSP
        TIFGGPNGGQSG+KRKELAR ARREVCIIRE  PTC ITF  ADLE VHLPHNDALVIA LIDH +VRRVL+DG                          
Subjt:  TIFGGPNGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSP

Query:  TPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVLGEQAASRECYASALKGSSVC
                      GCIDLPVT+GQD T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKY TPN VG V GEQ  SRECYASALKGS+VC
Subjt:  TPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVLGEQAASRECYASALKGSSVC

Query:  ALETLAGRDGPLEFEADLP---RKEFAAPTEELELVPLVSLEKQIKDE
        ALE    R    E EADLP   +++F  PTEELELVPL+S E+Q   E
Subjt:  ALETLAGRDGPLEFEADLP---RKEFAAPTEELELVPLVSLEKQIKDE

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088137.2e-25988.64Show/hide
Query:  QAESSHN---LAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD
        +AESS N    AG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHN---LAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQ---------------------------------GSFSSGRAEYRRVENGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQ                                 GSFSSGRAEYRR ENGPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQ---------------------------------GSFSSGRAEYRRVENGPTRSRPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPNGG
        ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP+GG
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPNGG

Query:  QSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE
        QSG KRKELARAARREVCIIREQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLK+SPTPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTRVTQMAEFV
        SVIPEG IDLPVTLGQDQT+VTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTRVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188232.5e-25977.46Show/hide
Query:  SSNQQAESSHNLA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ
        SSNQQAESSHN A   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDGDLGESPFTSDVLE        APTVK YDG+KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNLA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQ--------------------------------GSFSSGRAEYRRVENGPTRSRPYERFTPTTIPISEILT
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQ                                GSFSSGRAE+RR  NGPTRSRPYERFTPTTIPISEILT
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQ--------------------------------GSFSSGRAEYRRVENGPTRSRPYERFTPTTIPISEILT

Query:  NIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP
        NIEESGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEERK SRTP RR DRPAVINTIFGGP
Subjt:  NIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP

Query:  NGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGF
        +GGQSGHKRKELARAARREVCIIREQ PTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLK+S TPLVGF
Subjt:  NGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGF

Query:  SGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVLGEQAASRECYASALKGSSVCALETLA
        S ESVIPEGCIDLPVTLG DQT+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKY TPNGVG V GEQ ASRECYASALKGSSVCALETL 
Subjt:  SGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVLGEQAASRECYASALKGSSVCALETLA

Query:  GRDGPLEFEADLPRKEFAAPTEELELVPLV
         RDG LEF+A+LPR+EFAAPTEELELVPL+
Subjt:  GRDGPLEFEADLPRKEFAAPTEELELVPLV

A0A6J1DD03 uncharacterized protein LOC1110198991.1e-21487.73Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQ--------------------------GSFSSGRAEYRRVENGPTRSRPYERFTPTTIPISE
        MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQ                          GSFS+GRAEYRR ENGPTRSRPYERFTPTTIPISE
Subjt:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQ--------------------------GSFSSGRAEYRRVENGPTRSRPYERFTPTTIPISE

Query:  ILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIF
        ILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIF
Subjt:  ILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIF

Query:  GGPNGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPL
        GGP+GGQSGHKRK+LARAARREVCIIREQ PTCPITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLK+SPTPL
Subjt:  GGPNGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPL

Query:  VGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVLGEQAASRECYASALKGSSVCALE
        VGFSGESV+PEGCIDLPVTLGQDQTRVTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKY TPNGVGTV GEQ ASRECYAS LKG+SVCALE
Subjt:  VGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVLGEQAASRECYASALKGSSVCALE

Query:  TLAGRDGPLEFEADLPRKEFAAPTEELELVPLVSLEKQIK
        TL  RDG LEFEADLP +EFAAP EELELVPL+S EKQ++
Subjt:  TLAGRDGPLEFEADLPRKEFAAPTEELELVPLVSLEKQIK

A0A6J1DHB3 uncharacterized protein LOC1110204792.6e-25663.16Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLAAEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGQAPAPTSENFDALKREMEAMR
        MVQPANSTNT DRR LAA+  HQREVGA  VEGQGH+ L  EPL RSARIT P LPPAHP+ SK                                    
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLAAEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGQAPAPTSENFDALKREMEAMR

Query:  TQIRSMEEMYNEMMLAAGAGSRSENRVTLVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLHEHLNRKRGSSLRKGQSPSRSHKSSNQQAESSHN--L
                                                                                                   AESS+N   
Subjt:  TQIRSMEEMYNEMMLAAGAGSRSENRVTLVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLHEHLNRKRGSSLRKGQSPSRSHKSSNQQAESSHN--L

Query:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL
         G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIAL
Subjt:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL

Query:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT
        TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPAT
Subjt:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT

Query:  FAEVLQKAKKVIDGQ----------------------------------GSFSSGRAEYRRVENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKR
        FAEVLQK KKVIDGQ                                   S SS R +YRR  +   +SRPYE +TPTTIPI EILTNIEE+GMEKLLKR
Subjt:  FAEVLQKAKKVIDGQ----------------------------------GSFSSGRAEYRRVENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKR

Query:  PEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPNGGQSGHKRKELA
        PEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN             K+KELA
Subjt:  PEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPNGGQSGHKRKELA

Query:  RAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDL
        R ARREVCIIREQ PT  I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLK+SPTPLVGFSGES+  EGCIDL
Subjt:  RAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDL

Query:  PVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVLGEQAASRECYASALKGSSVCALETLAGRD
        PV++ QD T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKY T NGVGTV GE   SRECYAS  K SSVCALE    RD
Subjt:  PVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVLGEQAASRECYASALKGSSVCALETLAGRD

A0A6J1DZB9 uncharacterized protein LOC1110249041.4e-20971.9Show/hide
Query:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAA+DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTATHLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQ----------------------------------GSFSSGRAEYRRVENGPTRSRPYERFTPTTIP
        T LADE LTVKLGEEAP TF EVLQKAKKVIDGQ                                   S S+ R EYRR+E+GP+RSRPYER+T +TIP
Subjt:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQ----------------------------------GSFSSGRAEYRRVENGPTRSRPYERFTPTTIP

Query:  ISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN
        ISEILTNIEESGMEKLLKRPEKLRG  E+R+K+KYCRFHR+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVIN
Subjt:  ISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN

Query:  TIFGGPNGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSP
        TIFGGPNGGQSG+KRKELAR ARREVCIIRE  PTC ITF  ADLE VHLPHNDALVIA LIDH +VRRVL+DG                          
Subjt:  TIFGGPNGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSP

Query:  TPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVLGEQAASRECYASALKGSSVC
                      GCIDLPVT+GQD T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKY TPN VG V GEQ  SRECYASALKGS+VC
Subjt:  TPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVLGEQAASRECYASALKGSSVC

Query:  ALETLAGRDGPLEFEADLP---RKEFAAPTEELELVPLVSLEKQIKDE
        ALE    R    E EADLP   +++F  PTEELELVPL+S E+Q   E
Subjt:  ALETLAGRDGPLEFEADLP---RKEFAAPTEELELVPLVSLEKQIKDE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCGAACTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGT
CACGACGGCCTAGCAGCGGAACCCCTCCGCAGGTCGGCACGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGT
GGGACCTCTAAGAAGGGCGCCCGGGGTCAAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCAAGAGAGAGATGGAGGCAATGCGCACACAAATACGCTCC
ATGGAGGAAATGTATAACGAAATGATGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCTCGTGGACGTACGCGAGCAAAGGGGTTCCCACCTC
GGCCCAGCCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGGGGTACACTCGCCAGAGGGGAGACCTCCATGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTC
CGAAAAGGGCAGTCACCATCCCGCTCACATAAGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCTCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTG
AGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGAC
GTTTTAGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATG
GACTTCCAAGCGGCATCAGACGCAATCAAGTGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCA
ACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCTCAGTTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAG
ACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCGGACGAA
GCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGGATCCTTTTCCAGCGGCCGA
GCTGAGTATCGAAGGGTGGAGAACGGACCTACCAGGAGCCGACCTTACGAGCGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAG
TCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAACTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAAC
ACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCAGGACCAGCTCAGCAGAGAAAAAG
GAAGAGCGAAAGCGTTCAAGGACGCCACCCCGACGCACCGACCGACCTGCGGTCATCAATACCATTTTTGGAGGGCCAAACGGGGGTCAGTCCGGACATAAAAGA
AAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTG
CCCCACAATGATGCACTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACC
TACCTCGCCTTGGGATGGACGAGGTCGCAATTGAAGAGGAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCAGTCATCCCAGAGGGTTGCATCGACTTGCCG
GTCACGCTGGGGCAGGACCAAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAATCGACGGTAGATCGGCCTATAACGCCATCTTTGGGCGACCCATCATCCAC
TCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATCCCACCCCCAATGGCGTGGGCACAGTCCTAGGAGAACAGGCCGCTTCGAGGGAGTGTTAT
GCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCGGCAGAGATGGGCCGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCA
CCCACTGAGGAGCTCGAGCTTGTTCCTCTAGTTAGTCTCGAGAAGCAGATCAAGGACGAGTACCAAGCCAAAGACACCCGAATGGAGAAGTATTTGGGCAAGGTC
AGATCGTACCTTGCCCAGTTTCGAACTTACGAAGTAAGCCGGATTCCGCGAGCAGAAAATTCTAATGCTGACGCCTTGGCCAAGTTAGCATCGACGTACGAGGCC
GACCTGGCTAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGAGCCAGGTCTGATGGAGATCGGCGCTCCAGAATCCTCATGGATGGACCCGATC
GCGGACTTCATTAGGGGTAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGGTTCGTGGTCCGAGATGGGGCATTGTACCGACGT
GGCTTTTCCCTGGCTCTATTGAGATTAGAGCATTACGAGCCTACGACAAATGAGGAAGAGCTGCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCC
CAGCTACGCCTGGCGGAATATCAGGGCAGAATGGCCAGACATTACAATGCCCGAATTCGACCTCGGGCCTTTCAGGTCGGACATCTGGTCTTAAGGAGGGTCCAA
ACGCATGTGGGTGCCCTTGATCCGGCCTGGGATGGCCCGTTTGAGATTAAGGGCATAGTCCGACCTGGGACGCACATATTGGCCGATCTGAAAGGAGACGTCCTC
GCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCGAACTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGT
CACGACGGCCTAGCAGCGGAACCCCTCCGCAGGTCGGCACGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGT
GGGACCTCTAAGAAGGGCGCCCGGGGTCAAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCAAGAGAGAGATGGAGGCAATGCGCACACAAATACGCTCC
ATGGAGGAAATGTATAACGAAATGATGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCTCGTGGACGTACGCGAGCAAAGGGGTTCCCACCTC
GGCCCAGCCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGGGGTACACTCGCCAGAGGGGAGACCTCCATGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTC
CGAAAAGGGCAGTCACCATCCCGCTCACATAAGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCTCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTG
AGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGAC
GTTTTAGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATG
GACTTCCAAGCGGCATCAGACGCAATCAAGTGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCA
ACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCTCAGTTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAG
ACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCGGACGAA
GCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGGATCCTTTTCCAGCGGCCGA
GCTGAGTATCGAAGGGTGGAGAACGGACCTACCAGGAGCCGACCTTACGAGCGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAG
TCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAACTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAAC
ACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCAGGACCAGCTCAGCAGAGAAAAAG
GAAGAGCGAAAGCGTTCAAGGACGCCACCCCGACGCACCGACCGACCTGCGGTCATCAATACCATTTTTGGAGGGCCAAACGGGGGTCAGTCCGGACATAAAAGA
AAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTG
CCCCACAATGATGCACTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACC
TACCTCGCCTTGGGATGGACGAGGTCGCAATTGAAGAGGAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCAGTCATCCCAGAGGGTTGCATCGACTTGCCG
GTCACGCTGGGGCAGGACCAAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAATCGACGGTAGATCGGCCTATAACGCCATCTTTGGGCGACCCATCATCCAC
TCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATCCCACCCCCAATGGCGTGGGCACAGTCCTAGGAGAACAGGCCGCTTCGAGGGAGTGTTAT
GCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCGGCAGAGATGGGCCGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCA
CCCACTGAGGAGCTCGAGCTTGTTCCTCTAGTTAGTCTCGAGAAGCAGATCAAGGACGAGTACCAAGCCAAAGACACCCGAATGGAGAAGTATTTGGGCAAGGTC
AGATCGTACCTTGCCCAGTTTCGAACTTACGAAGTAAGCCGGATTCCGCGAGCAGAAAATTCTAATGCTGACGCCTTGGCCAAGTTAGCATCGACGTACGAGGCC
GACCTGGCTAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGAGCCAGGTCTGATGGAGATCGGCGCTCCAGAATCCTCATGGATGGACCCGATC
GCGGACTTCATTAGGGGTAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGGTTCGTGGTCCGAGATGGGGCATTGTACCGACGT
GGCTTTTCCCTGGCTCTATTGAGATTAGAGCATTACGAGCCTACGACAAATGAGGAAGAGCTGCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCC
CAGCTACGCCTGGCGGAATATCAGGGCAGAATGGCCAGACATTACAATGCCCGAATTCGACCTCGGGCCTTTCAGGTCGGACATCTGGTCTTAAGGAGGGTCCAA
ACGCATGTGGGTGCCCTTGATCCGGCCTGGGATGGCCCGTTTGAGATTAAGGGCATAGTCCGACCTGGGACGCACATATTGGCCGATCTGAAAGGAGACGTCCTC
GCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLAAEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGQAPAPTSENFDALKREMEAMRTQIRS
MEEMYNEMMLAAGAGSRSENRVTLVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLHEHLNRKRGSSLRKGQSPSRSHKSSNQQAESSHNLAGIITREEFDQL
RGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSIS
TYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQGSFSSGR
AEYRRVENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKK
EERKRSRTPPRRTDRPAVINTIFGGPNGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPT
YLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVLGEQAASRECY
ASALKGSSVCALETLAGRDGPLEFEADLPRKEFAAPTEELELVPLVSLEKQIKDEYQAKDTRMEKYLGKVRSYLAQFRTYEVSRIPRAENSNADALAKLASTYEA
DLARSVPVEILDNPSILEPGLMEIGAPESSWMDPIADFIRGNSPQDPKERRKLARRAARFVVRDGALYRRGFSLALLRLEHYEPTTNEEELLLNLDLLEERRAMA
QLRLAEYQGRMARHYNARIRPRAFQVGHLVLRRVQTHVGALDPAWDGPFEIKGIVRPGTHILADLKGDVLAHPWNAEHLKRYYP