; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g17660 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g17660
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr9:13993945..13999479
RNA-Seq ExpressionMoc09g17660
SyntenyMoc09g17660
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]1.0e-23984.47Show/hide
Query:  QVESSHN---PAGIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKALTVKPYDGTKDPKDYVE-------------
        + ESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKA TVKPYDG+KDPKDYVE             
Subjt:  QVESSHN---PAGIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKALTVKPYDGTKDPKDYVE-------------

Query:  --------IALTGSARLWYRRLPARSISTYSQLRKEFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEAL
                IALTGSARLWYRRLPA SISTYSQLR+EFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAH SDDSAMCYFLTGLADEAL
Subjt:  --------IALTGSARLWYRRLPARSISTYSQLRKEFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGLPKRKIGRGRSGKDMEKADPKSKDKGSFSSGRAEYQRAENGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTG P+RKIGRGRSGKD+E ADPKSKDKGSFSSGRAEY+RAENGPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGLPKRKIGRGRSGKDMEKADPKSKDKGSFSSGRAEYQRAENGPTRSRPYERFTPTTIPISEILTNIE

Query:  ESAREKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERK-------------------------
        ES  EKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERK                         
Subjt:  ESAREKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERK-------------------------

Query:  -LGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
          G KRKELARAARREVCIIREQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
Subjt:  -LGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDRTRVTQMVEFV
        SVIPEG IDLPVTLGQD+T+VTQM EFV
Subjt:  SVIPEGCIDLPVTLGQDRTRVTQMVEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]6.5e-24274.8Show/hide
Query:  SSNQQVESSHNPA---GIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKALTVKPYDGTKDPKDYVE---------
        SSNQQ ESSHNPA   G+ITREEFDQLRGKL+AQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAP        TVK YDG+KDPKDYVE         
Subjt:  SSNQQVESSHNPA---GIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKALTVKPYDGTKDPKDYVE---------

Query:  ------------IALTGSARLWYRRLPARSISTYSQLRKEFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLA
                    IALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  ------------IALTGSARLWYRRLPARSISTYSQLRKEFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGLPKRKIGRGRSGKDMEKADPKSKDKGSFSSGRAEYQRAENGPTRSRPYERFTPTTIPISEIL
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTG P+R I RGRSGKD EKAD KSKDKGSFSSGRAE++RA NGPTRSRPYERFTPTTIPISEIL
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGLPKRKIGRGRSGKDMEKADPKSKDKGSFSSGRAEYQRAENGPTRSRPYERFTPTTIPISEIL

Query:  TNIEESAREKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKL--------------------
        TNIEES  EKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEERKL                    
Subjt:  TNIEESAREKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKL--------------------

Query:  ------GHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVG
              GHKRKELARAARREVCIIREQ PTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVG
Subjt:  ------GHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVG

Query:  FSGESVIPEGCIDLPVTLGQDRTRVTQMVEFVVIDGRSAYNAIFGRPIIRSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETL
        FS ESVIPEGCIDLPVTLG D+T+VTQM EFVVIDGRSAYNAIFGRPII SFRAIPSTLHQVLKY TPNGVG VRGEQ ASRECYASALKGSSVCALETL
Subjt:  FSGESVIPEGCIDLPVTLGQDRTRVTQMVEFVVIDGRSAYNAIFGRPIIRSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETL

Query:  VGRDETLEFEADLPRKEFAAPTEELELVPLL
        V RD TLEF+A+LPR+EFAAPTEELELVPLL
Subjt:  VGRDETLEFEADLPRKEFAAPTEELELVPLL

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]7.5e-20685.84Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGLPKRKIGRGRSGKDMEKADPKSKDKGSFSSGRAEYQRAENGPTRSRPYERFTP
        MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRT       KIG+GRSGKDME  DPKSKDKGSFS+GRAEY+RAENGPTRSRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGLPKRKIGRGRSGKDMEKADPKSKDKGSFSSGRAEYQRAENGPTRSRPYERFTP

Query:  TTIPISEILTNIEESAREKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERK------------
        TTIPISEILTNIEES  EKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAEKKEERK            
Subjt:  TTIPISEILTNIEESAREKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERK------------

Query:  --------------LGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
                       GHKRK+LARAARREVCIIREQ PTCPITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Subjt:  --------------LGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL

Query:  KKSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMVEFVVIDGRSAYNAIFGRPIIRSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKG
        KKSPTPLVGFSGESV+PEGCIDLPVTLGQD+TRVTQM EFVV+DGRSAYNAIFGRPII SFRAIPSTLHQVLKY TPNGVGTVRGEQTASRECYAS LKG
Subjt:  KKSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMVEFVVIDGRSAYNAIFGRPIIRSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKG

Query:  SSVCALETLVGRDETLEFEADLPRKEFAAPTEELELVPLLSPEKQ
        +SVCALETL  RD TLEFEADLP +EFAAP EELELVPLLS EKQ
Subjt:  SSVCALETLVGRDETLEFEADLPRKEFAAPTEELELVPLLSPEKQ

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]4.7e-24862.98Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAAVEGQGHDGLATRPLRRSARITAPALPPAHPRTSKATRGRGETSKKGARGPAPAPTSENFDALQREMEAMR
        MVQPANSTNTADRR LAA+  HQREVGA  VEGQGH+ L T PL RSARIT P LPPAHP+ SKA                                   
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAAVEGQGHDGLATRPLRRSARITAPALPPAHPRTSKATRGRGETSKKGARGPAPAPTSENFDALQREMEAMR

Query:  AQMRSMEEMYNEMMLAAGAGSRSENRVTRVDVRKQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQVESSHNP--
                                                                                                    ESS+NP  
Subjt:  AQMRSMEEMYNEMMLAAGAGSRSENRVTRVDVRKQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQVESSHNP--

Query:  AGIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKALTVKPYDGTKDPKDYVE---------------------IAL
         G+ITREEFDQL+ K DAQVEALKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK  T+KPYDG+KDPKDYVE                     IAL
Subjt:  AGIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKALTVKPYDGTKDPKDYVE---------------------IAL

Query:  TGSARLWYRRLPARSISTYSQLRKEFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAPAT
        TGSARLWYRRLPAR ISTYSQLRKEF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAH SDDSAMCYFLTGLADE LTVKL EEAPAT
Subjt:  TGSARLWYRRLPARSISTYSQLRKEFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAPAT

Query:  FAEVLQKAKKVIDGQELLRTKTGLPKRKIGRGRSGKDMEKADPKSKDKG-SFSSGRAEYQRAENGPTRSRPYERFTPTTIPISEILTNIEESAREKLLKR
        FAEVLQK KKVIDGQELLRTKTG P++ I +GR+GKD  KAD KS+DKG S SS R +Y+R+ +   +SRPYE +TPTTIPI EILTNIEE+  EKLLKR
Subjt:  FAEVLQKAKKVIDGQELLRTKTGLPKRKIGRGRSGKDMEKADPKSKDKG-SFSSGRAEYQRAENGPTRSRPYERFTPTTIPISEILTNIEESAREKLLKR

Query:  PEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERK-------------LGHKRKELARAARREVCIIREQ
        PEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERK             + +K+KELAR ARREVCIIREQ
Subjt:  PEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERK-------------LGHKRKELARAARREVCIIREQ

Query:  GPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQ
         PT  I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLKKSPTPLVGFSGES+  EGCIDLPV++ QD T+VTQ
Subjt:  GPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQ

Query:  MVEFVVIDGRSAYNAIFGRPIIRSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLVGRDE
        M EFVVIDGRSAYNAIFGRPII SFRA+PSTLHQVLKY T NGVGTVRGE   SRECYAS  K SSVCALE    RDE
Subjt:  MVEFVVIDGRSAYNAIFGRPIIRSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLVGRDE

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]5.2e-19169.75Show/hide
Query:  EIALTGSARLWYRRLPARSISTYSQLRKEFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEE
        +IALTGSARLWYRRLPARSISTYSQLRKEF++QFSS HYD+KTATHLATIRQKE ETLREYVTRFQEEQLKVAH SDDSAMCYFLT LADE LTVKLGEE
Subjt:  EIALTGSARLWYRRLPARSISTYSQLRKEFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEE

Query:  APATFAEVLQKAKKVIDGQELLRTKTGLPKRKIGRGRSGKDMEKADPKSKDKGSFSS-GRAEYQRAENGPTRSRPYERFTPTTIPISEILTNIEESAREK
        AP TF EVLQKAKKVIDGQELLRTKTG P+++I + +  ++  KAD KS+DKGS SS  R EY+R E+GP+RSRPYER+T +TIPISEILTNIEES  EK
Subjt:  APATFAEVLQKAKKVIDGQELLRTKTGLPKRKIGRGRSGKDMEKADPKSKDKGSFSS-GRAEYQRAENGPTRSRPYERFTPTTIPISEILTNIEESAREK

Query:  LLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERK--------------------------LGHKR
        LLKRPEKLRG  E+R+K+KYCRFHR+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKPR++S EKKEERK                           G+KR
Subjt:  LLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERK--------------------------LGHKR

Query:  KELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEG
        KELAR ARREVCIIRE  PTC ITF  ADLE VHLPHNDALVIA LIDH +VRRVL+DG                                        G
Subjt:  KELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEG

Query:  CIDLPVTLGQDRTRVTQMVEFVVIDGRSAYNAIFGRPIIRSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLVGRDETLEFE
        CIDLPVT+GQD T+VTQM EFVVIDGRSAYNAIFGRPII SFRA+PSTLHQVLKY TPN VG VRGEQ  SRECYASALKGS+VCALE    R +  E E
Subjt:  CIDLPVTLGQDRTRVTQMVEFVVIDGRSAYNAIFGRPIIRSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLVGRDETLEFE

Query:  ADLP---RKEFAAPTEELELVPLLSPEKQ
        ADLP   +++F  PTEELELVPLLSPE+Q
Subjt:  ADLP---RKEFAAPTEELELVPLLSPEKQ

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088135.0e-24084.47Show/hide
Query:  QVESSHN---PAGIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKALTVKPYDGTKDPKDYVE-------------
        + ESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKA TVKPYDG+KDPKDYVE             
Subjt:  QVESSHN---PAGIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKALTVKPYDGTKDPKDYVE-------------

Query:  --------IALTGSARLWYRRLPARSISTYSQLRKEFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEAL
                IALTGSARLWYRRLPA SISTYSQLR+EFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAH SDDSAMCYFLTGLADEAL
Subjt:  --------IALTGSARLWYRRLPARSISTYSQLRKEFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGLPKRKIGRGRSGKDMEKADPKSKDKGSFSSGRAEYQRAENGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTG P+RKIGRGRSGKD+E ADPKSKDKGSFSSGRAEY+RAENGPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGLPKRKIGRGRSGKDMEKADPKSKDKGSFSSGRAEYQRAENGPTRSRPYERFTPTTIPISEILTNIE

Query:  ESAREKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERK-------------------------
        ES  EKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERK                         
Subjt:  ESAREKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERK-------------------------

Query:  -LGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
          G KRKELARAARREVCIIREQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
Subjt:  -LGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDRTRVTQMVEFV
        SVIPEG IDLPVTLGQD+T+VTQM EFV
Subjt:  SVIPEGCIDLPVTLGQDRTRVTQMVEFV

A0A6J1D9E1 uncharacterized protein LOC1110188233.2e-24274.8Show/hide
Query:  SSNQQVESSHNPA---GIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKALTVKPYDGTKDPKDYVE---------
        SSNQQ ESSHNPA   G+ITREEFDQLRGKL+AQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAP        TVK YDG+KDPKDYVE         
Subjt:  SSNQQVESSHNPA---GIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKALTVKPYDGTKDPKDYVE---------

Query:  ------------IALTGSARLWYRRLPARSISTYSQLRKEFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLA
                    IALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  ------------IALTGSARLWYRRLPARSISTYSQLRKEFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGLPKRKIGRGRSGKDMEKADPKSKDKGSFSSGRAEYQRAENGPTRSRPYERFTPTTIPISEIL
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTG P+R I RGRSGKD EKAD KSKDKGSFSSGRAE++RA NGPTRSRPYERFTPTTIPISEIL
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGLPKRKIGRGRSGKDMEKADPKSKDKGSFSSGRAEYQRAENGPTRSRPYERFTPTTIPISEIL

Query:  TNIEESAREKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKL--------------------
        TNIEES  EKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEERKL                    
Subjt:  TNIEESAREKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKL--------------------

Query:  ------GHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVG
              GHKRKELARAARREVCIIREQ PTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVG
Subjt:  ------GHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVG

Query:  FSGESVIPEGCIDLPVTLGQDRTRVTQMVEFVVIDGRSAYNAIFGRPIIRSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETL
        FS ESVIPEGCIDLPVTLG D+T+VTQM EFVVIDGRSAYNAIFGRPII SFRAIPSTLHQVLKY TPNGVG VRGEQ ASRECYASALKGSSVCALETL
Subjt:  FSGESVIPEGCIDLPVTLGQDRTRVTQMVEFVVIDGRSAYNAIFGRPIIRSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETL

Query:  VGRDETLEFEADLPRKEFAAPTEELELVPLL
        V RD TLEF+A+LPR+EFAAPTEELELVPLL
Subjt:  VGRDETLEFEADLPRKEFAAPTEELELVPLL

A0A6J1DD03 uncharacterized protein LOC1110198993.6e-20685.84Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGLPKRKIGRGRSGKDMEKADPKSKDKGSFSSGRAEYQRAENGPTRSRPYERFTP
        MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRT       KIG+GRSGKDME  DPKSKDKGSFS+GRAEY+RAENGPTRSRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGLPKRKIGRGRSGKDMEKADPKSKDKGSFSSGRAEYQRAENGPTRSRPYERFTP

Query:  TTIPISEILTNIEESAREKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERK------------
        TTIPISEILTNIEES  EKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAEKKEERK            
Subjt:  TTIPISEILTNIEESAREKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERK------------

Query:  --------------LGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
                       GHKRK+LARAARREVCIIREQ PTCPITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Subjt:  --------------LGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL

Query:  KKSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMVEFVVIDGRSAYNAIFGRPIIRSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKG
        KKSPTPLVGFSGESV+PEGCIDLPVTLGQD+TRVTQM EFVV+DGRSAYNAIFGRPII SFRAIPSTLHQVLKY TPNGVGTVRGEQTASRECYAS LKG
Subjt:  KKSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMVEFVVIDGRSAYNAIFGRPIIRSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKG

Query:  SSVCALETLVGRDETLEFEADLPRKEFAAPTEELELVPLLSPEKQ
        +SVCALETL  RD TLEFEADLP +EFAAP EELELVPLLS EKQ
Subjt:  SSVCALETLVGRDETLEFEADLPRKEFAAPTEELELVPLLSPEKQ

A0A6J1DHB3 uncharacterized protein LOC1110204792.3e-24862.98Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAAVEGQGHDGLATRPLRRSARITAPALPPAHPRTSKATRGRGETSKKGARGPAPAPTSENFDALQREMEAMR
        MVQPANSTNTADRR LAA+  HQREVGA  VEGQGH+ L T PL RSARIT P LPPAHP+ SKA                                   
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAAVEGQGHDGLATRPLRRSARITAPALPPAHPRTSKATRGRGETSKKGARGPAPAPTSENFDALQREMEAMR

Query:  AQMRSMEEMYNEMMLAAGAGSRSENRVTRVDVRKQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQVESSHNP--
                                                                                                    ESS+NP  
Subjt:  AQMRSMEEMYNEMMLAAGAGSRSENRVTRVDVRKQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQVESSHNP--

Query:  AGIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKALTVKPYDGTKDPKDYVE---------------------IAL
         G+ITREEFDQL+ K DAQVEALKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK  T+KPYDG+KDPKDYVE                     IAL
Subjt:  AGIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKALTVKPYDGTKDPKDYVE---------------------IAL

Query:  TGSARLWYRRLPARSISTYSQLRKEFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAPAT
        TGSARLWYRRLPAR ISTYSQLRKEF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAH SDDSAMCYFLTGLADE LTVKL EEAPAT
Subjt:  TGSARLWYRRLPARSISTYSQLRKEFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAPAT

Query:  FAEVLQKAKKVIDGQELLRTKTGLPKRKIGRGRSGKDMEKADPKSKDKG-SFSSGRAEYQRAENGPTRSRPYERFTPTTIPISEILTNIEESAREKLLKR
        FAEVLQK KKVIDGQELLRTKTG P++ I +GR+GKD  KAD KS+DKG S SS R +Y+R+ +   +SRPYE +TPTTIPI EILTNIEE+  EKLLKR
Subjt:  FAEVLQKAKKVIDGQELLRTKTGLPKRKIGRGRSGKDMEKADPKSKDKG-SFSSGRAEYQRAENGPTRSRPYERFTPTTIPISEILTNIEESAREKLLKR

Query:  PEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERK-------------LGHKRKELARAARREVCIIREQ
        PEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERK             + +K+KELAR ARREVCIIREQ
Subjt:  PEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERK-------------LGHKRKELARAARREVCIIREQ

Query:  GPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQ
         PT  I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLKKSPTPLVGFSGES+  EGCIDLPV++ QD T+VTQ
Subjt:  GPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQ

Query:  MVEFVVIDGRSAYNAIFGRPIIRSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLVGRDE
        M EFVVIDGRSAYNAIFGRPII SFRA+PSTLHQVLKY T NGVGTVRGE   SRECYAS  K SSVCALE    RDE
Subjt:  MVEFVVIDGRSAYNAIFGRPIIRSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLVGRDE

A0A6J1DZB9 uncharacterized protein LOC1110249042.5e-19169.75Show/hide
Query:  EIALTGSARLWYRRLPARSISTYSQLRKEFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEE
        +IALTGSARLWYRRLPARSISTYSQLRKEF++QFSS HYD+KTATHLATIRQKE ETLREYVTRFQEEQLKVAH SDDSAMCYFLT LADE LTVKLGEE
Subjt:  EIALTGSARLWYRRLPARSISTYSQLRKEFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEE

Query:  APATFAEVLQKAKKVIDGQELLRTKTGLPKRKIGRGRSGKDMEKADPKSKDKGSFSS-GRAEYQRAENGPTRSRPYERFTPTTIPISEILTNIEESAREK
        AP TF EVLQKAKKVIDGQELLRTKTG P+++I + +  ++  KAD KS+DKGS SS  R EY+R E+GP+RSRPYER+T +TIPISEILTNIEES  EK
Subjt:  APATFAEVLQKAKKVIDGQELLRTKTGLPKRKIGRGRSGKDMEKADPKSKDKGSFSS-GRAEYQRAENGPTRSRPYERFTPTTIPISEILTNIEESAREK

Query:  LLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERK--------------------------LGHKR
        LLKRPEKLRG  E+R+K+KYCRFHR+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKPR++S EKKEERK                           G+KR
Subjt:  LLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERK--------------------------LGHKR

Query:  KELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEG
        KELAR ARREVCIIRE  PTC ITF  ADLE VHLPHNDALVIA LIDH +VRRVL+DG                                        G
Subjt:  KELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEG

Query:  CIDLPVTLGQDRTRVTQMVEFVVIDGRSAYNAIFGRPIIRSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLVGRDETLEFE
        CIDLPVT+GQD T+VTQM EFVVIDGRSAYNAIFGRPII SFRA+PSTLHQVLKY TPN VG VRGEQ  SRECYASALKGS+VCALE    R +  E E
Subjt:  CIDLPVTLGQDRTRVTQMVEFVVIDGRSAYNAIFGRPIIRSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLVGRDETLEFE

Query:  ADLP---RKEFAAPTEELELVPLLSPEKQ
        ADLP   +++F  PTEELELVPLLSPE+Q
Subjt:  ADLP---RKEFAAPTEELELVPLLSPEKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCCGCAAACTCGACCAATACGGCGGACCGAAGGACCCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGTCACGA
CGGCTTAGCCACAAGACCCCTCCGCAGGTCGGCGCGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAAGCCACCCGTGGACGAGGTGAGACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCACTCCAGAGAGAGATGGAGGCAATGCGCGCACAAATGCGCTCCATGGAGGAAATGTAT
AACGAAATGATGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCGTGGACGTACGCAAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAGGAACG
TCCCGAAGACAACGAGAGCGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCT
CACACAGGAGCTCCAACCAGCAGGTTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGCAAGCTCGATGCTCAGGTGGAGGCC
TTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGC
TCTTACCGTGAAGCCTTATGATGGGACGAAGGACCCCAAGGACTATGTTGAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCT
CGACCTACTCTCAGCTGAGAAAGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACG
CTGCGGGAATATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTCGCACACTACTCCGATGACTCGGCTATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCAC
GGTGAAACTTGGAGAGGAGGCCCCGGCCACGTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCTACCAAAAC
GAAAGATCGGCCGGGGCAGAAGTGGAAAAGATATGGAAAAGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCAAAGGGCGGAGAAC
GGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGCAAGGGAAAAACTACTCAAGCGTCC
TGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCAGACTGCTGGGAGTTGAAGCGCCAAATTG
AGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACCAGCTCAGCAGAGAAAAAGGAAGAGCGAAAGCTCGGACATAAAAGAAAGGAGTTAGCC
CGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTACCCCACAATGATGCACT
TGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCCAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGATGGACGA
GGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGATTTGCCGGTCACGCTGGGGCAGGACCGAACTCGG
GTCACTCAAATGGTCGAGTTCGTGGTAATTGACGGTAGATCAGCCTATAACGCCATCTTTGGGAGACCCATCATCCGCTCATTTCGGGCCATTCCCTCAACACTTCATCA
AGTTTTGAAGTATCCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCG
AAACTCTCGTCGGTAGGGATGAGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAG
AAGCAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGGTTCGTGGTCCGAGATGAGGCATTGTACCGACGTGGCTTTTCCCTGCCTCTGTTGAGATGCCTAACCCCTGA
AGAGGGCCTAGTAGAACATTACGAACCTACGACAAATGAGGAAGAGCTGCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGCGGAAT
ATCAGGGCAGAATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGGACATCTGGTCTTAAGGAGGGTCCAAACGCATGTGGGTGCCCTTGATCCG
GCCTGGGAGGGCCCGTTTGAGGTCAAGGGCATAGTCCGACCTGGGACATACGTATTGGTGATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCCGCAAACTCGACCAATACGGCGGACCGAAGGACCCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGTCACGA
CGGCTTAGCCACAAGACCCCTCCGCAGGTCGGCGCGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAAGCCACCCGTGGACGAGGTGAGACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCACTCCAGAGAGAGATGGAGGCAATGCGCGCACAAATGCGCTCCATGGAGGAAATGTAT
AACGAAATGATGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCGTGGACGTACGCAAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAGGAACG
TCCCGAAGACAACGAGAGCGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCT
CACACAGGAGCTCCAACCAGCAGGTTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGCAAGCTCGATGCTCAGGTGGAGGCC
TTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGC
TCTTACCGTGAAGCCTTATGATGGGACGAAGGACCCCAAGGACTATGTTGAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCT
CGACCTACTCTCAGCTGAGAAAGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACG
CTGCGGGAATATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTCGCACACTACTCCGATGACTCGGCTATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCAC
GGTGAAACTTGGAGAGGAGGCCCCGGCCACGTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCTACCAAAAC
GAAAGATCGGCCGGGGCAGAAGTGGAAAAGATATGGAAAAGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCAAAGGGCGGAGAAC
GGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGCAAGGGAAAAACTACTCAAGCGTCC
TGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCAGACTGCTGGGAGTTGAAGCGCCAAATTG
AGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACCAGCTCAGCAGAGAAAAAGGAAGAGCGAAAGCTCGGACATAAAAGAAAGGAGTTAGCC
CGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTACCCCACAATGATGCACT
TGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCCAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGATGGACGA
GGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGATTTGCCGGTCACGCTGGGGCAGGACCGAACTCGG
GTCACTCAAATGGTCGAGTTCGTGGTAATTGACGGTAGATCAGCCTATAACGCCATCTTTGGGAGACCCATCATCCGCTCATTTCGGGCCATTCCCTCAACACTTCATCA
AGTTTTGAAGTATCCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCG
AAACTCTCGTCGGTAGGGATGAGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAG
AAGCAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGGTTCGTGGTCCGAGATGAGGCATTGTACCGACGTGGCTTTTCCCTGCCTCTGTTGAGATGCCTAACCCCTGA
AGAGGGCCTAGTAGAACATTACGAACCTACGACAAATGAGGAAGAGCTGCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGCGGAAT
ATCAGGGCAGAATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGGACATCTGGTCTTAAGGAGGGTCCAAACGCATGTGGGTGCCCTTGATCCG
GCCTGGGAGGGCCCGTTTGAGGTCAAGGGCATAGTCCGACCTGGGACATACGTATTGGTGATTTAA
Protein sequenceShow/hide protein sequence
MVQPANSTNTADRRTLAASDAHQREVGAAAVEGQGHDGLATRPLRRSARITAPALPPAHPRTSKATRGRGETSKKGARGPAPAPTSENFDALQREMEAMRAQMRSMEEMY
NEMMLAAGAGSRSENRVTRVDVRKQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQVESSHNPAGIITREEFDQLRGKLDAQVEA
LKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKALTVKPYDGTKDPKDYVEIALTGSARLWYRRLPARSISTYSQLRKEFLAQFSSRHYDKKTATHLATIRQKEGET
LREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGLPKRKIGRGRSGKDMEKADPKSKDKGSFSSGRAEYQRAEN
GPTRSRPYERFTPTTIPISEILTNIEESAREKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKLGHKRKELA
RAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDRTR
VTQMVEFVVIDGRSAYNAIFGRPIIRSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLVGRDETLEFEADLPRKEFAAPTEELELVPLLSPE
KQERRKLARRAARFVVRDEALYRRGFSLPLLRCLTPEEGLVEHYEPTTNEEELLLNLDLLEERRAMAQLRLAEYQGRMARHYNARVRPRAFQVGHLVLRRVQTHVGALDP
AWEGPFEVKGIVRPGTYVLVI