; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g15770 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g15770
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr3:10525786..10531239
RNA-Seq ExpressionMoc03g15770
SyntenyMoc03g15770
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]2.5e-25490.67Show/hide
Query:  GELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDLKDYVE---------------------IALTGSARLWYRRLPA
        G+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKD KDYVE                     IALTGSARLWYRRLPA
Subjt:  GELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDLKDYVE---------------------IALTGSARLWYRRLPA

Query:  RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVID
         SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVID
Subjt:  RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVID

Query:  GQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAGSGPTKSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKD
        GQELLRTKTGRPERKIGRGRSGKD+E ADPKSKDKGSFSSGRAEYRRA +GPT+SRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKD
Subjt:  GQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAGSGPTKSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKD

Query:  KYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRIDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQG
        KYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRR DRPAVINTIFGGPSGGQSG KRKELARAARREVCIIREQ 
Subjt:  KYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRIDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQG

Query:  PTCPITFDGADSEEVHLPYNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQM
        PTCPITFDGAD EEVHLP+NDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLK+SPTPLVGFSGESVIPEG IDLPVTLGQD+T+VTQM
Subjt:  PTCPITFDGADSEEVHLPYNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQM

Query:  ADFV
        A+FV
Subjt:  ADFV

XP_022150613.1 uncharacterized protein LOC111018708, partial [Momordica charantia]6.2e-21391.94Show/hide
Query:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDLKDYVE---------------------IALTGSARLWYRRLPARSISTYSQLRREFLAQ
        KDDSLNDGDLGES FTSDVLEAPIPPKFKAPTVKPYDGSKD KDYVE                     IALTGSARLWYRRLPARSISTYSQLRREFLAQ
Subjt:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDLKDYVE---------------------IALTGSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
        FSSR Y KKT THLATIRQKEG TLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+AP TFAEVLQKAKKVIDGQELLRTKTGRP+RKI
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI

Query:  GRGRSGKDVERADPKSKDKGSFSSGRAEYRRAGSGPTKSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
        GRGRSGKDVERADPKSKDKGSFSSGRAEYRRA SGPTKSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
Subjt:  GRGRSGKDVERADPKSKDKGSFSSGRAEYRRAGSGPTKSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC

Query:  WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRIDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADSEEVH
        WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRR DRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADSEEVH
Subjt:  WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRIDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADSEEVH

Query:  LPYNDALVIAPLIDHVVVRRVL
        LP+NDA VIAPLIDHVVVRRVL
Subjt:  LPYNDALVIAPLIDHVVVRRVL

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]4.3e-24673.68Show/hide
Query:  GELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDLKDYVE---------------------IALTGSARLWYRRLPA
        G+L+AQVEALKAKCEQK+  LNDGDLGESPFTSDVLE        APTVK YDGSKD KDYVE                     IALTGSARLW      
Subjt:  GELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDLKDYVE---------------------IALTGSARLWYRRLPA

Query:  RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVID
                                                       FQE+QLKVA  SDDSAMCYFLTGLADEALTVKLG+EAPATFAEVLQKAKKVID
Subjt:  RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVID

Query:  GQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAGSGPTKSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKD
        GQELLRTKTGRPER I RGRSGKD E+AD KSKDKGSFSSGRAE+RRA +GPT+SRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERR+KD
Subjt:  GQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAGSGPTKSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKD

Query:  KYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRIDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQG
        KYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEERK SRTP RRIDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQ 
Subjt:  KYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRIDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQG

Query:  PTCPITFDGADSEEVHLPYNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQM
        PTCPITFD AD EEVHLP+NDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLK+S TPLVGFS ESVIPEGCIDLPVTLG D+T+VTQM
Subjt:  PTCPITFDGADSEEVHLPYNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQM

Query:  ADFVVIDGRSAYNAIFGRSIIHSFRAIPSTLHQVLKYPTPNGVGTVQGEQAASRECYASALKGSSVCALENLTGGDGTLEFEVDLPRKEFAAPTEELELV
        A+FVVIDGRSAYNAIFGR IIHSFRAIPSTLHQVLKY TPNGVG V+GEQ ASRECYASALKGSSVCALE L   DGTLEF+ +LPR+EFAAPTEELELV
Subjt:  ADFVVIDGRSAYNAIFGRSIIHSFRAIPSTLHQVLKYPTPNGVGTVQGEQAASRECYASALKGSSVCALENLTGGDGTLEFEVDLPRKEFAAPTEELELV

Query:  PLLSPEKQLASAYETDLARSVPVEILDNPSILE--PDLMEIG
        PLL  +      +E +L     +  +D+   +E  P+ + +G
Subjt:  PLLSPEKQLASAYETDLARSVPVEILDNPSILE--PDLMEIG

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]7.8e-22489.91Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAGSGPTKSRPYERFTP
        MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRT       KIG+GRSGKD+E  DPKSKDKGSFS+GRAEYRRA +GPT+SRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAGSGPTKSRPYERFTP

Query:  TTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRIDRP
        TTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRR DRP
Subjt:  TTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRIDRP

Query:  AVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADSEEVHLPYNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
        AVINTIFGGPSGGQSGHKRK+LARAARREVCIIREQ PTCPITFD AD  EVHLP+NDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Subjt:  AVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADSEEVHLPYNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL

Query:  KRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMADFVVIDGRSAYNAIFGRSIIHSFRAIPSTLHQVLKYPTPNGVGTVQGEQAASRECYASALKG
        K+SPTPLVGFSGESV+PEGCIDLPVTLGQD+TRVTQMA+FVV+DGRSAYNAIFGR IIHSFRAIPSTLHQVLKY TPNGVGTV+GEQ ASRECYAS LKG
Subjt:  KRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMADFVVIDGRSAYNAIFGRSIIHSFRAIPSTLHQVLKYPTPNGVGTVQGEQAASRECYASALKG

Query:  SSVCALENLTGGDGTLEFEVDLPRKEFAAPTEELELVPLLSPEKQL
        +SVCALE LT  DGTLEFE DLP +EFAAP EELELVPLLS EKQ+
Subjt:  SSVCALENLTGGDGTLEFEVDLPRKEFAAPTEELELVPLLSPEKQL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]1.1e-25769.83Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLAAEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARRPAPAPTRSRSENRVTRVDVRE
        MVQPANSTNT DRR LAA+  HQREVGA  VEGQGH+ L  EPL RSARIT P LPPAHP+ SKA                       S N +T      
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLAAEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARRPAPAPTRSRSENRVTRVDVRE

Query:  HRGSHLGPVEEERPEDNESEGDNHKGGVRPAEGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDLKDYVE-----
              G +  E  +  +S              + DAQVEALKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDGSKD KDYVE     
Subjt:  HRGSHLGPVEEERPEDNESEGDNHKGGVRPAEGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDLKDYVE-----

Query:  ----------------IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
                        IALTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFL
Subjt:  ----------------IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKG-SFSSGRAEYRRAGSGPTKSRPYERFTPTTIP
        TGLADE LTVKL EEAPATFAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD  +AD KS+DKG S SS R +YRR+ S   +SRPYE +TPTTIP
Subjt:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKG-SFSSGRAEYRRAGSGPTKSRPYERFTPTTIP

Query:  ISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRIDRPAVIN
        I EILTNIEE+GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN
Subjt:  ISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRIDRPAVIN

Query:  TIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADSEEVHLPYNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSP
                     K+KELAR ARREVCIIREQ PT  I F+ AD E VHLP+NDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLK+SP
Subjt:  TIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADSEEVHLPYNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSP

Query:  TPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMADFVVIDGRSAYNAIFGRSIIHSFRAIPSTLHQVLKYPTPNGVGTVQGEQAASRECYASALKGSSVC
        TPLVGFSGES+  EGCIDLPV++ QD T+VTQMA+FVVIDGRSAYNAIFGR IIHSFRA+PSTLHQVLKY T NGVGTV+GE   SRECYAS  K SSVC
Subjt:  TPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMADFVVIDGRSAYNAIFGRSIIHSFRAIPSTLHQVLKYPTPNGVGTVQGEQAASRECYASALKGSSVC

Query:  ALENLT
        ALE  T
Subjt:  ALENLT

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088131.2e-25490.67Show/hide
Query:  GELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDLKDYVE---------------------IALTGSARLWYRRLPA
        G+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKD KDYVE                     IALTGSARLWYRRLPA
Subjt:  GELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDLKDYVE---------------------IALTGSARLWYRRLPA

Query:  RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVID
         SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVID
Subjt:  RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVID

Query:  GQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAGSGPTKSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKD
        GQELLRTKTGRPERKIGRGRSGKD+E ADPKSKDKGSFSSGRAEYRRA +GPT+SRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKD
Subjt:  GQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAGSGPTKSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKD

Query:  KYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRIDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQG
        KYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRR DRPAVINTIFGGPSGGQSG KRKELARAARREVCIIREQ 
Subjt:  KYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRIDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQG

Query:  PTCPITFDGADSEEVHLPYNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQM
        PTCPITFDGAD EEVHLP+NDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLK+SPTPLVGFSGESVIPEG IDLPVTLGQD+T+VTQM
Subjt:  PTCPITFDGADSEEVHLPYNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQM

Query:  ADFV
        A+FV
Subjt:  ADFV

A0A6J1D9E1 uncharacterized protein LOC1110188232.1e-24673.68Show/hide
Query:  GELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDLKDYVE---------------------IALTGSARLWYRRLPA
        G+L+AQVEALKAKCEQK+  LNDGDLGESPFTSDVLE        APTVK YDGSKD KDYVE                     IALTGSARLW      
Subjt:  GELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDLKDYVE---------------------IALTGSARLWYRRLPA

Query:  RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVID
                                                       FQE+QLKVA  SDDSAMCYFLTGLADEALTVKLG+EAPATFAEVLQKAKKVID
Subjt:  RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVID

Query:  GQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAGSGPTKSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKD
        GQELLRTKTGRPER I RGRSGKD E+AD KSKDKGSFSSGRAE+RRA +GPT+SRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERR+KD
Subjt:  GQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAGSGPTKSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKD

Query:  KYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRIDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQG
        KYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEERK SRTP RRIDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQ 
Subjt:  KYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRIDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQG

Query:  PTCPITFDGADSEEVHLPYNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQM
        PTCPITFD AD EEVHLP+NDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLK+S TPLVGFS ESVIPEGCIDLPVTLG D+T+VTQM
Subjt:  PTCPITFDGADSEEVHLPYNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQM

Query:  ADFVVIDGRSAYNAIFGRSIIHSFRAIPSTLHQVLKYPTPNGVGTVQGEQAASRECYASALKGSSVCALENLTGGDGTLEFEVDLPRKEFAAPTEELELV
        A+FVVIDGRSAYNAIFGR IIHSFRAIPSTLHQVLKY TPNGVG V+GEQ ASRECYASALKGSSVCALE L   DGTLEF+ +LPR+EFAAPTEELELV
Subjt:  ADFVVIDGRSAYNAIFGRSIIHSFRAIPSTLHQVLKYPTPNGVGTVQGEQAASRECYASALKGSSVCALENLTGGDGTLEFEVDLPRKEFAAPTEELELV

Query:  PLLSPEKQLASAYETDLARSVPVEILDNPSILE--PDLMEIG
        PLL  +      +E +L     +  +D+   +E  P+ + +G
Subjt:  PLLSPEKQLASAYETDLARSVPVEILDNPSILE--PDLMEIG

A0A6J1D9W7 uncharacterized protein LOC1110187083.0e-21391.94Show/hide
Query:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDLKDYVE---------------------IALTGSARLWYRRLPARSISTYSQLRREFLAQ
        KDDSLNDGDLGES FTSDVLEAPIPPKFKAPTVKPYDGSKD KDYVE                     IALTGSARLWYRRLPARSISTYSQLRREFLAQ
Subjt:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDLKDYVE---------------------IALTGSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
        FSSR Y KKT THLATIRQKEG TLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+AP TFAEVLQKAKKVIDGQELLRTKTGRP+RKI
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI

Query:  GRGRSGKDVERADPKSKDKGSFSSGRAEYRRAGSGPTKSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
        GRGRSGKDVERADPKSKDKGSFSSGRAEYRRA SGPTKSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
Subjt:  GRGRSGKDVERADPKSKDKGSFSSGRAEYRRAGSGPTKSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC

Query:  WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRIDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADSEEVH
        WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRR DRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADSEEVH
Subjt:  WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRIDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADSEEVH

Query:  LPYNDALVIAPLIDHVVVRRVL
        LP+NDA VIAPLIDHVVVRRVL
Subjt:  LPYNDALVIAPLIDHVVVRRVL

A0A6J1DD03 uncharacterized protein LOC1110198993.8e-22489.91Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAGSGPTKSRPYERFTP
        MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRT       KIG+GRSGKD+E  DPKSKDKGSFS+GRAEYRRA +GPT+SRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAGSGPTKSRPYERFTP

Query:  TTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRIDRP
        TTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRR DRP
Subjt:  TTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRIDRP

Query:  AVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADSEEVHLPYNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
        AVINTIFGGPSGGQSGHKRK+LARAARREVCIIREQ PTCPITFD AD  EVHLP+NDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Subjt:  AVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADSEEVHLPYNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL

Query:  KRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMADFVVIDGRSAYNAIFGRSIIHSFRAIPSTLHQVLKYPTPNGVGTVQGEQAASRECYASALKG
        K+SPTPLVGFSGESV+PEGCIDLPVTLGQD+TRVTQMA+FVV+DGRSAYNAIFGR IIHSFRAIPSTLHQVLKY TPNGVGTV+GEQ ASRECYAS LKG
Subjt:  KRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMADFVVIDGRSAYNAIFGRSIIHSFRAIPSTLHQVLKYPTPNGVGTVQGEQAASRECYASALKG

Query:  SSVCALENLTGGDGTLEFEVDLPRKEFAAPTEELELVPLLSPEKQL
        +SVCALE LT  DGTLEFE DLP +EFAAP EELELVPLLS EKQ+
Subjt:  SSVCALENLTGGDGTLEFEVDLPRKEFAAPTEELELVPLLSPEKQL

A0A6J1DHB3 uncharacterized protein LOC1110204795.2e-25869.83Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLAAEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARRPAPAPTRSRSENRVTRVDVRE
        MVQPANSTNT DRR LAA+  HQREVGA  VEGQGH+ L  EPL RSARIT P LPPAHP+ SKA                       S N +T      
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLAAEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARRPAPAPTRSRSENRVTRVDVRE

Query:  HRGSHLGPVEEERPEDNESEGDNHKGGVRPAEGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDLKDYVE-----
              G +  E  +  +S              + DAQVEALKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDGSKD KDYVE     
Subjt:  HRGSHLGPVEEERPEDNESEGDNHKGGVRPAEGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDLKDYVE-----

Query:  ----------------IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
                        IALTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFL
Subjt:  ----------------IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKG-SFSSGRAEYRRAGSGPTKSRPYERFTPTTIP
        TGLADE LTVKL EEAPATFAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD  +AD KS+DKG S SS R +YRR+ S   +SRPYE +TPTTIP
Subjt:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKG-SFSSGRAEYRRAGSGPTKSRPYERFTPTTIP

Query:  ISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRIDRPAVIN
        I EILTNIEE+GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN
Subjt:  ISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRIDRPAVIN

Query:  TIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADSEEVHLPYNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSP
                     K+KELAR ARREVCIIREQ PT  I F+ AD E VHLP+NDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLK+SP
Subjt:  TIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADSEEVHLPYNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSP

Query:  TPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMADFVVIDGRSAYNAIFGRSIIHSFRAIPSTLHQVLKYPTPNGVGTVQGEQAASRECYASALKGSSVC
        TPLVGFSGES+  EGCIDLPV++ QD T+VTQMA+FVVIDGRSAYNAIFGR IIHSFRA+PSTLHQVLKY T NGVGTV+GE   SRECYAS  K SSVC
Subjt:  TPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMADFVVIDGRSAYNAIFGRSIIHSFRAIPSTLHQVLKYPTPNGVGTVQGEQAASRECYASALKGSSVC

Query:  ALENLT
        ALE  T
Subjt:  ALENLT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCGAACTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAAGGGCAAGGT
CACGACGGCCTAGCAGCGGAACCCCTCCGCAGGTCGGCACGGATCACCGCGCCAGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGT
GGGACCTCTAAGAAGGGCGCCCGGCGTCCAGCCCCGGCTCCAACAAGGTCCCGATCTGAAAATCGAGTGACGCGCGTGGACGTACGCGAGCACAGGGGTTCCCAC
CTCGGCCCAGTCGAGGAGGAACGGCCCGAAGACAACGAGAGCGAGGGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGCGAGCTCGATGCTCAGGTGGAG
GCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGATGTTTTGGAAGCACCAATCCCTCCGAAG
TTCAAAGCTCCTACCGTGAAGCCTTATGATGGGTCGAAGGACCTCAAGGACTATGTTGAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCA
GCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCCTCTCGGCACTATGACAAAAAGACAGCAACCCATCTTGCCACCATCAGG
CAGAAGGAGGGTGAAACTCTGCGGGAATATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACC
GGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGGGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTC
CTCCGAACCAAAACCGGCCGACCGGAGCGAAAGATCGGCCGGGGTAGAAGTGGAAAAGATGTAGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCC
AGCGGCCGAGCTGAGTATCGAAGGGCGGGGAGCGGACCTACCAAGAGCCGACCTTACGAGCGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACGAAC
ATCGAGGAGTCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAACTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCAC
GGCCACAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAGTTTGTGGGAAAGCCCAGGACCAGCTCAGCA
GAGAAAAAGGAAGAGCGAAAGCGTTCAAGGACGCCACCCCGGCGCATCGACCGACCTGCGGTCATCAATACCATTTTTGGAGGGCCAAGCGGAGGTCAGTCCGGA
CATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTTTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGATGGTGCCGACTCGGAGGAG
GTCCACCTGCCCTACAATGATGCACTTGTGATCGCGCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGTGCATCTGCCAACATCCTGTCC
TTACCGACCTACCTCGCCTTGGGATGGACGAGGTCGCAATTGAAGAGAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATC
GACTTGCCGGTCACGCTGGGGCAGGATCGAACTCGAGTCACTCAAATGGCCGACTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGATCC
ATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATCCCACCCCCAATGGCGTGGGCACGGTCCAAGGAGAACAGGCCGCTTCGAGG
GAGTGTTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAAATCTCACCGGTGGGGATGGGACGCTCGAGTTCGAGGTCGACCTGCCGAGGAAGGAG
TTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTTTGCTTAGTCCCGAGAAGCAGTTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAG
ATCTTAGATAATCCTTCGATCTTAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAATCCTCATTGATGGACCCGATCGCGGACTTCATTAGGGGCAACTCACCA
CAAGACCCCAAGGAGCGCAGAGAGTTGGCAAGGCGGGCAGCTCGGTTCGTGGTCCGAGATGGGGCATTGTACCGACGTGGCTTTTCTCTGCCCCTCTTGAGATGC
CTAACCCCTGAAGAGGGCCTAGTAGAGCATTACGAACCTACGACAAATGAGGAAGAGCTGCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAG
CTACGCCTGGTGGAATATCAAAGCAGAATGGCCCCCACATTACAACGCCCGCGTTCGACCTCGGACCTTTCAGGTCAGACATCTGGTCTTAAGGCGGGTCCAAAC
CCATGTGGGTGCTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCGAACTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAAGGGCAAGGT
CACGACGGCCTAGCAGCGGAACCCCTCCGCAGGTCGGCACGGATCACCGCGCCAGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGT
GGGACCTCTAAGAAGGGCGCCCGGCGTCCAGCCCCGGCTCCAACAAGGTCCCGATCTGAAAATCGAGTGACGCGCGTGGACGTACGCGAGCACAGGGGTTCCCAC
CTCGGCCCAGTCGAGGAGGAACGGCCCGAAGACAACGAGAGCGAGGGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGCGAGCTCGATGCTCAGGTGGAG
GCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGATGTTTTGGAAGCACCAATCCCTCCGAAG
TTCAAAGCTCCTACCGTGAAGCCTTATGATGGGTCGAAGGACCTCAAGGACTATGTTGAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCA
GCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCCTCTCGGCACTATGACAAAAAGACAGCAACCCATCTTGCCACCATCAGG
CAGAAGGAGGGTGAAACTCTGCGGGAATATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACC
GGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGGGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTC
CTCCGAACCAAAACCGGCCGACCGGAGCGAAAGATCGGCCGGGGTAGAAGTGGAAAAGATGTAGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCC
AGCGGCCGAGCTGAGTATCGAAGGGCGGGGAGCGGACCTACCAAGAGCCGACCTTACGAGCGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACGAAC
ATCGAGGAGTCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAACTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCAC
GGCCACAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAGTTTGTGGGAAAGCCCAGGACCAGCTCAGCA
GAGAAAAAGGAAGAGCGAAAGCGTTCAAGGACGCCACCCCGGCGCATCGACCGACCTGCGGTCATCAATACCATTTTTGGAGGGCCAAGCGGAGGTCAGTCCGGA
CATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTTTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGATGGTGCCGACTCGGAGGAG
GTCCACCTGCCCTACAATGATGCACTTGTGATCGCGCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGTGCATCTGCCAACATCCTGTCC
TTACCGACCTACCTCGCCTTGGGATGGACGAGGTCGCAATTGAAGAGAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATC
GACTTGCCGGTCACGCTGGGGCAGGATCGAACTCGAGTCACTCAAATGGCCGACTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGATCC
ATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATCCCACCCCCAATGGCGTGGGCACGGTCCAAGGAGAACAGGCCGCTTCGAGG
GAGTGTTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAAATCTCACCGGTGGGGATGGGACGCTCGAGTTCGAGGTCGACCTGCCGAGGAAGGAG
TTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTTTGCTTAGTCCCGAGAAGCAGTTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAG
ATCTTAGATAATCCTTCGATCTTAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAATCCTCATTGATGGACCCGATCGCGGACTTCATTAGGGGCAACTCACCA
CAAGACCCCAAGGAGCGCAGAGAGTTGGCAAGGCGGGCAGCTCGGTTCGTGGTCCGAGATGGGGCATTGTACCGACGTGGCTTTTCTCTGCCCCTCTTGAGATGC
CTAACCCCTGAAGAGGGCCTAGTAGAGCATTACGAACCTACGACAAATGAGGAAGAGCTGCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAG
CTACGCCTGGTGGAATATCAAAGCAGAATGGCCCCCACATTACAACGCCCGCGTTCGACCTCGGACCTTTCAGGTCAGACATCTGGTCTTAAGGCGGGTCCAAAC
CCATGTGGGTGCTCTTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLAAEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARRPAPAPTRSRSENRVTRVDVREHRGSH
LGPVEEERPEDNESEGDNHKGGVRPAEGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDLKDYVEIALTGSARLWYRRLP
ARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQEL
LRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAGSGPTKSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREH
GHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRIDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADSEE
VHLPYNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMADFVVIDGRSAYNAIFGRS
IIHSFRAIPSTLHQVLKYPTPNGVGTVQGEQAASRECYASALKGSSVCALENLTGGDGTLEFEVDLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVE
ILDNPSILEPDLMEIGAPESSLMDPIADFIRGNSPQDPKERRELARRAARFVVRDGALYRRGFSLPLLRCLTPEEGLVEHYEPTTNEEELLLNLDLLEERRAMAQ
LRLVEYQSRMAPTLQRPRSTSDLSGQTSGLKAGPNPCGCS