; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc10g06100 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc10g06100
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr10:4368561..4373069
RNA-Seq ExpressionMoc10g06100
SyntenyMoc10g06100
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]1.2e-25889.96Show/hide
Query:  QAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGES-------------SFKAPTVKPYDGSKDPKDYVE-------------
        +AESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGES              FKAPTVKPYDGSKDPKDYVE             
Subjt:  QAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGES-------------SFKAPTVKPYDGSKDPKDYVE-------------

Query:  --------IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHIATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLAYEAL
                IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATH+ATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA EAL
Subjt:  --------IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHIATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLAYEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTSRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENRPTGSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKT RPERKIGRGRSGKDIE ADPKSKDKGSFSSGRAEYRRAEN PT SRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTSRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENRPTGSRPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
        ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGQKRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKILTPLVGFSGE
        QSG+KRKELARAARREVCIIREQRPTCPITFD ADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKK  TPLVGFSGE
Subjt:  QSGQKRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKILTPLVGFSGE

Query:  SVIPEGFIDLPVTLGQNQTQVTQMAEFV
        SVIPEGFIDLPVTLGQ+QTQVTQMAEFV
Subjt:  SVIPEGFIDLPVTLGQNQTQVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]3.1e-25979.78Show/hide
Query:  SSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESSF-----KAPTVKPYDGSKDPKDYVE-----------------
        SSNQQAESS NPATP GVITREEFDQLRG+L+AQVEALKAKCEQKEGPLNDGDLGES F     +APTVK YDGSKDPKDYVE                 
Subjt:  SSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESSF-----KAPTVKPYDGSKDPKDYVE-----------------

Query:  ----IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHIATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLAYEALTVKL
            IALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA EALTVKL
Subjt:  ----IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHIATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLAYEALTVKL

Query:  GEEAPATFAEVLQKAKKVIDGQELLRTKTSRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENRPTGSRPYERFTPTTIPISEILTNIEESGM
        G+EAPATFAEVLQKAKKVIDGQELLRTKT RPER I RGRSGKD EKAD KSKDKGSFSSGRAE+RRA N PT SRPYERFTPTTIPISEILTNIEESGM
Subjt:  GEEAPATFAEVLQKAKKVIDGQELLRTKTSRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENRPTGSRPYERFTPTTIPISEILTNIEESGM

Query:  EKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGQ
        EKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEERK SRTP RR DRPAVINTIFGGPSGGQSG 
Subjt:  EKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGQ

Query:  KRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKILTPLVGFSGESVIP
        KRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKK  TPLVGFS ESVIP
Subjt:  KRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKILTPLVGFSGESVIP

Query:  EGFIDLPVTLGQNQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCAIETLASGDGTLE
        EG IDLPVTLG +QTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVG VRGEQ ASRECYASALKGSSVCA+ETL S DGTLE
Subjt:  EGFIDLPVTLGQNQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCAIETLASGDGTLE

Query:  FEADLPRMEFAAPTEELELVPLL
        F+A+LPR EFAAPTEELELVPLL
Subjt:  FEADLPRMEFAAPTEELELVPLL

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]3.5e-22391.28Show/hide
Query:  MCYFLTGLAYEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTSRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENRPTGSRPYERFTP
        MCYFLTGLA EALTVKL EEAPATFAEVLQKAKKVIDGQELLRT       KIG+GRSGKD+E  DPKSKDKGSFS+GRAEYRRAEN PT SRPYERFTP
Subjt:  MCYFLTGLAYEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTSRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENRPTGSRPYERFTP

Query:  TTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRP
        TTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRP
Subjt:  TTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRP

Query:  AVINTIFGGPSGGQSGQKRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
        AVINTIFGGPSGGQSG KRK+LARAARREVCIIREQRPTCPITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Subjt:  AVINTIFGGPSGGQSGQKRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL

Query:  KKILTPLVGFSGESVIPEGFIDLPVTLGQNQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKG
        KK  TPLVGFSGESV+PEG IDLPVTLGQ+QT+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAS LKG
Subjt:  KKILTPLVGFSGESVIPEGFIDLPVTLGQNQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKG

Query:  SSVCAIETLASGDGTLEFEADLPRMEFAAPTEELELVPLLSPEKQGQ
        +SVCA+ETL S DGTLEFEADLP  EFAAP EELELVPLLS EKQ Q
Subjt:  SSVCAIETLASGDGTLEFEADLPRMEFAAPTEELELVPLLSPEKQGQ

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]6.6e-25466.45Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTSNKGARGPALASTSAGSRSENRVTRVG
        MVQPANSTNTADRR LAA+  HQREVGA VVEGQGH+ L TEPL RSARIT PVLPPAHP                                        
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTSNKGARGPALASTSAGSRSENRVTRVG

Query:  IREQRGSHLGPVEEEHAEDNESEGHTRQRGDLREHLNRKRGSSLRKRQSPSRSHRSSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQK
                                                         PS+        AESS NP TP GVITREEFDQL+ + DAQVEALKA+CE+K
Subjt:  IREQRGSHLGPVEEEHAEDNESEGHTRQRGDLREHLNRKRGSSLRKRQSPSRSHRSSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQK

Query:  EGPLNDGDLGESS-------------FKAPTVKPYDGSKDPKDYVE---------------------IALTGSARLWYRRLPARSISTYSQLRREFLAQF
        E   +DGDLGE S             FK PT+KPYDGSKDPKDYVE                     IALTGSARLWYRRLPAR ISTYSQLR+EF++QF
Subjt:  EGPLNDGDLGESS-------------FKAPTVKPYDGSKDPKDYVE---------------------IALTGSARLWYRRLPARSISTYSQLRREFLAQF

Query:  SSRHYDKKTATHIATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLAYEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTSRPERKIG
        SSRHYD+KT TH+ATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLA E LTVKL EEAPATFAEVLQK KKVIDGQELLRTKT RPE+ I 
Subjt:  SSRHYDKKTATHIATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLAYEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTSRPERKIG

Query:  RGRSGKDIEKADPKSKDKG-SFSSGRAEYRRAENRPTGSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
        +GR+GKD  KAD KS+DKG S SS R +YRR+ +    SRPYE +TPTTIPI EILTNIEE+GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNTS+ 
Subjt:  RGRSGKDIEKADPKSKDKG-SFSSGRAEYRRAENRPTGSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC

Query:  WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGQKRKELARAARREVCIIREQRPTCPITFDSADLEEVH
        WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN             K+KELAR ARREVCIIREQRPT  I F+ ADLE VH
Subjt:  WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGQKRKELARAARREVCIIREQRPTCPITFDSADLEEVH

Query:  LPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKILTPLVGFSGESVIPEGFIDLPVTLGQNQTQVTQMAEFVVIDGRSAYNAIF
        LPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLKK  TPLVGFSGES+  EG IDLPV++ Q+ TQVTQMAEFVVIDGRSAYNAIF
Subjt:  LPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKILTPLVGFSGESVIPEGFIDLPVTLGQNQTQVTQMAEFVVIDGRSAYNAIF

Query:  GRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCAIE
        GRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SRECYAS  K SSVCA+E
Subjt:  GRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCAIE

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]2.0e-21074.48Show/hide
Query:  EIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHIATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLAYEALTVKLGEE
        +IALTGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTATH+ATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFLT LA E LTVKLGEE
Subjt:  EIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHIATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLAYEALTVKLGEE

Query:  APATFAEVLQKAKKVIDGQELLRTKTSRPERKIGRGRSGKDIEKADPKSKDKGSFSS-GRAEYRRAENRPTGSRPYERFTPTTIPISEILTNIEESGMEK
        AP TF EVLQKAKKVIDGQELLRTKT RPE++I + +  ++  KAD KS+DKGS SS  R EYRR E+ P+ SRPYER+T +TIPISEILTNIEESGMEK
Subjt:  APATFAEVLQKAKKVIDGQELLRTKTSRPERKIGRGRSGKDIEKADPKSKDKGSFSS-GRAEYRRAENRPTGSRPYERFTPTTIPISEILTNIEESGMEK

Query:  LLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGQKR
        LLKRPEKLRG  E+R+K+KYCRFHR+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVINTIFGGP+GGQSG KR
Subjt:  LLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGQKR

Query:  KELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKILTPLVGFSGESVIPEG
        KELAR ARREVCIIRE +PTC ITF  ADLE VHLPHNDALVIA LIDH +VRRVL+DGG                                        
Subjt:  KELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKILTPLVGFSGESVIPEG

Query:  FIDLPVTLGQNQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCAIETLASGDGTLEFE
         IDLPVT+GQ+ TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPN VG VRGEQ  SRECYASALKGS+VCA+E   +     E E
Subjt:  FIDLPVTLGQNQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCAIETLASGDGTLEFE

Query:  ADLP---RMEFAAPTEELELVPLLSPEKQ
        ADLP   + +F  PTEELELVPLLSPE+Q
Subjt:  ADLP---RMEFAAPTEELELVPLLSPEKQ

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088135.6e-25989.96Show/hide
Query:  QAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGES-------------SFKAPTVKPYDGSKDPKDYVE-------------
        +AESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGES              FKAPTVKPYDGSKDPKDYVE             
Subjt:  QAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGES-------------SFKAPTVKPYDGSKDPKDYVE-------------

Query:  --------IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHIATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLAYEAL
                IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATH+ATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA EAL
Subjt:  --------IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHIATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLAYEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTSRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENRPTGSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKT RPERKIGRGRSGKDIE ADPKSKDKGSFSSGRAEYRRAEN PT SRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTSRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENRPTGSRPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
        ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGQKRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKILTPLVGFSGE
        QSG+KRKELARAARREVCIIREQRPTCPITFD ADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKK  TPLVGFSGE
Subjt:  QSGQKRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKILTPLVGFSGE

Query:  SVIPEGFIDLPVTLGQNQTQVTQMAEFV
        SVIPEGFIDLPVTLGQ+QTQVTQMAEFV
Subjt:  SVIPEGFIDLPVTLGQNQTQVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188231.5e-25979.78Show/hide
Query:  SSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESSF-----KAPTVKPYDGSKDPKDYVE-----------------
        SSNQQAESS NPATP GVITREEFDQLRG+L+AQVEALKAKCEQKEGPLNDGDLGES F     +APTVK YDGSKDPKDYVE                 
Subjt:  SSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESSF-----KAPTVKPYDGSKDPKDYVE-----------------

Query:  ----IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHIATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLAYEALTVKL
            IALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA EALTVKL
Subjt:  ----IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHIATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLAYEALTVKL

Query:  GEEAPATFAEVLQKAKKVIDGQELLRTKTSRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENRPTGSRPYERFTPTTIPISEILTNIEESGM
        G+EAPATFAEVLQKAKKVIDGQELLRTKT RPER I RGRSGKD EKAD KSKDKGSFSSGRAE+RRA N PT SRPYERFTPTTIPISEILTNIEESGM
Subjt:  GEEAPATFAEVLQKAKKVIDGQELLRTKTSRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENRPTGSRPYERFTPTTIPISEILTNIEESGM

Query:  EKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGQ
        EKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEERK SRTP RR DRPAVINTIFGGPSGGQSG 
Subjt:  EKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGQ

Query:  KRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKILTPLVGFSGESVIP
        KRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKK  TPLVGFS ESVIP
Subjt:  KRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKILTPLVGFSGESVIP

Query:  EGFIDLPVTLGQNQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCAIETLASGDGTLE
        EG IDLPVTLG +QTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVG VRGEQ ASRECYASALKGSSVCA+ETL S DGTLE
Subjt:  EGFIDLPVTLGQNQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCAIETLASGDGTLE

Query:  FEADLPRMEFAAPTEELELVPLL
        F+A+LPR EFAAPTEELELVPLL
Subjt:  FEADLPRMEFAAPTEELELVPLL

A0A6J1DD03 uncharacterized protein LOC1110198991.7e-22391.28Show/hide
Query:  MCYFLTGLAYEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTSRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENRPTGSRPYERFTP
        MCYFLTGLA EALTVKL EEAPATFAEVLQKAKKVIDGQELLRT       KIG+GRSGKD+E  DPKSKDKGSFS+GRAEYRRAEN PT SRPYERFTP
Subjt:  MCYFLTGLAYEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTSRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENRPTGSRPYERFTP

Query:  TTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRP
        TTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRP
Subjt:  TTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRP

Query:  AVINTIFGGPSGGQSGQKRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
        AVINTIFGGPSGGQSG KRK+LARAARREVCIIREQRPTCPITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Subjt:  AVINTIFGGPSGGQSGQKRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL

Query:  KKILTPLVGFSGESVIPEGFIDLPVTLGQNQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKG
        KK  TPLVGFSGESV+PEG IDLPVTLGQ+QT+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAS LKG
Subjt:  KKILTPLVGFSGESVIPEGFIDLPVTLGQNQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKG

Query:  SSVCAIETLASGDGTLEFEADLPRMEFAAPTEELELVPLLSPEKQGQ
        +SVCA+ETL S DGTLEFEADLP  EFAAP EELELVPLLS EKQ Q
Subjt:  SSVCAIETLASGDGTLEFEADLPRMEFAAPTEELELVPLLSPEKQGQ

A0A6J1DHB3 uncharacterized protein LOC1110204793.2e-25466.45Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTSNKGARGPALASTSAGSRSENRVTRVG
        MVQPANSTNTADRR LAA+  HQREVGA VVEGQGH+ L TEPL RSARIT PVLPPAHP                                        
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTSNKGARGPALASTSAGSRSENRVTRVG

Query:  IREQRGSHLGPVEEEHAEDNESEGHTRQRGDLREHLNRKRGSSLRKRQSPSRSHRSSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQK
                                                         PS+        AESS NP TP GVITREEFDQL+ + DAQVEALKA+CE+K
Subjt:  IREQRGSHLGPVEEEHAEDNESEGHTRQRGDLREHLNRKRGSSLRKRQSPSRSHRSSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQK

Query:  EGPLNDGDLGESS-------------FKAPTVKPYDGSKDPKDYVE---------------------IALTGSARLWYRRLPARSISTYSQLRREFLAQF
        E   +DGDLGE S             FK PT+KPYDGSKDPKDYVE                     IALTGSARLWYRRLPAR ISTYSQLR+EF++QF
Subjt:  EGPLNDGDLGESS-------------FKAPTVKPYDGSKDPKDYVE---------------------IALTGSARLWYRRLPARSISTYSQLRREFLAQF

Query:  SSRHYDKKTATHIATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLAYEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTSRPERKIG
        SSRHYD+KT TH+ATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLA E LTVKL EEAPATFAEVLQK KKVIDGQELLRTKT RPE+ I 
Subjt:  SSRHYDKKTATHIATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLAYEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTSRPERKIG

Query:  RGRSGKDIEKADPKSKDKG-SFSSGRAEYRRAENRPTGSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
        +GR+GKD  KAD KS+DKG S SS R +YRR+ +    SRPYE +TPTTIPI EILTNIEE+GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNTS+ 
Subjt:  RGRSGKDIEKADPKSKDKG-SFSSGRAEYRRAENRPTGSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC

Query:  WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGQKRKELARAARREVCIIREQRPTCPITFDSADLEEVH
        WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN             K+KELAR ARREVCIIREQRPT  I F+ ADLE VH
Subjt:  WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGQKRKELARAARREVCIIREQRPTCPITFDSADLEEVH

Query:  LPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKILTPLVGFSGESVIPEGFIDLPVTLGQNQTQVTQMAEFVVIDGRSAYNAIF
        LPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLKK  TPLVGFSGES+  EG IDLPV++ Q+ TQVTQMAEFVVIDGRSAYNAIF
Subjt:  LPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKILTPLVGFSGESVIPEGFIDLPVTLGQNQTQVTQMAEFVVIDGRSAYNAIF

Query:  GRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCAIE
        GRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SRECYAS  K SSVCA+E
Subjt:  GRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCAIE

A0A6J1DZB9 uncharacterized protein LOC1110249049.7e-21174.48Show/hide
Query:  EIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHIATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLAYEALTVKLGEE
        +IALTGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTATH+ATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFLT LA E LTVKLGEE
Subjt:  EIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHIATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLAYEALTVKLGEE

Query:  APATFAEVLQKAKKVIDGQELLRTKTSRPERKIGRGRSGKDIEKADPKSKDKGSFSS-GRAEYRRAENRPTGSRPYERFTPTTIPISEILTNIEESGMEK
        AP TF EVLQKAKKVIDGQELLRTKT RPE++I + +  ++  KAD KS+DKGS SS  R EYRR E+ P+ SRPYER+T +TIPISEILTNIEESGMEK
Subjt:  APATFAEVLQKAKKVIDGQELLRTKTSRPERKIGRGRSGKDIEKADPKSKDKGSFSS-GRAEYRRAENRPTGSRPYERFTPTTIPISEILTNIEESGMEK

Query:  LLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGQKR
        LLKRPEKLRG  E+R+K+KYCRFHR+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVINTIFGGP+GGQSG KR
Subjt:  LLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGQKR

Query:  KELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKILTPLVGFSGESVIPEG
        KELAR ARREVCIIRE +PTC ITF  ADLE VHLPHNDALVIA LIDH +VRRVL+DGG                                        
Subjt:  KELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKILTPLVGFSGESVIPEG

Query:  FIDLPVTLGQNQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCAIETLASGDGTLEFE
         IDLPVT+GQ+ TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPN VG VRGEQ  SRECYASALKGS+VCA+E   +     E E
Subjt:  FIDLPVTLGQNQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCAIETLASGDGTLEFE

Query:  ADLP---RMEFAAPTEELELVPLLSPEKQ
        ADLP   + +F  PTEELELVPLLSPE+Q
Subjt:  ADLP---RMEFAAPTEELELVPLLSPEKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCCGCAAACTCGACCAATACGGCAGATCGAAGGACCCTAGCTGCCAGCGATGCCCACCAAAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGTAGGTCGGCACGAATCACCGCGCCTGTTCTACCACCTGCGCACCCCCCAAGGACATCCAAAGCCACCCGTGGCCGAGGTGGAACCT
CTAATAAGGGCGCCCGGGGTCCAGCCCTGGCCTCGACAAGCGCAGGGTCCCGATCTGAGAACCGAGTGACGCGCGTTGGCATACGCGAGCAAAGGGGTTCCCACCTCGGC
CCAGTCGAGGAGGAACATGCCGAGGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTCCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAAG
ACAGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCGCAACCCAGCAACTCCTGCAGGAGTGATTACAAGGGAGGAGTTCGACCAGCTGAGGG
GCCAGCTCGACGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTCCACTGAACGATGGCGACTTGGGAGAATCGTCCTTCAAAGCTCCTACCGTGAAA
CCTTATGATGGGTCAAAGGACCCCAAGGATTATGTTGAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCA
GCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATATCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATG
TCACCAGATTCCAGGAGGAGCAATTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCTACGAAGCCCTCACGGTGAAGCTTGGA
GAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCAGCCGACCTGAACGAAAGATCGGCCG
GGGCAGAAGTGGAAAAGATATAGAAAAGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTCTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACAGACCTACCGGGA
GCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAACGTCCTGAGAAGCTTCGG
GGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCGGACTGCTGGGAGTTGAAACGCCAAATTGAGGATCTAATTCA
AGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACCAGCTCGGCAGAGAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACCTGCGG
TCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACAGAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGGCCG
ACCTGCCCAATCACCTTCGACAGTGCAGACTTAGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCT
GGTAGACGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCACAATTGAAGAAAATCCTGACACCGTTGGTTGGGTTTTCTG
GAGAATCGGTCATCCCAGAAGGTTTTATCGACTTGCCGGTCACACTCGGGCAGAACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCC
TATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGG
AGAACAGACCGCTTCGAGAGAGTGCTATGCCTCCGCGCTCAAAGGCTCATCGGTCTGCGCCATCGAAACTCTCGCCAGTGGGGATGGGACGCTCGAGTTCGAGGCCGACC
TGCCGAGGATGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAAGGGCAATTCACCACAAGACCCCAAGGAGCGCAGAAAGTT
GGCAAGGCGAGCAGCTCGGTTCGTGGTCCGAGGTGGAGCATTGTACCGACGCGACTTTTCCCTGCCTCTATTGAGATGCCTAACCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCCGCAAACTCGACCAATACGGCAGATCGAAGGACCCTAGCTGCCAGCGATGCCCACCAAAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGTAGGTCGGCACGAATCACCGCGCCTGTTCTACCACCTGCGCACCCCCCAAGGACATCCAAAGCCACCCGTGGCCGAGGTGGAACCT
CTAATAAGGGCGCCCGGGGTCCAGCCCTGGCCTCGACAAGCGCAGGGTCCCGATCTGAGAACCGAGTGACGCGCGTTGGCATACGCGAGCAAAGGGGTTCCCACCTCGGC
CCAGTCGAGGAGGAACATGCCGAGGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTCCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAAG
ACAGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCGCAACCCAGCAACTCCTGCAGGAGTGATTACAAGGGAGGAGTTCGACCAGCTGAGGG
GCCAGCTCGACGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTCCACTGAACGATGGCGACTTGGGAGAATCGTCCTTCAAAGCTCCTACCGTGAAA
CCTTATGATGGGTCAAAGGACCCCAAGGATTATGTTGAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCA
GCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATATCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATG
TCACCAGATTCCAGGAGGAGCAATTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCTACGAAGCCCTCACGGTGAAGCTTGGA
GAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCAGCCGACCTGAACGAAAGATCGGCCG
GGGCAGAAGTGGAAAAGATATAGAAAAGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTCTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACAGACCTACCGGGA
GCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAACGTCCTGAGAAGCTTCGG
GGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCGGACTGCTGGGAGTTGAAACGCCAAATTGAGGATCTAATTCA
AGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACCAGCTCGGCAGAGAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACCTGCGG
TCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACAGAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGGCCG
ACCTGCCCAATCACCTTCGACAGTGCAGACTTAGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCT
GGTAGACGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCACAATTGAAGAAAATCCTGACACCGTTGGTTGGGTTTTCTG
GAGAATCGGTCATCCCAGAAGGTTTTATCGACTTGCCGGTCACACTCGGGCAGAACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCC
TATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGG
AGAACAGACCGCTTCGAGAGAGTGCTATGCCTCCGCGCTCAAAGGCTCATCGGTCTGCGCCATCGAAACTCTCGCCAGTGGGGATGGGACGCTCGAGTTCGAGGCCGACC
TGCCGAGGATGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAAGGGCAATTCACCACAAGACCCCAAGGAGCGCAGAAAGTT
GGCAAGGCGAGCAGCTCGGTTCGTGGTCCGAGGTGGAGCATTGTACCGACGCGACTTTTCCCTGCCTCTATTGAGATGCCTAACCCTTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTSNKGARGPALASTSAGSRSENRVTRVGIREQRGSHLG
PVEEEHAEDNESEGHTRQRGDLREHLNRKRGSSLRKRQSPSRSHRSSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESSFKAPTVK
PYDGSKDPKDYVEIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHIATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLAYEALTVKLG
EEAPATFAEVLQKAKKVIDGQELLRTKTSRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENRPTGSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLR
GAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGQKRKELARAARREVCIIREQRP
TCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKILTPLVGFSGESVIPEGFIDLPVTLGQNQTQVTQMAEFVVIDGRSA
YNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCAIETLASGDGTLEFEADLPRMEFAAPTEELELVPLLSPEKQGQFTTRPQGAQKV
GKASSSVRGPRWSIVPTRLFPASIEMPNP