; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g38720 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g38720
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr8:29141147..29145682
RNA-Seq ExpressionMoc08g38720
SyntenyMoc08g38720
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]4.1e-26289.39Show/hide
Query:  QAESSHNPATPAGIITREDFDQLRGKLNAQVEALKAKCEQKEGLLNDGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKDPKDYVE-------------
        +AESS NPATPAG+ITRE+FDQLRG+L+AQVEALKAKCEQKEG LNDGDLGESPFTSDVLEAPIP KFKAPTVKPYDGSKDPKDYVE             
Subjt:  QAESSHNPATPAGIITREDFDQLRGKLNAQVEALKAKCEQKEGLLNDGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKDPKDYVE-------------

Query:  --------IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVNRFKEEQLKVAHCSDDSAMCYFLTGLADEAL
                IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYV RF+EEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  --------IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVNRFKEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVINGQELLRTKTGRPERKIGRGSSRKDIEKADLKSKDKGSFFSRRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVI+GQELLRTKTGRPERKIGRG S KDIE AD KSKDKGSF S RAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVINGQELLRTKTGRPERKIGRGSSRKDIEKADLKSKDKGSFFSRRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEDRKRSRTPPRCTDRPAVINTIFGGPSGG
        ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKP TSSAEKKE+RKRSRTPPR TDRPAVINTIFGGPSGG
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEDRKRSRTPPRCTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAARREVCIIREQRPTCPITFDSADLEKVHLPHNDALVIAPLIDHVVVRRVLVDGGASANIMSLPTYHALGWTRLQLKKSPTPLVGFSGE
        QSG KRKELARAARREVCIIREQRPTCPITFD ADLE+VHLPHNDALVIAPLIDHVVV RVLVDGG SANI+SLPTY ALGWTR QLKKSPTPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQRPTCPITFDSADLEKVHLPHNDALVIAPLIDHVVVRRVLVDGGASANIMSLPTYHALGWTRLQLKKSPTPLVGFSGE

Query:  SVIPEGCIDLPVTFGQDKTQVTQMAEFV
        SVIPEG IDLPVT GQD+TQVTQMAEFV
Subjt:  SVIPEGCIDLPVTFGQDKTQVTQMAEFV

XP_022150613.1 uncharacterized protein LOC111018708, partial [Momordica charantia]6.2e-20287.44Show/hide
Query:  KEGLLNDGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKDPKDYVE---------------------IALTGSARLWYRRLPARSISTYSQLRREFLAQ
        K+  LNDGDLGES FTSDVLEAPIP KFKAPTVKPYDGSKDPKDYVE                     IALTGSARLWYRRLPARSISTYSQLRREFLAQ
Subjt:  KEGLLNDGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKDPKDYVE---------------------IALTGSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVNRFKEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVINGQELLRTKTGRPERKI
        FSSR Y KKT THLATIRQKEG TLREYV RF+EEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+AP TFAEVLQKAKKVI+GQELLRTKTGRP+RKI
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVNRFKEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVINGQELLRTKTGRPERKI

Query:  GRGSSRKDIEKADLKSKDKGSFFSRRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
        GRG S KD+E+AD KSKDKGSF S RAEYRRAE+GPT+SRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
Subjt:  GRGSSRKDIEKADLKSKDKGSFFSRRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC

Query:  WELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEDRKRSRTPPRCTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSADLEKVH
        WELKRQIEDLIQDGYFKKFVGKP TSSAEKKE+RKRSRTPPR TDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQ PTCPITFD AD E+VH
Subjt:  WELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEDRKRSRTPPRCTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSADLEKVH

Query:  LPHNDALVIAPLIDHVVVRRVL
        LPHNDA VIAPLIDHVVVRRVL
Subjt:  LPHNDALVIAPLIDHVVVRRVL

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]4.3e-23569.69Show/hide
Query:  SSNQQAESSHNPATPAGIITREDFDQLRGKLNAQVEALKAKCEQKEGLLNDGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKDPKDYVE---------
        SSNQQAESSHNPATP G+ITRE+FDQLRGKLNAQVEALKAKCEQKEG LNDGDLGESPFTSDVLE        APTVK YDGSKDPKDYVE         
Subjt:  SSNQQAESSHNPATPAGIITREDFDQLRGKLNAQVEALKAKCEQKEGLLNDGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKDPKDYVE---------

Query:  ------------IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVNRFKEEQLKVAHCSDDSAMCYFLTGLA
                    IALTGSARLW                                                     F+E+QLKVA  SDDSAMCYFLTGLA
Subjt:  ------------IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVNRFKEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVINGQELLRTKTGRPERKIGRGSSRKDIEKADLKSKDKGSFFSRRAEYRRAENGPTRSRPYERFTPTTIPISEIL
        DEALTVKLG+EAPATFAEVLQKAKKVI+GQELLRTKTGRPER I RG S KD EKADLKSKDKGSF S RAE+RRA NGPTRSRPYERFTPTTIPISEIL
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVINGQELLRTKTGRPERKIGRGSSRKDIEKADLKSKDKGSFFSRRAEYRRAENGPTRSRPYERFTPTTIPISEIL

Query:  TNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEDRKRSRTPPRCTDRPAVINTIFGG
        TNIEESGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKP TSSAEKKE+RK SRTP R  DRPAVINTIFGG
Subjt:  TNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEDRKRSRTPPRCTDRPAVINTIFGG

Query:  PSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSADLEKVHLPHNDALVIAPLIDHVVVRRVLVDGGASANIMSLPTYHALGWTRLQLKKSPTPLVG
        PSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSADLE+VHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TY ALGWTR QLKKS TPLVG
Subjt:  PSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSADLEKVHLPHNDALVIAPLIDHVVVRRVLVDGGASANIMSLPTYHALGWTRLQLKKSPTPLVG

Query:  FSGESVIPEGCIDLPVTFGQDKTQVTQMAEFVVIN--------------------------------------VRGEQTASRECYASALKGSSVCALETH
        FS ESVIPEGCIDLPVT G D+TQVTQMAEFVVI+                                      VRGEQ ASRECYASALKGSSVCALET 
Subjt:  FSGESVIPEGCIDLPVTFGQDKTQVTQMAEFVVIN--------------------------------------VRGEQTASRECYASALKGSSVCALETH

Query:  ASGKGTPEFKADLPRREFSAPTEELELVPLLSPEKQLALAYETDLARSVPVEILDNPSILEPDLMEIGAPEPL
         S  GT EFKA+LPRREF+APTEELELVPLL  +    + +E +L     +  +D+      D+   G PEPL
Subjt:  ASGKGTPEFKADLPRREFSAPTEELELVPLLSPEKQLALAYETDLARSVPVEILDNPSILEPDLMEIGAPEPL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]1.6e-22959.24Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAALVEGQGHDSLVTEPLRRSARITAPVLPLAHPRTSKATRGRGGTSKKGAWDPAPAPTSENFDALQREMEAMR
        MVQPANSTNTADRR LAA+  HQREVGA +VEGQGH+ L TEPL RSARIT PVLP AHP+ SK                                    
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAALVEGQGHDSLVTEPLRRSARITAPVLPLAHPRTSKATRGRGGTSKKGAWDPAPAPTSENFDALQREMEAMR

Query:  TQMRSMEEMYNEMILAAGAGSRSENRVTRIDIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSFRKGQSPSRSHRSSNQQAESSHNPAT
                                                                                                   AESS+NP T
Subjt:  TQMRSMEEMYNEMILAAGAGSRSENRVTRIDIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSFRKGQSPSRSHRSSNQQAESSHNPAT

Query:  PAGIITREDFDQLRGKLNAQVEALKAKCEQKEGLLNDGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKDPKDYVE---------------------IA
        P G+ITRE+FDQL+ K +AQVEALKA+CE+KE   +DGDLGE  F+SD+LEA IP KFK PT+KPYDGSKDPKDYVE                     IA
Subjt:  PAGIITREDFDQLRGKLNAQVEALKAKCEQKEGLLNDGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKDPKDYVE---------------------IA

Query:  LTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVNRFKEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPA
        LTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYV RF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPA
Subjt:  LTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVNRFKEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPA

Query:  TFAEVLQKAKKVINGQELLRTKTGRPERKIGRGSSRKDIEKADLKSKDKG-SFFSRRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLK
        TFAEVLQK KKVI+GQELLRTKTGRPE+ I +G + KD  KAD KS+DKG S  S R +YRR+ +   +SRPYE +TPTTIPI EILTNIEE+GMEKLLK
Subjt:  TFAEVLQKAKKVINGQELLRTKTGRPERKIGRGSSRKDIEKADLKSKDKG-SFFSRRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLK

Query:  RPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEDRKRSRTPPRCTDRPAVINTIFGGPSGGQSGHKRKEL
        RPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKP ++S EKKE+RKR RTPPR  DRPAVIN             K+KEL
Subjt:  RPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEDRKRSRTPPRCTDRPAVINTIFGGPSGGQSGHKRKEL

Query:  ARAARREVCIIREQRPTCPITFDSADLEKVHLPHNDALVIAPLIDHVVVRRVLVDGGASANIMSLPTYHALGWTRLQLKKSPTPLVGFSGESVIPEGCID
        AR ARREVCIIREQRPT  I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANI+SL TY ALGWTR QLKKSPTPLVGFSGES+  EGCID
Subjt:  ARAARREVCIIREQRPTCPITFDSADLEKVHLPHNDALVIAPLIDHVVVRRVLVDGGASANIMSLPTYHALGWTRLQLKKSPTPLVGFSGESVIPEGCID

Query:  LPVTFGQDKTQVTQMAEFVVIN--------------------------------------VRGEQTASRECYASALKGSSVCALE
        LPV+  QD TQVTQMAEFVVI+                                      VRGE   SRECYAS  K SSVCALE
Subjt:  LPVTFGQDKTQVTQMAEFVVIN--------------------------------------VRGEQTASRECYASALKGSSVCALE

XP_022156542.1 uncharacterized protein LOC111023421 [Momordica charantia]8.7e-19685.95Show/hide
Query:  GIITREDFDQLRGKLNAQVEALKAKCEQKEGLLNDGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKDPKDYVE---------------------IALT
        GIITRE+FDQLRG+L+AQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIP KFKAPTVKPYDG+KDPKDYVE                     IALT
Subjt:  GIITREDFDQLRGKLNAQVEALKAKCEQKEGLLNDGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKDPKDYVE---------------------IALT

Query:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVNRFKEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF
        GSARLWYRRLP RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYV RF+EEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF
Subjt:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVNRFKEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF

Query:  AEVLQKAKKVINGQELLRTKTGRPERKIGRGSSRKDIEKADLKSKDKGSFFSRRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPE
        AEVLQKAKKVI+GQELLRTKTGRPERKIGRG S KD+E+AD KSKDKGSF S RAEYRRAENGPTRSRPYERFTPTTIPI EILTNIEESGMEKLLKRPE
Subjt:  AEVLQKAKKVINGQELLRTKTGRPERKIGRGSSRKDIEKADLKSKDKGSFFSRRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPE

Query:  KLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEDRKRSRTPPRCTDRPAVINTIFGGPSGGQSGHKRKELARA
        KLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIEDLIQDGYFKKFVGKP TSSAEKKE+RKRSRTPPR TDRPAVINTIFGGPSGGQ GHKRKELARA
Subjt:  KLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEDRKRSRTPPRCTDRPAVINTIFGGPSGGQSGHKRKELARA

Query:  ARREVCIIREQRPTCPITFD
        ARRE+   +E R    +  D
Subjt:  ARREVCIIREQRPTCPITFD

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088132.0e-26289.39Show/hide
Query:  QAESSHNPATPAGIITREDFDQLRGKLNAQVEALKAKCEQKEGLLNDGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKDPKDYVE-------------
        +AESS NPATPAG+ITRE+FDQLRG+L+AQVEALKAKCEQKEG LNDGDLGESPFTSDVLEAPIP KFKAPTVKPYDGSKDPKDYVE             
Subjt:  QAESSHNPATPAGIITREDFDQLRGKLNAQVEALKAKCEQKEGLLNDGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKDPKDYVE-------------

Query:  --------IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVNRFKEEQLKVAHCSDDSAMCYFLTGLADEAL
                IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYV RF+EEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  --------IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVNRFKEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVINGQELLRTKTGRPERKIGRGSSRKDIEKADLKSKDKGSFFSRRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVI+GQELLRTKTGRPERKIGRG S KDIE AD KSKDKGSF S RAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVINGQELLRTKTGRPERKIGRGSSRKDIEKADLKSKDKGSFFSRRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEDRKRSRTPPRCTDRPAVINTIFGGPSGG
        ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKP TSSAEKKE+RKRSRTPPR TDRPAVINTIFGGPSGG
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEDRKRSRTPPRCTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAARREVCIIREQRPTCPITFDSADLEKVHLPHNDALVIAPLIDHVVVRRVLVDGGASANIMSLPTYHALGWTRLQLKKSPTPLVGFSGE
        QSG KRKELARAARREVCIIREQRPTCPITFD ADLE+VHLPHNDALVIAPLIDHVVV RVLVDGG SANI+SLPTY ALGWTR QLKKSPTPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQRPTCPITFDSADLEKVHLPHNDALVIAPLIDHVVVRRVLVDGGASANIMSLPTYHALGWTRLQLKKSPTPLVGFSGE

Query:  SVIPEGCIDLPVTFGQDKTQVTQMAEFV
        SVIPEG IDLPVT GQD+TQVTQMAEFV
Subjt:  SVIPEGCIDLPVTFGQDKTQVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188232.1e-23569.69Show/hide
Query:  SSNQQAESSHNPATPAGIITREDFDQLRGKLNAQVEALKAKCEQKEGLLNDGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKDPKDYVE---------
        SSNQQAESSHNPATP G+ITRE+FDQLRGKLNAQVEALKAKCEQKEG LNDGDLGESPFTSDVLE        APTVK YDGSKDPKDYVE         
Subjt:  SSNQQAESSHNPATPAGIITREDFDQLRGKLNAQVEALKAKCEQKEGLLNDGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKDPKDYVE---------

Query:  ------------IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVNRFKEEQLKVAHCSDDSAMCYFLTGLA
                    IALTGSARLW                                                     F+E+QLKVA  SDDSAMCYFLTGLA
Subjt:  ------------IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVNRFKEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVINGQELLRTKTGRPERKIGRGSSRKDIEKADLKSKDKGSFFSRRAEYRRAENGPTRSRPYERFTPTTIPISEIL
        DEALTVKLG+EAPATFAEVLQKAKKVI+GQELLRTKTGRPER I RG S KD EKADLKSKDKGSF S RAE+RRA NGPTRSRPYERFTPTTIPISEIL
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVINGQELLRTKTGRPERKIGRGSSRKDIEKADLKSKDKGSFFSRRAEYRRAENGPTRSRPYERFTPTTIPISEIL

Query:  TNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEDRKRSRTPPRCTDRPAVINTIFGG
        TNIEESGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKP TSSAEKKE+RK SRTP R  DRPAVINTIFGG
Subjt:  TNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEDRKRSRTPPRCTDRPAVINTIFGG

Query:  PSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSADLEKVHLPHNDALVIAPLIDHVVVRRVLVDGGASANIMSLPTYHALGWTRLQLKKSPTPLVG
        PSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSADLE+VHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TY ALGWTR QLKKS TPLVG
Subjt:  PSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSADLEKVHLPHNDALVIAPLIDHVVVRRVLVDGGASANIMSLPTYHALGWTRLQLKKSPTPLVG

Query:  FSGESVIPEGCIDLPVTFGQDKTQVTQMAEFVVIN--------------------------------------VRGEQTASRECYASALKGSSVCALETH
        FS ESVIPEGCIDLPVT G D+TQVTQMAEFVVI+                                      VRGEQ ASRECYASALKGSSVCALET 
Subjt:  FSGESVIPEGCIDLPVTFGQDKTQVTQMAEFVVIN--------------------------------------VRGEQTASRECYASALKGSSVCALETH

Query:  ASGKGTPEFKADLPRREFSAPTEELELVPLLSPEKQLALAYETDLARSVPVEILDNPSILEPDLMEIGAPEPL
         S  GT EFKA+LPRREF+APTEELELVPLL  +    + +E +L     +  +D+      D+   G PEPL
Subjt:  ASGKGTPEFKADLPRREFSAPTEELELVPLLSPEKQLALAYETDLARSVPVEILDNPSILEPDLMEIGAPEPL

A0A6J1D9W7 uncharacterized protein LOC1110187083.0e-20287.44Show/hide
Query:  KEGLLNDGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKDPKDYVE---------------------IALTGSARLWYRRLPARSISTYSQLRREFLAQ
        K+  LNDGDLGES FTSDVLEAPIP KFKAPTVKPYDGSKDPKDYVE                     IALTGSARLWYRRLPARSISTYSQLRREFLAQ
Subjt:  KEGLLNDGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKDPKDYVE---------------------IALTGSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVNRFKEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVINGQELLRTKTGRPERKI
        FSSR Y KKT THLATIRQKEG TLREYV RF+EEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+AP TFAEVLQKAKKVI+GQELLRTKTGRP+RKI
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVNRFKEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVINGQELLRTKTGRPERKI

Query:  GRGSSRKDIEKADLKSKDKGSFFSRRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
        GRG S KD+E+AD KSKDKGSF S RAEYRRAE+GPT+SRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
Subjt:  GRGSSRKDIEKADLKSKDKGSFFSRRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC

Query:  WELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEDRKRSRTPPRCTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSADLEKVH
        WELKRQIEDLIQDGYFKKFVGKP TSSAEKKE+RKRSRTPPR TDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQ PTCPITFD AD E+VH
Subjt:  WELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEDRKRSRTPPRCTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSADLEKVH

Query:  LPHNDALVIAPLIDHVVVRRVL
        LPHNDA VIAPLIDHVVVRRVL
Subjt:  LPHNDALVIAPLIDHVVVRRVL

A0A6J1DHB3 uncharacterized protein LOC1110204797.6e-23059.24Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAALVEGQGHDSLVTEPLRRSARITAPVLPLAHPRTSKATRGRGGTSKKGAWDPAPAPTSENFDALQREMEAMR
        MVQPANSTNTADRR LAA+  HQREVGA +VEGQGH+ L TEPL RSARIT PVLP AHP+ SK                                    
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAALVEGQGHDSLVTEPLRRSARITAPVLPLAHPRTSKATRGRGGTSKKGAWDPAPAPTSENFDALQREMEAMR

Query:  TQMRSMEEMYNEMILAAGAGSRSENRVTRIDIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSFRKGQSPSRSHRSSNQQAESSHNPAT
                                                                                                   AESS+NP T
Subjt:  TQMRSMEEMYNEMILAAGAGSRSENRVTRIDIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSFRKGQSPSRSHRSSNQQAESSHNPAT

Query:  PAGIITREDFDQLRGKLNAQVEALKAKCEQKEGLLNDGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKDPKDYVE---------------------IA
        P G+ITRE+FDQL+ K +AQVEALKA+CE+KE   +DGDLGE  F+SD+LEA IP KFK PT+KPYDGSKDPKDYVE                     IA
Subjt:  PAGIITREDFDQLRGKLNAQVEALKAKCEQKEGLLNDGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKDPKDYVE---------------------IA

Query:  LTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVNRFKEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPA
        LTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYV RF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPA
Subjt:  LTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVNRFKEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPA

Query:  TFAEVLQKAKKVINGQELLRTKTGRPERKIGRGSSRKDIEKADLKSKDKG-SFFSRRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLK
        TFAEVLQK KKVI+GQELLRTKTGRPE+ I +G + KD  KAD KS+DKG S  S R +YRR+ +   +SRPYE +TPTTIPI EILTNIEE+GMEKLLK
Subjt:  TFAEVLQKAKKVINGQELLRTKTGRPERKIGRGSSRKDIEKADLKSKDKG-SFFSRRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLK

Query:  RPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEDRKRSRTPPRCTDRPAVINTIFGGPSGGQSGHKRKEL
        RPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKP ++S EKKE+RKR RTPPR  DRPAVIN             K+KEL
Subjt:  RPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEDRKRSRTPPRCTDRPAVINTIFGGPSGGQSGHKRKEL

Query:  ARAARREVCIIREQRPTCPITFDSADLEKVHLPHNDALVIAPLIDHVVVRRVLVDGGASANIMSLPTYHALGWTRLQLKKSPTPLVGFSGESVIPEGCID
        AR ARREVCIIREQRPT  I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANI+SL TY ALGWTR QLKKSPTPLVGFSGES+  EGCID
Subjt:  ARAARREVCIIREQRPTCPITFDSADLEKVHLPHNDALVIAPLIDHVVVRRVLVDGGASANIMSLPTYHALGWTRLQLKKSPTPLVGFSGESVIPEGCID

Query:  LPVTFGQDKTQVTQMAEFVVIN--------------------------------------VRGEQTASRECYASALKGSSVCALE
        LPV+  QD TQVTQMAEFVVI+                                      VRGE   SRECYAS  K SSVCALE
Subjt:  LPVTFGQDKTQVTQMAEFVVIN--------------------------------------VRGEQTASRECYASALKGSSVCALE

A0A6J1DS95 uncharacterized protein LOC1110234214.2e-19685.95Show/hide
Query:  GIITREDFDQLRGKLNAQVEALKAKCEQKEGLLNDGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKDPKDYVE---------------------IALT
        GIITRE+FDQLRG+L+AQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIP KFKAPTVKPYDG+KDPKDYVE                     IALT
Subjt:  GIITREDFDQLRGKLNAQVEALKAKCEQKEGLLNDGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKDPKDYVE---------------------IALT

Query:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVNRFKEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF
        GSARLWYRRLP RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYV RF+EEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF
Subjt:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVNRFKEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF

Query:  AEVLQKAKKVINGQELLRTKTGRPERKIGRGSSRKDIEKADLKSKDKGSFFSRRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPE
        AEVLQKAKKVI+GQELLRTKTGRPERKIGRG S KD+E+AD KSKDKGSF S RAEYRRAENGPTRSRPYERFTPTTIPI EILTNIEESGMEKLLKRPE
Subjt:  AEVLQKAKKVINGQELLRTKTGRPERKIGRGSSRKDIEKADLKSKDKGSFFSRRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPE

Query:  KLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEDRKRSRTPPRCTDRPAVINTIFGGPSGGQSGHKRKELARA
        KLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIEDLIQDGYFKKFVGKP TSSAEKKE+RKRSRTPPR TDRPAVINTIFGGPSGGQ GHKRKELARA
Subjt:  KLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEDRKRSRTPPRCTDRPAVINTIFGGPSGGQSGHKRKELARA

Query:  ARREVCIIREQRPTCPITFD
        ARRE+   +E R    +  D
Subjt:  ARREVCIIREQRPTCPITFD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCATTGGTAGAGGGGCAAGGTCACGA
CAGCCTAGTAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTCCTACCACTTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCTGGGATCCAGCCCCGGCTCCAACAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATGTAT
AACGAAATGATACTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCATTGACATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACA
TCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCATCCTTCCGAAAAGGACAGTCACCATCCCGCT
CACATCGGAGCTCCAACCAGCAGGCTGAATCTTCTCACAACCCAGCAACTCCCGCAGGGATAATCACAAGGGAGGATTTCGACCAGCTAAGGGGCAAGCTCAATGCTCAG
GTTGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTCTACTGAACGATGGCGACCTGGGAGAATCGCCATTCACCTCGGACGTTTTGGAAGCACCGATCCCTCTGAA
GTTCAAAGCTCCTACCGTGAAACCTTATGATGGGTCAAAGGACCCTAAGGATTATGTTGAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCA
GGTCGATCTCGACCTACTCTCAGCTAAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACTCATCTCGCCACCATCAGACAGAAGGAG
GGTGAGACGCTGCGAGAATATGTCAACAGGTTCAAGGAGGAGCAATTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGA
AGCCCTCACGGTGAAGCTTGGAGAGGAGGCCCCGGCCACCTTCGCTGAAGTGCTTCAGAAGGCGAAGAAAGTCATCAATGGACAGGAGCTCCTCCGAACCAAAACCGGCC
GACCAGAACGAAAGATCGGCCGGGGCAGTAGTAGAAAAGATATAGAAAAGGCAGATCTCAAGTCCAAGGACAAGGGATCCTTCTTTAGTCGCCGAGCTGAGTATCGAAGG
GCGGAGAACGGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTCCT
CAAACGCCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCGGATTGCTGGGAATTGAAGC
GCCAAATTGAGGATCTAATTCAAGATGGATACTTCAAGAAATTTGTGGGGAAGCCCGGCACCAGCTCGGCAGAAAAAAAAGAAGATAGGAAGCGTTCGCGGACGCCGCCC
CGGTGCACTGACCGACCTGCGGTCATCAACACCATTTTCGGAGGGCCAAGCGGGGGCCAGTCCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTG
CATCATCAGGGAGCAGAGGCCGACCTGCCCAATCACCTTCGACAGCGCAGACTTGGAGAAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCCTTGATTGATC
ATGTGGTGGTCAGAAGGGTGCTAGTAGACGGGGGCGCATCTGCTAATATCATGTCCCTACCGACCTACCACGCCCTGGGATGGACAAGGTTGCAGTTGAAGAAAAGCCCA
ACACCGCTAGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAAGGTTGCATCGACTTGCCGGTCACATTTGGGCAAGACAAAACTCAGGTCACCCAAATGGCCGAGTTCGT
GGTGATTAACGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGCTATGCCTCCGCACTCAAAGGATCATCGGTATGCGCCCTCGAAACTCACGCCAGTGGGAAGGGGACAC
CCGAGTTCAAGGCCGACCTGCCAAGAAGGGAGTTTTCCGCGCCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGTTAGCATTGGCGTACGAGACC
GACCTGGCCAGGTCAGTCCCCGTTGAAATCTTGGATAATCCCTCGATCTTGGAGCCAGATCTGATGGAGATTGGCGCTCCAGAGCCCTTATGGATGGACCCGATTATGGA
CTTCATTAGGGGCAATTCACCACAAGACCCCAAGGAGTGCAGAAAATTGGCAAGGCAGGCAGCTCGGTTCGTGGTCCGAGGTGGAGCGTTGTACCGGCGCGGCTTTTCCC
TGCCTCTACTGAATGTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCATTGGTAGAGGGGCAAGGTCACGA
CAGCCTAGTAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTCCTACCACTTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCTGGGATCCAGCCCCGGCTCCAACAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATGTAT
AACGAAATGATACTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCATTGACATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACA
TCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCATCCTTCCGAAAAGGACAGTCACCATCCCGCT
CACATCGGAGCTCCAACCAGCAGGCTGAATCTTCTCACAACCCAGCAACTCCCGCAGGGATAATCACAAGGGAGGATTTCGACCAGCTAAGGGGCAAGCTCAATGCTCAG
GTTGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTCTACTGAACGATGGCGACCTGGGAGAATCGCCATTCACCTCGGACGTTTTGGAAGCACCGATCCCTCTGAA
GTTCAAAGCTCCTACCGTGAAACCTTATGATGGGTCAAAGGACCCTAAGGATTATGTTGAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCA
GGTCGATCTCGACCTACTCTCAGCTAAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACTCATCTCGCCACCATCAGACAGAAGGAG
GGTGAGACGCTGCGAGAATATGTCAACAGGTTCAAGGAGGAGCAATTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGA
AGCCCTCACGGTGAAGCTTGGAGAGGAGGCCCCGGCCACCTTCGCTGAAGTGCTTCAGAAGGCGAAGAAAGTCATCAATGGACAGGAGCTCCTCCGAACCAAAACCGGCC
GACCAGAACGAAAGATCGGCCGGGGCAGTAGTAGAAAAGATATAGAAAAGGCAGATCTCAAGTCCAAGGACAAGGGATCCTTCTTTAGTCGCCGAGCTGAGTATCGAAGG
GCGGAGAACGGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTCCT
CAAACGCCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCGGATTGCTGGGAATTGAAGC
GCCAAATTGAGGATCTAATTCAAGATGGATACTTCAAGAAATTTGTGGGGAAGCCCGGCACCAGCTCGGCAGAAAAAAAAGAAGATAGGAAGCGTTCGCGGACGCCGCCC
CGGTGCACTGACCGACCTGCGGTCATCAACACCATTTTCGGAGGGCCAAGCGGGGGCCAGTCCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTG
CATCATCAGGGAGCAGAGGCCGACCTGCCCAATCACCTTCGACAGCGCAGACTTGGAGAAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCCTTGATTGATC
ATGTGGTGGTCAGAAGGGTGCTAGTAGACGGGGGCGCATCTGCTAATATCATGTCCCTACCGACCTACCACGCCCTGGGATGGACAAGGTTGCAGTTGAAGAAAAGCCCA
ACACCGCTAGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAAGGTTGCATCGACTTGCCGGTCACATTTGGGCAAGACAAAACTCAGGTCACCCAAATGGCCGAGTTCGT
GGTGATTAACGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGCTATGCCTCCGCACTCAAAGGATCATCGGTATGCGCCCTCGAAACTCACGCCAGTGGGAAGGGGACAC
CCGAGTTCAAGGCCGACCTGCCAAGAAGGGAGTTTTCCGCGCCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGTTAGCATTGGCGTACGAGACC
GACCTGGCCAGGTCAGTCCCCGTTGAAATCTTGGATAATCCCTCGATCTTGGAGCCAGATCTGATGGAGATTGGCGCTCCAGAGCCCTTATGGATGGACCCGATTATGGA
CTTCATTAGGGGCAATTCACCACAAGACCCCAAGGAGTGCAGAAAATTGGCAAGGCAGGCAGCTCGGTTCGTGGTCCGAGGTGGAGCGTTGTACCGGCGCGGCTTTTCCC
TGCCTCTACTGAATGTCTAA
Protein sequenceShow/hide protein sequence
MVQPANSTNTADRRTLAASDAHQREVGAALVEGQGHDSLVTEPLRRSARITAPVLPLAHPRTSKATRGRGGTSKKGAWDPAPAPTSENFDALQREMEAMRTQMRSMEEMY
NEMILAAGAGSRSENRVTRIDIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSFRKGQSPSRSHRSSNQQAESSHNPATPAGIITREDFDQLRGKLNAQ
VEALKAKCEQKEGLLNDGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKDPKDYVEIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKE
GETLREYVNRFKEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVINGQELLRTKTGRPERKIGRGSSRKDIEKADLKSKDKGSFFSRRAEYRR
AENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEDRKRSRTPP
RCTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSADLEKVHLPHNDALVIAPLIDHVVVRRVLVDGGASANIMSLPTYHALGWTRLQLKKSP
TPLVGFSGESVIPEGCIDLPVTFGQDKTQVTQMAEFVVINVRGEQTASRECYASALKGSSVCALETHASGKGTPEFKADLPRREFSAPTEELELVPLLSPEKQLALAYET
DLARSVPVEILDNPSILEPDLMEIGAPEPLWMDPIMDFIRGNSPQDPKECRKLARQAARFVVRGGALYRRGFSLPLLNV