; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g18800 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g18800
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:13521442..13527013
RNA-Seq ExpressionMoc07g18800
SyntenyMoc07g18800
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]3.6e-25387.31Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSAQLWYRRLPAGSISTYSQLRREFLAQFSSRYYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAF
        AIKCRAF+IALTGSA+LWYRRLPA SISTYSQLRREFLA FSSR+YDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEA 
Subjt:  AIKCRAFQIALTGSAQLWYRRLPAGSISTYSQLRREFLAQFSSRYYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAF

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDNGSFSSGRAEYRRAENGPTRSRPYERFTPTT-----------
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD E ADPKSKD GSFSSGRAEYRRAENGPTRSRPYERFTPTT           
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDNGSFSSGRAEYRRAENGPTRSRPYERFTPTT-----------

Query:  ----------------------KDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
                              KDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSS EKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  ----------------------KDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELAREARREVCIIREQGSTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE
        QSG KRKELAR ARREVCIIREQ  TCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLK+SPTPLVGFSGE
Subjt:  QSGHKRKELAREARREVCIIREQGSTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDKTRVTQMAEFV
        SVIPEG IDLPVTLGQD+T+VTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDKTRVTQMAEFV

XP_022150613.1 uncharacterized protein LOC111018708, partial [Momordica charantia]1.0e-19986.97Show/hide
Query:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAQLWYRRLPAGSISTYSQLRREFLAQ
        KDDSLNDGDLGES FTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFEGLMDF AASDAIKCRAFQIALTGSA+LWYRRLPA SISTYSQLRREFLAQ
Subjt:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAQLWYRRLPAGSISTYSQLRREFLAQ

Query:  FSSRYYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAFTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
        FSSR Y KKT THLATIRQKEG TLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEA TVKLGE+AP TFAEVLQKAKKVIDGQELLRTKTGRP+RKI
Subjt:  FSSRYYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAFTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI

Query:  GRGRSGKD-ERADPKSKDNGSFSSGRAEYRRAENGPTRSRPYERFTPTT---------------------------------KDKYCRFHREHGHNTSDC
        GRGRSGKD ERADPKSKD GSFSSGRAEYRRAE+GPT+SRPYERFTPTT                                 KDKYCRFHREHGHNTSDC
Subjt:  GRGRSGKD-ERADPKSKDNGSFSSGRAEYRRAENGPTRSRPYERFTPTT---------------------------------KDKYCRFHREHGHNTSDC

Query:  WELKRQIEDLIQDGYFKKFVGKPRTSSVEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELAREARREVCIIREQGSTCPITFDGADLEEVH
        WELKRQIEDLIQDGYFKKFVGKPRTSS EKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELAR ARREVCIIREQG TCPITFDGAD EEVH
Subjt:  WELKRQIEDLIQDGYFKKFVGKPRTSSVEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELAREARREVCIIREQGSTCPITFDGADLEEVH

Query:  LPHNDALVIAPLIDHVVVRRVL
        LPHNDA VIAPLIDHVVVRRVL
Subjt:  LPHNDALVIAPLIDHVVVRRVL

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]8.7e-22375.31Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPA   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDGDLGESPFTSDVLE        APTVK YDG+KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSAQLWYRRLPAGSISTYSQLRREFLAQFSSRYYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSA+LW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSAQLWYRRLPAGSISTYSQLRREFLAQFSSRYYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEAFTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDNGSFSSGRAEYRRAENGPTRSRPYERFTPTT--------
        DEA TVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSGKDE+AD KSKD GSFSSGRAE+RRA NGPTRSRPYERFTPTT        
Subjt:  DEAFTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDNGSFSSGRAEYRRAENGPTRSRPYERFTPTT--------

Query:  -------------------------KDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKKEERKRSRTPPRRTDRPAVINTIFGGP
                                 KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSS EKKEERK SRTP RR DRPAVINTIFGGP
Subjt:  -------------------------KDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKKEERKRSRTPPRRTDRPAVINTIFGGP

Query:  SGGQSGHKRKELAREARREVCIIREQGSTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGF
        SGGQSGHKRKELAR ARREVCIIREQ  TCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLK+S TPLVGF
Subjt:  SGGQSGHKRKELAREARREVCIIREQGSTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGF

Query:  SGESVIPEGCIDLPVTLGQDKTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTP
        S ESVIPEGCIDLPVTLG D+T+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTP
Subjt:  SGESVIPEGCIDLPVTLGQDKTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTP

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]2.1e-23762.02Show/hide
Query:  MAQPANSTNTTDRRTLAASNAHQREVGAAAVEGQGHDGLPTEPLHRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPASAPTSENFDALQREMEAIR
        M QPANSTNT DRR LAA++ HQREVGA  VEGQGH+ L TEPL RSARIT P LPPAHP+ SK                                    
Subjt:  MAQPANSTNTTDRRTLAASNAHQREVGAAAVEGQGHDGLPTEPLHRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPASAPTSENFDALQREMEAIR

Query:  TQMRSMEAMYNEMVLTAGAGSRSENRVTRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--
                                                                                                   AESS+NP  
Subjt:  TQMRSMEAMYNEMVLTAGAGSRSENRVTRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--

Query:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL
         G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIAL
Subjt:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL

Query:  TGSAQLWYRRLPAGSISTYSQLRREFLAQFSSRYYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAFTVKLGEEAPAT
        TGSA+LWYRRLPA  ISTYSQLR+EF++QFSSR+YD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE  TVKL EEAPAT
Subjt:  TGSAQLWYRRLPAGSISTYSQLRREFLAQFSSRYYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAFTVKLGEEAPAT

Query:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDNG-SFSSGRAEYRRAENGPTRSRPYERFTPTT---------------------
        FAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD+ +AD KS+D G S SS R +YRR+ +   +SRPYE +TPTT                     
Subjt:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDNG-SFSSGRAEYRRAENGPTRSRPYERFTPTT---------------------

Query:  ------------KDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELA
                     DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++SVEKKEERKR RTPPRR DRPAVIN             K+KELA
Subjt:  ------------KDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELA

Query:  REARREVCIIREQGSTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDL
        REARREVCIIREQ  T  I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLK+SPTPLVGFSGES+  EGCIDL
Subjt:  REARREVCIIREQGSTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDL

Query:  PVTLGQDKTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYST
        PV++ QD T+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST
Subjt:  PVTLGQDKTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYST

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]7.2e-20162.58Show/hide
Query:  EQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQ---SPSRSHRSSNQQAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQ
        E+    + P   E   ++E   Y+ +  DLR+HL  K+  +  + +   S SR   +SN +A+S +    P  +I REEFD ++   D QVEALKA+CE+
Subjt:  EQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQ---SPSRSHRSSNQQAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQ

Query:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAQLWYRRLPAGSISTYSQLRREFLAQ
        K+   +D DLGESPFTSD++EAPIPPKFK PT+KPYDG+KDPKDYVEVFEGLMDFQAA+DAIKC AFQIALTGSA+LW RRLPA SISTYSQLR+EF+ Q
Subjt:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAQLWYRRLPAGSISTYSQLRREFLAQ

Query:  FSSRYYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAFTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
        FS R+YD+KTATHLATIRQKE ETL                                   TVKLGEEAPATFAEVLQ AKKVIDGQELLRTKT RPE++I
Subjt:  FSSRYYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAFTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI

Query:  GRGR-SGKDERADPKSKDNGSFSSG-RAEYRRAENGPTRSRPYERFTPTTKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKKE
         + R S K  + D KSKD GS SSG R EYRR+E+GP+RSRPYER                      CWELKRQIEDLIQD YFKKFVGKPR++SVEKKE
Subjt:  GRGR-SGKDERADPKSKDNGSFSSG-RAEYRRAENGPTRSRPYERFTPTTKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKKE

Query:  ERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELAREARREVCIIREQGSTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANIL
        ERKRSRTPPRR DRPAVINTIFGGPSGGQ  +KRKELA EARR+V IIREQ  TC ITF   DLE VHLPHNDALVIAPLIDHV+VRRVLVDGGASANIL
Subjt:  ERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELAREARREVCIIREQGSTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANIL

Query:  SLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDKTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTP--MAWARS
        SLPTYLAL  TRSQLK+SPTPLVGFS ESV PEGCIDLPVT+GQD T+VTQMAEFVV+DGR AYNAIF RPIIHSF+A+PS LHQVLKYSTP  +   R 
Subjt:  SLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDKTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTP--MAWARS

Query:  EENRPLRGNAMPPHSKAHRSAPSKLPGMGRSSSRPTCRGRSLPHPPRSSSLFLCLVPRSR
        E+       A    S   RS+   L         P     S PH  RSSSLF CL  +++
Subjt:  EENRPLRGNAMPPHSKAHRSAPSKLPGMGRSSSRPTCRGRSLPHPPRSSSLFLCLVPRSR

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088131.8e-25387.31Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSAQLWYRRLPAGSISTYSQLRREFLAQFSSRYYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAF
        AIKCRAF+IALTGSA+LWYRRLPA SISTYSQLRREFLA FSSR+YDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEA 
Subjt:  AIKCRAFQIALTGSAQLWYRRLPAGSISTYSQLRREFLAQFSSRYYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAF

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDNGSFSSGRAEYRRAENGPTRSRPYERFTPTT-----------
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD E ADPKSKD GSFSSGRAEYRRAENGPTRSRPYERFTPTT           
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDNGSFSSGRAEYRRAENGPTRSRPYERFTPTT-----------

Query:  ----------------------KDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
                              KDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSS EKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  ----------------------KDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELAREARREVCIIREQGSTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE
        QSG KRKELAR ARREVCIIREQ  TCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLK+SPTPLVGFSGE
Subjt:  QSGHKRKELAREARREVCIIREQGSTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDKTRVTQMAEFV
        SVIPEG IDLPVTLGQD+T+VTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDKTRVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188234.2e-22375.31Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPA   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDGDLGESPFTSDVLE        APTVK YDG+KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSAQLWYRRLPAGSISTYSQLRREFLAQFSSRYYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSA+LW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSAQLWYRRLPAGSISTYSQLRREFLAQFSSRYYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEAFTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDNGSFSSGRAEYRRAENGPTRSRPYERFTPTT--------
        DEA TVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSGKDE+AD KSKD GSFSSGRAE+RRA NGPTRSRPYERFTPTT        
Subjt:  DEAFTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDNGSFSSGRAEYRRAENGPTRSRPYERFTPTT--------

Query:  -------------------------KDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKKEERKRSRTPPRRTDRPAVINTIFGGP
                                 KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSS EKKEERK SRTP RR DRPAVINTIFGGP
Subjt:  -------------------------KDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKKEERKRSRTPPRRTDRPAVINTIFGGP

Query:  SGGQSGHKRKELAREARREVCIIREQGSTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGF
        SGGQSGHKRKELAR ARREVCIIREQ  TCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLK+S TPLVGF
Subjt:  SGGQSGHKRKELAREARREVCIIREQGSTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGF

Query:  SGESVIPEGCIDLPVTLGQDKTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTP
        S ESVIPEGCIDLPVTLG D+T+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTP
Subjt:  SGESVIPEGCIDLPVTLGQDKTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTP

A0A6J1D9W7 uncharacterized protein LOC1110187085.0e-20086.97Show/hide
Query:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAQLWYRRLPAGSISTYSQLRREFLAQ
        KDDSLNDGDLGES FTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFEGLMDF AASDAIKCRAFQIALTGSA+LWYRRLPA SISTYSQLRREFLAQ
Subjt:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAQLWYRRLPAGSISTYSQLRREFLAQ

Query:  FSSRYYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAFTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
        FSSR Y KKT THLATIRQKEG TLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEA TVKLGE+AP TFAEVLQKAKKVIDGQELLRTKTGRP+RKI
Subjt:  FSSRYYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAFTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI

Query:  GRGRSGKD-ERADPKSKDNGSFSSGRAEYRRAENGPTRSRPYERFTPTT---------------------------------KDKYCRFHREHGHNTSDC
        GRGRSGKD ERADPKSKD GSFSSGRAEYRRAE+GPT+SRPYERFTPTT                                 KDKYCRFHREHGHNTSDC
Subjt:  GRGRSGKD-ERADPKSKDNGSFSSGRAEYRRAENGPTRSRPYERFTPTT---------------------------------KDKYCRFHREHGHNTSDC

Query:  WELKRQIEDLIQDGYFKKFVGKPRTSSVEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELAREARREVCIIREQGSTCPITFDGADLEEVH
        WELKRQIEDLIQDGYFKKFVGKPRTSS EKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELAR ARREVCIIREQG TCPITFDGAD EEVH
Subjt:  WELKRQIEDLIQDGYFKKFVGKPRTSSVEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELAREARREVCIIREQGSTCPITFDGADLEEVH

Query:  LPHNDALVIAPLIDHVVVRRVL
        LPHNDA VIAPLIDHVVVRRVL
Subjt:  LPHNDALVIAPLIDHVVVRRVL

A0A6J1DHB3 uncharacterized protein LOC1110204791.0e-23762.02Show/hide
Query:  MAQPANSTNTTDRRTLAASNAHQREVGAAAVEGQGHDGLPTEPLHRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPASAPTSENFDALQREMEAIR
        M QPANSTNT DRR LAA++ HQREVGA  VEGQGH+ L TEPL RSARIT P LPPAHP+ SK                                    
Subjt:  MAQPANSTNTTDRRTLAASNAHQREVGAAAVEGQGHDGLPTEPLHRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPASAPTSENFDALQREMEAIR

Query:  TQMRSMEAMYNEMVLTAGAGSRSENRVTRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--
                                                                                                   AESS+NP  
Subjt:  TQMRSMEAMYNEMVLTAGAGSRSENRVTRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--

Query:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL
         G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIAL
Subjt:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL

Query:  TGSAQLWYRRLPAGSISTYSQLRREFLAQFSSRYYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAFTVKLGEEAPAT
        TGSA+LWYRRLPA  ISTYSQLR+EF++QFSSR+YD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE  TVKL EEAPAT
Subjt:  TGSAQLWYRRLPAGSISTYSQLRREFLAQFSSRYYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAFTVKLGEEAPAT

Query:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDNG-SFSSGRAEYRRAENGPTRSRPYERFTPTT---------------------
        FAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD+ +AD KS+D G S SS R +YRR+ +   +SRPYE +TPTT                     
Subjt:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDNG-SFSSGRAEYRRAENGPTRSRPYERFTPTT---------------------

Query:  ------------KDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELA
                     DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++SVEKKEERKR RTPPRR DRPAVIN             K+KELA
Subjt:  ------------KDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELA

Query:  REARREVCIIREQGSTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDL
        REARREVCIIREQ  T  I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLK+SPTPLVGFSGES+  EGCIDL
Subjt:  REARREVCIIREQGSTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDL

Query:  PVTLGQDKTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYST
        PV++ QD T+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST
Subjt:  PVTLGQDKTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYST

A0A6J1DPC9 uncharacterized protein LOC1110222803.5e-20162.58Show/hide
Query:  EQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQ---SPSRSHRSSNQQAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQ
        E+    + P   E   ++E   Y+ +  DLR+HL  K+  +  + +   S SR   +SN +A+S +    P  +I REEFD ++   D QVEALKA+CE+
Subjt:  EQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQ---SPSRSHRSSNQQAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQ

Query:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAQLWYRRLPAGSISTYSQLRREFLAQ
        K+   +D DLGESPFTSD++EAPIPPKFK PT+KPYDG+KDPKDYVEVFEGLMDFQAA+DAIKC AFQIALTGSA+LW RRLPA SISTYSQLR+EF+ Q
Subjt:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAQLWYRRLPAGSISTYSQLRREFLAQ

Query:  FSSRYYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAFTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
        FS R+YD+KTATHLATIRQKE ETL                                   TVKLGEEAPATFAEVLQ AKKVIDGQELLRTKT RPE++I
Subjt:  FSSRYYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAFTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI

Query:  GRGR-SGKDERADPKSKDNGSFSSG-RAEYRRAENGPTRSRPYERFTPTTKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKKE
         + R S K  + D KSKD GS SSG R EYRR+E+GP+RSRPYER                      CWELKRQIEDLIQD YFKKFVGKPR++SVEKKE
Subjt:  GRGR-SGKDERADPKSKDNGSFSSG-RAEYRRAENGPTRSRPYERFTPTTKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKKE

Query:  ERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELAREARREVCIIREQGSTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANIL
        ERKRSRTPPRR DRPAVINTIFGGPSGGQ  +KRKELA EARR+V IIREQ  TC ITF   DLE VHLPHNDALVIAPLIDHV+VRRVLVDGGASANIL
Subjt:  ERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELAREARREVCIIREQGSTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANIL

Query:  SLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDKTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTP--MAWARS
        SLPTYLAL  TRSQLK+SPTPLVGFS ESV PEGCIDLPVT+GQD T+VTQMAEFVV+DGR AYNAIF RPIIHSF+A+PS LHQVLKYSTP  +   R 
Subjt:  SLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDKTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTP--MAWARS

Query:  EENRPLRGNAMPPHSKAHRSAPSKLPGMGRSSSRPTCRGRSLPHPPRSSSLFLCLVPRSR
        E+       A    S   RS+   L         P     S PH  RSSSLF CL  +++
Subjt:  EENRPLRGNAMPPHSKAHRSAPSKLPGMGRSSSRPTCRGRSLPHPPRSSSLFLCLVPRSR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCAACCAGCGAATTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCAATGCCCACCAGAGGGAGGTCGGAGCAGCGGCGGTAGAAGGGCAAGGTCACGA
CGGCCTACCAACGGAACCCCTCCACAGGTCGGCACGGATCACCGCGCCCGCCCTACCGCCCGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCGAGGTCCAGCCTCGGCTCCAACAAGTGAGAACTTTGATGCGCTCCAGAGAGAGATGGAGGCAATTCGAACACAAATGCGCTCCATGGAGGCAATGTAT
AACGAAATGGTGCTAACTGCAGGCGCAGGGTCTCGATCAGAAAATCGAGTGACGCGCATGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAAGAACG
TCCCGAAGACAACGAGAGTGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCT
CCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCC
TTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGC
TCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCT
TTCAGATCGCGCTTACTGGCAGCGCGCAATTGTGGTACCGGAGACTGCCAGCCGGGTCGATCTCGACCTACTCTCAGTTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCT
CGGTACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGCGAGACGCTGCGGGAGTATGTCACCAGATTCCAAGAGGAGCAGTTGAAGGTTGC
ACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCTTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCC
AGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCC
AAGTCCAAGGACAATGGATCCTTCTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGGCCTTACGAGCGCTTCACCCCAACCACCAAGGA
CAAGTATTGCCGCTTCCATCGGGAGCATGGCCACAACACGTCGGACTGCTGGGAATTGAAGCGCCAAATTGAGGATCTAATTCAAGACGGCTACTTCAAGAAGTTTGTGG
GAAAGCCCAGGACCAGCTCAGTAGAGAAAAAGGAAGAGCGAAAGCGTTCAAGGACGCCACCCCGGCGCACCGACCGACCTGCGGTCATCAATACCATTTTTGGAGGGCCA
AGCGGGGGTCAATCCGGACATAAAAGAAAGGAGTTAGCCCGTGAAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGTCGACCTGCCCAATCACCTTCGACGGTGC
AGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACA
TCCTGTCCTTACCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCCGACACCGCTAGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGC
ATCGACTTGCCAGTCACGCTGGGGCAGGACAAAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAGTTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCAT
CATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATTCCACCCCAATGGCGTGGGCACGATCCGAGGAGAACAGACCGCTTCGAGGGAATGCTA
TGCCGCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCCCAGGGATGGGACGCTCGAGCTCAAGGCCGACCTGCCGAGGAAGAAGTTTGCCGCACCCACCGAGG
AGCTCGAGCTTGTTCCTCTGCTTAGTTCCGAGAAGCAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCCATCTTAGAGCCAGAACTGATGGAGATCGGCGCTCCAGA
ATCCTCATGGATGGACCCGATCGCGGACTTCATTAGGGGCAACTCACCACAGGACCCCAAGGCGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGGAGGGTCCAAACGCATG
TGGGTGCCCTTGATCCGGCCTGGGAGGGCCCGTTTGAGATCAAGGGCATAGTCCGACCTGGGACGTACATATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGG
AACGCGGAGCACCTGAAGCGTTATTATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCAACCAGCGAATTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCAATGCCCACCAGAGGGAGGTCGGAGCAGCGGCGGTAGAAGGGCAAGGTCACGA
CGGCCTACCAACGGAACCCCTCCACAGGTCGGCACGGATCACCGCGCCCGCCCTACCGCCCGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCGAGGTCCAGCCTCGGCTCCAACAAGTGAGAACTTTGATGCGCTCCAGAGAGAGATGGAGGCAATTCGAACACAAATGCGCTCCATGGAGGCAATGTAT
AACGAAATGGTGCTAACTGCAGGCGCAGGGTCTCGATCAGAAAATCGAGTGACGCGCATGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAAGAACG
TCCCGAAGACAACGAGAGTGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCT
CCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCC
TTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGC
TCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCT
TTCAGATCGCGCTTACTGGCAGCGCGCAATTGTGGTACCGGAGACTGCCAGCCGGGTCGATCTCGACCTACTCTCAGTTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCT
CGGTACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGCGAGACGCTGCGGGAGTATGTCACCAGATTCCAAGAGGAGCAGTTGAAGGTTGC
ACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCTTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCC
AGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCC
AAGTCCAAGGACAATGGATCCTTCTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGGCCTTACGAGCGCTTCACCCCAACCACCAAGGA
CAAGTATTGCCGCTTCCATCGGGAGCATGGCCACAACACGTCGGACTGCTGGGAATTGAAGCGCCAAATTGAGGATCTAATTCAAGACGGCTACTTCAAGAAGTTTGTGG
GAAAGCCCAGGACCAGCTCAGTAGAGAAAAAGGAAGAGCGAAAGCGTTCAAGGACGCCACCCCGGCGCACCGACCGACCTGCGGTCATCAATACCATTTTTGGAGGGCCA
AGCGGGGGTCAATCCGGACATAAAAGAAAGGAGTTAGCCCGTGAAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGTCGACCTGCCCAATCACCTTCGACGGTGC
AGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACA
TCCTGTCCTTACCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCCGACACCGCTAGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGC
ATCGACTTGCCAGTCACGCTGGGGCAGGACAAAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAGTTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCAT
CATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATTCCACCCCAATGGCGTGGGCACGATCCGAGGAGAACAGACCGCTTCGAGGGAATGCTA
TGCCGCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCCCAGGGATGGGACGCTCGAGCTCAAGGCCGACCTGCCGAGGAAGAAGTTTGCCGCACCCACCGAGG
AGCTCGAGCTTGTTCCTCTGCTTAGTTCCGAGAAGCAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCCATCTTAGAGCCAGAACTGATGGAGATCGGCGCTCCAGA
ATCCTCATGGATGGACCCGATCGCGGACTTCATTAGGGGCAACTCACCACAGGACCCCAAGGCGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGGAGGGTCCAAACGCATG
TGGGTGCCCTTGATCCGGCCTGGGAGGGCCCGTTTGAGATCAAGGGCATAGTCCGACCTGGGACGTACATATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGG
AACGCGGAGCACCTGAAGCGTTATTATCCTTGA
Protein sequenceShow/hide protein sequence
MAQPANSTNTTDRRTLAASNAHQREVGAAAVEGQGHDGLPTEPLHRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPASAPTSENFDALQREMEAIRTQMRSMEAMY
NEMVLTAGAGSRSENRVTRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQLRGELDAQVEA
LKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAQLWYRRLPAGSISTYSQLRREFLAQFSS
RYYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAFTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADP
KSKDNGSFSSGRAEYRRAENGPTRSRPYERFTPTTKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKKEERKRSRTPPRRTDRPAVINTIFGGP
SGGQSGHKRKELAREARREVCIIREQGSTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGC
IDLPVTLGQDKTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPMAWARSEENRPLRGNAMPPHSKAHRSAPSKLPGMGRSSSRPTCRGRSLPHPPR
SSSLFLCLVPRSRSVPVEILDNPSILEPELMEIGAPESSWMDPIADFIRGNSPQDPKARRKLARRAARRVQTHVGALDPAWEGPFEIKGIVRPGTYILADLKGDVLAHPW
NAEHLKRYYP