; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g21560 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g21560
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr4:15691063..15696654
RNA-Seq ExpressionMoc04g21560
SyntenyMoc04g21560
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]1.8e-25287.12Show/hide
Query:  QAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDDDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        +AESS NPATPAGVITR EFDQLRG+LDAQVEALKAKCEQKEG LND DLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDDDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  TIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQFSSRNYDKKTAIHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
         IKCRAF+IALT SARLWYRRLPA SISTYSQLRREFLA FSSR+YDKKTA HLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  TIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQFSSRNYDKKTAIHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGNDIEKADPKSKDKGSFSSSRAE--------------------------------
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSG DIE ADPKSKDKGSFSS RAE                                
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGNDIEKADPKSKDKGSFSSSRAE--------------------------------

Query:  ---MEKLLKRPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
           MEKLLKRPEKLRGAPERR+KDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSS EKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  ---MEKLLKRPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARVARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALMIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFFGE
        QSG KRKELAR ARREVCIIREQRPTCPITFD  DLEEVHLPHNDAL+IAPLIDHVVV RVL+DGG SANILSLPTYLALGWTRSQL KSPTPLVGF GE
Subjt:  QSGHKRKELARVARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALMIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFFGE

Query:  SVVPEGCIDLPVTLGQDQTQVTQMAEFV
        SV+PEG IDLPVTLGQDQTQVTQMAEFV
Subjt:  SVVPEGCIDLPVTLGQDQTQVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]2.2e-25373.68Show/hide
Query:  SSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDDDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPATP GVITR EFDQLRGKL+AQVEALKAKCEQKEG LND DLGESPFTSDVLE        APTVK YDGSKDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDDDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AASDTIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQFSSRNYDKKTAIHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASD IKCRAFQIALT SARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDTIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQFSSRNYDKKTAIHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGNDIEKADPKSKDKGSFSSSRAE----------------------------
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSG D EKAD KSKDKGSFSS RAE                            
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGNDIEKADPKSKDKGSFSSSRAE----------------------------

Query:  -------MEKLLKRPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGG
               MEKLLKRPEKLRGAPERRNKDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSS EKKEERK SRTP RR DRPAVINTIFGG
Subjt:  -------MEKLLKRPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGG

Query:  PSGGQSGHKRKELARVARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALMIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVG
        PSGGQSGHKRKELAR ARREVCIIREQRPTCPITFDS DLEEVHLPHNDAL+IAPLIDHVVVRRVL+D G SANI+SL TYLALGWTRSQL KS TPLVG
Subjt:  PSGGQSGHKRKELARVARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALMIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVG

Query:  FFGESVVPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSTYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETL
        F  ESV+PEGCIDLPVTLG DQTQVTQMAEFVVIDGRS YNAIFGRPIIHSFRAIPSTLHQ+LKYSTPNGVG VRGEQ ASRECYASALKGSSVCALETL
Subjt:  FFGESVVPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSTYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETL

Query:  AGRDETLEFEADLPRREFAAPTEELKLVPLLSPEKQTDLARPVPVEILDNPSISEPDLMEIGAPE
          RD TLEF+A+LPRREFAAPTEEL+LVPLL  +   ++     ++   + +  + D+   G PE
Subjt:  AGRDETLEFEADLPRREFAAPTEELKLVPLLSPEKQTDLARPVPVEILDNPSISEPDLMEIGAPE

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]2.7e-25663.89Show/hide
Query:  MVQPANSTNTADRRTLAANDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPPSENFDALQREMEAMR
        MVQPANSTNTADRR LAAN  HQREVGA VVEGQGH+ L TEPL RSARIT PVLPPAHP+ SK                                    
Subjt:  MVQPANSTNTADRRTLAANDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPPSENFDALQREMEAMR

Query:  TQMRSMEEMYNEMILAAGAGSRSENRMTRIDIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGHSPSRSHRSSNQQAESSHNPAT
                                                                                                   AESS+NP T
Subjt:  TQMRSMEEMYNEMILAAGAGSRSENRMTRIDIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGHSPSRSHRSSNQQAESSHNPAT

Query:  PAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDDDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDTIKCRAFQIA
        P GVITR EFDQL+ K DAQVEALKA+CE+KE S +D DLGE  F+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+D IKC AFQIA
Subjt:  PAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDDDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDTIKCRAFQIA

Query:  LTSSARLWYRRLPARSISTYSQLRREFLAQFSSRNYDKKTAIHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPA
        LT SARLWYRRLPAR ISTYSQLR+EF++QFSSR+YD+KT  HLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPA
Subjt:  LTSSARLWYRRLPARSISTYSQLRREFLAQFSSRNYDKKTAIHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPA

Query:  TFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGNDIEKADPKSKDKG-SFSSSRAE-----------------------------------MEKLLK
        TFAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+G D  KAD KS+DKG S SSSR +                                   MEKLLK
Subjt:  TFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGNDIEKADPKSKDKG-SFSSSRAE-----------------------------------MEKLLK

Query:  RPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKEL
        RPEKLRG PE+RN DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN             K+KEL
Subjt:  RPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKEL

Query:  ARVARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALMIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFFGESVVPEGCID
        AR ARREVCIIREQRPT  I F+  DLE VHLPHNDAL+IAPLID V+VRR+L+DGGASANILSL TYLALGWTRSQL KSPTPLVGF GES+  EGCID
Subjt:  ARVARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALMIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFFGESVVPEGCID

Query:  LPVTLGQDQTQVTQMAEFVVIDGRSTYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRDE
        LPV++ QD TQVTQMAEFVVIDGRS YNAIFGRPIIHSFRA+PSTLHQ+LKYST NGVGTVRGE   SRECYAS  K SSVCALE    RDE
Subjt:  LPVTLGQDQTQVTQMAEFVVIDGRSTYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRDE

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]1.8e-20765.31Show/hide
Query:  EQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSL---RKGHSPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQ
        E+    + P   E   ++E   ++ +  DLR+HL  K+  +        S SR   +SN +A+S + P  P  VI R EFD ++ + D QVEALKA+CE+
Subjt:  EQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSL---RKGHSPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQ

Query:  KEGSLNDDDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDTIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQ
        KE   +DDDLGESPFTSD++EAPIPPKFK PT+KPYDGSKDPKDYVEVFEGLMDFQAA+D IKC AFQIALT SARLW RRLPARSISTYSQLR+EF+ Q
Subjt:  KEGSLNDDDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDTIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FSSRNYDKKTAIHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
        FS R+YD+KTA HLATIRQKE                                   DE LTVKLGEEAPATFAEVLQ AKKVIDGQELLRTKT RPE++I
Subjt:  FSSRNYDKKTAIHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI

Query:  GRGRSGNDIEKADPKSKDKGSFSS-SRAEMEKLLKRPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEER
         + R      K D KSKDKGS SS SR E  +    P + R  P  R                  CWELKRQIEDLIQD YFKKFVGKPR++S EKKEER
Subjt:  GRGRSGNDIEKADPKSKDKGSFSS-SRAEMEKLLKRPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEER

Query:  KRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARVARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALMIAPLIDHVVVRRVLIDGGASANILSL
        KRSRTPPRR DRPAVINTIFGGPSGGQ  +KRKELA  ARR+V IIREQ+PTC ITF  TDLE VHLPHNDAL+IAPLIDHV+VRRVL+DGGASANILSL
Subjt:  KRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARVARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALMIAPLIDHVVVRRVLIDGGASANILSL

Query:  PTYLALGWTRSQLTKSPTPLVGFFGESVVPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSTYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQ
        PTYLAL  TRSQL KSPTPLVGF  ESV PEGCIDLPVT+GQD TQVTQMAEFVVIDGR  YNAIF RPIIHSF+A+PS LHQ+LKYSTPNGVGTVRGEQ
Subjt:  PTYLALGWTRSQLTKSPTPLVGFFGESVVPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSTYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQ

Query:  TASRECYASALKGSSVCALETLAGRDETLEFEADLPR
          SRECYASALK SSVCALE    +D       DLPR
Subjt:  TASRECYASALKGSSVCALETLAGRDETLEFEADLPR

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]3.6e-20070.33Show/hide
Query:  MDFQAASDTIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQFSSRNYDKKTAIHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAA+D IKCRAFQIALT SARLWYRRLPARSISTYSQLR+EF++QFSS +YD+KTA HLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAASDTIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQFSSRNYDKKTAIHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGNDIEKADPKSKDKGSFSS-SRAE-----------------------
        T LADE LTVKLGEEAP TF EVLQKAKKVIDGQELLRTKTGRPE++I + +   +  KAD KS+DKGS SS SR E                       
Subjt:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGNDIEKADPKSKDKGSFSS-SRAE-----------------------

Query:  ------------MEKLLKRPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVIN
                    MEKLLKRPEKLRG  E+RNK+KYCRFHR+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVIN
Subjt:  ------------MEKLLKRPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVIN

Query:  TIFGGPSGGQSGHKRKELARVARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALMIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSP
        TIFGGP+GGQSG+KRKELAR ARREVCIIRE +PTC ITF   DLE VHLPHNDAL+IA LIDH +VRRVLIDG                          
Subjt:  TIFGGPSGGQSGHKRKELARVARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALMIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSP

Query:  TPLVGFFGESVVPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSTYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKGSSVC
                      GCIDLPVT+GQD TQVTQMAEFVVIDGRS YNAIFGRPIIHSFRA+PSTLHQ+LKYSTPN VG VRGEQ  SRECYASALKGS+VC
Subjt:  TPLVGFFGESVVPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSTYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKGSSVC

Query:  ALETLAGRDETLEFEADLP---RREFAAPTEELKLVPLLSPEKQTD
        ALE    R +  E EADLP   +R+F  PTEEL+LVPLLSPE+Q +
Subjt:  ALETLAGRDETLEFEADLP---RREFAAPTEELKLVPLLSPEKQTD

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088138.9e-25387.12Show/hide
Query:  QAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDDDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        +AESS NPATPAGVITR EFDQLRG+LDAQVEALKAKCEQKEG LND DLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDDDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  TIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQFSSRNYDKKTAIHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
         IKCRAF+IALT SARLWYRRLPA SISTYSQLRREFLA FSSR+YDKKTA HLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  TIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQFSSRNYDKKTAIHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGNDIEKADPKSKDKGSFSSSRAE--------------------------------
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSG DIE ADPKSKDKGSFSS RAE                                
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGNDIEKADPKSKDKGSFSSSRAE--------------------------------

Query:  ---MEKLLKRPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
           MEKLLKRPEKLRGAPERR+KDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSS EKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  ---MEKLLKRPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARVARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALMIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFFGE
        QSG KRKELAR ARREVCIIREQRPTCPITFD  DLEEVHLPHNDAL+IAPLIDHVVV RVL+DGG SANILSLPTYLALGWTRSQL KSPTPLVGF GE
Subjt:  QSGHKRKELARVARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALMIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFFGE

Query:  SVVPEGCIDLPVTLGQDQTQVTQMAEFV
        SV+PEG IDLPVTLGQDQTQVTQMAEFV
Subjt:  SVVPEGCIDLPVTLGQDQTQVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188231.1e-25373.68Show/hide
Query:  SSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDDDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPATP GVITR EFDQLRGKL+AQVEALKAKCEQKEG LND DLGESPFTSDVLE        APTVK YDGSKDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDDDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AASDTIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQFSSRNYDKKTAIHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASD IKCRAFQIALT SARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDTIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQFSSRNYDKKTAIHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGNDIEKADPKSKDKGSFSSSRAE----------------------------
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSG D EKAD KSKDKGSFSS RAE                            
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGNDIEKADPKSKDKGSFSSSRAE----------------------------

Query:  -------MEKLLKRPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGG
               MEKLLKRPEKLRGAPERRNKDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSS EKKEERK SRTP RR DRPAVINTIFGG
Subjt:  -------MEKLLKRPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGG

Query:  PSGGQSGHKRKELARVARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALMIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVG
        PSGGQSGHKRKELAR ARREVCIIREQRPTCPITFDS DLEEVHLPHNDAL+IAPLIDHVVVRRVL+D G SANI+SL TYLALGWTRSQL KS TPLVG
Subjt:  PSGGQSGHKRKELARVARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALMIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVG

Query:  FFGESVVPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSTYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETL
        F  ESV+PEGCIDLPVTLG DQTQVTQMAEFVVIDGRS YNAIFGRPIIHSFRAIPSTLHQ+LKYSTPNGVG VRGEQ ASRECYASALKGSSVCALETL
Subjt:  FFGESVVPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSTYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETL

Query:  AGRDETLEFEADLPRREFAAPTEELKLVPLLSPEKQTDLARPVPVEILDNPSISEPDLMEIGAPE
          RD TLEF+A+LPRREFAAPTEEL+LVPLL  +   ++     ++   + +  + D+   G PE
Subjt:  AGRDETLEFEADLPRREFAAPTEELKLVPLLSPEKQTDLARPVPVEILDNPSISEPDLMEIGAPE

A0A6J1DHB3 uncharacterized protein LOC1110204791.3e-25663.89Show/hide
Query:  MVQPANSTNTADRRTLAANDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPPSENFDALQREMEAMR
        MVQPANSTNTADRR LAAN  HQREVGA VVEGQGH+ L TEPL RSARIT PVLPPAHP+ SK                                    
Subjt:  MVQPANSTNTADRRTLAANDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPPSENFDALQREMEAMR

Query:  TQMRSMEEMYNEMILAAGAGSRSENRMTRIDIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGHSPSRSHRSSNQQAESSHNPAT
                                                                                                   AESS+NP T
Subjt:  TQMRSMEEMYNEMILAAGAGSRSENRMTRIDIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGHSPSRSHRSSNQQAESSHNPAT

Query:  PAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDDDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDTIKCRAFQIA
        P GVITR EFDQL+ K DAQVEALKA+CE+KE S +D DLGE  F+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+D IKC AFQIA
Subjt:  PAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDDDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDTIKCRAFQIA

Query:  LTSSARLWYRRLPARSISTYSQLRREFLAQFSSRNYDKKTAIHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPA
        LT SARLWYRRLPAR ISTYSQLR+EF++QFSSR+YD+KT  HLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPA
Subjt:  LTSSARLWYRRLPARSISTYSQLRREFLAQFSSRNYDKKTAIHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPA

Query:  TFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGNDIEKADPKSKDKG-SFSSSRAE-----------------------------------MEKLLK
        TFAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+G D  KAD KS+DKG S SSSR +                                   MEKLLK
Subjt:  TFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGNDIEKADPKSKDKG-SFSSSRAE-----------------------------------MEKLLK

Query:  RPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKEL
        RPEKLRG PE+RN DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN             K+KEL
Subjt:  RPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKEL

Query:  ARVARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALMIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFFGESVVPEGCID
        AR ARREVCIIREQRPT  I F+  DLE VHLPHNDAL+IAPLID V+VRR+L+DGGASANILSL TYLALGWTRSQL KSPTPLVGF GES+  EGCID
Subjt:  ARVARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALMIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFFGESVVPEGCID

Query:  LPVTLGQDQTQVTQMAEFVVIDGRSTYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRDE
        LPV++ QD TQVTQMAEFVVIDGRS YNAIFGRPIIHSFRA+PSTLHQ+LKYST NGVGTVRGE   SRECYAS  K SSVCALE    RDE
Subjt:  LPVTLGQDQTQVTQMAEFVVIDGRSTYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRDE

A0A6J1DPC9 uncharacterized protein LOC1110222808.7e-20865.31Show/hide
Query:  EQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSL---RKGHSPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQ
        E+    + P   E   ++E   ++ +  DLR+HL  K+  +        S SR   +SN +A+S + P  P  VI R EFD ++ + D QVEALKA+CE+
Subjt:  EQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSL---RKGHSPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQ

Query:  KEGSLNDDDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDTIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQ
        KE   +DDDLGESPFTSD++EAPIPPKFK PT+KPYDGSKDPKDYVEVFEGLMDFQAA+D IKC AFQIALT SARLW RRLPARSISTYSQLR+EF+ Q
Subjt:  KEGSLNDDDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDTIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FSSRNYDKKTAIHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
        FS R+YD+KTA HLATIRQKE                                   DE LTVKLGEEAPATFAEVLQ AKKVIDGQELLRTKT RPE++I
Subjt:  FSSRNYDKKTAIHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI

Query:  GRGRSGNDIEKADPKSKDKGSFSS-SRAEMEKLLKRPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEER
         + R      K D KSKDKGS SS SR E  +    P + R  P  R                  CWELKRQIEDLIQD YFKKFVGKPR++S EKKEER
Subjt:  GRGRSGNDIEKADPKSKDKGSFSS-SRAEMEKLLKRPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEER

Query:  KRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARVARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALMIAPLIDHVVVRRVLIDGGASANILSL
        KRSRTPPRR DRPAVINTIFGGPSGGQ  +KRKELA  ARR+V IIREQ+PTC ITF  TDLE VHLPHNDAL+IAPLIDHV+VRRVL+DGGASANILSL
Subjt:  KRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARVARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALMIAPLIDHVVVRRVLIDGGASANILSL

Query:  PTYLALGWTRSQLTKSPTPLVGFFGESVVPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSTYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQ
        PTYLAL  TRSQL KSPTPLVGF  ESV PEGCIDLPVT+GQD TQVTQMAEFVVIDGR  YNAIF RPIIHSF+A+PS LHQ+LKYSTPNGVGTVRGEQ
Subjt:  PTYLALGWTRSQLTKSPTPLVGFFGESVVPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSTYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQ

Query:  TASRECYASALKGSSVCALETLAGRDETLEFEADLPR
          SRECYASALK SSVCALE    +D       DLPR
Subjt:  TASRECYASALKGSSVCALETLAGRDETLEFEADLPR

A0A6J1DZB9 uncharacterized protein LOC1110249041.8e-20070.33Show/hide
Query:  MDFQAASDTIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQFSSRNYDKKTAIHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAA+D IKCRAFQIALT SARLWYRRLPARSISTYSQLR+EF++QFSS +YD+KTA HLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAASDTIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQFSSRNYDKKTAIHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGNDIEKADPKSKDKGSFSS-SRAE-----------------------
        T LADE LTVKLGEEAP TF EVLQKAKKVIDGQELLRTKTGRPE++I + +   +  KAD KS+DKGS SS SR E                       
Subjt:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGNDIEKADPKSKDKGSFSS-SRAE-----------------------

Query:  ------------MEKLLKRPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVIN
                    MEKLLKRPEKLRG  E+RNK+KYCRFHR+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVIN
Subjt:  ------------MEKLLKRPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVIN

Query:  TIFGGPSGGQSGHKRKELARVARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALMIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSP
        TIFGGP+GGQSG+KRKELAR ARREVCIIRE +PTC ITF   DLE VHLPHNDAL+IA LIDH +VRRVLIDG                          
Subjt:  TIFGGPSGGQSGHKRKELARVARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALMIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSP

Query:  TPLVGFFGESVVPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSTYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKGSSVC
                      GCIDLPVT+GQD TQVTQMAEFVVIDGRS YNAIFGRPIIHSFRA+PSTLHQ+LKYSTPN VG VRGEQ  SRECYASALKGS+VC
Subjt:  TPLVGFFGESVVPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSTYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKGSSVC

Query:  ALETLAGRDETLEFEADLP---RREFAAPTEELKLVPLLSPEKQTD
        ALE    R +  E EADLP   +R+F  PTEEL+LVPLLSPE+Q +
Subjt:  ALETLAGRDETLEFEADLP---RREFAAPTEELKLVPLLSPEKQTD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAACGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCACCAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATGTAT
AACGAGATGATACTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAATGACGCGCATTGACATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACA
TCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTTCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACATTCACCATCCCGCT
CACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCAGCAACTCCCGCAGGGGTGATCACAAGGGCGGAGTTCGACCAGCTGAGGGGTAAGCTCGACGCTCAG
GTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTTCACTGAACGATGACGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAA
GTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACACAATCAAAT
GCCGCGCCTTTCAGATCGCGCTTACTAGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAG
TTCTCTTCTCGGAATTATGACAAAAAGACAGCGATCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAATT
GAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCG
AGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTTCTCCGAACCAAAACCGGCCGACCAGAACGAAAGATCGGCCGGGGCAGAAGTGGAAACGATATAGAA
AAGGCAGATCCCAAGTCTAAGGACAAGGGATCCTTTTCCAGCAGCCGAGCTGAAATGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGGAGCCCCGGAAAGGCGCAA
CAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAAT
TTGTGGGAAAGCCCAGGACCAGCTCGACAGAGAAAAAGGAAGAGCGAAAGCGTTCGAGGACGCCGCCCCGACGCACTGATCGACCTGCGGTCATCAATACCATTTTCGGA
GGGCCCAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGCTAGCTCGTGTAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTGCCCAATCACCTTCGA
CAGTACAGACTTAGAGGAGGTCCACCTGCCCCACAATGATGCACTTATGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGATAGACGGAGGCGCATCTG
CTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGTTGGACGAGGTCACAATTGACGAAAAGCCCGACACCGCTGGTTGGGTTCTTTGGAGAATCGGTCGTCCCAGAG
GGTTGCATCGACTTGCCGGTCACACTTGGGCAGGACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGACCTATAACGCCATCTTTGGGAG
ACCCATCATCCACTCATTTCGGGCCATTCCCTCGACACTTCATCAAATTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACTGCTTCAAGGG
AGTGTTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCGGTAGGGATGAGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAGGGAGTTTGCC
GCACCCACTGAGGAGCTCAAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAAACCGACCTGGCCAGGCCGGTCCCCGTCGAGATCTTAGATAATCCATCGATCTCAGAGCC
AGATCTGATGGAGATCGGTGCTCCAGAGTCCTCATGGATGGACCCGATTGCGGACTTCATTAGGGGCAATTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGAC
GAGCAGCTCGGTTCGTGGTCCGAGGTGGAGCATTGAGGGTCCAAACCCATGTGGGTGCCCTTGATCCGACCTGGGAGGGCCCGTTTGAGGTCAAGGGCATAGTCCGACCT
GGGACGTACATATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAACGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCACCAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATGTAT
AACGAGATGATACTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAATGACGCGCATTGACATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACA
TCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTTCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACATTCACCATCCCGCT
CACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCAGCAACTCCCGCAGGGGTGATCACAAGGGCGGAGTTCGACCAGCTGAGGGGTAAGCTCGACGCTCAG
GTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTTCACTGAACGATGACGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAA
GTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACACAATCAAAT
GCCGCGCCTTTCAGATCGCGCTTACTAGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAG
TTCTCTTCTCGGAATTATGACAAAAAGACAGCGATCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAATT
GAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCG
AGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTTCTCCGAACCAAAACCGGCCGACCAGAACGAAAGATCGGCCGGGGCAGAAGTGGAAACGATATAGAA
AAGGCAGATCCCAAGTCTAAGGACAAGGGATCCTTTTCCAGCAGCCGAGCTGAAATGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGGAGCCCCGGAAAGGCGCAA
CAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAAT
TTGTGGGAAAGCCCAGGACCAGCTCGACAGAGAAAAAGGAAGAGCGAAAGCGTTCGAGGACGCCGCCCCGACGCACTGATCGACCTGCGGTCATCAATACCATTTTCGGA
GGGCCCAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGCTAGCTCGTGTAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTGCCCAATCACCTTCGA
CAGTACAGACTTAGAGGAGGTCCACCTGCCCCACAATGATGCACTTATGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGATAGACGGAGGCGCATCTG
CTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGTTGGACGAGGTCACAATTGACGAAAAGCCCGACACCGCTGGTTGGGTTCTTTGGAGAATCGGTCGTCCCAGAG
GGTTGCATCGACTTGCCGGTCACACTTGGGCAGGACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGACCTATAACGCCATCTTTGGGAG
ACCCATCATCCACTCATTTCGGGCCATTCCCTCGACACTTCATCAAATTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACTGCTTCAAGGG
AGTGTTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCGGTAGGGATGAGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAGGGAGTTTGCC
GCACCCACTGAGGAGCTCAAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAAACCGACCTGGCCAGGCCGGTCCCCGTCGAGATCTTAGATAATCCATCGATCTCAGAGCC
AGATCTGATGGAGATCGGTGCTCCAGAGTCCTCATGGATGGACCCGATTGCGGACTTCATTAGGGGCAATTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGAC
GAGCAGCTCGGTTCGTGGTCCGAGGTGGAGCATTGAGGGTCCAAACCCATGTGGGTGCCCTTGATCCGACCTGGGAGGGCCCGTTTGAGGTCAAGGGCATAGTCCGACCT
GGGACGTACATATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTADRRTLAANDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPPSENFDALQREMEAMRTQMRSMEEMY
NEMILAAGAGSRSENRMTRIDIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGHSPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQ
VEALKAKCEQKEGSLNDDDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDTIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQ
FSSRNYDKKTAIHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGNDIE
KADPKSKDKGSFSSSRAEMEKLLKRPEKLRGAPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFG
GPSGGQSGHKRKELARVARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALMIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFFGESVVPE
GCIDLPVTLGQDQTQVTQMAEFVVIDGRSTYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRDETLEFEADLPRREFA
APTEELKLVPLLSPEKQTDLARPVPVEILDNPSISEPDLMEIGAPESSWMDPIADFIRGNSPQDPKERRKLARRAARFVVRGGALRVQTHVGALDPTWEGPFEVKGIVRP
GTYILADLKGDVLAHPWNAEHLKRYYP