; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g01200 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g01200
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr1:902513..908778
RNA-Seq ExpressionMoc01g01200
SyntenyMoc01g01200
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]4.4e-24186.27Show/hide
Query:  VITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNNGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTS
        VITR EFDQLRG+LDAQVEALKAKCEQKEGPLN+GDLGESPFTSDVLEAPIP KFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASDAIKCRAF+IALT 
Subjt:  VITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNNGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTS

Query:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFWEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFA
        SARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFA
Subjt:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFWEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFA

Query:  EVLQKAKKVIDGQELLRTKTA-AELSIERRRTDLPGADLTNAS----------------------------------TTIPIFEILTNIEESGMEKLLKR
        EVLQKAKKVIDGQELLRTKT   E  I R R+   G D+ NA                                   TTIPI EILTNIEESGMEKLLKR
Subjt:  EVLQKAKKVIDGQELLRTKTA-AELSIERRRTDLPGADLTNAS----------------------------------TTIPIFEILTNIEESGMEKLLKR

Query:  PEKLRGAPERRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDVYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELA
        PEKLRGAPERRSKDKYCRFHREH HNTSD WELKRQIE+LIQD YFKKFVGKPRTSS EKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSG KRKELA
Subjt:  PEKLRGAPERRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDVYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELA

Query:  RAARREVCIIREQRPTCPITFDGADLEGVHLPHNDALVIAPLIDHMVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDL
        RAARREVCIIREQRPTCPITFDGADLE VHLPHNDALVIAPLIDH+VV RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEG IDL
Subjt:  RAARREVCIIREQRPTCPITFDGADLEGVHLPHNDALVIAPLIDHMVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDL

Query:  PVTLGQDQTQVTQMAEF
        PVTLGQDQTQVTQMAEF
Subjt:  PVTLGQDQTQVTQMAEF

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]3.2e-24476.67Show/hide
Query:  VITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNNGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTS
        VITR EFDQLRGKL+AQVEALKAKCEQKEGPLN+GDLGESPFTSDVLE        APTVK YDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT 
Subjt:  VITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNNGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTS

Query:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFWEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFA
        SARLW                                                     F E+QLKVA  SDDSAMCYFLTGLADEALTVKLG+EAPATFA
Subjt:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFWEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFA

Query:  EVLQKAKKVIDGQELLRTKTA-AELSIERRRT------DLPGADLTNAS------------------------TTIPIFEILTNIEESGMEKLLKRPEKL
        EVLQKAKKVIDGQELLRTKT   E  I+R R+      DL   D  + S                        TTIPI EILTNIEESGMEKLLKRPEKL
Subjt:  EVLQKAKKVIDGQELLRTKTA-AELSIERRRT------DLPGADLTNAS------------------------TTIPIFEILTNIEESGMEKLLKRPEKL

Query:  RGAPERRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDVYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAAR
        RGAPERR+KDKYCRFHREHDHNTSD WELKRQIEDLIQD YFKKFVGKPRTSS EKKEERK SRTP RR DRPAVINTIFGGPSGGQSGHKRKELARAAR
Subjt:  RGAPERRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDVYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAAR

Query:  REVCIIREQRPTCPITFDGADLEGVHLPHNDALVIAPLIDHMVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTL
        REVCIIREQRPTCPITFD ADLE VHLPHNDALVIAPLIDH+VVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVGFS ESVIPEGCIDLPVTL
Subjt:  REVCIIREQRPTCPITFDGADLEGVHLPHNDALVIAPLIDHMVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTL

Query:  GQDQTQVTQMAEFGVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLASRDETLEFEADLPRREF
        G DQTQVTQMAEF VIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPNGVG VRGEQ ASRECYASALKGSSVCALETL SRD TLEF+A+LPRREF
Subjt:  GQDQTQVTQMAEFGVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLASRDETLEFEADLPRREF

Query:  TAPTEELELVPLL
         APTEELELVPLL
Subjt:  TAPTEELELVPLL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]1.2e-25969.76Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRLARITAPVLPPAHPRTSKATRGRGGTSKKGAWGPAPAPTSAGSRSENRMTRIDI
        MVQPANSTNTADRR LAA+  HQREVGA VVEGQGH+ L TEPL R ARIT PVLPPAHP+ SKA                  P + G            
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRLARITAPVLPPAHPRTSKATRGRGGTSKKGAWGPAPAPTSAGSRSENRMTRIDI

Query:  REQRGSHLGPVEEEHPEDNESEGHTRRKGDLRVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNNGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKD
                                        VITR EFDQL+ K DAQVEALKA+CE+KE   ++GDLGE  F+SD+LEA IP KFK PT+KPYDGSKD
Subjt:  REQRGSHLGPVEEEHPEDNESEGHTRRKGDLRVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNNGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKD

Query:  PKDYVEVFEGLMDFQAASDAIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFWEEQLKVAH
        PKDYVEVFE LMDFQAA+DAIKC AFQIALT SARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAH
Subjt:  PKDYVEVFEGLMDFQAASDAIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFWEEQLKVAH

Query:  CSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTA-AELSIERRRT--DLPGAD------------------LTNAS-----
        CSDDSAMCYFLTGLADE LTVKL EEAPATFAEVLQK KKVIDGQELLRTKT   E +I++ R   D   AD                   +N+S     
Subjt:  CSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTA-AELSIERRRT--DLPGAD------------------LTNAS-----

Query:  -------TTIPIFEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDVYFKKFVGKPRTSSTEKKEERKRSRTP
               TTIPIFEILTNIEE+GMEKLLKRPEKLRG PE+R+ DKYCRFHR+H HNTS+ WELKRQIEDLIQD YFKKFVGKPR++S EKKEERKR RTP
Subjt:  -------TTIPIFEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDVYFKKFVGKPRTSSTEKKEERKRSRTP

Query:  PRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLEGVHLPHNDALVIAPLIDHMVVRRVLVDGGASANILSLPTYLAL
        PRR DRPAVIN             K+KELAR ARREVCIIREQRPT  I F+ ADLEGVHLPHNDALVIAPLID ++VRR+LVDGGASANILSL TYLAL
Subjt:  PRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLEGVHLPHNDALVIAPLIDHMVVRRVLVDGGASANILSLPTYLAL

Query:  GWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFGVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASREC
        GWTRSQLKKSPTPLVGFSGES+  EGCIDLPV++ QD TQVTQMAEF VIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SREC
Subjt:  GWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFGVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASREC

Query:  YASALKGSSVCALETLASRDE
        YAS  K SSVCALE    RDE
Subjt:  YASALKGSSVCALETLASRDE

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]1.4e-19968.29Show/hide
Query:  VITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNNGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTS
        VI R EFD ++ + D QVEALKA+CE+KE P ++ DLGESPFTSD++EAPIP KFK PT+KPYDGSKDPKDYVEVFEGLMDFQAA+DAIKC AFQIALT 
Subjt:  VITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNNGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTS

Query:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFWEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFA
        SARLW RRLPARSISTYSQLR+EF+ QFS RHYD+KTATHLATIRQKE                                   DE LTVKLGEEAPATFA
Subjt:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFWEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFA

Query:  EVLQKAKKVIDGQELLRTKT--------AAELSIERRRTDLPGADLTNASTTIPIFEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNT
        EVLQ AKKVIDGQELLRTKT           LS ++R+ D    D  ++S              SG     +R E   G    R  ++            
Subjt:  EVLQKAKKVIDGQELLRTKT--------AAELSIERRRTDLPGADLTNASTTIPIFEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNT

Query:  SDCWELKRQIEDLIQDVYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLE
          CWELKRQIEDLIQD YFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVINTIFGGPSGGQ  +KRKELA  ARR+V IIREQ+PTC ITF   DLE
Subjt:  SDCWELKRQIEDLIQDVYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLE

Query:  GVHLPHNDALVIAPLIDHMVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFGVIDGRSAYN
        GVHLPHNDALVIAPLIDH++VRRVLVDGGASANILSLPTYLAL  TRSQLKKSPTPLVGFS ESV PEGCIDLPVT+GQD TQVTQMAEF VIDGR AYN
Subjt:  GVHLPHNDALVIAPLIDHMVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFGVIDGRSAYN

Query:  AIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLASRDETLEFEADLPR
        AIF RPIIHSF+A+PS LHQVLKYSTPNGVGTVRGEQ  SRECYASALK SSVCALE   S+D       DLPR
Subjt:  AIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLASRDETLEFEADLPR

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]8.1e-20371.51Show/hide
Query:  MDFQAASDAIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFWEEQLKVAHCSDDSAMCYFL
        MDFQAA+DAIKCRAFQIALT SARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTATHLATIRQKE ETLREYVTRF EEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFWEEQLKVAHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTA--------AELSIERRRTDLPGADLTNA-------------------------STTIP
        T LADE LTVKLGEEAP TF EVLQKAKKVIDGQELLRTKT          +LS E+R+ D    D  ++                         S+TIP
Subjt:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTA--------AELSIERRRTDLPGADLTNA-------------------------STTIP

Query:  IFEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDVYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVIN
        I EILTNIEESGMEKLLKRPEKLRG  E+R+K+KYCRFHR+H HNT+ CWELKRQIEDLIQD YFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVIN
Subjt:  IFEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDVYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVIN

Query:  TIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLEGVHLPHNDALVIAPLIDHMVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSP
        TIFGGP+GGQSG+KRKELAR ARREVCIIRE +PTC ITF  ADLEGVHLPHNDALVIA LIDH +VRRVL+DG                          
Subjt:  TIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLEGVHLPHNDALVIAPLIDHMVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSP

Query:  TPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFGVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVC
                      GCIDLPVT+GQD TQVTQMAEF VIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPN VG VRGEQ  SRECYASALKGS+VC
Subjt:  TPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFGVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVC

Query:  ALETLASRDETLEFEADLP---RREFTAPTEELELVPLLSPEKQ
        ALE   +R +  E EADLP   +R+F  PTEELELVPLLSPE+Q
Subjt:  ALETLASRDETLEFEADLP---RREFTAPTEELELVPLLSPEKQ

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088132.1e-24186.27Show/hide
Query:  VITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNNGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTS
        VITR EFDQLRG+LDAQVEALKAKCEQKEGPLN+GDLGESPFTSDVLEAPIP KFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASDAIKCRAF+IALT 
Subjt:  VITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNNGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTS

Query:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFWEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFA
        SARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFA
Subjt:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFWEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFA

Query:  EVLQKAKKVIDGQELLRTKTA-AELSIERRRTDLPGADLTNAS----------------------------------TTIPIFEILTNIEESGMEKLLKR
        EVLQKAKKVIDGQELLRTKT   E  I R R+   G D+ NA                                   TTIPI EILTNIEESGMEKLLKR
Subjt:  EVLQKAKKVIDGQELLRTKTA-AELSIERRRTDLPGADLTNAS----------------------------------TTIPIFEILTNIEESGMEKLLKR

Query:  PEKLRGAPERRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDVYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELA
        PEKLRGAPERRSKDKYCRFHREH HNTSD WELKRQIE+LIQD YFKKFVGKPRTSS EKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSG KRKELA
Subjt:  PEKLRGAPERRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDVYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELA

Query:  RAARREVCIIREQRPTCPITFDGADLEGVHLPHNDALVIAPLIDHMVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDL
        RAARREVCIIREQRPTCPITFDGADLE VHLPHNDALVIAPLIDH+VV RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEG IDL
Subjt:  RAARREVCIIREQRPTCPITFDGADLEGVHLPHNDALVIAPLIDHMVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDL

Query:  PVTLGQDQTQVTQMAEF
        PVTLGQDQTQVTQMAEF
Subjt:  PVTLGQDQTQVTQMAEF

A0A6J1D9E1 uncharacterized protein LOC1110188231.6e-24476.67Show/hide
Query:  VITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNNGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTS
        VITR EFDQLRGKL+AQVEALKAKCEQKEGPLN+GDLGESPFTSDVLE        APTVK YDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT 
Subjt:  VITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNNGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTS

Query:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFWEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFA
        SARLW                                                     F E+QLKVA  SDDSAMCYFLTGLADEALTVKLG+EAPATFA
Subjt:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFWEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFA

Query:  EVLQKAKKVIDGQELLRTKTA-AELSIERRRT------DLPGADLTNAS------------------------TTIPIFEILTNIEESGMEKLLKRPEKL
        EVLQKAKKVIDGQELLRTKT   E  I+R R+      DL   D  + S                        TTIPI EILTNIEESGMEKLLKRPEKL
Subjt:  EVLQKAKKVIDGQELLRTKTA-AELSIERRRT------DLPGADLTNAS------------------------TTIPIFEILTNIEESGMEKLLKRPEKL

Query:  RGAPERRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDVYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAAR
        RGAPERR+KDKYCRFHREHDHNTSD WELKRQIEDLIQD YFKKFVGKPRTSS EKKEERK SRTP RR DRPAVINTIFGGPSGGQSGHKRKELARAAR
Subjt:  RGAPERRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDVYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAAR

Query:  REVCIIREQRPTCPITFDGADLEGVHLPHNDALVIAPLIDHMVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTL
        REVCIIREQRPTCPITFD ADLE VHLPHNDALVIAPLIDH+VVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVGFS ESVIPEGCIDLPVTL
Subjt:  REVCIIREQRPTCPITFDGADLEGVHLPHNDALVIAPLIDHMVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTL

Query:  GQDQTQVTQMAEFGVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLASRDETLEFEADLPRREF
        G DQTQVTQMAEF VIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPNGVG VRGEQ ASRECYASALKGSSVCALETL SRD TLEF+A+LPRREF
Subjt:  GQDQTQVTQMAEFGVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLASRDETLEFEADLPRREF

Query:  TAPTEELELVPLL
         APTEELELVPLL
Subjt:  TAPTEELELVPLL

A0A6J1DHB3 uncharacterized protein LOC1110204795.9e-26069.76Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRLARITAPVLPPAHPRTSKATRGRGGTSKKGAWGPAPAPTSAGSRSENRMTRIDI
        MVQPANSTNTADRR LAA+  HQREVGA VVEGQGH+ L TEPL R ARIT PVLPPAHP+ SKA                  P + G            
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRLARITAPVLPPAHPRTSKATRGRGGTSKKGAWGPAPAPTSAGSRSENRMTRIDI

Query:  REQRGSHLGPVEEEHPEDNESEGHTRRKGDLRVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNNGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKD
                                        VITR EFDQL+ K DAQVEALKA+CE+KE   ++GDLGE  F+SD+LEA IP KFK PT+KPYDGSKD
Subjt:  REQRGSHLGPVEEEHPEDNESEGHTRRKGDLRVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNNGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKD

Query:  PKDYVEVFEGLMDFQAASDAIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFWEEQLKVAH
        PKDYVEVFE LMDFQAA+DAIKC AFQIALT SARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAH
Subjt:  PKDYVEVFEGLMDFQAASDAIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFWEEQLKVAH

Query:  CSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTA-AELSIERRRT--DLPGAD------------------LTNAS-----
        CSDDSAMCYFLTGLADE LTVKL EEAPATFAEVLQK KKVIDGQELLRTKT   E +I++ R   D   AD                   +N+S     
Subjt:  CSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTA-AELSIERRRT--DLPGAD------------------LTNAS-----

Query:  -------TTIPIFEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDVYFKKFVGKPRTSSTEKKEERKRSRTP
               TTIPIFEILTNIEE+GMEKLLKRPEKLRG PE+R+ DKYCRFHR+H HNTS+ WELKRQIEDLIQD YFKKFVGKPR++S EKKEERKR RTP
Subjt:  -------TTIPIFEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDVYFKKFVGKPRTSSTEKKEERKRSRTP

Query:  PRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLEGVHLPHNDALVIAPLIDHMVVRRVLVDGGASANILSLPTYLAL
        PRR DRPAVIN             K+KELAR ARREVCIIREQRPT  I F+ ADLEGVHLPHNDALVIAPLID ++VRR+LVDGGASANILSL TYLAL
Subjt:  PRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLEGVHLPHNDALVIAPLIDHMVVRRVLVDGGASANILSLPTYLAL

Query:  GWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFGVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASREC
        GWTRSQLKKSPTPLVGFSGES+  EGCIDLPV++ QD TQVTQMAEF VIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SREC
Subjt:  GWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFGVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASREC

Query:  YASALKGSSVCALETLASRDE
        YAS  K SSVCALE    RDE
Subjt:  YASALKGSSVCALETLASRDE

A0A6J1DPC9 uncharacterized protein LOC1110222806.9e-20068.29Show/hide
Query:  VITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNNGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTS
        VI R EFD ++ + D QVEALKA+CE+KE P ++ DLGESPFTSD++EAPIP KFK PT+KPYDGSKDPKDYVEVFEGLMDFQAA+DAIKC AFQIALT 
Subjt:  VITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNNGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTS

Query:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFWEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFA
        SARLW RRLPARSISTYSQLR+EF+ QFS RHYD+KTATHLATIRQKE                                   DE LTVKLGEEAPATFA
Subjt:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFWEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFA

Query:  EVLQKAKKVIDGQELLRTKT--------AAELSIERRRTDLPGADLTNASTTIPIFEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNT
        EVLQ AKKVIDGQELLRTKT           LS ++R+ D    D  ++S              SG     +R E   G    R  ++            
Subjt:  EVLQKAKKVIDGQELLRTKT--------AAELSIERRRTDLPGADLTNASTTIPIFEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNT

Query:  SDCWELKRQIEDLIQDVYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLE
          CWELKRQIEDLIQD YFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVINTIFGGPSGGQ  +KRKELA  ARR+V IIREQ+PTC ITF   DLE
Subjt:  SDCWELKRQIEDLIQDVYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLE

Query:  GVHLPHNDALVIAPLIDHMVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFGVIDGRSAYN
        GVHLPHNDALVIAPLIDH++VRRVLVDGGASANILSLPTYLAL  TRSQLKKSPTPLVGFS ESV PEGCIDLPVT+GQD TQVTQMAEF VIDGR AYN
Subjt:  GVHLPHNDALVIAPLIDHMVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFGVIDGRSAYN

Query:  AIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLASRDETLEFEADLPR
        AIF RPIIHSF+A+PS LHQVLKYSTPNGVGTVRGEQ  SRECYASALK SSVCALE   S+D       DLPR
Subjt:  AIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLASRDETLEFEADLPR

A0A6J1DZB9 uncharacterized protein LOC1110249043.9e-20371.51Show/hide
Query:  MDFQAASDAIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFWEEQLKVAHCSDDSAMCYFL
        MDFQAA+DAIKCRAFQIALT SARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTATHLATIRQKE ETLREYVTRF EEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFWEEQLKVAHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTA--------AELSIERRRTDLPGADLTNA-------------------------STTIP
        T LADE LTVKLGEEAP TF EVLQKAKKVIDGQELLRTKT          +LS E+R+ D    D  ++                         S+TIP
Subjt:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTA--------AELSIERRRTDLPGADLTNA-------------------------STTIP

Query:  IFEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDVYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVIN
        I EILTNIEESGMEKLLKRPEKLRG  E+R+K+KYCRFHR+H HNT+ CWELKRQIEDLIQD YFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVIN
Subjt:  IFEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDVYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVIN

Query:  TIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLEGVHLPHNDALVIAPLIDHMVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSP
        TIFGGP+GGQSG+KRKELAR ARREVCIIRE +PTC ITF  ADLEGVHLPHNDALVIA LIDH +VRRVL+DG                          
Subjt:  TIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLEGVHLPHNDALVIAPLIDHMVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSP

Query:  TPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFGVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVC
                      GCIDLPVT+GQD TQVTQMAEF VIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPN VG VRGEQ  SRECYASALKGS+VC
Subjt:  TPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFGVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVC

Query:  ALETLASRDETLEFEADLP---RREFTAPTEELELVPLLSPEKQ
        ALE   +R +  E EADLP   +R+F  PTEELELVPLLSPE+Q
Subjt:  ALETLASRDETLEFEADLP---RREFTAPTEELELVPLLSPEKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGACGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGCAGGTTGGCACGAATTACCGCGCCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCTGGGGTCCAGCCCCGGCTCCAACAAGCGCAGGGTCCCGATCTGAAAATCGAATGACGCGCATTGACATACGCGAGCAAAGGGGTTCCCACCTCGGCCCA
GTCGAGGAGGAACATCCCGAAGACAACGAAAGCGAGGGACACACTCGCCGGAAGGGAGACCTCCGGGTGATCACAAGGGCGGAGTTCGACCAGCTGAGGGGCAAGCTCGA
CGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTCCACTGAATAATGGCGACTTGGGGGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCC
CTCTGAAGTTCAAAGCTCCTACCGTGAAACCTTATGATGGGTCAAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCA
ATCAAATGCCGCGCCTTTCAGATCGCGCTTACCAGCAGCGCGCGTTTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCT
CGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTTTGGGAGG
AGCAATTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTTTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACC
TTCGCCGAAGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGCGGCCGAGCTGAGTATCGAAAGGCGGAGAACGGACCTACCAGG
AGCCGACCTTACGAACGCTTCGACCACGATTCCAATTTTCGAAATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGGAG
CCCCGGAGAGGCGCAGCAAGGACAAGTATTGTCGCTTCCATCGGGAGCACGACCATAACACGTCGGACTGTTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGAT
GTCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACCAGCTCGACAGAGAAAAAGGAAGAGCGAAAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACCTGCGGTCAT
CAATACCATTTTTGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCT
GCCCAATCACCTTCGACGGAGCAGACTTGGAGGGAGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCACATGGTGGTCAGGAGGGTGCTGGTA
GACGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGA
ATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTTGGGCAGGACCAGACTCAGGTCACCCAAATGGCCGAGTTCGGGGTAATTGACGGTAGATCGGCCTATA
ACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCCTTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAA
CAGACCGCTTCGAGGGAGTGCTATGCCTCCGCACTTAAAGGCTCATCGGTCTGCGCCCTTGAAACTCTCGCCAGTAGGGATGAGACGCTCGAGTTCGAGGCCGACCTGCC
GAGGAGGGAGTTTACCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAACTGAGCGAGTCCGACCTACTGTGGACAATCTATGTTGACGGAT
CCTCCAATGAGAAGGGGTGCGGGGCCGGGGTCCTCTTGCTTGGACCAGGAGGCGAGCGATTTGAGTATGCCTTGCGGTCGGTTCCCGTCGAGATCTTAGATAATCCCTCG
ATCTCAGAGCCAGATCTGATGGAGATCGACGCTCCAGAGTCTTCATGGATGGACCCGATTGCGGACTTCATTAGGGGCAATTCACCACAAGACCCAAAGGAGCGCAGAAA
GTTGGCAAGGCGAGCAGCTCGGTTCGTGGTCCGAGGTAGAGCATTGTACCGACGCGGCTTTTCCCTGCCTCTATTGAGATGCCTAACCCCTGAAGAGGGCCTAATGGCCA
GACACTACAACGTCCACGTTCGACCTCGAACCTTCCAGGTCGGACATCTGGTCCTAAGGAAGGTCCAAACCCATGTGGGTGCCCTTGACCCGACCTGGGAGGGGCCGTTT
GAAGTCAAGGGAATAGTCCGACCTGGGACGTACGTGTTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCATGGAACGCGGAGCACCTGAAGCGTTATTATCCTTTAAA
TGTCGAAATGGTTTTCAATGGATCTGTAAAAATTGTTTCAAAAGAATTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGACGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGCAGGTTGGCACGAATTACCGCGCCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCTGGGGTCCAGCCCCGGCTCCAACAAGCGCAGGGTCCCGATCTGAAAATCGAATGACGCGCATTGACATACGCGAGCAAAGGGGTTCCCACCTCGGCCCA
GTCGAGGAGGAACATCCCGAAGACAACGAAAGCGAGGGACACACTCGCCGGAAGGGAGACCTCCGGGTGATCACAAGGGCGGAGTTCGACCAGCTGAGGGGCAAGCTCGA
CGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTCCACTGAATAATGGCGACTTGGGGGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCC
CTCTGAAGTTCAAAGCTCCTACCGTGAAACCTTATGATGGGTCAAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCA
ATCAAATGCCGCGCCTTTCAGATCGCGCTTACCAGCAGCGCGCGTTTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCT
CGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTTTGGGAGG
AGCAATTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTTTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACC
TTCGCCGAAGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGCGGCCGAGCTGAGTATCGAAAGGCGGAGAACGGACCTACCAGG
AGCCGACCTTACGAACGCTTCGACCACGATTCCAATTTTCGAAATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGGAG
CCCCGGAGAGGCGCAGCAAGGACAAGTATTGTCGCTTCCATCGGGAGCACGACCATAACACGTCGGACTGTTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGAT
GTCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACCAGCTCGACAGAGAAAAAGGAAGAGCGAAAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACCTGCGGTCAT
CAATACCATTTTTGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCT
GCCCAATCACCTTCGACGGAGCAGACTTGGAGGGAGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCACATGGTGGTCAGGAGGGTGCTGGTA
GACGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGA
ATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTTGGGCAGGACCAGACTCAGGTCACCCAAATGGCCGAGTTCGGGGTAATTGACGGTAGATCGGCCTATA
ACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCCTTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAA
CAGACCGCTTCGAGGGAGTGCTATGCCTCCGCACTTAAAGGCTCATCGGTCTGCGCCCTTGAAACTCTCGCCAGTAGGGATGAGACGCTCGAGTTCGAGGCCGACCTGCC
GAGGAGGGAGTTTACCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAACTGAGCGAGTCCGACCTACTGTGGACAATCTATGTTGACGGAT
CCTCCAATGAGAAGGGGTGCGGGGCCGGGGTCCTCTTGCTTGGACCAGGAGGCGAGCGATTTGAGTATGCCTTGCGGTCGGTTCCCGTCGAGATCTTAGATAATCCCTCG
ATCTCAGAGCCAGATCTGATGGAGATCGACGCTCCAGAGTCTTCATGGATGGACCCGATTGCGGACTTCATTAGGGGCAATTCACCACAAGACCCAAAGGAGCGCAGAAA
GTTGGCAAGGCGAGCAGCTCGGTTCGTGGTCCGAGGTAGAGCATTGTACCGACGCGGCTTTTCCCTGCCTCTATTGAGATGCCTAACCCCTGAAGAGGGCCTAATGGCCA
GACACTACAACGTCCACGTTCGACCTCGAACCTTCCAGGTCGGACATCTGGTCCTAAGGAAGGTCCAAACCCATGTGGGTGCCCTTGACCCGACCTGGGAGGGGCCGTTT
GAAGTCAAGGGAATAGTCCGACCTGGGACGTACGTGTTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCATGGAACGCGGAGCACCTGAAGCGTTATTATCCTTTAAA
TGTCGAAATGGTTTTCAATGGATCTGTAAAAATTGTTTCAAAAGAATTATGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRLARITAPVLPPAHPRTSKATRGRGGTSKKGAWGPAPAPTSAGSRSENRMTRIDIREQRGSHLGP
VEEEHPEDNESEGHTRRKGDLRVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNNGDLGESPFTSDVLEAPIPLKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDA
IKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFWEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT
FAEVLQKAKKVIDGQELLRTKTAAELSIERRRTDLPGADLTNASTTIPIFEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQD
VYFKKFVGKPRTSSTEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLEGVHLPHNDALVIAPLIDHMVVRRVLV
DGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFGVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGE
QTASRECYASALKGSSVCALETLASRDETLEFEADLPRREFTAPTEELELVPLLSPEKQLSESDLLWTIYVDGSSNEKGCGAGVLLLGPGGERFEYALRSVPVEILDNPS
ISEPDLMEIDAPESSWMDPIADFIRGNSPQDPKERRKLARRAARFVVRGRALYRRGFSLPLLRCLTPEEGLMARHYNVHVRPRTFQVGHLVLRKVQTHVGALDPTWEGPF
EVKGIVRPGTYVLADLKGDVLAHPWNAEHLKRYYPLNVEMVFNGSVKIVSKEL