; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g14700 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g14700
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr4:11225935..11230242
RNA-Seq ExpressionMoc04g14700
SyntenyMoc04g14700
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]2.6e-25388.07Show/hide
Query:  QAESSSNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEVSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        +AESS NPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKE  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASD
Subjt:  QAESSSNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEVSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAH SDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGKSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILMNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRG+SGKDIE ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEIL NIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGKSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILMNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNT--------------------------------EKGRAEAFEDAARRTDRPAVINTIFGGPSGG
        ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREH HNT                                +K   +      RRTDRPAVINTIFGGPSGG
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNT--------------------------------EKGRAEAFEDAARRTDRPAVINTIFGGPSGG

Query:  QSGHKIKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASVNILSLPTYLALGWTRSQLKKSPTPLVGFSGE
        QSG K KELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG S NILSLPTYLALGWTRSQLKKSPTPLVGFSGE
Subjt:  QSGHKIKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASVNILSLPTYLALGWTRSQLKKSPTPLVGFSGE

Query:  SVIPEGCFDLPVTLGQDQTQVTQMAEFV
        SVIPEG  DLPVTLGQDQTQVTQMAEFV
Subjt:  SVIPEGCFDLPVTLGQDQTQVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]4.0e-25477.34Show/hide
Query:  SSNQQAESSSNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEVSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ
        SSNQQAESS NPATP GVITREEFDQLRG+L+AQVEALKAKCEQKE  LNDGDLGESPFTSDVLE        APTVK YDGSKDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSSNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEVSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGKSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEIL
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RG+SGKD EKAD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPISEIL
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGKSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEIL

Query:  MNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNT--------------------------------EKGRAEAFEDAARRTDRPAVINTIFGG
         NIEESGMEKLLKRPEKLRGAPERR+KDKYCRFHREHDHNT                                +K   +      RR DRPAVINTIFGG
Subjt:  MNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNT--------------------------------EKGRAEAFEDAARRTDRPAVINTIFGG

Query:  PSGGQSGHKIKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASVNILSLPTYLALGWTRSQLKKSPTPLVG
        PSGGQSGHK KELARAARREVCIIREQRPTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G S NI+SL TYLALGWTRSQLKKS TPLVG
Subjt:  PSGGQSGHKIKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASVNILSLPTYLALGWTRSQLKKSPTPLVG

Query:  FSGESVIPEGCFDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETL
        FS ESVIPEGC DLPVTLG DQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPNGVG VRGEQ ASRECYASALKGSSVCALETL
Subjt:  FSGESVIPEGCFDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETL

Query:  ASRDGTLEFEADLLRREFAAPTEELELVPLL
         SRDGTLEF+A+L RREFAAPTEELELVPLL
Subjt:  ASRDGTLEFEADLLRREFAAPTEELELVPLL

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]5.3e-19882.77Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGKSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTP
        MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRT       KIG+G+SGKD+E  DPKSKDKGSFS+GRAEYRRAENGPTRSRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGKSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTP

Query:  TTIPISEILMNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNT--------------------------------EKGRAEAFEDAARRTDRP
        TTIPISEIL NIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREH HNT                                +K   +      RRTDRP
Subjt:  TTIPISEILMNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNT--------------------------------EKGRAEAFEDAARRTDRP

Query:  AVINTIFGGPSGGQSGHKIKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASVNILSLPTYLALGWTRSQL
        AVINTIFGGPSGGQSGHK K+LARAARREVCIIREQRPTCPITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGAS NILSLPTYLALGWTRSQL
Subjt:  AVINTIFGGPSGGQSGHKIKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASVNILSLPTYLALGWTRSQL

Query:  KKSPTPLVGFSGESVIPEGCFDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKG
        KKSPTPLVGFSGESV+PEGC DLPVTLGQDQT+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPNGVGTVRGEQTASRECYAS LKG
Subjt:  KKSPTPLVGFSGESVIPEGCFDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKG

Query:  SSVCALETLASRDGTLEFEADLLRREFAAPTEELELVPLLSPEKQGQ
        +SVCALETL SRDGTLEFEADL  REFAAP EELELVPLLS EKQ Q
Subjt:  SSVCALETLASRDGTLEFEADLLRREFAAPTEELELVPLLSPEKQGQ

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]9.6e-24862.75Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITTPVLPPAHPPRTSKATHGRGGTSKKGARGPAPAPISENLDALQREMEAM
        MVQPANSTNTADRR LAA+  HQREVGA VVEGQGH+ L TEPL RSARITTPVLPPAHP                                        
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITTPVLPPAHPPRTSKATHGRGGTSKKGARGPAPAPISENLDALQREMEAM

Query:  RTKMWSMEEMYNEMILAAGVGSRSENRVTRADIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLHEHLNRKRGSSLRKGQSPSRSHRSSNQQAESSSNPA
                                                                                         PS+        AESS NP 
Subjt:  RTKMWSMEEMYNEMILAAGVGSRSENRVTRADIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLHEHLNRKRGSSLRKGQSPSRSHRSSNQQAESSSNPA

Query:  TPAGVITREEFDQLRGQLDAQVEALKAKCEQKEVSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQI
        TP GVITREEFDQL+ + DAQVEALKA+CE+KE S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+DAIKC AFQI
Subjt:  TPAGVITREEFDQLRGQLDAQVEALKAKCEQKEVSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQI

Query:  ALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAP
        ALTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAH SDDSAMCYFLTGLADE LTVKL EEAP
Subjt:  ALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAP

Query:  ATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGKSGKDIEKADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILMNIEESGMEKLL
        ATFAEVLQK KKVIDGQELLRTKTGRPE+ I +G++GKD  KAD KS+DKG S SS R +YRR+ +   +SRPYE +TPTTIPI EIL NIEE+GMEKLL
Subjt:  ATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGKSGKDIEKADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILMNIEESGMEKLL

Query:  KRPEKLRGAPERRSKDKYCRFHREHDHNT--------------------------------EKGRAEAFEDAARRTDRPAVINTIFGGPSGGQSGHKIKE
        KRPEKLRG PE+R+ DKYCRFHR+H HNT                                +K   +      RR DRPAVIN             K KE
Subjt:  KRPEKLRGAPERRSKDKYCRFHREHDHNT--------------------------------EKGRAEAFEDAARRTDRPAVINTIFGGPSGGQSGHKIKE

Query:  LARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASVNILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCF
        LAR ARREVCIIREQRPT  I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGAS NILSL TYLALGWTRSQLKKSPTPLVGFSGES+  EGC 
Subjt:  LARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASVNILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCF

Query:  DLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLASRD
        DLPV++ QD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SRECYAS  K SSVCALE    RD
Subjt:  DLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLASRD

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]2.1e-19462.96Show/hide
Query:  EQRGSHLGPVEEEHPEDNESEGHTRQRGDLHEHLNRKRGSSLRKGQ---SPSRSHRSSNQQAESSSNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQ
        E+    + P   E   ++E   ++ +  DL +HL  K+  +  + +   S SR   +SN +A+S   P  P  VI REEFD ++ + D QVEALKA+CE+
Subjt:  EQRGSHLGPVEEEHPEDNESEGHTRQRGDLHEHLNRKRGSSLRKGQ---SPSRSHRSSNQQAESSSNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQ

Query:  KEVSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
        KE   +D DLGESPFTSD++EAPIPPKFK PT+KPYDGSKDPKDYVEVFEGLMDFQAA+DAIKC AFQIALTGSARLW RRLPARSISTYSQLR+EF+ Q
Subjt:  KEVSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
        FS RHYD+KTATHLATIRQKE                                   DE LTVKLGEEAPATFAEVLQ AKKVIDGQELLRTKT RPE++I
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI

Query:  GRGKSGKDIEKADPKSKDKGSFSSG-RAEYRRAENGPTRSRPYERFTPTTIPISEILMNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNTEK
         + +  +   K D KSKDKGS SSG R EYRR+E+GP+RSRPYER       I ++   I++S  +K + +P     + E++ + K  R           
Subjt:  GRGKSGKDIEKADPKSKDKGSFSSG-RAEYRRAENGPTRSRPYERFTPTTIPISEILMNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNTEK

Query:  GRAEAFEDAARRTDRPAVINTIFGGPSGGQSGHKIKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASVNI
                  RR DRPAVINTIFGGPSGGQ  +K KELA  ARR+V IIREQ+PTC ITF   DLE VHLPHNDALVIAPLIDHV+VRRVLVDGGAS NI
Subjt:  GRAEAFEDAARRTDRPAVINTIFGGPSGGQSGHKIKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASVNI

Query:  LSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCFDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVR
        LSLPTYLAL  TRSQLKKSPTPLVGFS ESV PEGC DLPVT+GQD TQVTQMAEFVVIDGR AYNAIF RPIIHSF+A+PS LHQVLKYSTPNGVGTVR
Subjt:  LSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCFDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVR

Query:  GEQTASRECYASALKGSSVCALETLASRD
        GEQ  SRECYASALK SSVCALE   S+D
Subjt:  GEQTASRECYASALKGSSVCALETLASRD

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088131.3e-25388.07Show/hide
Query:  QAESSSNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEVSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        +AESS NPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKE  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASD
Subjt:  QAESSSNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEVSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAH SDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGKSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILMNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRG+SGKDIE ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEIL NIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGKSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILMNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNT--------------------------------EKGRAEAFEDAARRTDRPAVINTIFGGPSGG
        ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREH HNT                                +K   +      RRTDRPAVINTIFGGPSGG
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNT--------------------------------EKGRAEAFEDAARRTDRPAVINTIFGGPSGG

Query:  QSGHKIKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASVNILSLPTYLALGWTRSQLKKSPTPLVGFSGE
        QSG K KELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG S NILSLPTYLALGWTRSQLKKSPTPLVGFSGE
Subjt:  QSGHKIKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASVNILSLPTYLALGWTRSQLKKSPTPLVGFSGE

Query:  SVIPEGCFDLPVTLGQDQTQVTQMAEFV
        SVIPEG  DLPVTLGQDQTQVTQMAEFV
Subjt:  SVIPEGCFDLPVTLGQDQTQVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188231.9e-25477.34Show/hide
Query:  SSNQQAESSSNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEVSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ
        SSNQQAESS NPATP GVITREEFDQLRG+L+AQVEALKAKCEQKE  LNDGDLGESPFTSDVLE        APTVK YDGSKDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSSNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEVSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGKSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEIL
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RG+SGKD EKAD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPISEIL
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGKSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEIL

Query:  MNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNT--------------------------------EKGRAEAFEDAARRTDRPAVINTIFGG
         NIEESGMEKLLKRPEKLRGAPERR+KDKYCRFHREHDHNT                                +K   +      RR DRPAVINTIFGG
Subjt:  MNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNT--------------------------------EKGRAEAFEDAARRTDRPAVINTIFGG

Query:  PSGGQSGHKIKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASVNILSLPTYLALGWTRSQLKKSPTPLVG
        PSGGQSGHK KELARAARREVCIIREQRPTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G S NI+SL TYLALGWTRSQLKKS TPLVG
Subjt:  PSGGQSGHKIKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASVNILSLPTYLALGWTRSQLKKSPTPLVG

Query:  FSGESVIPEGCFDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETL
        FS ESVIPEGC DLPVTLG DQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPNGVG VRGEQ ASRECYASALKGSSVCALETL
Subjt:  FSGESVIPEGCFDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETL

Query:  ASRDGTLEFEADLLRREFAAPTEELELVPLL
         SRDGTLEF+A+L RREFAAPTEELELVPLL
Subjt:  ASRDGTLEFEADLLRREFAAPTEELELVPLL

A0A6J1DD03 uncharacterized protein LOC1110198992.6e-19882.77Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGKSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTP
        MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRT       KIG+G+SGKD+E  DPKSKDKGSFS+GRAEYRRAENGPTRSRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGKSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTP

Query:  TTIPISEILMNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNT--------------------------------EKGRAEAFEDAARRTDRP
        TTIPISEIL NIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREH HNT                                +K   +      RRTDRP
Subjt:  TTIPISEILMNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNT--------------------------------EKGRAEAFEDAARRTDRP

Query:  AVINTIFGGPSGGQSGHKIKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASVNILSLPTYLALGWTRSQL
        AVINTIFGGPSGGQSGHK K+LARAARREVCIIREQRPTCPITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGAS NILSLPTYLALGWTRSQL
Subjt:  AVINTIFGGPSGGQSGHKIKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASVNILSLPTYLALGWTRSQL

Query:  KKSPTPLVGFSGESVIPEGCFDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKG
        KKSPTPLVGFSGESV+PEGC DLPVTLGQDQT+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPNGVGTVRGEQTASRECYAS LKG
Subjt:  KKSPTPLVGFSGESVIPEGCFDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKG

Query:  SSVCALETLASRDGTLEFEADLLRREFAAPTEELELVPLLSPEKQGQ
        +SVCALETL SRDGTLEFEADL  REFAAP EELELVPLLS EKQ Q
Subjt:  SSVCALETLASRDGTLEFEADLLRREFAAPTEELELVPLLSPEKQGQ

A0A6J1DHB3 uncharacterized protein LOC1110204794.6e-24862.75Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITTPVLPPAHPPRTSKATHGRGGTSKKGARGPAPAPISENLDALQREMEAM
        MVQPANSTNTADRR LAA+  HQREVGA VVEGQGH+ L TEPL RSARITTPVLPPAHP                                        
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITTPVLPPAHPPRTSKATHGRGGTSKKGARGPAPAPISENLDALQREMEAM

Query:  RTKMWSMEEMYNEMILAAGVGSRSENRVTRADIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLHEHLNRKRGSSLRKGQSPSRSHRSSNQQAESSSNPA
                                                                                         PS+        AESS NP 
Subjt:  RTKMWSMEEMYNEMILAAGVGSRSENRVTRADIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLHEHLNRKRGSSLRKGQSPSRSHRSSNQQAESSSNPA

Query:  TPAGVITREEFDQLRGQLDAQVEALKAKCEQKEVSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQI
        TP GVITREEFDQL+ + DAQVEALKA+CE+KE S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+DAIKC AFQI
Subjt:  TPAGVITREEFDQLRGQLDAQVEALKAKCEQKEVSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQI

Query:  ALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAP
        ALTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAH SDDSAMCYFLTGLADE LTVKL EEAP
Subjt:  ALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAP

Query:  ATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGKSGKDIEKADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILMNIEESGMEKLL
        ATFAEVLQK KKVIDGQELLRTKTGRPE+ I +G++GKD  KAD KS+DKG S SS R +YRR+ +   +SRPYE +TPTTIPI EIL NIEE+GMEKLL
Subjt:  ATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGKSGKDIEKADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILMNIEESGMEKLL

Query:  KRPEKLRGAPERRSKDKYCRFHREHDHNT--------------------------------EKGRAEAFEDAARRTDRPAVINTIFGGPSGGQSGHKIKE
        KRPEKLRG PE+R+ DKYCRFHR+H HNT                                +K   +      RR DRPAVIN             K KE
Subjt:  KRPEKLRGAPERRSKDKYCRFHREHDHNT--------------------------------EKGRAEAFEDAARRTDRPAVINTIFGGPSGGQSGHKIKE

Query:  LARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASVNILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCF
        LAR ARREVCIIREQRPT  I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGAS NILSL TYLALGWTRSQLKKSPTPLVGFSGES+  EGC 
Subjt:  LARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASVNILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCF

Query:  DLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLASRD
        DLPV++ QD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SRECYAS  K SSVCALE    RD
Subjt:  DLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLASRD

A0A6J1DPC9 uncharacterized protein LOC1110222801.0e-19462.96Show/hide
Query:  EQRGSHLGPVEEEHPEDNESEGHTRQRGDLHEHLNRKRGSSLRKGQ---SPSRSHRSSNQQAESSSNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQ
        E+    + P   E   ++E   ++ +  DL +HL  K+  +  + +   S SR   +SN +A+S   P  P  VI REEFD ++ + D QVEALKA+CE+
Subjt:  EQRGSHLGPVEEEHPEDNESEGHTRQRGDLHEHLNRKRGSSLRKGQ---SPSRSHRSSNQQAESSSNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQ

Query:  KEVSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
        KE   +D DLGESPFTSD++EAPIPPKFK PT+KPYDGSKDPKDYVEVFEGLMDFQAA+DAIKC AFQIALTGSARLW RRLPARSISTYSQLR+EF+ Q
Subjt:  KEVSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
        FS RHYD+KTATHLATIRQKE                                   DE LTVKLGEEAPATFAEVLQ AKKVIDGQELLRTKT RPE++I
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI

Query:  GRGKSGKDIEKADPKSKDKGSFSSG-RAEYRRAENGPTRSRPYERFTPTTIPISEILMNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNTEK
         + +  +   K D KSKDKGS SSG R EYRR+E+GP+RSRPYER       I ++   I++S  +K + +P     + E++ + K  R           
Subjt:  GRGKSGKDIEKADPKSKDKGSFSSG-RAEYRRAENGPTRSRPYERFTPTTIPISEILMNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNTEK

Query:  GRAEAFEDAARRTDRPAVINTIFGGPSGGQSGHKIKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASVNI
                  RR DRPAVINTIFGGPSGGQ  +K KELA  ARR+V IIREQ+PTC ITF   DLE VHLPHNDALVIAPLIDHV+VRRVLVDGGAS NI
Subjt:  GRAEAFEDAARRTDRPAVINTIFGGPSGGQSGHKIKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASVNI

Query:  LSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCFDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVR
        LSLPTYLAL  TRSQLKKSPTPLVGFS ESV PEGC DLPVT+GQD TQVTQMAEFVVIDGR AYNAIF RPIIHSF+A+PS LHQVLKYSTPNGVGTVR
Subjt:  LSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCFDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVR

Query:  GEQTASRECYASALKGSSVCALETLASRD
        GEQ  SRECYASALK SSVCALE   S+D
Subjt:  GEQTASRECYASALKGSSVCALETLASRD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCCGCAAACTCGACCAATACAGCAGATCGAAGGACCCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCACGCCTGTTCTACCGCCTGCGCACCCCCCAAGGACATCCAAGGCCACCCATGGCCGAGGTGGAACCT
CTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCCCCGATAAGTGAGAACTTGGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACGAAAATGTGGTCCATGGAGGAAATG
TATAACGAAATGATATTAGCTGCAGGTGTAGGGTCCCGATCTGAGAACCGAGTGACGCGCGCTGACATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAAGA
GCATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTCCATGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCC
GCTCACACAGGAGCTCCAACCAGCAAGCTGAATCCTCTAGCAATCCAGCAACTCCTGCAGGAGTGATTACAAGGGAGGAGTTCGACCAGCTGAGGGGCCAGCTCGACGCT
CAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGTTTCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCAGACGTTTTGGAAGCACCGATCCCTCC
GAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCAGCATCAGACGCAATCA
AATGTCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGCGAAGGGAGTTCCTCGCC
CAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAAGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCA
ATTGAAGGTCGCACACTACTCCGATGACTCGGCCATGTGCTATTTTCTCACTGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCG
CCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCAGAACGAAAGATCGGCCGAGGCAAAAGTGGAAAAGATATA
GAAAAGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTACGAACGCTTCAC
CCCGACCACGATTCCAATTTCCGAGATCCTAATGAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCA
AGGACAAGTATTGCCGCTTCCATCGGGAGCACGACCATAACACAGAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCGGCGCACTGACCGACCTGCGGTCATCAAT
ACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAATAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTGCCC
AATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCTTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAGACG
GAGGCGCATCTGTTAACATCCTGTCCTTACCGACCTACCTCGCCCTAGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCG
GTCATCCCAGAGGGTTGCTTCGACTTGCCGGTCACGCTTGGGCAAGACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGC
CATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCCTTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGAGTGGGCACGGTCCGAGGAGAACAGA
CCGCTTCGAGGGAGTGCTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTTGAAACTCTCGCCAGTAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCTGAGG
AGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAAGGGCAATTCACCACAAGACCCCAAGGAGCGCAGAAGGTTGGCAAGGC
AAGCAGCTCGGTTCATGGTCCGAAGTGGAGCGTTGTACCGACGCGGCTTTTCCCTGCCTCTATTGAAATGCCTAACCCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCCGCAAACTCGACCAATACAGCAGATCGAAGGACCCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCACGCCTGTTCTACCGCCTGCGCACCCCCCAAGGACATCCAAGGCCACCCATGGCCGAGGTGGAACCT
CTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCCCCGATAAGTGAGAACTTGGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACGAAAATGTGGTCCATGGAGGAAATG
TATAACGAAATGATATTAGCTGCAGGTGTAGGGTCCCGATCTGAGAACCGAGTGACGCGCGCTGACATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAAGA
GCATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTCCATGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCC
GCTCACACAGGAGCTCCAACCAGCAAGCTGAATCCTCTAGCAATCCAGCAACTCCTGCAGGAGTGATTACAAGGGAGGAGTTCGACCAGCTGAGGGGCCAGCTCGACGCT
CAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGTTTCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCAGACGTTTTGGAAGCACCGATCCCTCC
GAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCAGCATCAGACGCAATCA
AATGTCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGCGAAGGGAGTTCCTCGCC
CAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAAGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCA
ATTGAAGGTCGCACACTACTCCGATGACTCGGCCATGTGCTATTTTCTCACTGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCG
CCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCAGAACGAAAGATCGGCCGAGGCAAAAGTGGAAAAGATATA
GAAAAGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTACGAACGCTTCAC
CCCGACCACGATTCCAATTTCCGAGATCCTAATGAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCA
AGGACAAGTATTGCCGCTTCCATCGGGAGCACGACCATAACACAGAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCGGCGCACTGACCGACCTGCGGTCATCAAT
ACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAATAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTGCCC
AATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCTTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAGACG
GAGGCGCATCTGTTAACATCCTGTCCTTACCGACCTACCTCGCCCTAGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCG
GTCATCCCAGAGGGTTGCTTCGACTTGCCGGTCACGCTTGGGCAAGACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGC
CATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCCTTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGAGTGGGCACGGTCCGAGGAGAACAGA
CCGCTTCGAGGGAGTGCTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTTGAAACTCTCGCCAGTAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCTGAGG
AGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAAGGGCAATTCACCACAAGACCCCAAGGAGCGCAGAAGGTTGGCAAGGC
AAGCAGCTCGGTTCATGGTCCGAAGTGGAGCGTTGTACCGACGCGGCTTTTCCCTGCCTCTATTGAAATGCCTAACCCCTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITTPVLPPAHPPRTSKATHGRGGTSKKGARGPAPAPISENLDALQREMEAMRTKMWSMEEM
YNEMILAAGVGSRSENRVTRADIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLHEHLNRKRGSSLRKGQSPSRSHRSSNQQAESSSNPATPAGVITREEFDQLRGQLDA
QVEALKAKCEQKEVSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLA
QFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHYSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGKSGKDI
EKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILMNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHDHNTEKGRAEAFEDAARRTDRPAVIN
TIFGGPSGGQSGHKIKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASVNILSLPTYLALGWTRSQLKKSPTPLVGFSGES
VIPEGCFDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLASRDGTLEFEADLLR
REFAAPTEELELVPLLSPEKQGQFTTRPQGAQKVGKASSSVHGPKWSVVPTRLFPASIEMPNP