; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g01400 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g01400
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr3:1058452..1064513
RNA-Seq ExpressionMoc03g01400
SyntenyMoc03g01400
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]1.3e-24085.23Show/hide
Query:  QAESSHNPGTPAGVITRAEFDQLRGKLDAQVEDLKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGLKDPKDYVEVFEGLMDFQAASD
        +AESS NP TPAGVITR EFDQLRG+LDAQVE LKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHNPGTPAGVITRAEFDQLRGKLDAQVEDLKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGLKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEDHVLLSHRP-------------------------SRRTL
        AIKCRAF+IALTGSARLWYRRLPA SISTY+QLRREFLA FSSRHYDKKTATHLATIRQKE   L  +                           +   L
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEDHVLLSHRP-------------------------SRRTL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTNRPERRIGRDRSRKD-EKADPKSKDKGSFSSGRDEFRRAVNGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKT RPER+IGR RS KD E ADPKSKDKGSFSSGR E+RRA NGPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTNRPERRIGRDRSRKD-EKADPKSKDKGSFSSGRDEFRRAVNGPTRSRPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYYRFHREHGHNTSDCWELKRQIEDLIQDDYFKKFVGKPRTSSAEKKEERK-------RTDRPAVINTIFGGPSGG
        ESGMEKLLKRPEKLRGAPERRSKDKY RFHREHGHNTSD WELKRQIE+LIQD YFKKFVGKPRTSSAEKKEERK       RTDRPAVINTIFGGPSGG
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYYRFHREHGHNTSDCWELKRQIEDLIQDDYFKKFVGKPRTSSAEKKEERK-------RTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAARREVCIIREQRPTCPITFDSTNLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSLTPLVGFSGE
        QSG KRKELARAARREVCIIREQRPTCPITFD  +LEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQL KS TPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQRPTCPITFDSTNLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSLTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV
        SVIPEG IDLPVTLGQDQTQVTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]7.0e-26683.47Show/hide
Query:  SSNQQAESSHNPGTPAGVITRAEFDQLRGKLDAQVEDLKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGLKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNP TP GVITR EFDQLRGKL+AQVE LKAKCEQKEGPLNDGDLGESPFTSDVLE        APTVK YDG KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPGTPAGVITRAEFDQLRGKLDAQVEDLKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGLKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEDHVLLSHRPSRRTLTVKLGEEAPATFAEVLQKAKK
        AASDAIKCRAFQIALTGSARLW++           QL+   +AQ S    D     +  T    E             LTVKLG+EAPATFAEVLQKAKK
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEDHVLLSHRPSRRTLTVKLGEEAPATFAEVLQKAKK

Query:  VIDGQELLRTKTNRPERRIGRDRSRKDEKADPKSKDKGSFSSGRDEFRRAVNGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRS
        VIDGQELLRTKT RPER I R RS KDEKAD KSKDKGSFSSGR EFRRAVNGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERR+
Subjt:  VIDGQELLRTKTNRPERRIGRDRSRKDEKADPKSKDKGSFSSGRDEFRRAVNGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRS

Query:  KDKYYRFHREHGHNTSDCWELKRQIEDLIQDDYFKKFVGKPRTSSAEKKEERK-------RTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIRE
        KDKY RFHREH HNTSD WELKRQIEDLIQDDYFKKFVGKPRTSSAEKKEERK       R DRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIRE
Subjt:  KDKYYRFHREHGHNTSDCWELKRQIEDLIQDDYFKKFVGKPRTSSAEKKEERK-------RTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIRE

Query:  QRPTCPITFDSTNLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSLTPLVGFSGESVIPEGCIDLPVTLGQDQTQVT
        QRPTCPITFDS +LEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQL KS TPLVGFS ESVIPEGCIDLPVTLG DQTQVT
Subjt:  QRPTCPITFDSTNLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSLTPLVGFSGESVIPEGCIDLPVTLGQDQTQVT

Query:  QMAEFVVIDGRSAYSAIFGRPIIHSFRVIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLVGRDGTLEFEADLPRREFAAPTEELE
        QMAEFVVIDGRSAY+AIFGRPIIHSFR IPSTLHQVLKYSTPNGVG VRGEQ ASRECYASALKGSSVCALETLV RDGTLEF+A+LPRREFAAPTEELE
Subjt:  QMAEFVVIDGRSAYSAIFGRPIIHSFRVIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLVGRDGTLEFEADLPRREFAAPTEELE

Query:  LVPLL
        LVPLL
Subjt:  LVPLL

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]1.9e-20788.53Show/hide
Query:  LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTNRPERRIGRDRSRKD-EKADPKSKDKGSFSSGRDEFRRAVNGPTRSRPYERFTPTTIPISEILTNI
        LTVKL EEAPATFAEVLQKAKKVIDGQELLRTK       IG+ RS KD E  DPKSKDKGSFS+GR E+RRA NGPTRSRPYERFTPTTIPISEILTNI
Subjt:  LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTNRPERRIGRDRSRKD-EKADPKSKDKGSFSSGRDEFRRAVNGPTRSRPYERFTPTTIPISEILTNI

Query:  EESGMEKLLKRPEKLRGAPERRSKDKYYRFHREHGHNTSDCWELKRQIEDLIQDDYFKKFVGKPRTSSAEKKEERK-------RTDRPAVINTIFGGPSG
        EESGMEKLLKRPEKLRGAPERRSKDKY RFHREHGHNTSD WELK QIEDLIQD YFKKFVGKPRTSSAEKKEERK       RTDRPAVINTIFGGPSG
Subjt:  EESGMEKLLKRPEKLRGAPERRSKDKYYRFHREHGHNTSDCWELKRQIEDLIQDDYFKKFVGKPRTSSAEKKEERK-------RTDRPAVINTIFGGPSG

Query:  GQSGHKRKELARAARREVCIIREQRPTCPITFDSTNLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSLTPLVGFSG
        GQSGHKRK+LARAARREVCIIREQRPTCPITFD  +L EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL KS TPLVGFSG
Subjt:  GQSGHKRKELARAARREVCIIREQRPTCPITFDSTNLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSLTPLVGFSG

Query:  ESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYSAIFGRPIIHSFRVIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLVGR
        ESV+PEGCIDLPVTLGQDQT+VTQMAEFVV+DGRSAY+AIFGRPIIHSFR IPSTLHQVLKYSTPNGVGTVRGEQTASRECYAS LKG+SVCALETL  R
Subjt:  ESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYSAIFGRPIIHSFRVIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLVGR

Query:  DGTLEFEADLPRREFAAPTEELELVPLLSPEKQTDL
        DGTLEFEADLP REFAAP EELELVPLLS EKQ  L
Subjt:  DGTLEFEADLPRREFAAPTEELELVPLLSPEKQTDL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]1.5e-22371.88Show/hide
Query:  QAESSHNPGTPAGVITRAEFDQLRGKLDAQVEDLKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGLKDPKDYVEVFEGLMDFQAASD
        +AESS+NP TP GVITR EFDQL+ K DAQVE LKA+CE+KE   +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG KDPKDYVEVFE LMDFQAA+D
Subjt:  QAESSHNPGTPAGVITRAEFDQLRGKLDAQVEDLKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGLKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEDHVL---LSHRPSRR----------------------TL
        AIKC AFQIALTGSARLWYRRLPAR ISTY+QLR+EF++QFSSRHYD+KT THLATIRQKE   L   ++  P  +                      TL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEDHVL---LSHRPSRR----------------------TL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTNRPERRIGRDRSRKDE-KADPKSKDKG-SFSSGRDEFRRAVNGPTRSRPYERFTPTTIPISEILTNI
        TVKL EEAPATFAEVLQK KKVIDGQELLRTKT RPE+ I + R+ KD+ KAD KS+DKG S SS R ++RR+ +   +SRPYE +TPTTIPI EILTNI
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTNRPERRIGRDRSRKDE-KADPKSKDKG-SFSSGRDEFRRAVNGPTRSRPYERFTPTTIPISEILTNI

Query:  EESGMEKLLKRPEKLRGAPERRSKDKYYRFHREHGHNTSDCWELKRQIEDLIQDDYFKKFVGKPRTSSAEKKEERK-------RTDRPAVINTIFGGPSG
        EE+GMEKLLKRPEKLRG PE+R+ DKY RFHR+HGHNTS+ WELKRQIEDLIQD YFKKFVGKPR++S EKKEERK       R DRPAVIN        
Subjt:  EESGMEKLLKRPEKLRGAPERRSKDKYYRFHREHGHNTSDCWELKRQIEDLIQDDYFKKFVGKPRTSSAEKKEERK-------RTDRPAVINTIFGGPSG

Query:  GQSGHKRKELARAARREVCIIREQRPTCPITFDSTNLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSLTPLVGFSG
             K+KELAR ARREVCIIREQRPT  I F+  +LE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQL KS TPLVGFSG
Subjt:  GQSGHKRKELARAARREVCIIREQRPTCPITFDSTNLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSLTPLVGFSG

Query:  ESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYSAIFGRPIIHSFRVIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLVGR
        ES+  EGCIDLPV++ QD TQVTQMAEFVVIDGRSAY+AIFGRPIIHSFR +PSTLHQVLKYST NGVGTVRGE   SRECYAS  K SSVCALE    R
Subjt:  ESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYSAIFGRPIIHSFRVIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLVGR

Query:  D
        D
Subjt:  D

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]2.7e-20965.07Show/hide
Query:  EQRGSHLGPVEEEHPEDNESEGHTRRRGDLREHLNRKRGSSLRKGQ---SPSRSHRSSNQQAESSHNPGTPAGVITRAEFDQLRGKLDAQVEDLKAKCEQ
        E+    + P   E   ++E   ++ R  DLR+HL  K+  +  + +   S SR   +SN +A+S + P  P  VI R EFD ++ + D QVE LKA+CE+
Subjt:  EQRGSHLGPVEEEHPEDNESEGHTRRRGDLREHLNRKRGSSLRKGQ---SPSRSHRSSNQQAESSHNPGTPAGVITRAEFDQLRGKLDAQVEDLKAKCEQ

Query:  KEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGLKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYAQLRREFLAQ
        KE P +D DLGESPFTSD++EAPIPPKFK PT+KPYDG KDPKDYVEVFEGLMDFQAA+DAIKC AFQIALTGSARLW RRLPARSISTY+QLR+EF+ Q
Subjt:  KEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGLKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYAQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEDHVLLSHRPSRRTLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTNRPERRIGRDR-SRKDEKADPKSKDKGSFSSG
        FS RHYD+KTATHLATIRQKED           TLTVKLGEEAPATFAEVLQ AKKVIDGQELLRTKT+RPE++I + R S+K  K D KSKDKGS SSG
Subjt:  FSSRHYDKKTATHLATIRQKEDHVLLSHRPSRRTLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTNRPERRIGRDR-SRKDEKADPKSKDKGSFSSG

Query:  -RDEFRRAVNGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYYRFHREHGHNTSDCWELKRQIEDLIQDDYFKKFVGKPR
         R E+RR+ +GP+RSRPYER                                                       CWELKRQIEDLIQD YFKKFVGKPR
Subjt:  -RDEFRRAVNGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYYRFHREHGHNTSDCWELKRQIEDLIQDDYFKKFVGKPR

Query:  TSSAEKKEERKRT-------DRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSTNLEEVHLPHNDALVIAPLIDHVVVRRVLVD
        ++S EKKEERKR+       DRPAVINTIFGGPSGGQ  +KRKELA  ARR+V IIREQ+PTC ITF  T+LE VHLPHNDALVIAPLIDHV+VRRVLVD
Subjt:  TSSAEKKEERKRT-------DRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSTNLEEVHLPHNDALVIAPLIDHVVVRRVLVD

Query:  GGASANILSLPTYLALGWTRSQLTKSLTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYSAIFGRPIIHSFRVIPSTLHQVLKYSTP
        GGASANILSLPTYLAL  TRSQL KS TPLVGFS ESV PEGCIDLPVT+GQD TQVTQMAEFVVIDGR AY+AIF RPIIHSF+ +PS LHQVLKYSTP
Subjt:  GGASANILSLPTYLALGWTRSQLTKSLTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYSAIFGRPIIHSFRVIPSTLHQVLKYSTP

Query:  NGVGTVRGEQTASRECYASALKGSSVCALETLVGRDGTLEFEADLPR
        NGVGTVRGEQ  SRECYASALK SSVCALE    +D       DLPR
Subjt:  NGVGTVRGEQTASRECYASALKGSSVCALETLVGRDGTLEFEADLPR

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088136.4e-24185.23Show/hide
Query:  QAESSHNPGTPAGVITRAEFDQLRGKLDAQVEDLKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGLKDPKDYVEVFEGLMDFQAASD
        +AESS NP TPAGVITR EFDQLRG+LDAQVE LKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHNPGTPAGVITRAEFDQLRGKLDAQVEDLKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGLKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEDHVLLSHRP-------------------------SRRTL
        AIKCRAF+IALTGSARLWYRRLPA SISTY+QLRREFLA FSSRHYDKKTATHLATIRQKE   L  +                           +   L
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEDHVLLSHRP-------------------------SRRTL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTNRPERRIGRDRSRKD-EKADPKSKDKGSFSSGRDEFRRAVNGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKT RPER+IGR RS KD E ADPKSKDKGSFSSGR E+RRA NGPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTNRPERRIGRDRSRKD-EKADPKSKDKGSFSSGRDEFRRAVNGPTRSRPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYYRFHREHGHNTSDCWELKRQIEDLIQDDYFKKFVGKPRTSSAEKKEERK-------RTDRPAVINTIFGGPSGG
        ESGMEKLLKRPEKLRGAPERRSKDKY RFHREHGHNTSD WELKRQIE+LIQD YFKKFVGKPRTSSAEKKEERK       RTDRPAVINTIFGGPSGG
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYYRFHREHGHNTSDCWELKRQIEDLIQDDYFKKFVGKPRTSSAEKKEERK-------RTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAARREVCIIREQRPTCPITFDSTNLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSLTPLVGFSGE
        QSG KRKELARAARREVCIIREQRPTCPITFD  +LEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQL KS TPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQRPTCPITFDSTNLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSLTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV
        SVIPEG IDLPVTLGQDQTQVTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188233.4e-26683.47Show/hide
Query:  SSNQQAESSHNPGTPAGVITRAEFDQLRGKLDAQVEDLKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGLKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNP TP GVITR EFDQLRGKL+AQVE LKAKCEQKEGPLNDGDLGESPFTSDVLE        APTVK YDG KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPGTPAGVITRAEFDQLRGKLDAQVEDLKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGLKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEDHVLLSHRPSRRTLTVKLGEEAPATFAEVLQKAKK
        AASDAIKCRAFQIALTGSARLW++           QL+   +AQ S    D     +  T    E             LTVKLG+EAPATFAEVLQKAKK
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEDHVLLSHRPSRRTLTVKLGEEAPATFAEVLQKAKK

Query:  VIDGQELLRTKTNRPERRIGRDRSRKDEKADPKSKDKGSFSSGRDEFRRAVNGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRS
        VIDGQELLRTKT RPER I R RS KDEKAD KSKDKGSFSSGR EFRRAVNGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERR+
Subjt:  VIDGQELLRTKTNRPERRIGRDRSRKDEKADPKSKDKGSFSSGRDEFRRAVNGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRS

Query:  KDKYYRFHREHGHNTSDCWELKRQIEDLIQDDYFKKFVGKPRTSSAEKKEERK-------RTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIRE
        KDKY RFHREH HNTSD WELKRQIEDLIQDDYFKKFVGKPRTSSAEKKEERK       R DRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIRE
Subjt:  KDKYYRFHREHGHNTSDCWELKRQIEDLIQDDYFKKFVGKPRTSSAEKKEERK-------RTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIRE

Query:  QRPTCPITFDSTNLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSLTPLVGFSGESVIPEGCIDLPVTLGQDQTQVT
        QRPTCPITFDS +LEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQL KS TPLVGFS ESVIPEGCIDLPVTLG DQTQVT
Subjt:  QRPTCPITFDSTNLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSLTPLVGFSGESVIPEGCIDLPVTLGQDQTQVT

Query:  QMAEFVVIDGRSAYSAIFGRPIIHSFRVIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLVGRDGTLEFEADLPRREFAAPTEELE
        QMAEFVVIDGRSAY+AIFGRPIIHSFR IPSTLHQVLKYSTPNGVG VRGEQ ASRECYASALKGSSVCALETLV RDGTLEF+A+LPRREFAAPTEELE
Subjt:  QMAEFVVIDGRSAYSAIFGRPIIHSFRVIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLVGRDGTLEFEADLPRREFAAPTEELE

Query:  LVPLL
        LVPLL
Subjt:  LVPLL

A0A6J1DD03 uncharacterized protein LOC1110198999.4e-20888.53Show/hide
Query:  LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTNRPERRIGRDRSRKD-EKADPKSKDKGSFSSGRDEFRRAVNGPTRSRPYERFTPTTIPISEILTNI
        LTVKL EEAPATFAEVLQKAKKVIDGQELLRTK       IG+ RS KD E  DPKSKDKGSFS+GR E+RRA NGPTRSRPYERFTPTTIPISEILTNI
Subjt:  LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTNRPERRIGRDRSRKD-EKADPKSKDKGSFSSGRDEFRRAVNGPTRSRPYERFTPTTIPISEILTNI

Query:  EESGMEKLLKRPEKLRGAPERRSKDKYYRFHREHGHNTSDCWELKRQIEDLIQDDYFKKFVGKPRTSSAEKKEERK-------RTDRPAVINTIFGGPSG
        EESGMEKLLKRPEKLRGAPERRSKDKY RFHREHGHNTSD WELK QIEDLIQD YFKKFVGKPRTSSAEKKEERK       RTDRPAVINTIFGGPSG
Subjt:  EESGMEKLLKRPEKLRGAPERRSKDKYYRFHREHGHNTSDCWELKRQIEDLIQDDYFKKFVGKPRTSSAEKKEERK-------RTDRPAVINTIFGGPSG

Query:  GQSGHKRKELARAARREVCIIREQRPTCPITFDSTNLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSLTPLVGFSG
        GQSGHKRK+LARAARREVCIIREQRPTCPITFD  +L EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL KS TPLVGFSG
Subjt:  GQSGHKRKELARAARREVCIIREQRPTCPITFDSTNLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSLTPLVGFSG

Query:  ESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYSAIFGRPIIHSFRVIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLVGR
        ESV+PEGCIDLPVTLGQDQT+VTQMAEFVV+DGRSAY+AIFGRPIIHSFR IPSTLHQVLKYSTPNGVGTVRGEQTASRECYAS LKG+SVCALETL  R
Subjt:  ESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYSAIFGRPIIHSFRVIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLVGR

Query:  DGTLEFEADLPRREFAAPTEELELVPLLSPEKQTDL
        DGTLEFEADLP REFAAP EELELVPLLS EKQ  L
Subjt:  DGTLEFEADLPRREFAAPTEELELVPLLSPEKQTDL

A0A6J1DHB3 uncharacterized protein LOC1110204797.1e-22471.88Show/hide
Query:  QAESSHNPGTPAGVITRAEFDQLRGKLDAQVEDLKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGLKDPKDYVEVFEGLMDFQAASD
        +AESS+NP TP GVITR EFDQL+ K DAQVE LKA+CE+KE   +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG KDPKDYVEVFE LMDFQAA+D
Subjt:  QAESSHNPGTPAGVITRAEFDQLRGKLDAQVEDLKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGLKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEDHVL---LSHRPSRR----------------------TL
        AIKC AFQIALTGSARLWYRRLPAR ISTY+QLR+EF++QFSSRHYD+KT THLATIRQKE   L   ++  P  +                      TL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEDHVL---LSHRPSRR----------------------TL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTNRPERRIGRDRSRKDE-KADPKSKDKG-SFSSGRDEFRRAVNGPTRSRPYERFTPTTIPISEILTNI
        TVKL EEAPATFAEVLQK KKVIDGQELLRTKT RPE+ I + R+ KD+ KAD KS+DKG S SS R ++RR+ +   +SRPYE +TPTTIPI EILTNI
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTNRPERRIGRDRSRKDE-KADPKSKDKG-SFSSGRDEFRRAVNGPTRSRPYERFTPTTIPISEILTNI

Query:  EESGMEKLLKRPEKLRGAPERRSKDKYYRFHREHGHNTSDCWELKRQIEDLIQDDYFKKFVGKPRTSSAEKKEERK-------RTDRPAVINTIFGGPSG
        EE+GMEKLLKRPEKLRG PE+R+ DKY RFHR+HGHNTS+ WELKRQIEDLIQD YFKKFVGKPR++S EKKEERK       R DRPAVIN        
Subjt:  EESGMEKLLKRPEKLRGAPERRSKDKYYRFHREHGHNTSDCWELKRQIEDLIQDDYFKKFVGKPRTSSAEKKEERK-------RTDRPAVINTIFGGPSG

Query:  GQSGHKRKELARAARREVCIIREQRPTCPITFDSTNLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSLTPLVGFSG
             K+KELAR ARREVCIIREQRPT  I F+  +LE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQL KS TPLVGFSG
Subjt:  GQSGHKRKELARAARREVCIIREQRPTCPITFDSTNLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSLTPLVGFSG

Query:  ESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYSAIFGRPIIHSFRVIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLVGR
        ES+  EGCIDLPV++ QD TQVTQMAEFVVIDGRSAY+AIFGRPIIHSFR +PSTLHQVLKYST NGVGTVRGE   SRECYAS  K SSVCALE    R
Subjt:  ESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYSAIFGRPIIHSFRVIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLVGR

Query:  D
        D
Subjt:  D

A0A6J1DPC9 uncharacterized protein LOC1110222801.3e-20965.07Show/hide
Query:  EQRGSHLGPVEEEHPEDNESEGHTRRRGDLREHLNRKRGSSLRKGQ---SPSRSHRSSNQQAESSHNPGTPAGVITRAEFDQLRGKLDAQVEDLKAKCEQ
        E+    + P   E   ++E   ++ R  DLR+HL  K+  +  + +   S SR   +SN +A+S + P  P  VI R EFD ++ + D QVE LKA+CE+
Subjt:  EQRGSHLGPVEEEHPEDNESEGHTRRRGDLREHLNRKRGSSLRKGQ---SPSRSHRSSNQQAESSHNPGTPAGVITRAEFDQLRGKLDAQVEDLKAKCEQ

Query:  KEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGLKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYAQLRREFLAQ
        KE P +D DLGESPFTSD++EAPIPPKFK PT+KPYDG KDPKDYVEVFEGLMDFQAA+DAIKC AFQIALTGSARLW RRLPARSISTY+QLR+EF+ Q
Subjt:  KEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGLKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYAQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEDHVLLSHRPSRRTLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTNRPERRIGRDR-SRKDEKADPKSKDKGSFSSG
        FS RHYD+KTATHLATIRQKED           TLTVKLGEEAPATFAEVLQ AKKVIDGQELLRTKT+RPE++I + R S+K  K D KSKDKGS SSG
Subjt:  FSSRHYDKKTATHLATIRQKEDHVLLSHRPSRRTLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTNRPERRIGRDR-SRKDEKADPKSKDKGSFSSG

Query:  -RDEFRRAVNGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYYRFHREHGHNTSDCWELKRQIEDLIQDDYFKKFVGKPR
         R E+RR+ +GP+RSRPYER                                                       CWELKRQIEDLIQD YFKKFVGKPR
Subjt:  -RDEFRRAVNGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYYRFHREHGHNTSDCWELKRQIEDLIQDDYFKKFVGKPR

Query:  TSSAEKKEERKRT-------DRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSTNLEEVHLPHNDALVIAPLIDHVVVRRVLVD
        ++S EKKEERKR+       DRPAVINTIFGGPSGGQ  +KRKELA  ARR+V IIREQ+PTC ITF  T+LE VHLPHNDALVIAPLIDHV+VRRVLVD
Subjt:  TSSAEKKEERKRT-------DRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSTNLEEVHLPHNDALVIAPLIDHVVVRRVLVD

Query:  GGASANILSLPTYLALGWTRSQLTKSLTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYSAIFGRPIIHSFRVIPSTLHQVLKYSTP
        GGASANILSLPTYLAL  TRSQL KS TPLVGFS ESV PEGCIDLPVT+GQD TQVTQMAEFVVIDGR AY+AIF RPIIHSF+ +PS LHQVLKYSTP
Subjt:  GGASANILSLPTYLALGWTRSQLTKSLTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYSAIFGRPIIHSFRVIPSTLHQVLKYSTP

Query:  NGVGTVRGEQTASRECYASALKGSSVCALETLVGRDGTLEFEADLPR
        NGVGTVRGEQ  SRECYASALK SSVCALE    +D       DLPR
Subjt:  NGVGTVRGEQTASRECYASALKGSSVCALETLVGRDGTLEFEADLPR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAAGTATGAGGGCCGAGATGAACCTGGCCGAGGTCCGACCTACCGGGAAGCTCGGTGGGGACCGATGTGAGTTACTGTCAGCAGGCGTGATTATCGGGGGTAACCG
CCGATCCCGAGATTACCGGGTCCGCCCAAGTATTCAGATCGGTCCGGAGGCTGAGTTCGAGCTGCAATCTGAAATACACTGTTGTGCATATCCTTGCATAAACATTTGGC
GCCGTCTATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGACGGCCTAGCAACAGAACCCTCCGCAGG
TCGGCACGAATAACCGAGCCTGTCCTACCACTTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGC
TCCAACAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATGTATAACGAGATGATACTAGCTGCAGGCGCAG
GGTCCCGATCTGAAAATCGAATGACGCGCATTGACATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAAGAACATCCCGAAGACAACGAGAGCGAGGGACAC
ACTCGCCGGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCCGCTCACACCGGAGCTCCAACCAGCAGGCTGA
ATCCTCTCACAACCCAGGAACTCCCGCAGGGGTGATCACAAGGGCGGAGTTCGACCAGCTGAGGGGCAAGCTCGACGCTCAGGTGGAGGACTTAAAGGCCAAATGTGAGC
AGAAAGAAGGTCCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAACCTTAT
GATGGGTTGAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACCGG
TAGCGCGCGTTTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACGCTCAGCTGAGAAGGGAGTTCCTTGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGA
CAGCGACCCATCTCGCCACCATCAGGCAAAAGGAAGACCATGTGCTACTTTCTCACCGGCCTAGCCGACGAACCCTCACGGTGAAGCTTGGAGAGGAGGCCCCGGCCACC
TTCGCCGAAGTGCTGCAGAAGGCGAAGAAAGTCATCGATGGGCAGGAGCTCCTCCGAACCAAAACCAATCGACCAGAACGAAGGATCGGCCGGGATAGAAGCAGAAAAGA
TGAAAAGGCGGATCCCAAGTCCAAGGACAAGGGATCTTTCTCCAGTGGCCGAGATGAGTTTCGAAGGGCGGTGAACGGACCCACCAGGAGCCGACCTTACGAACGCTTCA
CCCCGACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCCGGAATGGAAAAACTACTCAAACGTCCTGAGAAGCTTCGGGGAGCCCCAGAGAGGCGCAGC
AAGGACAAGTATTACCGCTTCCATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGACTACTTCAAGAAATT
TGTGGGAAAGCCCAGGACCAGCTCGGCAGAGAAAAAGGAAGAGCGGAAGCGCACTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCG
GACATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTGCCCAATCACCTTCGACAGTACAAACTTAGAGGAGGTC
CACCTGCCCCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTTCTGGTAGACGGAGGCGCATCCGCTAACATCCTGTCCTTACCGAC
CTACCTCGCCCTGGGATGGACGAGGTCGCAATTGACGAAAAGCCTGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCA
CACTTGGGCAGGACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAGCGCCATCTTTGGGAGACCCATCATCCACTCATTTCGG
GTCATTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAA
AGGCTCATCGGTCTGCGCCCTCGAAACTCTCGTCGGTAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGC
TTGTTCCTCTGCTTAGTCCCGAGAAGCAAACCGACCTGACCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCAAAGCCAGATCTGATGGAGATCGGCGCT
CCAGAGTCCTCATGGATGGACCCGATTGCGGACTTCATTAGAGGCAATTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCTCGATTCGTGGTCCG
AGGTGGAGCATTGAAGGTCCAAACCCATGTGGGTGCCCTTGACTCGACCTGGGAGGGGCCGTTTGAAGTCAAGGGAATAGTCCGACCTGAGACGTACATGTTGGCCGATC
TGAAAGGAGACGTCCTCGCGCACCCATGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTAAGTATGAGGGCCGAGATGAACCTGGCCGAGGTCCGACCTACCGGGAAGCTCGGTGGGGACCGATGTGAGTTACTGTCAGCAGGCGTGATTATCGGGGGTAACCG
CCGATCCCGAGATTACCGGGTCCGCCCAAGTATTCAGATCGGTCCGGAGGCTGAGTTCGAGCTGCAATCTGAAATACACTGTTGTGCATATCCTTGCATAAACATTTGGC
GCCGTCTATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGACGGCCTAGCAACAGAACCCTCCGCAGG
TCGGCACGAATAACCGAGCCTGTCCTACCACTTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGC
TCCAACAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATGTATAACGAGATGATACTAGCTGCAGGCGCAG
GGTCCCGATCTGAAAATCGAATGACGCGCATTGACATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAAGAACATCCCGAAGACAACGAGAGCGAGGGACAC
ACTCGCCGGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCCGCTCACACCGGAGCTCCAACCAGCAGGCTGA
ATCCTCTCACAACCCAGGAACTCCCGCAGGGGTGATCACAAGGGCGGAGTTCGACCAGCTGAGGGGCAAGCTCGACGCTCAGGTGGAGGACTTAAAGGCCAAATGTGAGC
AGAAAGAAGGTCCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAACCTTAT
GATGGGTTGAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACCGG
TAGCGCGCGTTTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACGCTCAGCTGAGAAGGGAGTTCCTTGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGA
CAGCGACCCATCTCGCCACCATCAGGCAAAAGGAAGACCATGTGCTACTTTCTCACCGGCCTAGCCGACGAACCCTCACGGTGAAGCTTGGAGAGGAGGCCCCGGCCACC
TTCGCCGAAGTGCTGCAGAAGGCGAAGAAAGTCATCGATGGGCAGGAGCTCCTCCGAACCAAAACCAATCGACCAGAACGAAGGATCGGCCGGGATAGAAGCAGAAAAGA
TGAAAAGGCGGATCCCAAGTCCAAGGACAAGGGATCTTTCTCCAGTGGCCGAGATGAGTTTCGAAGGGCGGTGAACGGACCCACCAGGAGCCGACCTTACGAACGCTTCA
CCCCGACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCCGGAATGGAAAAACTACTCAAACGTCCTGAGAAGCTTCGGGGAGCCCCAGAGAGGCGCAGC
AAGGACAAGTATTACCGCTTCCATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGACTACTTCAAGAAATT
TGTGGGAAAGCCCAGGACCAGCTCGGCAGAGAAAAAGGAAGAGCGGAAGCGCACTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCG
GACATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTGCCCAATCACCTTCGACAGTACAAACTTAGAGGAGGTC
CACCTGCCCCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTTCTGGTAGACGGAGGCGCATCCGCTAACATCCTGTCCTTACCGAC
CTACCTCGCCCTGGGATGGACGAGGTCGCAATTGACGAAAAGCCTGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCA
CACTTGGGCAGGACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAGCGCCATCTTTGGGAGACCCATCATCCACTCATTTCGG
GTCATTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAA
AGGCTCATCGGTCTGCGCCCTCGAAACTCTCGTCGGTAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGC
TTGTTCCTCTGCTTAGTCCCGAGAAGCAAACCGACCTGACCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCAAAGCCAGATCTGATGGAGATCGGCGCT
CCAGAGTCCTCATGGATGGACCCGATTGCGGACTTCATTAGAGGCAATTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCTCGATTCGTGGTCCG
AGGTGGAGCATTGAAGGTCCAAACCCATGTGGGTGCCCTTGACTCGACCTGGGAGGGGCCGTTTGAAGTCAAGGGAATAGTCCGACCTGAGACGTACATGTTGGCCGATC
TGAAAGGAGACGTCCTCGCGCACCCATGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
Protein sequenceShow/hide protein sequence
MLSMRAEMNLAEVRPTGKLGGDRCELLSAGVIIGGNRRSRDYRVRPSIQIGPEAEFELQSEIHCCAYPCINIWRRLSKDSSCQRCPPEGGRSSSGRGARSRRPSNRTLRR
SARITEPVLPLAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMRTQMRSMEEMYNEMILAAGAGSRSENRMTRIDIREQRGSHLGPVEEEHPEDNESEGH
TRRRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPGTPAGVITRAEFDQLRGKLDAQVEDLKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPY
DGLKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEDHVLLSHRPSRRTLTVKLGEEAPAT
FAEVLQKAKKVIDGQELLRTKTNRPERRIGRDRSRKDEKADPKSKDKGSFSSGRDEFRRAVNGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRS
KDKYYRFHREHGHNTSDCWELKRQIEDLIQDDYFKKFVGKPRTSSAEKKEERKRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSTNLEEV
HLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSLTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYSAIFGRPIIHSFR
VIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLVGRDGTLEFEADLPRREFAAPTEELELVPLLSPEKQTDLTRSVPVEILDNPSISKPDLMEIGA
PESSWMDPIADFIRGNSPQDPKERRKLARRAARFVVRGGALKVQTHVGALDSTWEGPFEVKGIVRPETYMLADLKGDVLAHPWNAEHLKRYYP