; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS010108 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS010108
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPlastid envelope DNA binding protein
Genome locationscaffold779:709103..712414
RNA-Seq ExpressionMS010108
SyntenyMS010108
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602280.1 hypothetical protein SDJN03_07513, partial [Cucurbita argyrosperma subsp. sororia]3.8e-15570.97Show/hide
Query:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS
        MHAI+GGW G PLALAK+NESEGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLTHKEVGGSFY VREIVRDIIQENRVLGPGKL LEEH+ DH 
Subjt:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS

Query:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSD-EQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMP
        L+ENPLHSIAIEPQS L   SEEFDF + +N CINEEPI+VSD EQ TS NIQ S NG IINGSLVDVSDKDSD+ I++EL VNE K++EEV+KEESGMP
Subjt:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSD-EQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMP

Query:  ITHVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKSDSMEVAPIVES
        I HVTPLA DV V TFPL   S AAN S   SET IST  SEK+VSQT+ELES V LF +E +N TK S    EKA       LS   SD +EVA IVE 
Subjt:  ITHVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKSDSMEVAPIVES

Query:  SNGSILKEGGVREVEGPQLEVRTDTPTAAAFEQCQKTSEKVSAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLE
        +NG+++K+G + EVEGP LE+ TDTP +  FEQ QK+SE + APNASPSGT+N N + +NGIDQASKIKEET+++NKV+A+Q  GSQK++IPTLNR+NL+
Subjt:  SNGSILKEGGVREVEGPQLEVRTDTPTAAAFEQCQKTSEKVSAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLE

Query:  SWEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE
        SW   SK+ S PE+NPLLEIL AF++AFVKFWSE
Subjt:  SWEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE

KAG7032960.1 hypothetical protein SDJN02_07011 [Cucurbita argyrosperma subsp. argyrosperma]3.8e-15570.97Show/hide
Query:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS
        MHAI+GGW G PLALAK+NESEGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLTHKEVGGSFY VREIVRDIIQENRVLGPGKL LEEH+ DH 
Subjt:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS

Query:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSD-EQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMP
        L+ENPLHSIAIEPQS L   SEEFDF + +N CINEEPI+VSD EQ TS NIQ S NG IINGSLVDVSDKDSD+ I++EL VNE K++EEV+KEESGMP
Subjt:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSD-EQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMP

Query:  ITHVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKSDSMEVAPIVES
        I HVTPLA DV V TFPL   S AAN S   SET IST  SEK+VSQT+ELES V LF +E +N TK S    EKA       LS   SD +EVA IVE 
Subjt:  ITHVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKSDSMEVAPIVES

Query:  SNGSILKEGGVREVEGPQLEVRTDTPTAAAFEQCQKTSEKVSAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLE
        +NG+++K+G + EVEGP LE+ TDTP +  FEQ QK+SE + APNASPSGT+N N + +NGIDQASKIKEET+++NKV+A+Q  GSQK++IPTLNR+NL+
Subjt:  SNGSILKEGGVREVEGPQLEVRTDTPTAAAFEQCQKTSEKVSAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLE

Query:  SWEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE
        SW   SK+ S PE+NPLLEIL AF++AFVKFWSE
Subjt:  SWEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE

XP_022134343.1 uncharacterized protein LOC111006625 isoform X1 [Momordica charantia]2.9e-23299.08Show/hide
Query:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS
        MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS
Subjt:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS

Query:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPI
        LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPI
Subjt:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPI

Query:  THVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKSDSMEVAPIVESS
        THVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAK DSMEVAPIVESS
Subjt:  THVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKSDSMEVAPIVESS

Query:  NGSILKEGGVREVEGPQLEVRTDTPTAAAFEQCQKTSEKVSAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLES
        NGSILKEGGV EVEGPQLEVRTDTPTAAAFEQCQKTSEK  APNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLES
Subjt:  NGSILKEGGVREVEGPQLEVRTDTPTAAAFEQCQKTSEKVSAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLES

Query:  WEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE
        WEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE
Subjt:  WEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE

XP_022134345.1 uncharacterized protein LOC111006625 isoform X2 [Momordica charantia]2.7e-20188.91Show/hide
Query:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS
        MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS
Subjt:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS

Query:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPI
        LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPI
Subjt:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPI

Query:  THVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKSDSMEVAPIVESS
        THVTPLAADVVVETFPLSPISGAANSS                                            VEKAEENFVSSLSVAK DSMEVAPIVESS
Subjt:  THVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKSDSMEVAPIVESS

Query:  NGSILKEGGVREVEGPQLEVRTDTPTAAAFEQCQKTSEKVSAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLES
        NGSILKEGGV EVEGPQLEVRTDTPTAAAFEQCQKTSEK  APNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLES
Subjt:  NGSILKEGGVREVEGPQLEVRTDTPTAAAFEQCQKTSEKVSAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLES

Query:  WEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE
        WEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE
Subjt:  WEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE

XP_038886590.1 uncharacterized protein LOC120076760 [Benincasa hispida]6.0e-16174.19Show/hide
Query:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLL-LEEHNFDH
        MHAI+GGWTG PLALA++NE+EGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFY VREIVRDIIQENRVLGPGKLL  EEH  DH
Subjt:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLL-LEEHNFDH

Query:  SLEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMP
        SLEENPLHSIAIEPQS L LS++E  F + Y+Q INEEPI VSDEQCT+TNIQ S NGPIINGSLVD++DK+  + IESELLVNE K+VEEVVKEESGMP
Subjt:  SLEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMP

Query:  ITHVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKSDSMEVAPIVES
        I HVTPLA DVVVETFPL   S   N S  RSE  IST  SEKQVSQT+ELES VGLF        K S  VVEKAEENF   LS  +SD +E A IVE+
Subjt:  ITHVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKSDSMEVAPIVES

Query:  SNGSILKEGGVREVEGPQLEVRTDTPTAAAFEQCQKTSEKVSAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLE
        SNGS +KEG + EV GP+LEV +DTP +  FEQ QK+SE + APNASPS  +N NKTFSNG DQASKIKEET++ENKVDA Q  GSQK++IPTLNRINLE
Subjt:  SNGSILKEGGVREVEGPQLEVRTDTPTAAAFEQCQKTSEKVSAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLE

Query:  SWEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE
        SWE MSKN S  E+NP+LEI KAF++AFVKFWSE
Subjt:  SWEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE

TrEMBL top hitse value%identityAlignment
A0A1S3C473 uncharacterized protein LOC103496473 isoform X13.2e-15268.1Show/hide
Query:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLL-EEHNFDH
        MHAI+GGWTGRPLALAK+NE+EGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFY VREIVRDIIQENR+LGPGKLLL EEHN DH
Subjt:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLL-EEHNFDH

Query:  SLEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKE------------
        SL++NPLHSIAIEPQS L LSS+E  F + YN+ INEEPI VSDEQCT+TNIQ S N  IINGSLVDVS++DSD+ I+SELLVNE KE            
Subjt:  SLEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKE------------

Query:  ------------------VEEVVKEESGMPITHVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSD
                          VEEVVKEESGMPI HVTPLA DVVVETFPL P+    N S  RSE  IST  SEKQVSQ++ELES VGL     SN T  SD
Subjt:  ------------------VEEVVKEESGMPITHVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSD

Query:  HVVEKAEENFVSSLSVAKSDSMEVAPIVESSNGSILKEGGVREVEGPQLEVRTDTPTAAAFEQCQKTSEKVSAPNASPSGTQNHNKTFSNGIDQASKIKE
         VVEKA ENF   LS  KSD +EVA IVE SNGS +KEG + EV GP+LEV +DTP +  FEQ QK+S+       SP  ++N NKTFSN  DQASKI  
Subjt:  HVVEKAEENFVSSLSVAKSDSMEVAPIVESSNGSILKEGGVREVEGPQLEVRTDTPTAAAFEQCQKTSEKVSAPNASPSGTQNHNKTFSNGIDQASKIKE

Query:  ETQIENKVDAQQDSGSQKQTIPTLNRINLESWEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE
          +IENKVD  Q  GSQK+++PTLNRINLESWE MSKN S PE+NPLLEI+K+F++AFVKFWSE
Subjt:  ETQIENKVDAQQDSGSQKQTIPTLNRINLESWEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE

A0A6J1BY28 uncharacterized protein LOC111006625 isoform X21.3e-20188.91Show/hide
Query:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS
        MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS
Subjt:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS

Query:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPI
        LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPI
Subjt:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPI

Query:  THVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKSDSMEVAPIVESS
        THVTPLAADVVVETFPLSPISGAANSS                                            VEKAEENFVSSLSVAK DSMEVAPIVESS
Subjt:  THVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKSDSMEVAPIVESS

Query:  NGSILKEGGVREVEGPQLEVRTDTPTAAAFEQCQKTSEKVSAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLES
        NGSILKEGGV EVEGPQLEVRTDTPTAAAFEQCQKTSEK  APNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLES
Subjt:  NGSILKEGGVREVEGPQLEVRTDTPTAAAFEQCQKTSEKVSAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLES

Query:  WEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE
        WEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE
Subjt:  WEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE

A0A6J1C1R0 uncharacterized protein LOC111006625 isoform X11.4e-23299.08Show/hide
Query:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS
        MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS
Subjt:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS

Query:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPI
        LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPI
Subjt:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPI

Query:  THVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKSDSMEVAPIVESS
        THVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAK DSMEVAPIVESS
Subjt:  THVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKSDSMEVAPIVESS

Query:  NGSILKEGGVREVEGPQLEVRTDTPTAAAFEQCQKTSEKVSAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLES
        NGSILKEGGV EVEGPQLEVRTDTPTAAAFEQCQKTSEK  APNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLES
Subjt:  NGSILKEGGVREVEGPQLEVRTDTPTAAAFEQCQKTSEKVSAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLES

Query:  WEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE
        WEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE
Subjt:  WEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE

A0A6J1E1K4 uncharacterized protein LOC1114297502.4e-15570.97Show/hide
Query:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS
        MHAI+GGW G PLALAK+NESEGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLTHKEVGGSFY VREIVRDIIQENRVLGPGKL LEEH+ DH 
Subjt:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS

Query:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSD-EQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMP
        LEENPLHSIAIEPQS L   SEEFDF + +N CINEEPI+VSD EQ TS NIQ S NG IINGSLVD SDKDSD++I++ELLVNE K++EEV+KEESGMP
Subjt:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSD-EQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMP

Query:  ITHVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKSDSMEVAPIVES
        I HVTPLA DV V TFPL   S AAN S   SET IST  SEK+VSQ +ELES V LF +E +N TK S    EKA       LS   SD +EVA IVE 
Subjt:  ITHVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKSDSMEVAPIVES

Query:  SNGSILKEGGVREVEGPQLEVRTDTPTAAAFEQCQKTSEKVSAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLE
        +NG+++K+G + EVEGP LE+ TDTP +  FEQ QK+SE + APNASPSGT+N N + +NGIDQASKIKEET+++NKV+A+Q  GSQK++IPTLNR+NL+
Subjt:  SNGSILKEGGVREVEGPQLEVRTDTPTAAAFEQCQKTSEKVSAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLE

Query:  SWEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE
        SW   SK+ S PE+NPLLEIL AF++AFVKFWSE
Subjt:  SWEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE

A0A6J1JPX5 uncharacterized protein LOC1114872472.9e-15370.28Show/hide
Query:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS
        MHA++GGW G PLALAK+NESEGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLTHKEVGGSFY VREIVRDIIQENRVLGPGKL LEEH+ DH 
Subjt:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS

Query:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSD-EQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMP
        LEENPLHSIAIEPQS L   SEEFDF + +N CINEEPI+VSD EQ TS NIQ S NG IINGSLVD SDKDSD+ I++EL VNE K++E+V+KEESGMP
Subjt:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSD-EQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMP

Query:  ITHVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKSDSMEVAPIVES
        I HVTPLA DV V TFPL   S AAN S   SET IST  SEK+VSQT+ELES V LF +E +N TK S    EKA        S   SD +EVA IVE 
Subjt:  ITHVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKSDSMEVAPIVES

Query:  SNGSILKEGGVREVEGPQLEVRTDTPTAAAFEQCQKTSEKVSAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLE
        +NG+I+K+G + EVEGP LE+ TDTP +  FEQ QK+SE + APNAS SGT+N N + +NGIDQASKIKEET+++NKV+A+Q  GSQK++IPTLNR+NL+
Subjt:  SNGSILKEGGVREVEGPQLEVRTDTPTAAAFEQCQKTSEKVSAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLE

Query:  SWEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE
        SW   SK+ S PE+NPLLEIL AF++AFVKFWSE
Subjt:  SWEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G52170.1 DNA binding3.2e-4333.59Show/hide
Query:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLE------E
        MH+++    G+  ALAK ++S G++TR R  KEERK +VE FIKK+Q+ NNGSFPSL+LTHKEVGGSFY +REIVR+IIQENRVLGPG LLLE      +
Subjt:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLE------E

Query:  HNFDHSLEENPLHSIAIEPQ-------SHLPLSSEEFDFRVKYNQ-CIN-----------EEPILVSDEQCTSTNIQISH--------NGPIINGSLVD-
         +   S+  +P+  +++ P          L  SSE  +  V  +Q C++           +E I +  +   ST+I ++         N    N  L + 
Subjt:  HNFDHSLEENPLHSIAIEPQ-------SHLPLSSEEFDFRVKYNQ-CIN-----------EEPILVSDEQCTSTNIQISH--------NGPIINGSLVD-

Query:  -------VSDKDSDKLIESELLVNEDKEVEEVVKEES---------------GMPITHV------TPLAADVVVETFPLSPISGAANSSGERSETSISTC
               V  K  DK ++ +   N+D+  EE+   ES               G  +T +        ++A+ VVETFPL  ++   +S   +       C
Subjt:  -------VSDKDSDKLIESELLVNEDKEVEEVVKEES---------------GMPITHV------TPLAADVVVETFPLSPISGAANSSGERSETSISTC

Query:  DSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVE---------------------KAEENFVSSLSVAKSDSMEVAPIVES--SNGSILKEGGVREVEG
        +  K     VE +      +  G   + TS  V+E                     K  E  V+S SV     +E A   E+   NG I   G V E + 
Subjt:  DSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVE---------------------KAEENFVSSLSVAKSDSMEVAPIVES--SNGSILKEGGVREVEG

Query:  PQLEVRTDTPTAAAFEQCQKTSEKVSAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWEEMSKNPSNPESNP
           E    T TA   EQ   TS   S    +     +   +++ G + AS  K+ T  + K+DA   S SQK+   TLNRI  ESW+  S N    E+NP
Subjt:  PQLEVRTDTPTAAAFEQCQKTSEKVSAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWEEMSKNPSNPESNP

Query:  LLEILKAFVSAFVKFWSE
        LL +LK+FV+AFVKFWSE
Subjt:  LLEILKAFVSAFVKFWSE

AT3G52170.2 DNA binding3.2e-4333.59Show/hide
Query:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLE------E
        MH+++    G+  ALAK ++S G++TR R  KEERK +VE FIKK+Q+ NNGSFPSL+LTHKEVGGSFY +REIVR+IIQENRVLGPG LLLE      +
Subjt:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLE------E

Query:  HNFDHSLEENPLHSIAIEPQ-------SHLPLSSEEFDFRVKYNQ-CIN-----------EEPILVSDEQCTSTNIQISH--------NGPIINGSLVD-
         +   S+  +P+  +++ P          L  SSE  +  V  +Q C++           +E I +  +   ST+I ++         N    N  L + 
Subjt:  HNFDHSLEENPLHSIAIEPQ-------SHLPLSSEEFDFRVKYNQ-CIN-----------EEPILVSDEQCTSTNIQISH--------NGPIINGSLVD-

Query:  -------VSDKDSDKLIESELLVNEDKEVEEVVKEES---------------GMPITHV------TPLAADVVVETFPLSPISGAANSSGERSETSISTC
               V  K  DK ++ +   N+D+  EE+   ES               G  +T +        ++A+ VVETFPL  ++   +S   +       C
Subjt:  -------VSDKDSDKLIESELLVNEDKEVEEVVKEES---------------GMPITHV------TPLAADVVVETFPLSPISGAANSSGERSETSISTC

Query:  DSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVE---------------------KAEENFVSSLSVAKSDSMEVAPIVES--SNGSILKEGGVREVEG
        +  K     VE +      +  G   + TS  V+E                     K  E  V+S SV     +E A   E+   NG I   G V E + 
Subjt:  DSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVE---------------------KAEENFVSSLSVAKSDSMEVAPIVES--SNGSILKEGGVREVEG

Query:  PQLEVRTDTPTAAAFEQCQKTSEKVSAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWEEMSKNPSNPESNP
           E    T TA   EQ   TS   S    +     +   +++ G + AS  K+ T  + K+DA   S SQK+   TLNRI  ESW+  S N    E+NP
Subjt:  PQLEVRTDTPTAAAFEQCQKTSEKVSAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWEEMSKNPSNPESNP

Query:  LLEILKAFVSAFVKFWSE
        LL +LK+FV+AFVKFWSE
Subjt:  LLEILKAFVSAFVKFWSE

AT5G58210.1 hydroxyproline-rich glycoprotein family protein1.0e-0952Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDI
        R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y+VR+I +++
Subjt:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDI

AT5G58210.2 hydroxyproline-rich glycoprotein family protein1.0e-0952Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDI
        R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y+VR+I +++
Subjt:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDI

AT5G58210.3 hydroxyproline-rich glycoprotein family protein1.0e-0952Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDI
        R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y+VR+I +++
Subjt:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACGCGATTAGGGGTGGGTGGACAGGGCGTCCTCTTGCCCTAGCCAAACACAATGAGTCTGAAGGGAGAAAGACTAGAATTCGGCGTTCAAAAGAGGAAAGGAAGGC
AATGGTCGAAGTCTTCATAAAAAAGTATCAGGAATCCAATAATGGAAGTTTCCCCTCTCTCAATCTTACCCACAAGGAGGTTGGTGGATCTTTCTACATGGTGCGGGAGA
TTGTACGTGATATAATTCAAGAAAATCGAGTGCTTGGCCCTGGAAAGTTGTTACTAGAAGAGCACAACTTTGATCATTCACTTGAAGAGAATCCACTCCACTCAATTGCC
ATAGAACCTCAGTCCCATTTACCGTTATCATCAGAAGAATTTGATTTTAGAGTTAAGTACAACCAATGTATAAATGAAGAACCCATCCTTGTCTCAGATGAACAATGCAC
TTCAACAAATATTCAGATATCACACAATGGGCCGATAATCAATGGTAGCCTGGTGGATGTGAGTGACAAGGACTCTGACAAACTTATCGAGTCAGAGTTGCTAGTGAATG
AAGACAAGGAAGTAGAGGAAGTGGTCAAAGAGGAATCAGGAATGCCAATTACTCATGTGACACCTTTGGCGGCAGATGTGGTAGTAGAGACATTTCCATTGAGTCCAATT
TCTGGTGCTGCTAATAGTTCAGGTGAAAGATCCGAAACGTCGATTTCAACTTGTGATTCAGAAAAGCAAGTTAGTCAAACTGTCGAGTTAGAGTCAGGGGTTGGTTTGTT
TATCACTGAAGGTAGTAATTTCACAAAAACTTCTGATCATGTAGTTGAGAAAGCAGAGGAAAACTTTGTAAGTTCATTATCAGTAGCAAAGTCTGATTCGATGGAAGTAG
CACCAATTGTTGAAAGCTCTAATGGATCCATTCTCAAAGAAGGTGGCGTTCGTGAAGTTGAGGGTCCTCAGCTGGAAGTTCGTACTGATACTCCAACAGCTGCGGCCTTT
GAACAATGCCAAAAAACTAGTGAGAAGGTGAGTGCTCCAAATGCTTCTCCAAGTGGTACCCAGAATCACAACAAGACATTCAGCAATGGCATTGATCAGGCCTCAAAAAT
CAAAGAGGAGACACAGATTGAAAATAAAGTAGATGCTCAACAGGACAGTGGCTCCCAGAAACAAACCATTCCAACATTAAATAGAATCAATCTCGAATCATGGGAAGAGA
TGTCCAAAAATCCCTCAAACCCCGAAAGCAACCCGCTTCTGGAAATCCTCAAGGCATTCGTTTCCGCCTTCGTGAAGTTTTGGTCCGAG
mRNA sequenceShow/hide mRNA sequence
ATGCACGCGATTAGGGGTGGGTGGACAGGGCGTCCTCTTGCCCTAGCCAAACACAATGAGTCTGAAGGGAGAAAGACTAGAATTCGGCGTTCAAAAGAGGAAAGGAAGGC
AATGGTCGAAGTCTTCATAAAAAAGTATCAGGAATCCAATAATGGAAGTTTCCCCTCTCTCAATCTTACCCACAAGGAGGTTGGTGGATCTTTCTACATGGTGCGGGAGA
TTGTACGTGATATAATTCAAGAAAATCGAGTGCTTGGCCCTGGAAAGTTGTTACTAGAAGAGCACAACTTTGATCATTCACTTGAAGAGAATCCACTCCACTCAATTGCC
ATAGAACCTCAGTCCCATTTACCGTTATCATCAGAAGAATTTGATTTTAGAGTTAAGTACAACCAATGTATAAATGAAGAACCCATCCTTGTCTCAGATGAACAATGCAC
TTCAACAAATATTCAGATATCACACAATGGGCCGATAATCAATGGTAGCCTGGTGGATGTGAGTGACAAGGACTCTGACAAACTTATCGAGTCAGAGTTGCTAGTGAATG
AAGACAAGGAAGTAGAGGAAGTGGTCAAAGAGGAATCAGGAATGCCAATTACTCATGTGACACCTTTGGCGGCAGATGTGGTAGTAGAGACATTTCCATTGAGTCCAATT
TCTGGTGCTGCTAATAGTTCAGGTGAAAGATCCGAAACGTCGATTTCAACTTGTGATTCAGAAAAGCAAGTTAGTCAAACTGTCGAGTTAGAGTCAGGGGTTGGTTTGTT
TATCACTGAAGGTAGTAATTTCACAAAAACTTCTGATCATGTAGTTGAGAAAGCAGAGGAAAACTTTGTAAGTTCATTATCAGTAGCAAAGTCTGATTCGATGGAAGTAG
CACCAATTGTTGAAAGCTCTAATGGATCCATTCTCAAAGAAGGTGGCGTTCGTGAAGTTGAGGGTCCTCAGCTGGAAGTTCGTACTGATACTCCAACAGCTGCGGCCTTT
GAACAATGCCAAAAAACTAGTGAGAAGGTGAGTGCTCCAAATGCTTCTCCAAGTGGTACCCAGAATCACAACAAGACATTCAGCAATGGCATTGATCAGGCCTCAAAAAT
CAAAGAGGAGACACAGATTGAAAATAAAGTAGATGCTCAACAGGACAGTGGCTCCCAGAAACAAACCATTCCAACATTAAATAGAATCAATCTCGAATCATGGGAAGAGA
TGTCCAAAAATCCCTCAAACCCCGAAAGCAACCCGCTTCTGGAAATCCTCAAGGCATTCGTTTCCGCCTTCGTGAAGTTTTGGTCCGAG
Protein sequenceShow/hide protein sequence
MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHSLEENPLHSIA
IEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPITHVTPLAADVVVETFPLSPI
SGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKSDSMEVAPIVESSNGSILKEGGVREVEGPQLEVRTDTPTAAAF
EQCQKTSEKVSAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE