; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g1450 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g1450
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionPlastid envelope DNA binding protein
Genome locationMC04:22427278..22432163
RNA-Seq ExpressionMC04g1450
SyntenyMC04g1450
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602280.1 hypothetical protein SDJN03_07513, partial [Cucurbita argyrosperma subsp. sororia]2.60e-19671.36Show/hide
Query:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS
        MHAI+GGW G PLALAK+NESEGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLTHKEVGGSFY VREIVRDIIQENRVLGPGKL LEEH+ DH 
Subjt:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS

Query:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDE-QCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMP
        L+ENPLHSIAIEPQS L   SEEFDF + +N CINEEPI+VSDE Q TS NIQ S NG IINGSLVDVSDKDSD+ I++EL VNE K++EEV+KEESGMP
Subjt:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDE-QCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMP

Query:  ITHVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVES
        I HVTPLA DV V TFPL   S AAN S   SET IST  SEK+VSQT+ELES V LF +E +N TK S    EKA       LS    D +EVA IVE 
Subjt:  ITHVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVES

Query:  SNGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSE-KAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLES
        +NG+++K+G +HEVEGP LE+ TDTP +  FEQ QK+SE KAPNASPSGT+N N + +NGIDQASKIKEET+++NKV+A+Q  GSQK++IPTLNR+NL+S
Subjt:  SNGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSE-KAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLES

Query:  WEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE
        W   SK+ S PE+NPLLEIL AF++AFVKFWSE
Subjt:  WEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE

KAG7032960.1 hypothetical protein SDJN02_07011 [Cucurbita argyrosperma subsp. argyrosperma]2.60e-19671.36Show/hide
Query:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS
        MHAI+GGW G PLALAK+NESEGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLTHKEVGGSFY VREIVRDIIQENRVLGPGKL LEEH+ DH 
Subjt:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS

Query:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDE-QCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMP
        L+ENPLHSIAIEPQS L   SEEFDF + +N CINEEPI+VSDE Q TS NIQ S NG IINGSLVDVSDKDSD+ I++EL VNE K++EEV+KEESGMP
Subjt:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDE-QCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMP

Query:  ITHVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVES
        I HVTPLA DV V TFPL   S AAN S   SET IST  SEK+VSQT+ELES V LF +E +N TK S    EKA       LS    D +EVA IVE 
Subjt:  ITHVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVES

Query:  SNGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSE-KAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLES
        +NG+++K+G +HEVEGP LE+ TDTP +  FEQ QK+SE KAPNASPSGT+N N + +NGIDQASKIKEET+++NKV+A+Q  GSQK++IPTLNR+NL+S
Subjt:  SNGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSE-KAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLES

Query:  WEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE
        W   SK+ S PE+NPLLEIL AF++AFVKFWSE
Subjt:  WEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE

XP_022134343.1 uncharacterized protein LOC111006625 isoform X1 [Momordica charantia]2.82e-301100Show/hide
Query:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS
        MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS
Subjt:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS

Query:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPI
        LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPI
Subjt:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPI

Query:  THVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVESS
        THVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVESS
Subjt:  THVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVESS

Query:  NGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSEKAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWE
        NGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSEKAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWE
Subjt:  NGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSEKAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWE

Query:  EMSKNPSNPESNPLLEILKAFVSAFVKFWSE
        EMSKNPSNPESNPLLEILKAFVSAFVKFWSE
Subjt:  EMSKNPSNPESNPLLEILKAFVSAFVKFWSE

XP_022134345.1 uncharacterized protein LOC111006625 isoform X2 [Momordica charantia]1.82e-26189.79Show/hide
Query:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS
        MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS
Subjt:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS

Query:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPI
        LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPI
Subjt:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPI

Query:  THVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVESS
        THVTPLAADVVVETFPLSPISGAANSS                                            VEKAEENFVSSLSVAKFDSMEVAPIVESS
Subjt:  THVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVESS

Query:  NGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSEKAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWE
        NGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSEKAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWE
Subjt:  NGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSEKAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWE

Query:  EMSKNPSNPESNPLLEILKAFVSAFVKFWSE
        EMSKNPSNPESNPLLEILKAFVSAFVKFWSE
Subjt:  EMSKNPSNPESNPLLEILKAFVSAFVKFWSE

XP_038886590.1 uncharacterized protein LOC120076760 [Benincasa hispida]1.80e-20374.36Show/hide
Query:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEE-HNFDH
        MHAI+GGWTG PLALA++NE+EGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFY VREIVRDIIQENRVLGPGKLL EE H  DH
Subjt:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEE-HNFDH

Query:  SLEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMP
        SLEENPLHSIAIEPQS L LS++E  F + Y+Q INEEPI VSDEQCT+TNIQ S NGPIINGSLVD++DK+  + IESELLVNE K+VEEVVKEESGMP
Subjt:  SLEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMP

Query:  ITHVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVES
        I HVTPLA DVVVETFPL   S   N S  RSE  IST  SEKQVSQT+ELES VGLF        K S  VVEKAEENF   LS +  D +E A IVE+
Subjt:  ITHVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVES

Query:  SNGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSE-KAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLES
        SNGS +KEG ++EV GP+LEV +DTP +  FEQ QK+SE KAPNASPS  +N NKTFSNG DQASKIKEET++ENKVDA Q  GSQK++IPTLNRINLES
Subjt:  SNGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSE-KAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLES

Query:  WEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE
        WE MSKN S  E+NP+LEI KAF++AFVKFWSE
Subjt:  WEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE

TrEMBL top hitse value%identityAlignment
A0A1S3C473 uncharacterized protein LOC103496473 isoform X13.63e-19268.4Show/hide
Query:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEE-HNFDH
        MHAI+GGWTGRPLALAK+NE+EGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFY VREIVRDIIQENR+LGPGKLLLEE HN DH
Subjt:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEE-HNFDH

Query:  SLEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVV-------
        SL++NPLHSIAIEPQS L LSS+E  F + YN+ INEEPI VSDEQCT+TNIQ S N  IINGSLVDVS++DSD+ I+SELLVNE KEVEEVV       
Subjt:  SLEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVV-------

Query:  -----------------------KEESGMPITHVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSD
                               KEESGMPI HVTPLA DVVVETFPL P+    N S  RSE  IST  SEKQVSQ++ELES VGL     SN T  SD
Subjt:  -----------------------KEESGMPITHVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSD

Query:  HVVEKAEENFVSSLSVAKFDSMEVAPIVESSNGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSEKAPNASPSGTQNHNKTFSNGIDQASKIKEET
         VVEKA ENF   LS  K D +EVA IVE SNGS +KEG +HEV GP+LEV +DTP +  FEQ QK+S+     SP  ++N NKTFSN  DQASKI    
Subjt:  HVVEKAEENFVSSLSVAKFDSMEVAPIVESSNGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSEKAPNASPSGTQNHNKTFSNGIDQASKIKEET

Query:  QIENKVDAQQDSGSQKQTIPTLNRINLESWEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE
        +IENKVD  Q  GSQK+++PTLNRINLESWE MSKN S PE+NPLLEI+K+F++AFVKFWSE
Subjt:  QIENKVDAQQDSGSQKQTIPTLNRINLESWEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE

A0A6J1BY28 uncharacterized protein LOC111006625 isoform X28.83e-26289.79Show/hide
Query:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS
        MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS
Subjt:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS

Query:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPI
        LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPI
Subjt:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPI

Query:  THVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVESS
        THVTPLAADVVVETFPLSPISGAANSS                                            VEKAEENFVSSLSVAKFDSMEVAPIVESS
Subjt:  THVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVESS

Query:  NGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSEKAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWE
        NGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSEKAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWE
Subjt:  NGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSEKAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWE

Query:  EMSKNPSNPESNPLLEILKAFVSAFVKFWSE
        EMSKNPSNPESNPLLEILKAFVSAFVKFWSE
Subjt:  EMSKNPSNPESNPLLEILKAFVSAFVKFWSE

A0A6J1C1R0 uncharacterized protein LOC111006625 isoform X11.37e-301100Show/hide
Query:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS
        MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS
Subjt:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS

Query:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPI
        LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPI
Subjt:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPI

Query:  THVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVESS
        THVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVESS
Subjt:  THVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVESS

Query:  NGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSEKAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWE
        NGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSEKAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWE
Subjt:  NGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSEKAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWE

Query:  EMSKNPSNPESNPLLEILKAFVSAFVKFWSE
        EMSKNPSNPESNPLLEILKAFVSAFVKFWSE
Subjt:  EMSKNPSNPESNPLLEILKAFVSAFVKFWSE

A0A6J1E1K4 uncharacterized protein LOC1114297501.79e-19671.36Show/hide
Query:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS
        MHAI+GGW G PLALAK+NESEGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLTHKEVGGSFY VREIVRDIIQENRVLGPGKL LEEH+ DH 
Subjt:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS

Query:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDE-QCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMP
        LEENPLHSIAIEPQS L   SEEFDF + +N CINEEPI+VSDE Q TS NIQ S NG IINGSLVD SDKDSD++I++ELLVNE K++EEV+KEESGMP
Subjt:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDE-QCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMP

Query:  ITHVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVES
        I HVTPLA DV V TFPL   S AAN S   SET IST  SEK+VSQ +ELES V LF +E +N TK S    EKA       LS    D +EVA IVE 
Subjt:  ITHVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVES

Query:  SNGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSE-KAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLES
        +NG+++K+G +HEVEGP LE+ TDTP +  FEQ QK+SE KAPNASPSGT+N N + +NGIDQASKIKEET+++NKV+A+Q  GSQK++IPTLNR+NL+S
Subjt:  SNGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSE-KAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLES

Query:  WEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE
        W   SK+ S PE+NPLLEIL AF++AFVKFWSE
Subjt:  WEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE

A0A6J1JPX5 uncharacterized protein LOC1114872479.72e-19470.67Show/hide
Query:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS
        MHA++GGW G PLALAK+NESEGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLTHKEVGGSFY VREIVRDIIQENRVLGPGKL LEEH+ DH 
Subjt:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS

Query:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDE-QCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMP
        LEENPLHSIAIEPQS L   SEEFDF + +N CINEEPI+VSDE Q TS NIQ S NG IINGSLVD SDKDSD+ I++EL VNE K++E+V+KEESGMP
Subjt:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDE-QCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMP

Query:  ITHVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVES
        I HVTPLA DV V TFPL   S AAN S   SET IST  SEK+VSQT+ELES V LF +E +N TK S    EKA    +S       D +EVA IVE 
Subjt:  ITHVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVES

Query:  SNGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSE-KAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLES
        +NG+I+K+G +HEVEGP LE+ TDTP +  FEQ QK+SE KAPNAS SGT+N N + +NGIDQASKIKEET+++NKV+A+Q  GSQK++IPTLNR+NL+S
Subjt:  SNGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSE-KAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLES

Query:  WEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE
        W   SK+ S PE+NPLLEIL AF++AFVKFWSE
Subjt:  WEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G52170.1 DNA binding2.9e-4434.1Show/hide
Query:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLE------E
        MH+++    G+  ALAK ++S G++TR R  KEERK +VE FIKK+Q+ NNGSFPSL+LTHKEVGGSFY +REIVR+IIQENRVLGPG LLLE      +
Subjt:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLE------E

Query:  HNFDHSLEENPLHSIAIEPQ-------SHLPLSSEEFDFRVKYNQ-CIN-----------EEPILVSDEQCTSTNIQISH--------NGPIINGSLVD-
         +   S+  +P+  +++ P          L  SSE  +  V  +Q C++           +E I +  +   ST+I ++         N    N  L + 
Subjt:  HNFDHSLEENPLHSIAIEPQ-------SHLPLSSEEFDFRVKYNQ-CIN-----------EEPILVSDEQCTSTNIQISH--------NGPIINGSLVD-

Query:  -------VSDKDSDKLIESELLVNEDKEVEEVVKEES---------------GMPITHV------TPLAADVVVETFPLSPISGAANSSGERSETSISTC
               V  K  DK ++ +   N+D+  EE+   ES               G  +T +        ++A+ VVETFPL  ++   +S   +       C
Subjt:  -------VSDKDSDKLIESELLVNEDKEVEEVVKEES---------------GMPITHV------TPLAADVVVETFPLSPISGAANSSGERSETSISTC

Query:  DSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVE---------------------KAEENFVSSLSVAKFDSMEVAPIVES--SNGSILKEGGVHEVEG
        +  K     VE +      +  G   + TS  V+E                     K  E  V+S SV     +E A   E+   NG I   G VHE + 
Subjt:  DSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVE---------------------KAEENFVSSLSVAKFDSMEVAPIVES--SNGSILKEGGVHEVEG

Query:  PQLEVRTDTPTAAAFEQCQKTSEKAPNASPSGTQNHNK----TFSN--GIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWEEMSKNPSNP
           E    T TA   EQ   TS     ++ SG++ +++    T S+  G + AS  K+ T  + K+DA   S SQK+   TLNRI  ESW+  S N    
Subjt:  PQLEVRTDTPTAAAFEQCQKTSEKAPNASPSGTQNHNK----TFSN--GIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWEEMSKNPSNP

Query:  ESNPLLEILKAFVSAFVKFWSE
        E+NPLL +LK+FV+AFVKFWSE
Subjt:  ESNPLLEILKAFVSAFVKFWSE

AT3G52170.2 DNA binding2.9e-4434.1Show/hide
Query:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLE------E
        MH+++    G+  ALAK ++S G++TR R  KEERK +VE FIKK+Q+ NNGSFPSL+LTHKEVGGSFY +REIVR+IIQENRVLGPG LLLE      +
Subjt:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLE------E

Query:  HNFDHSLEENPLHSIAIEPQ-------SHLPLSSEEFDFRVKYNQ-CIN-----------EEPILVSDEQCTSTNIQISH--------NGPIINGSLVD-
         +   S+  +P+  +++ P          L  SSE  +  V  +Q C++           +E I +  +   ST+I ++         N    N  L + 
Subjt:  HNFDHSLEENPLHSIAIEPQ-------SHLPLSSEEFDFRVKYNQ-CIN-----------EEPILVSDEQCTSTNIQISH--------NGPIINGSLVD-

Query:  -------VSDKDSDKLIESELLVNEDKEVEEVVKEES---------------GMPITHV------TPLAADVVVETFPLSPISGAANSSGERSETSISTC
               V  K  DK ++ +   N+D+  EE+   ES               G  +T +        ++A+ VVETFPL  ++   +S   +       C
Subjt:  -------VSDKDSDKLIESELLVNEDKEVEEVVKEES---------------GMPITHV------TPLAADVVVETFPLSPISGAANSSGERSETSISTC

Query:  DSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVE---------------------KAEENFVSSLSVAKFDSMEVAPIVES--SNGSILKEGGVHEVEG
        +  K     VE +      +  G   + TS  V+E                     K  E  V+S SV     +E A   E+   NG I   G VHE + 
Subjt:  DSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVE---------------------KAEENFVSSLSVAKFDSMEVAPIVES--SNGSILKEGGVHEVEG

Query:  PQLEVRTDTPTAAAFEQCQKTSEKAPNASPSGTQNHNK----TFSN--GIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWEEMSKNPSNP
           E    T TA   EQ   TS     ++ SG++ +++    T S+  G + AS  K+ T  + K+DA   S SQK+   TLNRI  ESW+  S N    
Subjt:  PQLEVRTDTPTAAAFEQCQKTSEKAPNASPSGTQNHNK----TFSN--GIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWEEMSKNPSNP

Query:  ESNPLLEILKAFVSAFVKFWSE
        E+NPLL +LK+FV+AFVKFWSE
Subjt:  ESNPLLEILKAFVSAFVKFWSE

AT5G58210.1 hydroxyproline-rich glycoprotein family protein1.0e-0952Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDI
        R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y+VR+I +++
Subjt:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDI

AT5G58210.2 hydroxyproline-rich glycoprotein family protein1.0e-0952Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDI
        R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y+VR+I +++
Subjt:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDI

AT5G58210.3 hydroxyproline-rich glycoprotein family protein1.0e-0952Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDI
        R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y+VR+I +++
Subjt:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACGCGATTAGGGGTGGGTGGACAGGGCGTCCTCTTGCCCTAGCCAAACACAATGAGTCTGAAGGGAGAAAGACTAGAATTCGGCGTTCAAAAGAGGAAAGGAAGGC
AATGGTCGAAGTCTTCATAAAAAAGTATCAGGAATCCAATAATGGAAGTTTCCCCTCTCTCAATCTTACCCACAAGGAGGTTGGTGGATCTTTCTACATGGTGCGGGAGA
TTGTACGTGATATAATTCAAGAAAATCGAGTGCTTGGCCCTGGAAAGTTGTTACTAGAAGAGCACAACTTTGATCATTCACTTGAAGAGAATCCACTCCACTCAATTGCC
ATAGAACCTCAGTCCCATTTACCGTTATCATCAGAAGAATTTGATTTTAGAGTTAAGTACAACCAATGTATAAATGAAGAACCCATCCTTGTCTCAGATGAACAATGCAC
TTCAACAAATATTCAGATATCACACAATGGGCCGATAATCAATGGTAGCCTGGTGGATGTGAGTGACAAGGACTCTGACAAACTTATCGAGTCAGAGTTGCTAGTGAATG
AAGACAAGGAAGTAGAGGAAGTGGTCAAAGAGGAATCAGGAATGCCAATTACTCATGTGACACCTTTGGCGGCAGATGTGGTAGTAGAGACATTTCCATTGAGTCCAATT
TCTGGTGCTGCTAATAGTTCAGGTGAAAGATCCGAAACGTCGATTTCAACTTGTGATTCAGAAAAGCAAGTTAGTCAAACTGTCGAGTTAGAGTCAGGGGTTGGTTTGTT
TATCACTGAAGGTAGTAATTTCACAAAAACTTCTGATCATGTAGTCGAGAAAGCAGAGGAAAACTTTGTAAGTTCATTATCAGTAGCAAAGTTTGATTCGATGGAAGTAG
CACCAATTGTTGAAAGCTCTAATGGATCCATTCTCAAAGAAGGTGGCGTTCATGAAGTTGAGGGTCCTCAGCTGGAAGTTCGTACTGATACTCCAACAGCTGCGGCCTTT
GAACAATGCCAAAAAACTAGTGAGAAGGCTCCAAATGCTTCTCCAAGTGGTACCCAGAATCACAACAAGACATTCAGCAATGGCATTGATCAGGCCTCAAAAATCAAAGA
GGAGACACAGATTGAAAATAAAGTAGATGCTCAACAGGACAGTGGCTCCCAGAAACAAACCATTCCAACATTAAATAGAATCAATCTCGAATCATGGGAAGAGATGTCCA
AAAATCCCTCAAACCCCGAAAGCAACCCGCTTCTGGAAATCCTCAAGGCATTCGTTTCCGCCTTCGTGAAGTTTTGGTCCGAGTAA
mRNA sequenceShow/hide mRNA sequence
AGGGAAGACTAATAATTACAAATGGTATGGAGGACTATTTCTACATAACTATAAGATATATGTATACACATATAAGATATATTCACATATATAATATATACCTATATATA
CGAATATAATCAGGAATACTAAGACTGCGAGACGTTTAGAAATTTCATTTTCTGTATACATAGGCCGACACCAAGAGGCGCATTAAACTGGGGCGGCCCAGTCCATCAAA
TCACGGTAACAATCGGACATAGATTTATTTCAACACAGTAGAATTAGAATTACATGACATATTCACCTCCCGCTCTCACCATTCCCATTCCCCACTGCCCATTTTTGCGT
TGGAAGTTTGCAGAATGCCACTGGCCCGCTCTCTGTGACCCCTCAGTAGGGTTTTACACTTCTCTATCTCCATTTCTCTCTCTTGGCTCCTCCCCCGCTCTCTCCCGGAG
CACAGGTTTGTCTCCTTAGGGTTTAACTTTGTGGATTTCATGCACGCGATTAGGGGTGGGTGGACAGGGCGTCCTCTTGCCCTAGCCAAACACAATGAGTCTGAAGGGAG
AAAGACTAGAATTCGGCGTTCAAAAGAGGAAAGGAAGGCAATGGTCGAAGTCTTCATAAAAAAGTATCAGGAATCCAATAATGGAAGTTTCCCCTCTCTCAATCTTACCC
ACAAGGAGGTTGGTGGATCTTTCTACATGGTGCGGGAGATTGTACGTGATATAATTCAAGAAAATCGAGTGCTTGGCCCTGGAAAGTTGTTACTAGAAGAGCACAACTTT
GATCATTCACTTGAAGAGAATCCACTCCACTCAATTGCCATAGAACCTCAGTCCCATTTACCGTTATCATCAGAAGAATTTGATTTTAGAGTTAAGTACAACCAATGTAT
AAATGAAGAACCCATCCTTGTCTCAGATGAACAATGCACTTCAACAAATATTCAGATATCACACAATGGGCCGATAATCAATGGTAGCCTGGTGGATGTGAGTGACAAGG
ACTCTGACAAACTTATCGAGTCAGAGTTGCTAGTGAATGAAGACAAGGAAGTAGAGGAAGTGGTCAAAGAGGAATCAGGAATGCCAATTACTCATGTGACACCTTTGGCG
GCAGATGTGGTAGTAGAGACATTTCCATTGAGTCCAATTTCTGGTGCTGCTAATAGTTCAGGTGAAAGATCCGAAACGTCGATTTCAACTTGTGATTCAGAAAAGCAAGT
TAGTCAAACTGTCGAGTTAGAGTCAGGGGTTGGTTTGTTTATCACTGAAGGTAGTAATTTCACAAAAACTTCTGATCATGTAGTCGAGAAAGCAGAGGAAAACTTTGTAA
GTTCATTATCAGTAGCAAAGTTTGATTCGATGGAAGTAGCACCAATTGTTGAAAGCTCTAATGGATCCATTCTCAAAGAAGGTGGCGTTCATGAAGTTGAGGGTCCTCAG
CTGGAAGTTCGTACTGATACTCCAACAGCTGCGGCCTTTGAACAATGCCAAAAAACTAGTGAGAAGGCTCCAAATGCTTCTCCAAGTGGTACCCAGAATCACAACAAGAC
ATTCAGCAATGGCATTGATCAGGCCTCAAAAATCAAAGAGGAGACACAGATTGAAAATAAAGTAGATGCTCAACAGGACAGTGGCTCCCAGAAACAAACCATTCCAACAT
TAAATAGAATCAATCTCGAATCATGGGAAGAGATGTCCAAAAATCCCTCAAACCCCGAAAGCAACCCGCTTCTGGAAATCCTCAAGGCATTCGTTTCCGCCTTCGTGAAG
TTTTGGTCCGAGTAAGTAGCTCTATGATTGTCAAATAGATGAATAGAGAGTAATTAGAGATTAGTTTTTTGCCTGCCTGTCTCTGTACCCCCACCCGGTTTACTCCGGTC
CCATCGTCGGTACTTCTGAAGAAACTGGGATGTGGGTTGGCATTTCTACCCTCAGTAGTTACAAGGTAAGCAAAATCTTGTAGAAGGGATATGTTTATTTTTTTGCTTTT
TTACTGTAACCATGAGATGTGTCACCCCATGATTATTTTCTTCCAGATATTACGGTTGATTATATTCTTTTTCCATTACCACATGATAAAAAAGAGAGAGTAACGTGAGA
AGGAGCCTTTTGTGTGTTATAAGAAGGAACGACAGGGAAACAGGAGAGAAAGGGATATACAGATTGGAGTTTTAATAGCCATTTTCTTTCTCCTTTTTGCTCTGACCCAC
CCGACA
Protein sequenceShow/hide protein sequence
MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHSLEENPLHSIA
IEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPITHVTPLAADVVVETFPLSPI
SGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVESSNGSILKEGGVHEVEGPQLEVRTDTPTAAAF
EQCQKTSEKAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE