; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g33490 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g33490
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionPlastid envelope DNA binding protein
Genome locationchr4:25229795..25233116
RNA-Seq ExpressionMoc04g33490
SyntenyMoc04g33490
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602280.1 hypothetical protein SDJN03_07513, partial [Cucurbita argyrosperma subsp. sororia]5.8e-15671.36Show/hide
Query:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS
        MHAI+GGW G PLALAK+NESEGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLTHKEVGGSFY VREIVRDIIQENRVLGPGKL LEEH+ DH 
Subjt:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS

Query:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSD-EQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMP
        L+ENPLHSIAIEPQS L   SEEFDF + +N CINEEPI+VSD EQ TS NIQ S NG IINGSLVDVSDKDSD+ I++EL VNE K++EEV+KEESGMP
Subjt:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSD-EQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMP

Query:  ITHVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVES
        I HVTPLA DV V TFPL   S AAN S   SET IST  SEK+VSQT+ELES V LF +E +N TK S    EKA       LS    D +EVA IVE 
Subjt:  ITHVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVES

Query:  SNGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSE-KAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLES
        +NG+++K+G +HEVEGP LE+ TDTP +  FEQ QK+SE KAPNASPSGT+N N + +NGIDQASKIKEET+++NKV+A+Q  GSQK++IPTLNR+NL+S
Subjt:  SNGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSE-KAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLES

Query:  WEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE
        W   SK+ S PE+NPLLEIL AF++AFVKFWSE
Subjt:  WEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE

KAG7032960.1 hypothetical protein SDJN02_07011 [Cucurbita argyrosperma subsp. argyrosperma]5.8e-15671.36Show/hide
Query:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS
        MHAI+GGW G PLALAK+NESEGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLTHKEVGGSFY VREIVRDIIQENRVLGPGKL LEEH+ DH 
Subjt:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS

Query:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSD-EQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMP
        L+ENPLHSIAIEPQS L   SEEFDF + +N CINEEPI+VSD EQ TS NIQ S NG IINGSLVDVSDKDSD+ I++EL VNE K++EEV+KEESGMP
Subjt:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSD-EQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMP

Query:  ITHVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVES
        I HVTPLA DV V TFPL   S AAN S   SET IST  SEK+VSQT+ELES V LF +E +N TK S    EKA       LS    D +EVA IVE 
Subjt:  ITHVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVES

Query:  SNGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSE-KAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLES
        +NG+++K+G +HEVEGP LE+ TDTP +  FEQ QK+SE KAPNASPSGT+N N + +NGIDQASKIKEET+++NKV+A+Q  GSQK++IPTLNR+NL+S
Subjt:  SNGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSE-KAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLES

Query:  WEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE
        W   SK+ S PE+NPLLEIL AF++AFVKFWSE
Subjt:  WEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE

XP_022134343.1 uncharacterized protein LOC111006625 isoform X1 [Momordica charantia]5.7e-236100Show/hide
Query:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS
        MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS
Subjt:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS

Query:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPI
        LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPI
Subjt:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPI

Query:  THVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVESS
        THVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVESS
Subjt:  THVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVESS

Query:  NGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSEKAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWE
        NGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSEKAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWE
Subjt:  NGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSEKAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWE

Query:  EMSKNPSNPESNPLLEILKAFVSAFVKFWSE
        EMSKNPSNPESNPLLEILKAFVSAFVKFWSE
Subjt:  EMSKNPSNPESNPLLEILKAFVSAFVKFWSE

XP_022134345.1 uncharacterized protein LOC111006625 isoform X2 [Momordica charantia]5.2e-20589.79Show/hide
Query:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS
        MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS
Subjt:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS

Query:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPI
        LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPI
Subjt:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPI

Query:  THVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVESS
        THVTPLAADVVVETFPLSPISGAANSS                                            VEKAEENFVSSLSVAKFDSMEVAPIVESS
Subjt:  THVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVESS

Query:  NGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSEKAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWE
        NGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSEKAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWE
Subjt:  NGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSEKAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWE

Query:  EMSKNPSNPESNPLLEILKAFVSAFVKFWSE
        EMSKNPSNPESNPLLEILKAFVSAFVKFWSE
Subjt:  EMSKNPSNPESNPLLEILKAFVSAFVKFWSE

XP_038886590.1 uncharacterized protein LOC120076760 [Benincasa hispida]1.6e-16174.36Show/hide
Query:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLL-LEEHNFDH
        MHAI+GGWTG PLALA++NE+EGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFY VREIVRDIIQENRVLGPGKLL  EEH  DH
Subjt:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLL-LEEHNFDH

Query:  SLEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMP
        SLEENPLHSIAIEPQS L LS++E  F + Y+Q INEEPI VSDEQCT+TNIQ S NGPIINGSLVD++DK+  + IESELLVNE K+VEEVVKEESGMP
Subjt:  SLEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMP

Query:  ITHVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVES
        I HVTPLA DVVVETFPL   S   N S  RSE  IST  SEKQVSQT+ELES VGLF        K S  VVEKAEENF   LS  + D +E A IVE+
Subjt:  ITHVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVES

Query:  SNGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSE-KAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLES
        SNGS +KEG ++EV GP+LEV +DTP +  FEQ QK+SE KAPNASPS  +N NKTFSNG DQASKIKEET++ENKVDA Q  GSQK++IPTLNRINLES
Subjt:  SNGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSE-KAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLES

Query:  WEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE
        WE MSKN S  E+NP+LEI KAF++AFVKFWSE
Subjt:  WEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE

TrEMBL top hitse value%identityAlignment
A0A1S3C473 uncharacterized protein LOC103496473 isoform X15.0e-15368.4Show/hide
Query:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLL-EEHNFDH
        MHAI+GGWTGRPLALAK+NE+EGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFY VREIVRDIIQENR+LGPGKLLL EEHN DH
Subjt:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLL-EEHNFDH

Query:  SLEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKE------------
        SL++NPLHSIAIEPQS L LSS+E  F + YN+ INEEPI VSDEQCT+TNIQ S N  IINGSLVDVS++DSD+ I+SELLVNE KE            
Subjt:  SLEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKE------------

Query:  ------------------VEEVVKEESGMPITHVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSD
                          VEEVVKEESGMPI HVTPLA DVVVETFPL P+    N S  RSE  IST  SEKQVSQ++ELES VGL     SN T  SD
Subjt:  ------------------VEEVVKEESGMPITHVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSD

Query:  HVVEKAEENFVSSLSVAKFDSMEVAPIVESSNGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSEKAPNASPSGTQNHNKTFSNGIDQASKIKEET
         VVEKA ENF   LS  K D +EVA IVE SNGS +KEG +HEV GP+LEV +DTP +  FEQ QK+S+     SP  ++N NKTFSN  DQASKI    
Subjt:  HVVEKAEENFVSSLSVAKFDSMEVAPIVESSNGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSEKAPNASPSGTQNHNKTFSNGIDQASKIKEET

Query:  QIENKVDAQQDSGSQKQTIPTLNRINLESWEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE
        +IENKVD  Q  GSQK+++PTLNRINLESWE MSKN S PE+NPLLEI+K+F++AFVKFWSE
Subjt:  QIENKVDAQQDSGSQKQTIPTLNRINLESWEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE

A0A6J1BY28 uncharacterized protein LOC111006625 isoform X22.5e-20589.79Show/hide
Query:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS
        MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS
Subjt:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS

Query:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPI
        LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPI
Subjt:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPI

Query:  THVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVESS
        THVTPLAADVVVETFPLSPISGAANSS                                            VEKAEENFVSSLSVAKFDSMEVAPIVESS
Subjt:  THVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVESS

Query:  NGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSEKAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWE
        NGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSEKAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWE
Subjt:  NGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSEKAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWE

Query:  EMSKNPSNPESNPLLEILKAFVSAFVKFWSE
        EMSKNPSNPESNPLLEILKAFVSAFVKFWSE
Subjt:  EMSKNPSNPESNPLLEILKAFVSAFVKFWSE

A0A6J1C1R0 uncharacterized protein LOC111006625 isoform X12.8e-236100Show/hide
Query:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS
        MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS
Subjt:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS

Query:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPI
        LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPI
Subjt:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPI

Query:  THVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVESS
        THVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVESS
Subjt:  THVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVESS

Query:  NGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSEKAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWE
        NGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSEKAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWE
Subjt:  NGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSEKAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWE

Query:  EMSKNPSNPESNPLLEILKAFVSAFVKFWSE
        EMSKNPSNPESNPLLEILKAFVSAFVKFWSE
Subjt:  EMSKNPSNPESNPLLEILKAFVSAFVKFWSE

A0A6J1E1K4 uncharacterized protein LOC1114297503.7e-15671.36Show/hide
Query:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS
        MHAI+GGW G PLALAK+NESEGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLTHKEVGGSFY VREIVRDIIQENRVLGPGKL LEEH+ DH 
Subjt:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS

Query:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSD-EQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMP
        LEENPLHSIAIEPQS L   SEEFDF + +N CINEEPI+VSD EQ TS NIQ S NG IINGSLVD SDKDSD++I++ELLVNE K++EEV+KEESGMP
Subjt:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSD-EQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMP

Query:  ITHVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVES
        I HVTPLA DV V TFPL   S AAN S   SET IST  SEK+VSQ +ELES V LF +E +N TK S    EKA       LS    D +EVA IVE 
Subjt:  ITHVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVES

Query:  SNGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSE-KAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLES
        +NG+++K+G +HEVEGP LE+ TDTP +  FEQ QK+SE KAPNASPSGT+N N + +NGIDQASKIKEET+++NKV+A+Q  GSQK++IPTLNR+NL+S
Subjt:  SNGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSE-KAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLES

Query:  WEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE
        W   SK+ S PE+NPLLEIL AF++AFVKFWSE
Subjt:  WEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE

A0A6J1JPX5 uncharacterized protein LOC1114872474.5e-15470.9Show/hide
Query:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS
        MHA++GGW G PLALAK+NESEGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLTHKEVGGSFY VREIVRDIIQENRVLGPGKL LEEH+ DH 
Subjt:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHS

Query:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSD-EQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMP
        LEENPLHSIAIEPQS L   SEEFDF + +N CINEEPI+VSD EQ TS NIQ S NG IINGSLVD SDKDSD+ I++EL VNE K++E+V+KEESGMP
Subjt:  LEENPLHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSD-EQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMP

Query:  ITHVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVES
        I HVTPLA DV V TFPL   S AAN S   SET IST  SEK+VSQT+ELES V LF +E +N TK S    EKA   F  ++S    D +EVA IVE 
Subjt:  ITHVTPLAADVVVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVES

Query:  SNGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSE-KAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLES
        +NG+I+K+G +HEVEGP LE+ TDTP +  FEQ QK+SE KAPNAS SGT+N N + +NGIDQASKIKEET+++NKV+A+Q  GSQK++IPTLNR+NL+S
Subjt:  SNGSILKEGGVHEVEGPQLEVRTDTPTAAAFEQCQKTSE-KAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLES

Query:  WEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE
        W   SK+ S PE+NPLLEIL AF++AFVKFWSE
Subjt:  WEEMSKNPSNPESNPLLEILKAFVSAFVKFWSE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G52170.1 DNA binding2.9e-4434.1Show/hide
Query:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLE------E
        MH+++    G+  ALAK ++S G++TR R  KEERK +VE FIKK+Q+ NNGSFPSL+LTHKEVGGSFY +REIVR+IIQENRVLGPG LLLE      +
Subjt:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLE------E

Query:  HNFDHSLEENPLHSIAIEPQ-------SHLPLSSEEFDFRVKYNQ-CIN-----------EEPILVSDEQCTSTNIQISH--------NGPIINGSLVD-
         +   S+  +P+  +++ P          L  SSE  +  V  +Q C++           +E I +  +   ST+I ++         N    N  L + 
Subjt:  HNFDHSLEENPLHSIAIEPQ-------SHLPLSSEEFDFRVKYNQ-CIN-----------EEPILVSDEQCTSTNIQISH--------NGPIINGSLVD-

Query:  -------VSDKDSDKLIESELLVNEDKEVEEVVKEES---------------GMPITHV------TPLAADVVVETFPLSPISGAANSSGERSETSISTC
               V  K  DK ++ +   N+D+  EE+   ES               G  +T +        ++A+ VVETFPL  ++   +S   +       C
Subjt:  -------VSDKDSDKLIESELLVNEDKEVEEVVKEES---------------GMPITHV------TPLAADVVVETFPLSPISGAANSSGERSETSISTC

Query:  DSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVE---------------------KAEENFVSSLSVAKFDSMEVAPIVES--SNGSILKEGGVHEVEG
        +  K     VE +      +  G   + TS  V+E                     K  E  V+S SV     +E A   E+   NG I   G VHE + 
Subjt:  DSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVE---------------------KAEENFVSSLSVAKFDSMEVAPIVES--SNGSILKEGGVHEVEG

Query:  PQLEVRTDTPTAAAFEQCQKTSEKAPNASPSGTQNHNK----TFSN--GIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWEEMSKNPSNP
           E    T TA   EQ   TS     ++ SG++ +++    T S+  G + AS  K+ T  + K+DA   S SQK+   TLNRI  ESW+  S N    
Subjt:  PQLEVRTDTPTAAAFEQCQKTSEKAPNASPSGTQNHNK----TFSN--GIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWEEMSKNPSNP

Query:  ESNPLLEILKAFVSAFVKFWSE
        E+NPLL +LK+FV+AFVKFWSE
Subjt:  ESNPLLEILKAFVSAFVKFWSE

AT3G52170.2 DNA binding2.9e-4434.1Show/hide
Query:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLE------E
        MH+++    G+  ALAK ++S G++TR R  KEERK +VE FIKK+Q+ NNGSFPSL+LTHKEVGGSFY +REIVR+IIQENRVLGPG LLLE      +
Subjt:  MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLE------E

Query:  HNFDHSLEENPLHSIAIEPQ-------SHLPLSSEEFDFRVKYNQ-CIN-----------EEPILVSDEQCTSTNIQISH--------NGPIINGSLVD-
         +   S+  +P+  +++ P          L  SSE  +  V  +Q C++           +E I +  +   ST+I ++         N    N  L + 
Subjt:  HNFDHSLEENPLHSIAIEPQ-------SHLPLSSEEFDFRVKYNQ-CIN-----------EEPILVSDEQCTSTNIQISH--------NGPIINGSLVD-

Query:  -------VSDKDSDKLIESELLVNEDKEVEEVVKEES---------------GMPITHV------TPLAADVVVETFPLSPISGAANSSGERSETSISTC
               V  K  DK ++ +   N+D+  EE+   ES               G  +T +        ++A+ VVETFPL  ++   +S   +       C
Subjt:  -------VSDKDSDKLIESELLVNEDKEVEEVVKEES---------------GMPITHV------TPLAADVVVETFPLSPISGAANSSGERSETSISTC

Query:  DSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVE---------------------KAEENFVSSLSVAKFDSMEVAPIVES--SNGSILKEGGVHEVEG
        +  K     VE +      +  G   + TS  V+E                     K  E  V+S SV     +E A   E+   NG I   G VHE + 
Subjt:  DSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVE---------------------KAEENFVSSLSVAKFDSMEVAPIVES--SNGSILKEGGVHEVEG

Query:  PQLEVRTDTPTAAAFEQCQKTSEKAPNASPSGTQNHNK----TFSN--GIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWEEMSKNPSNP
           E    T TA   EQ   TS     ++ SG++ +++    T S+  G + AS  K+ T  + K+DA   S SQK+   TLNRI  ESW+  S N    
Subjt:  PQLEVRTDTPTAAAFEQCQKTSEKAPNASPSGTQNHNK----TFSN--GIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWEEMSKNPSNP

Query:  ESNPLLEILKAFVSAFVKFWSE
        E+NPLL +LK+FV+AFVKFWSE
Subjt:  ESNPLLEILKAFVSAFVKFWSE

AT5G58210.1 hydroxyproline-rich glycoprotein family protein1.0e-0952Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDI
        R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y+VR+I +++
Subjt:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDI

AT5G58210.2 hydroxyproline-rich glycoprotein family protein1.0e-0952Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDI
        R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y+VR+I +++
Subjt:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDI

AT5G58210.3 hydroxyproline-rich glycoprotein family protein1.0e-0952Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDI
        R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y+VR+I +++
Subjt:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACGCGATTAGGGGTGGGTGGACAGGGCGTCCTCTTGCCCTAGCCAAACACAATGAGTCTGAAGGGAGAAAGACTAGAATTCGGCGTTCAAAAGAGGAAAGG
AAGGCAATGGTCGAAGTCTTCATAAAAAAGTATCAGGAATCCAATAATGGAAGTTTCCCCTCTCTCAATCTTACCCACAAGGAGGTTGGTGGATCTTTCTACATG
GTGCGGGAGATTGTACGTGATATAATTCAAGAAAATCGAGTGCTTGGCCCTGGAAAGTTGTTACTAGAAGAGCACAACTTTGATCATTCACTTGAAGAGAATCCA
CTCCACTCAATTGCCATAGAACCTCAGTCCCATTTACCGTTATCATCAGAAGAATTTGATTTTAGAGTTAAGTACAACCAATGTATAAATGAAGAACCCATCCTT
GTCTCAGATGAACAATGCACTTCAACAAATATTCAGATATCACACAATGGGCCGATAATCAATGGTAGCCTGGTGGATGTGAGTGACAAGGACTCTGACAAACTT
ATCGAGTCAGAGTTGCTAGTGAATGAAGACAAGGAAGTAGAGGAAGTGGTCAAAGAGGAATCAGGAATGCCAATTACTCATGTGACACCTTTGGCGGCAGATGTG
GTAGTAGAGACATTTCCATTGAGTCCAATTTCTGGTGCTGCTAATAGTTCAGGTGAAAGATCCGAAACGTCGATTTCAACTTGTGATTCAGAAAAGCAAGTTAGT
CAAACTGTCGAGTTAGAGTCAGGGGTTGGTTTGTTTATCACTGAAGGTAGTAATTTCACAAAAACTTCTGATCATGTAGTCGAGAAAGCAGAGGAAAACTTTGTA
AGTTCATTATCAGTAGCAAAGTTTGATTCGATGGAAGTAGCACCAATTGTTGAAAGCTCTAATGGATCCATTCTCAAAGAAGGTGGCGTTCATGAAGTTGAGGGT
CCTCAGCTGGAAGTTCGTACTGATACTCCAACAGCTGCGGCCTTTGAACAATGCCAAAAAACTAGTGAGAAGGCTCCAAATGCTTCTCCAAGTGGTACCCAGAAT
CACAACAAGACATTCAGCAATGGCATTGATCAGGCCTCAAAAATCAAAGAGGAGACACAGATTGAAAATAAAGTAGATGCTCAACAGGACAGTGGCTCCCAGAAA
CAAACCATTCCAACATTAAATAGAATCAATCTCGAATCATGGGAAGAGATGTCCAAAAATCCCTCAAACCCCGAAAGCAACCCGCTTCTGGAAATCCTCAAGGCA
TTCGTTTCCGCCTTCGTGAAGTTTTGGTCCGAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCACGCGATTAGGGGTGGGTGGACAGGGCGTCCTCTTGCCCTAGCCAAACACAATGAGTCTGAAGGGAGAAAGACTAGAATTCGGCGTTCAAAAGAGGAAAGG
AAGGCAATGGTCGAAGTCTTCATAAAAAAGTATCAGGAATCCAATAATGGAAGTTTCCCCTCTCTCAATCTTACCCACAAGGAGGTTGGTGGATCTTTCTACATG
GTGCGGGAGATTGTACGTGATATAATTCAAGAAAATCGAGTGCTTGGCCCTGGAAAGTTGTTACTAGAAGAGCACAACTTTGATCATTCACTTGAAGAGAATCCA
CTCCACTCAATTGCCATAGAACCTCAGTCCCATTTACCGTTATCATCAGAAGAATTTGATTTTAGAGTTAAGTACAACCAATGTATAAATGAAGAACCCATCCTT
GTCTCAGATGAACAATGCACTTCAACAAATATTCAGATATCACACAATGGGCCGATAATCAATGGTAGCCTGGTGGATGTGAGTGACAAGGACTCTGACAAACTT
ATCGAGTCAGAGTTGCTAGTGAATGAAGACAAGGAAGTAGAGGAAGTGGTCAAAGAGGAATCAGGAATGCCAATTACTCATGTGACACCTTTGGCGGCAGATGTG
GTAGTAGAGACATTTCCATTGAGTCCAATTTCTGGTGCTGCTAATAGTTCAGGTGAAAGATCCGAAACGTCGATTTCAACTTGTGATTCAGAAAAGCAAGTTAGT
CAAACTGTCGAGTTAGAGTCAGGGGTTGGTTTGTTTATCACTGAAGGTAGTAATTTCACAAAAACTTCTGATCATGTAGTCGAGAAAGCAGAGGAAAACTTTGTA
AGTTCATTATCAGTAGCAAAGTTTGATTCGATGGAAGTAGCACCAATTGTTGAAAGCTCTAATGGATCCATTCTCAAAGAAGGTGGCGTTCATGAAGTTGAGGGT
CCTCAGCTGGAAGTTCGTACTGATACTCCAACAGCTGCGGCCTTTGAACAATGCCAAAAAACTAGTGAGAAGGCTCCAAATGCTTCTCCAAGTGGTACCCAGAAT
CACAACAAGACATTCAGCAATGGCATTGATCAGGCCTCAAAAATCAAAGAGGAGACACAGATTGAAAATAAAGTAGATGCTCAACAGGACAGTGGCTCCCAGAAA
CAAACCATTCCAACATTAAATAGAATCAATCTCGAATCATGGGAAGAGATGTCCAAAAATCCCTCAAACCCCGAAAGCAACCCGCTTCTGGAAATCCTCAAGGCA
TTCGTTTCCGCCTTCGTGAAGTTTTGGTCCGAGTAA
Protein sequenceShow/hide protein sequence
MHAIRGGWTGRPLALAKHNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYMVREIVRDIIQENRVLGPGKLLLEEHNFDHSLEENP
LHSIAIEPQSHLPLSSEEFDFRVKYNQCINEEPILVSDEQCTSTNIQISHNGPIINGSLVDVSDKDSDKLIESELLVNEDKEVEEVVKEESGMPITHVTPLAADV
VVETFPLSPISGAANSSGERSETSISTCDSEKQVSQTVELESGVGLFITEGSNFTKTSDHVVEKAEENFVSSLSVAKFDSMEVAPIVESSNGSILKEGGVHEVEG
PQLEVRTDTPTAAAFEQCQKTSEKAPNASPSGTQNHNKTFSNGIDQASKIKEETQIENKVDAQQDSGSQKQTIPTLNRINLESWEEMSKNPSNPESNPLLEILKA
FVSAFVKFWSE