; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g41600 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g41600
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionFolate_rec domain-containing protein
Genome locationchr8:31850713..31859312
RNA-Seq ExpressionMoc08g41600
SyntenyMoc08g41600
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008406 - Dormancy/auxin associated protein
IPR018143 - Folate receptor-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAB1204414.1 hypothetical protein CJ030_MR8G028469 [Morella rubra]1.8e-13563.83Show/hide
Query:  MGLLDHLWDDTVAGPRPESGLGKLRKHSTFPGRSSSGKELDGGNARSYGDESSESPMRITRSIMIVKPPGYQFSSPPISPAGSNSPASPFSGRESFRFRR
        MGLLD LWDDTVAGPRPE+GLGKLRKHSTF  R  SGKE D GNARSY D++SE  MR+TRSIMIVKPPGYQ  SPPISPAGS  P SPFSG  +     
Subjt:  MGLLDHLWDDTVAGPRPESGLGKLRKHSTFPGRSSSGKELDGGNARSYGDESSESPMRITRSIMIVKPPGYQFSSPPISPAGSNSPASPFSGRESFRFRR

Query:  RSTSDGYEKATEGGPGSSSSPHNIYMLVNFCLGFELSGESNDVCISQGGRFPPFSMEGKPPKKVNK-AQDLTLCRVFRKKTCCGVAQTHPALLSVRRLAS
                                                  VC+SQGGRFPPFS EGKPPKKV+K  +DLTLCRVFR+ TCC VAQTHPALLSVRRLAS
Subjt:  RSTSDGYEKATEGGPGSSSSPHNIYMLVNFCLGFELSGESNDVCISQGGRFPPFSMEGKPPKKVNK-AQDLTLCRVFRKKTCCGVAQTHPALLSVRRLAS

Query:  TGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRVFKACSDAYFSVDAKTQVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDE-----
        TGE +QECL LWELLECSICDP+VGVQPGPPLIC SFCDRV+ ACS+AYFS+DAK QVLAPCGVNDFVCGRAS+WV NGTELC  AGF+V  SDE     
Subjt:  TGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRVFKACSDAYFSVDAKTQVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDE-----

Query:  -ETSCYGSKTRLDSIASSWKTSSSPVSSRGTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAK-TSQNS
         +TSCYG K  LDS+A SW+ S S V  R    LGVL+DFQQWVR M   E+VSW +G MV+TAGL F SKRKSH+ RQK AAIQR  +K+E K  +Q  
Subjt:  -ETSCYGSKTRLDSIASSWKTSSSPVSSRGTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAK-TSQNS

Query:  IGTQAIRRGSRR
          +Q I   SRR
Subjt:  IGTQAIRRGSRR

KAG6586243.1 Dormancy-associated protein-like 3, partial [Cucurbita argyrosperma subsp. sororia]2.5e-14868.47Show/hide
Query:  MGLLDHLWDDTVAGPRPESGLGKLRKHSTFPGRSSSG-KELDGGNARSYGDESSESPMRITRSIMIVKPPGYQFSSPPISPAGSNSPASPFSGRES----
        MGLLD LWDDTVAGPRP+SGLGKLRKHSTF GR+SSG KELDGG+ARSYG+ESS+S +RITRSIMI++PPGYQ  SPPISPAGS+SPASPFSG  S    
Subjt:  MGLLDHLWDDTVAGPRPESGLGKLRKHSTFPGRSSSG-KELDGGNARSYGDESSESPMRITRSIMIVKPPGYQFSSPPISPAGSNSPASPFSGRES----

Query:  ----------------------------------FRFRRRST-SDGYEKATEGGPGSSSSPHNI---YMLVNFCLGFEL-SGESNDVCISQGGRFPPFSM
                                          F +  +ST S   ++ +       +   NI   +M++NF L     SGESNDVC+S+GGRF PF+M
Subjt:  ----------------------------------FRFRRRST-SDGYEKATEGGPGSSSSPHNI---YMLVNFCLGFEL-SGESNDVCISQGGRFPPFSM

Query:  EGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRVFKACSDAYFSVDAKTQ
        EGKPPKKV+KAQDLTLCRVFRKKTCCGVAQT+PALLSVRRLASTGEGNQECLQLWELLECSICDP VG+QPGPPLIC SFCDRVF ACSDAYFSVDAKTQ
Subjt:  EGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRVFKACSDAYFSVDAKTQ

Query:  VLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSKTRLDSIASSWKTSSSPVSSRGTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVIT
        VLAPCGVNDFVCGRAS+WVSNGT+LCS AGFSVK+ ++E+SCYGSK RL+S+A+SWK+S S V S+ TG+LGVLEDFQQWV+EMS  EQVSWLI SMV++
Subjt:  VLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSKTRLDSIASSWKTSSSPVSSRGTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVIT

Query:  AGLLFS
        AGLL +
Subjt:  AGLLFS

XP_022150833.1 uncharacterized protein LOC111018882 isoform X1 [Momordica charantia]9.6e-15397.16Show/hide
Query:  IYMLVNFCLGFE-LSGESNDVCISQGGRFPPFSMEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQ
        ++ML+NF L    LSGESNDVCISQGGRFPPFSMEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQ
Subjt:  IYMLVNFCLGFE-LSGESNDVCISQGGRFPPFSMEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQ

Query:  VGVQPGPPLICASFCDRVFKACSDAYFSVDAKTQVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSKTRLDSIASSWKTSSSPVSSR
        VGVQPGPPLICASFCDRVFKACSDAYFSVDAKTQVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSKTRLDSIASSWKTSSSPVSSR
Subjt:  VGVQPGPPLICASFCDRVFKACSDAYFSVDAKTQVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSKTRLDSIASSWKTSSSPVSSR

Query:  GTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGSRR
        GTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGSRR
Subjt:  GTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGSRR

XP_022150835.1 uncharacterized protein LOC111018882 isoform X3 [Momordica charantia]2.3e-138100Show/hide
Query:  MEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRVFKACSDAYFSVDAKT
        MEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRVFKACSDAYFSVDAKT
Subjt:  MEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRVFKACSDAYFSVDAKT

Query:  QVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSKTRLDSIASSWKTSSSPVSSRGTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVI
        QVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSKTRLDSIASSWKTSSSPVSSRGTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVI
Subjt:  QVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSKTRLDSIASSWKTSSSPVSSRGTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVI

Query:  TAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGSRR
        TAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGSRR
Subjt:  TAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGSRR

XP_038877718.1 uncharacterized protein LOC120069948 [Benincasa hispida]1.4e-13584.81Show/hide
Query:  NIYMLVNF-CLGFELSGESNDVCISQGGRFPPFSMEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDP
        +++ML+NF  L    SGESNDVCISQGGRFPPFS+EGKPPKKV+KAQDLTLCRVFRK+TCCGVAQTHPALLS+R+LASTGE NQ+CLQLWELLECSICDP
Subjt:  NIYMLVNF-CLGFELSGESNDVCISQGGRFPPFSMEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDP

Query:  QVGVQPGPPLICASFCDRVFKACSDAYFSVDAKTQVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSKTRLDSIASSWKTSSSPVSS
        QVGVQPGPPLICASFCDRVFKACS AYFSVDAKTQVLAPCGVNDFVCGRAS+WVSNGTELCS AGFSVK+S+EETSCYGSK RLDSIA+SWKTS S VSS
Subjt:  QVGVQPGPPLICASFCDRVFKACSDAYFSVDAKTQVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSKTRLDSIASSWKTSSSPVSS

Query:  RGTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGSRR
        + TGY+G+LEDFQQWV+EMSF E+VSWLIGSMV++AGLLF+SKR+SH+QRQKYAAIQRATKK+EA  SQNSIGTQ IR+GSRR
Subjt:  RGTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGSRR

TrEMBL top hitse value%identityAlignment
A0A0A0LIV4 Folate_rec domain-containing protein4.1e-13382.33Show/hide
Query:  NIYMLVNFCLGFEL-SGESNDVCISQGGRFPPFSMEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDP
        +++ML+NF L F   SGESNDVCIS+GGRF PFS+EGKPP KV+K QDLTLCRVFRK+TCCGVAQTHPALLSVRRLASTGE N ECLQLWELLECSICDP
Subjt:  NIYMLVNFCLGFEL-SGESNDVCISQGGRFPPFSMEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDP

Query:  QVGVQPGPPLICASFCDRVFKACSDAYFSVDAKTQVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSKTRLDSIASSWKTSSSPVSS
        QVGVQPGPPLICASFCDRVFKACSDAYFSVDAKTQVLAPCGVNDFVCGRAS+WVSNGT+LC+ AGF++K+SDEE+SCYGSK RLDSIA+SWKTS S +SS
Subjt:  QVGVQPGPPLICASFCDRVFKACSDAYFSVDAKTQVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSKTRLDSIASSWKTSSSPVSS

Query:  RGTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGSRR
        + TGYLG+LEDFQQWV+EMSF EQVSWLIGSMV++AGLLF+SKR+SH+QRQKYAAIQRATKK+E   +QNS+ TQ IR+GSRR
Subjt:  RGTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGSRR

A0A1S3BYL2 uncharacterized protein LOC1034946071.8e-13383.69Show/hide
Query:  IYMLVNFCLGFEL-SGESNDVCISQGGRFPPFSMEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQ
        ++ML+NF L     SGESNDVCISQGGRF PFS EGKPP KV+KAQDLTLCRVFRK+TCCGVAQTHPALLSVR+LAS GE N ECLQLWELLECSICDPQ
Subjt:  IYMLVNFCLGFEL-SGESNDVCISQGGRFPPFSMEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQ

Query:  VGVQPGPPLICASFCDRVFKACSDAYFSVDAKTQVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSKTRLDSIASSWKTSSSPVSSR
        VGVQPGPPLICASFCDRVFKACSDAYFSVDAKTQVLAPCGVNDFVCGRAS+WVSNGT+LC+ AGFSVK+SDEETSCYGSK RLDSIA+SWKTS S VSS+
Subjt:  VGVQPGPPLICASFCDRVFKACSDAYFSVDAKTQVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSKTRLDSIASSWKTSSSPVSSR

Query:  GTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGSRR
         TGYLG+LEDFQQWV+EMSF EQVSWLIGSMV++AGLLF+SKR+SH+QRQKYAAI RATK++EA  +QNS+GTQ IR+GSRR
Subjt:  GTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGSRR

A0A6A1UVF4 Folate_rec domain-containing protein8.8e-13663.83Show/hide
Query:  MGLLDHLWDDTVAGPRPESGLGKLRKHSTFPGRSSSGKELDGGNARSYGDESSESPMRITRSIMIVKPPGYQFSSPPISPAGSNSPASPFSGRESFRFRR
        MGLLD LWDDTVAGPRPE+GLGKLRKHSTF  R  SGKE D GNARSY D++SE  MR+TRSIMIVKPPGYQ  SPPISPAGS  P SPFSG  +     
Subjt:  MGLLDHLWDDTVAGPRPESGLGKLRKHSTFPGRSSSGKELDGGNARSYGDESSESPMRITRSIMIVKPPGYQFSSPPISPAGSNSPASPFSGRESFRFRR

Query:  RSTSDGYEKATEGGPGSSSSPHNIYMLVNFCLGFELSGESNDVCISQGGRFPPFSMEGKPPKKVNK-AQDLTLCRVFRKKTCCGVAQTHPALLSVRRLAS
                                                  VC+SQGGRFPPFS EGKPPKKV+K  +DLTLCRVFR+ TCC VAQTHPALLSVRRLAS
Subjt:  RSTSDGYEKATEGGPGSSSSPHNIYMLVNFCLGFELSGESNDVCISQGGRFPPFSMEGKPPKKVNK-AQDLTLCRVFRKKTCCGVAQTHPALLSVRRLAS

Query:  TGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRVFKACSDAYFSVDAKTQVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDE-----
        TGE +QECL LWELLECSICDP+VGVQPGPPLIC SFCDRV+ ACS+AYFS+DAK QVLAPCGVNDFVCGRAS+WV NGTELC  AGF+V  SDE     
Subjt:  TGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRVFKACSDAYFSVDAKTQVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDE-----

Query:  -ETSCYGSKTRLDSIASSWKTSSSPVSSRGTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAK-TSQNS
         +TSCYG K  LDS+A SW+ S S V  R    LGVL+DFQQWVR M   E+VSW +G MV+TAGL F SKRKSH+ RQK AAIQR  +K+E K  +Q  
Subjt:  -ETSCYGSKTRLDSIASSWKTSSSPVSSRGTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAK-TSQNS

Query:  IGTQAIRRGSRR
          +Q I   SRR
Subjt:  IGTQAIRRGSRR

A0A6J1D9K7 uncharacterized protein LOC111018882 isoform X31.1e-138100Show/hide
Query:  MEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRVFKACSDAYFSVDAKT
        MEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRVFKACSDAYFSVDAKT
Subjt:  MEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRVFKACSDAYFSVDAKT

Query:  QVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSKTRLDSIASSWKTSSSPVSSRGTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVI
        QVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSKTRLDSIASSWKTSSSPVSSRGTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVI
Subjt:  QVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSKTRLDSIASSWKTSSSPVSSRGTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVI

Query:  TAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGSRR
        TAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGSRR
Subjt:  TAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGSRR

A0A6J1DAH8 uncharacterized protein LOC111018882 isoform X14.7e-15397.16Show/hide
Query:  IYMLVNFCLGFE-LSGESNDVCISQGGRFPPFSMEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQ
        ++ML+NF L    LSGESNDVCISQGGRFPPFSMEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQ
Subjt:  IYMLVNFCLGFE-LSGESNDVCISQGGRFPPFSMEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQ

Query:  VGVQPGPPLICASFCDRVFKACSDAYFSVDAKTQVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSKTRLDSIASSWKTSSSPVSSR
        VGVQPGPPLICASFCDRVFKACSDAYFSVDAKTQVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSKTRLDSIASSWKTSSSPVSSR
Subjt:  VGVQPGPPLICASFCDRVFKACSDAYFSVDAKTQVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSKTRLDSIASSWKTSSSPVSSR

Query:  GTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGSRR
        GTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGSRR
Subjt:  GTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGSRR

SwissProt top hitse value%identityAlignment
F4HV65 Dormancy-associated protein homolog 41.8e-0537.11Show/hide
Query:  MGLLDHLWDDTVAGPRPESGLGKLRKHSTFPGRSSSGKELDGGNARSYGDESSESPMRITRSIMIVKPPG-----YQFSSPPISP-AGSNSPASPFS
        MG L  LWD+TVAGP P++GLGKLRKH +     SS   L              S  ++TRSIM+ K         +    P SP   S++P +P +
Subjt:  MGLLDHLWDDTVAGPRPESGLGKLRKHSTFPGRSSSGKELDGGNARSYGDESSESPMRITRSIMIVKPPG-----YQFSSPPISP-AGSNSPASPFS

Q8LD26 Dormancy-associated protein homolog 31.7e-3564.39Show/hide
Query:  MGLLDHLWDDTVAGPRPESGLGKLRKHSTFPGRSSSGK-ELDGGNARSYGDES-SESPMRITRSIMIVKPPGYQFSSPPISPAGSNSPASPFS-------
        MGLLDHLWDDTVAGPRPE+GLGKLRKH TF  R SSG  + + G+ARSYG++S  E  +++TRSIMI+KPPGYQ SS P SPAGS  P SPFS       
Subjt:  MGLLDHLWDDTVAGPRPESGLGKLRKHSTFPGRSSSGK-ELDGGNARSYGDES-SESPMRITRSIMIVKPPGYQFSSPPISPAGSNSPASPFS-------

Query:  -GRESFRFRRRSTSDGYEKATEGG-PGSSSSP
         G+E FRFRRRSTSD +EKA  G   G  SSP
Subjt:  -GRESFRFRRRSTSDGYEKATEGG-PGSSSSP

Arabidopsis top hitse value%identityAlignment
AT5G27830.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: response to oxidative stress; LOCATED IN: endomembrane system; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Folate receptor, conserved region (InterPro:IPR018143); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink).1.9e-7448.41Show/hide
Query:  IYMLVNFCLGFELSGESNDVCISQGGRFPPFSMEGKPPKKVNK-AQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQ
        +++L++   G  +  +   VC+S+GGRFPP+ +EGKPPK V + ++DLTLCRVFRKKTCC   QT+PA ++VR LA+ GE +QECL+L+ELLECSIC+P 
Subjt:  IYMLVNFCLGFELSGESNDVCISQGGRFPPFSMEGKPPKKVNK-AQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQ

Query:  VGVQPGPPLICASFCDRVFKACSDAYFSVDAKTQVLAPCGVN-DFVCGRASQWVSNGTELCSTAGFSVKMSDE--ETSCYGSKTRLDSIASSWKTSSSPV
        VG+QPGPP ICASFCDRVF+AC DAYF+ +A  +V+ PCGVN D +C +AS W SNGT  C  AGF+V+ +D+  E  CYGSK  L+S+  SW   S   
Subjt:  VGVQPGPPLICASFCDRVFKACSDAYFSVDAKTQVLAPCGVN-DFVCGRASQWVSNGTELCSTAGFSVKMSDE--ETSCYGSKTRLDSIASSWKTSSSPV

Query:  SSRGTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGS
        +   T  L   +D  QWVREM+  +++S  +G   + AG+    +  + NQ+Q+ AAIQR  +++    + +S      RR S
Subjt:  SSRGTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGS

AT5G27830.2 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: response to oxidative stress; LOCATED IN: endomembrane system; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Folate receptor, conserved region (InterPro:IPR018143); Has 58 Blast hits to 58 proteins in 23 species: Archae - 0; Bacteria - 0; Metazoa - 8; Fungi - 0; Plants - 37; Viruses - 0; Other Eukaryotes - 13 (source: NCBI BLink).5.1e-5942.2Show/hide
Query:  IYMLVNFCLGFELSGESNDVCISQGGRFPPFSMEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQV
        +++L++   G  +  +   VC+S+GGRFPP+ +EGKPPK V                            +VR LA+ GE +QECL+L+ELLECSIC+P V
Subjt:  IYMLVNFCLGFELSGESNDVCISQGGRFPPFSMEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQV

Query:  GVQPGPPLICASFCDRVFKACSDAYFSVDAKTQVLAPCGVN-DFVCGRASQWVSNGTELCSTAGFSVKMSDE--ETSCYGSKTRLDSIASSWKTSSSPVS
        G+QPGPP ICASFCDRVF+AC DAYF+ +A  +V+ PCGVN D +C +AS W SNGT  C  AGF+V+ +D+  E  CYGSK  L+S+  SW   S   +
Subjt:  GVQPGPPLICASFCDRVFKACSDAYFSVDAKTQVLAPCGVN-DFVCGRASQWVSNGTELCSTAGFSVKMSDE--ETSCYGSKTRLDSIASSWKTSSSPVS

Query:  SRGTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGS
           T  L   +D  QWVREM+  +++S  +G   + AG+    +  + NQ+Q+ AAIQR  +++    + +S      RR S
Subjt:  SRGTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGS

AT5G27830.3 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: response to oxidative stress; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Folate receptor, conserved region (InterPro:IPR018143); Has 79 Blast hits to 79 proteins in 35 species: Archae - 0; Bacteria - 0; Metazoa - 18; Fungi - 0; Plants - 39; Viruses - 0; Other Eukaryotes - 22 (source: NCBI BLink).7.3e-7451.14Show/hide
Query:  VCISQGGRFPPFSMEGKPPKKVNK-AQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRVF
        VC+S+GGRFPP+ +EGKPPK V + ++DLTLCRVFRKKTCC   QT+PA ++VR LA+ GE +QECL+L+ELLECSIC+P VG+QPGPP ICASFCDRVF
Subjt:  VCISQGGRFPPFSMEGKPPKKVNK-AQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRVF

Query:  KACSDAYFSVDAKTQVLAPCGVN-DFVCGRASQWVSNGTELCSTAGFSVKMSDE--ETSCYGSKTRLDSIASSWKTSSSPVSSRGTGYLGVLEDFQQWVR
        +AC DAYF+ +A  +V+ PCGVN D +C +AS W SNGT  C  AGF+V+ +D+  E  CYGSK  L+S+  SW   S   +   T  L   +D  QWVR
Subjt:  KACSDAYFSVDAKTQVLAPCGVN-DFVCGRASQWVSNGTELCSTAGFSVKMSDE--ETSCYGSKTRLDSIASSWKTSSSPVSSRGTGYLGVLEDFQQWVR

Query:  EMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGS
        EM+  +++S  +G   + AG+    +  + NQ+Q+ AAIQR  +++    + +S      RR S
Subjt:  EMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGS

AT5G27830.4 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: response to oxidative stress; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Folate receptor, conserved region (InterPro:IPR018143).6.2e-7347.95Show/hide
Query:  SSSPHNIYMLVNFCLGFELSGESNDVCISQGGRFPPFSMEGKPPKKVNK-AQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLEC
        SSSP     +V       +  +   VC+S+GGRFPP+ +EGKPPK V + ++DLTLCRVFRKKTCC   QT+PA ++VR LA+ GE +QECL+L+ELLEC
Subjt:  SSSPHNIYMLVNFCLGFELSGESNDVCISQGGRFPPFSMEGKPPKKVNK-AQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLEC

Query:  SICDPQVGVQPGPPLICASFCDRVFKACSDAYFSVDAKTQVLAPCGVN-DFVCGRASQWVSNGTELCSTAGFSVKMSDE--ETSCYGSKTRLDSIASSWK
        SIC+P VG+QPGPP ICASFCDRVF+AC DAYF+ +A  +V+ PCGVN D +C +AS W SNGT  C  AGF+V+ +D+  E  CYGSK  L+S+  SW 
Subjt:  SICDPQVGVQPGPPLICASFCDRVFKACSDAYFSVDAKTQVLAPCGVN-DFVCGRASQWVSNGTELCSTAGFSVKMSDE--ETSCYGSKTRLDSIASSWK

Query:  TSSSPVSSRGTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVITAGLL---FSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGS
          S   +   T  L   +D  QWVREM+  +++S  +G   + AG+    +  +  + NQ+Q+ AAIQR  +++    + +S      RR S
Subjt:  TSSSPVSSRGTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVITAGLL---FSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGS

AT5G27830.5 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: response to oxidative stress; LOCATED IN: endomembrane system; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Folate receptor, conserved region (InterPro:IPR018143).3.6e-7347.9Show/hide
Query:  IYMLVNFCLGFELSGESNDVCISQGGRFPPFSMEGKPPKKVNK-AQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQ
        +++L++   G  +  +   VC+S+GGRFPP+ +EGKPPK V + ++DLTLCRVFRKKTCC   QT+PA ++VR LA+ GE +QECL+L+ELLECSIC+P 
Subjt:  IYMLVNFCLGFELSGESNDVCISQGGRFPPFSMEGKPPKKVNK-AQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQ

Query:  VGVQPGPPLICASFCDRVFKACSDAYFSVDAKTQVLAPCGVN-DFVCGRASQWVSNGTELCSTAGFSVKMSDE--ETSCYGSKTRLDSIASSWKTSSSPV
        VG+QPGPP ICASFCDRVF+AC DAYF+ +A  +V+ PCGVN D +C +AS W SNGT  C  AGF+V+ +D+  E  CYGSK  L+S+  SW   S   
Subjt:  VGVQPGPPLICASFCDRVFKACSDAYFSVDAKTQVLAPCGVN-DFVCGRASQWVSNGTELCSTAGFSVKMSDE--ETSCYGSKTRLDSIASSWKTSSSPV

Query:  SSRGTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVITAGLL---FSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGS
        +   T  L   +D  QWVREM+  +++S  +G   + AG+    +  +  + NQ+Q+ AAIQR  +++    + +S      RR S
Subjt:  SSRGTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVITAGLL---FSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCTTACTCGACCATCTCTGGGACGATACCGTCGCCGGACCGAGGCCGGAAAGTGGCCTCGGCAAACTTCGCAAACACTCCACGTTTCCCGGTCGATCTAGCTCCGG
CAAGGAACTGGATGGCGGGAACGCGAGATCGTACGGCGACGAATCTTCAGAGTCGCCGATGAGGATCACGCGGAGTATTATGATCGTAAAACCTCCAGGTTACCAGTTCA
GTTCGCCTCCCATTTCACCGGCCGGATCCAATTCTCCGGCGTCTCCCTTTTCTGGTAGAGAATCCTTCCGGTTTCGAAGAAGGTCGACATCAGATGGTTACGAGAAGGCA
ACCGAAGGTGGACCCGGGAGCTCTTCTTCTCCTCACAACATCTATATGCTTGTGAATTTCTGTCTCGGGTTCGAACTTTCTGGTGAGTCAAATGATGTCTGCATTTCGCA
AGGTGGTCGTTTCCCACCCTTCTCAATGGAAGGAAAACCTCCAAAAAAGGTTAACAAAGCACAAGATTTGACGCTTTGCCGAGTCTTCCGAAAGAAGACTTGCTGTGGTG
TAGCCCAAACGCACCCGGCTTTGCTTTCTGTTAGGAGGCTGGCTTCAACAGGGGAAGGCAACCAGGAATGCTTGCAATTATGGGAACTTCTGGAGTGCTCGATCTGTGAT
CCACAAGTTGGTGTTCAACCTGGACCTCCTCTAATATGTGCCTCTTTCTGTGACAGAGTCTTCAAAGCTTGCTCTGATGCTTACTTCTCTGTCGATGCTAAAACACAGGT
TCTAGCACCATGCGGAGTGAATGACTTTGTGTGTGGCAGGGCTTCTCAATGGGTCTCGAATGGCACGGAGCTTTGCAGTACTGCAGGTTTTTCAGTTAAGATGTCAGATG
AAGAAACCTCTTGTTATGGTAGTAAAACTAGACTAGACTCCATTGCTAGTTCGTGGAAGACTTCATCATCTCCAGTGTCGTCCCGAGGAACAGGGTACTTGGGTGTTTTA
GAGGATTTCCAGCAATGGGTAAGAGAAATGAGTTTCGGGGAACAGGTTTCTTGGCTGATTGGTTCCATGGTGATTACGGCAGGCCTTCTATTTTCCAGCAAAAGGAAAAG
TCATAACCAGCGCCAGAAATACGCTGCTATTCAACGAGCAACAAAGAAAATGGAAGCAAAGACGAGCCAGAATTCAATTGGTACTCAAGCAATCAGGAGAGGAAGTAGAA
GATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCTTACTCGACCATCTCTGGGACGATACCGTCGCCGGACCGAGGCCGGAAAGTGGCCTCGGCAAACTTCGCAAACACTCCACGTTTCCCGGTCGATCTAGCTCCGG
CAAGGAACTGGATGGCGGGAACGCGAGATCGTACGGCGACGAATCTTCAGAGTCGCCGATGAGGATCACGCGGAGTATTATGATCGTAAAACCTCCAGGTTACCAGTTCA
GTTCGCCTCCCATTTCACCGGCCGGATCCAATTCTCCGGCGTCTCCCTTTTCTGGTAGAGAATCCTTCCGGTTTCGAAGAAGGTCGACATCAGATGGTTACGAGAAGGCA
ACCGAAGGTGGACCCGGGAGCTCTTCTTCTCCTCACAACATCTATATGCTTGTGAATTTCTGTCTCGGGTTCGAACTTTCTGGTGAGTCAAATGATGTCTGCATTTCGCA
AGGTGGTCGTTTCCCACCCTTCTCAATGGAAGGAAAACCTCCAAAAAAGGTTAACAAAGCACAAGATTTGACGCTTTGCCGAGTCTTCCGAAAGAAGACTTGCTGTGGTG
TAGCCCAAACGCACCCGGCTTTGCTTTCTGTTAGGAGGCTGGCTTCAACAGGGGAAGGCAACCAGGAATGCTTGCAATTATGGGAACTTCTGGAGTGCTCGATCTGTGAT
CCACAAGTTGGTGTTCAACCTGGACCTCCTCTAATATGTGCCTCTTTCTGTGACAGAGTCTTCAAAGCTTGCTCTGATGCTTACTTCTCTGTCGATGCTAAAACACAGGT
TCTAGCACCATGCGGAGTGAATGACTTTGTGTGTGGCAGGGCTTCTCAATGGGTCTCGAATGGCACGGAGCTTTGCAGTACTGCAGGTTTTTCAGTTAAGATGTCAGATG
AAGAAACCTCTTGTTATGGTAGTAAAACTAGACTAGACTCCATTGCTAGTTCGTGGAAGACTTCATCATCTCCAGTGTCGTCCCGAGGAACAGGGTACTTGGGTGTTTTA
GAGGATTTCCAGCAATGGGTAAGAGAAATGAGTTTCGGGGAACAGGTTTCTTGGCTGATTGGTTCCATGGTGATTACGGCAGGCCTTCTATTTTCCAGCAAAAGGAAAAG
TCATAACCAGCGCCAGAAATACGCTGCTATTCAACGAGCAACAAAGAAAATGGAAGCAAAGACGAGCCAGAATTCAATTGGTACTCAAGCAATCAGGAGAGGAAGTAGAA
GATGA
Protein sequenceShow/hide protein sequence
MGLLDHLWDDTVAGPRPESGLGKLRKHSTFPGRSSSGKELDGGNARSYGDESSESPMRITRSIMIVKPPGYQFSSPPISPAGSNSPASPFSGRESFRFRRRSTSDGYEKA
TEGGPGSSSSPHNIYMLVNFCLGFELSGESNDVCISQGGRFPPFSMEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICD
PQVGVQPGPPLICASFCDRVFKACSDAYFSVDAKTQVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSKTRLDSIASSWKTSSSPVSSRGTGYLGVL
EDFQQWVREMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGSRR