; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS007294 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS007294
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionFolate_rec domain-containing protein
Genome locationscaffold25:2067784..2076167
RNA-Seq ExpressionMS007294
SyntenyMS007294
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008406 - Dormancy/auxin associated protein
IPR018143 - Folate receptor-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAB1204414.1 hypothetical protein CJ030_MR8G028469 [Morella rubra]2.6e-14172.4Show/hide
Query:  MGLLDHLWDDTVAGPRPESGLGKLRKHSTFFGRSSSGKELDGGNARSYGDESSESPMRITRSIMIVKPPGYQFSSPPISPAGSNSPASPFSGESNDVCIS
        MGLLD LWDDTVAGPRPE+GLGKLRKHSTF  R  SGKE D GNARSY D++SE  MR+TRSIMIVKPPGYQ  SPPISPAGS  P SPFSGE   VC+S
Subjt:  MGLLDHLWDDTVAGPRPESGLGKLRKHSTFFGRSSSGKELDGGNARSYGDESSESPMRITRSIMIVKPPGYQFSSPPISPAGSNSPASPFSGESNDVCIS

Query:  QGGRFPPFSMEGKPPKKVNK-AQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRVFKACS
        QGGRFPPFS EGKPPKKV+K  +DLTLCRVFR+ TCC VAQTHPALLSVRRLASTGE +QECL LWELLECSICDP+VGVQPGPPLIC SFCDRV+ ACS
Subjt:  QGGRFPPFSMEGKPPKKVNK-AQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRVFKACS

Query:  DAYFSVDAKTQVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDE------ETSCYGSKARLDSIASSWKTSSSPVSSRGTGYLGVLEDFQQWVRE
        +AYFS+DAK QVLAPCGVNDFVCGRAS+WV NGTELC  AGF+V  SDE      +TSCYG KA LDS+A SW+ S S V  R    LGVL+DFQQWVR 
Subjt:  DAYFSVDAKTQVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDE------ETSCYGSKARLDSIASSWKTSSSPVSSRGTGYLGVLEDFQQWVRE

Query:  MSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAK-TSQNSIGTQAIRRGSRR
        M   E+VSW +G MV+TAGL F SKRKSH+ RQK AAIQR  +K+E K  +Q    +Q I   SRR
Subjt:  MSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAK-TSQNSIGTQAIRRGSRR

KAG6586243.1 Dormancy-associated protein-like 3, partial [Cucurbita argyrosperma subsp. sororia]5.2e-14265.52Show/hide
Query:  MGLLDHLWDDTVAGPRPESGLGKLRKHSTFFGRSSSG-KELDGGNARSYGDESSESPMRITRSIMIVKPPGYQFSSPPISPAGSNSPASPF---------
        MGLLD LWDDTVAGPRP+SGLGKLRKHSTF GR+SSG KELDGG+ARSYG+ESS+S +RITRSIMI++PPGYQ  SPPISPAGS+SPASPF         
Subjt:  MGLLDHLWDDTVAGPRPESGLGKLRKHSTFFGRSSSG-KELDGGNARSYGDESSESPMRITRSIMIVKPPGYQFSSPPISPAGSNSPASPF---------

Query:  --------------------------------------------------------------------------------SGESNDVCISQGGRFPPFSM
                                                                                        SGESNDVC+S+GGRF PF+M
Subjt:  --------------------------------------------------------------------------------SGESNDVCISQGGRFPPFSM

Query:  EGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRVFKACSDAYFSVDAKTQ
        EGKPPKKV+KAQDLTLCRVFRKKTCCGVAQT+PALLSVRRLASTGEGNQECLQLWELLECSICDP VG+QPGPPLIC SFCDRVF ACSDAYFSVDAKTQ
Subjt:  EGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRVFKACSDAYFSVDAKTQ

Query:  VLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSKARLDSIASSWKTSSSPVSSRGTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVIT
        VLAPCGVNDFVCGRAS+WVSNGT+LCS AGFSVK+ ++E+SCYGSKARL+S+A+SWK+S S V S+ TG+LGVLEDFQQWV+EMS  EQVSWLI SMV++
Subjt:  VLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSKARLDSIASSWKTSSSPVSSRGTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVIT

Query:  AGLLFS
        AGLL +
Subjt:  AGLLFS

XP_022150833.1 uncharacterized protein LOC111018882 isoform X1 [Momordica charantia]8.8e-15099.26Show/hide
Query:  PF-SGESNDVCISQGGRFPPFSMEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLIC
        PF SGESNDVCISQGGRFPPFSMEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLIC
Subjt:  PF-SGESNDVCISQGGRFPPFSMEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLIC

Query:  ASFCDRVFKACSDAYFSVDAKTQVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSKARLDSIASSWKTSSSPVSSRGTGYLGVLEDF
        ASFCDRVFKACSDAYFSVDAKTQVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSK RLDSIASSWKTSSSPVSSRGTGYLGVLEDF
Subjt:  ASFCDRVFKACSDAYFSVDAKTQVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSKARLDSIASSWKTSSSPVSSRGTGYLGVLEDF

Query:  QQWVREMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGSRR
        QQWVREMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGSRR
Subjt:  QQWVREMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGSRR

XP_022150835.1 uncharacterized protein LOC111018882 isoform X3 [Momordica charantia]7.8e-13899.6Show/hide
Query:  MEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRVFKACSDAYFSVDAKT
        MEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRVFKACSDAYFSVDAKT
Subjt:  MEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRVFKACSDAYFSVDAKT

Query:  QVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSKARLDSIASSWKTSSSPVSSRGTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVI
        QVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSK RLDSIASSWKTSSSPVSSRGTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVI
Subjt:  QVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSKARLDSIASSWKTSSSPVSSRGTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVI

Query:  TAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGSRR
        TAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGSRR
Subjt:  TAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGSRR

XP_038877718.1 uncharacterized protein LOC120069948 [Benincasa hispida]3.1e-13487.45Show/hide
Query:  SPFSGESNDVCISQGGRFPPFSMEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLIC
        S  SGESNDVCISQGGRFPPFS+EGKPPKKV+KAQDLTLCRVFRK+TCCGVAQTHPALLS+R+LASTGE NQ+CLQLWELLECSICDPQVGVQPGPPLIC
Subjt:  SPFSGESNDVCISQGGRFPPFSMEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLIC

Query:  ASFCDRVFKACSDAYFSVDAKTQVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSKARLDSIASSWKTSSSPVSSRGTGYLGVLEDF
        ASFCDRVFKACS AYFSVDAKTQVLAPCGVNDFVCGRAS+WVSNGTELCS AGFSVK+S+EETSCYGSKARLDSIA+SWKTS S VSS+ TGY+G+LEDF
Subjt:  ASFCDRVFKACSDAYFSVDAKTQVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSKARLDSIASSWKTSSSPVSSRGTGYLGVLEDF

Query:  QQWVREMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGSRR
        QQWV+EMSF E+VSWLIGSMV++AGLLF+SKR+SH+QRQKYAAIQRATKK+EA  SQNSIGTQ IR+GSRR
Subjt:  QQWVREMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGSRR

TrEMBL top hitse value%identityAlignment
A0A1S3BYL2 uncharacterized protein LOC1034946075.3e-13286.35Show/hide
Query:  PF-SGESNDVCISQGGRFPPFSMEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLIC
        PF SGESNDVCISQGGRF PFS EGKPP KV+KAQDLTLCRVFRK+TCCGVAQTHPALLSVR+LAS GE N ECLQLWELLECSICDPQVGVQPGPPLIC
Subjt:  PF-SGESNDVCISQGGRFPPFSMEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLIC

Query:  ASFCDRVFKACSDAYFSVDAKTQVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSKARLDSIASSWKTSSSPVSSRGTGYLGVLEDF
        ASFCDRVFKACSDAYFSVDAKTQVLAPCGVNDFVCGRAS+WVSNGT+LC+ AGFSVK+SDEETSCYGSKARLDSIA+SWKTS S VSS+ TGYLG+LEDF
Subjt:  ASFCDRVFKACSDAYFSVDAKTQVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSKARLDSIASSWKTSSSPVSSRGTGYLGVLEDF

Query:  QQWVREMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGSRR
        QQWV+EMSF EQVSWLIGSMV++AGLLF+SKR+SH+QRQKYAAI RATK++EA  +QNS+GTQ IR+GSRR
Subjt:  QQWVREMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGSRR

A0A2K2AAM1 Folate_rec domain-containing protein1.4e-13265.95Show/hide
Query:  MGLLDHLWDDTVAGPRPESGLGKLRKHSTFFGRSSSGKELDGGNA-RSYGDESSE-SPMRITRSIMIVKPPGY---QFSSPPISPAGSNSPASPFSGESN
        M LLD LWDDTVAGP PESGLGKLRK  +   R + GKE  GG   RS+ +E++     R+TRSIMIV+PPGY     ++PP SPAGS  P SPF G++ 
Subjt:  MGLLDHLWDDTVAGPRPESGLGKLRKHSTFFGRSSSGKELDGGNA-RSYGDESSE-SPMRITRSIMIVKPPGY---QFSSPPISPAGSNSPASPFSGESN

Query:  DVCISQGGRFPPFSMEGKPPKKVNK-AQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRV
         VC+S+GGRFPP++ EGKPPKKV+K A+DLTLCRVFRKKTCC VAQT+PALLSVRRLASTGE +QECLQLWELLECSICDPQ+GVQPGPPLICASFCDRV
Subjt:  DVCISQGGRFPPFSMEGKPPKKVNK-AQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRV

Query:  FKACSDAYFSVDAKTQVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSD------EETSCYGSKARLDSIASSWKTSSSPVSSRGTGYLGVLEDFQ
        ++AC++AYFS+DA  +V+APCGVNDFVCG+A++WVSNGTELC  AG++VK+SD      EE SCYG +A LDSIA SW++S S    +    L VLEDFQ
Subjt:  FKACSDAYFSVDAKTQVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSD------EETSCYGSKARLDSIASSWKTSSSPVSSRGTGYLGVLEDFQ

Query:  QWVREMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGSRR
        QWV+EM F E++SW +G +V+TAGLLF SKRKSH QRQK AAIQRA ++++ KTSQNS  +   R+G+RR
Subjt:  QWVREMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGSRR

A0A6A1UVF4 Folate_rec domain-containing protein1.2e-14172.4Show/hide
Query:  MGLLDHLWDDTVAGPRPESGLGKLRKHSTFFGRSSSGKELDGGNARSYGDESSESPMRITRSIMIVKPPGYQFSSPPISPAGSNSPASPFSGESNDVCIS
        MGLLD LWDDTVAGPRPE+GLGKLRKHSTF  R  SGKE D GNARSY D++SE  MR+TRSIMIVKPPGYQ  SPPISPAGS  P SPFSGE   VC+S
Subjt:  MGLLDHLWDDTVAGPRPESGLGKLRKHSTFFGRSSSGKELDGGNARSYGDESSESPMRITRSIMIVKPPGYQFSSPPISPAGSNSPASPFSGESNDVCIS

Query:  QGGRFPPFSMEGKPPKKVNK-AQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRVFKACS
        QGGRFPPFS EGKPPKKV+K  +DLTLCRVFR+ TCC VAQTHPALLSVRRLASTGE +QECL LWELLECSICDP+VGVQPGPPLIC SFCDRV+ ACS
Subjt:  QGGRFPPFSMEGKPPKKVNK-AQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRVFKACS

Query:  DAYFSVDAKTQVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDE------ETSCYGSKARLDSIASSWKTSSSPVSSRGTGYLGVLEDFQQWVRE
        +AYFS+DAK QVLAPCGVNDFVCGRAS+WV NGTELC  AGF+V  SDE      +TSCYG KA LDS+A SW+ S S V  R    LGVL+DFQQWVR 
Subjt:  DAYFSVDAKTQVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDE------ETSCYGSKARLDSIASSWKTSSSPVSSRGTGYLGVLEDFQQWVRE

Query:  MSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAK-TSQNSIGTQAIRRGSRR
        M   E+VSW +G MV+TAGL F SKRKSH+ RQK AAIQR  +K+E K  +Q    +Q I   SRR
Subjt:  MSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAK-TSQNSIGTQAIRRGSRR

A0A6J1D9K7 uncharacterized protein LOC111018882 isoform X33.8e-13899.6Show/hide
Query:  MEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRVFKACSDAYFSVDAKT
        MEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRVFKACSDAYFSVDAKT
Subjt:  MEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRVFKACSDAYFSVDAKT

Query:  QVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSKARLDSIASSWKTSSSPVSSRGTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVI
        QVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSK RLDSIASSWKTSSSPVSSRGTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVI
Subjt:  QVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSKARLDSIASSWKTSSSPVSSRGTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVI

Query:  TAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGSRR
        TAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGSRR
Subjt:  TAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGSRR

A0A6J1DAH8 uncharacterized protein LOC111018882 isoform X14.3e-15099.26Show/hide
Query:  PF-SGESNDVCISQGGRFPPFSMEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLIC
        PF SGESNDVCISQGGRFPPFSMEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLIC
Subjt:  PF-SGESNDVCISQGGRFPPFSMEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLIC

Query:  ASFCDRVFKACSDAYFSVDAKTQVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSKARLDSIASSWKTSSSPVSSRGTGYLGVLEDF
        ASFCDRVFKACSDAYFSVDAKTQVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSK RLDSIASSWKTSSSPVSSRGTGYLGVLEDF
Subjt:  ASFCDRVFKACSDAYFSVDAKTQVLAPCGVNDFVCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSKARLDSIASSWKTSSSPVSSRGTGYLGVLEDF

Query:  QQWVREMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGSRR
        QQWVREMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGSRR
Subjt:  QQWVREMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGSRR

SwissProt top hitse value%identityAlignment
F4HV65 Dormancy-associated protein homolog 41.3e-0533.59Show/hide
Query:  MGLLDHLWDDTVAGPRPESGLGKLRKHSTFFGRSSSGKELDGGNARSYGDESSESPMRITRSIMIVKPPG-----YQFSSPPISP-AGSNSPASPFSGES
        MG L  LWD+TVAGP P++GLGKLRKH +     SS   L              S  ++TRSIM+ K         +    P SP   S++P +P +   
Subjt:  MGLLDHLWDDTVAGPRPESGLGKLRKHSTFFGRSSSGKELDGGNARSYGDESSESPMRITRSIMIVKPPG-----YQFSSPPISP-AGSNSPASPFSGES

Query:  NDVCISQGGRFPPFSMEGKPPKKVNKAQDLT
           C + G    PF+    P    + A  LT
Subjt:  NDVCISQGGRFPPFSMEGKPPKKVNKAQDLT

Q8LD26 Dormancy-associated protein homolog 36.8e-2869.89Show/hide
Query:  MGLLDHLWDDTVAGPRPESGLGKLRKHSTFFGRSSSGK-ELDGGNARSYGDES-SESPMRITRSIMIVKPPGYQFSSPPISPAGSNSPASPFS
        MGLLDHLWDDTVAGPRPE+GLGKLRKH TF  R SSG  + + G+ARSYG++S  E  +++TRSIMI+KPPGYQ SS P SPAGS  P SPFS
Subjt:  MGLLDHLWDDTVAGPRPESGLGKLRKHSTFFGRSSSGK-ELDGGNARSYGDES-SESPMRITRSIMIVKPPGYQFSSPPISPAGSNSPASPFS

Arabidopsis top hitse value%identityAlignment
AT5G27830.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: response to oxidative stress; LOCATED IN: endomembrane system; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Folate receptor, conserved region (InterPro:IPR018143); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink).2.2e-7451.52Show/hide
Query:  VCISQGGRFPPFSMEGKPPKKVNK-AQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRVF
        VC+S+GGRFPP+ +EGKPPK V + ++DLTLCRVFRKKTCC   QT+PA ++VR LA+ GE +QECL+L+ELLECSIC+P VG+QPGPP ICASFCDRVF
Subjt:  VCISQGGRFPPFSMEGKPPKKVNK-AQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRVF

Query:  KACSDAYFSVDAKTQVLAPCGVN-DFVCGRASQWVSNGTELCSTAGFSVKMSDE--ETSCYGSKARLDSIASSWKTSSSPVSSRGTGYLGVLEDFQQWVR
        +AC DAYF+ +A  +V+ PCGVN D +C +AS W SNGT  C  AGF+V+ +D+  E  CYGSKA L+S+  SW   S   +   T  L   +D  QWVR
Subjt:  KACSDAYFSVDAKTQVLAPCGVN-DFVCGRASQWVSNGTELCSTAGFSVKMSDE--ETSCYGSKARLDSIASSWKTSSSPVSSRGTGYLGVLEDFQQWVR

Query:  EMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGS
        EM+  +++S  +G   + AG+    +  + NQ+Q+ AAIQR  +++    + +S      RR S
Subjt:  EMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGS

AT5G27830.2 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: response to oxidative stress; LOCATED IN: endomembrane system; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Folate receptor, conserved region (InterPro:IPR018143); Has 58 Blast hits to 58 proteins in 23 species: Archae - 0; Bacteria - 0; Metazoa - 8; Fungi - 0; Plants - 37; Viruses - 0; Other Eukaryotes - 13 (source: NCBI BLink).7.7e-5944.87Show/hide
Query:  VCISQGGRFPPFSMEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRVFK
        VC+S+GGRFPP+ +EGKPPK V                            +VR LA+ GE +QECL+L+ELLECSIC+P VG+QPGPP ICASFCDRVF+
Subjt:  VCISQGGRFPPFSMEGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRVFK

Query:  ACSDAYFSVDAKTQVLAPCGVN-DFVCGRASQWVSNGTELCSTAGFSVKMSDE--ETSCYGSKARLDSIASSWKTSSSPVSSRGTGYLGVLEDFQQWVRE
        AC DAYF+ +A  +V+ PCGVN D +C +AS W SNGT  C  AGF+V+ +D+  E  CYGSKA L+S+  SW   S   +   T  L   +D  QWVRE
Subjt:  ACSDAYFSVDAKTQVLAPCGVN-DFVCGRASQWVSNGTELCSTAGFSVKMSDE--ETSCYGSKARLDSIASSWKTSSSPVSSRGTGYLGVLEDFQQWVRE

Query:  MSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGS
        M+  +++S  +G   + AG+    +  + NQ+Q+ AAIQR  +++    + +S      RR S
Subjt:  MSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGS

AT5G27830.3 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: response to oxidative stress; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Folate receptor, conserved region (InterPro:IPR018143); Has 79 Blast hits to 79 proteins in 35 species: Archae - 0; Bacteria - 0; Metazoa - 18; Fungi - 0; Plants - 39; Viruses - 0; Other Eukaryotes - 22 (source: NCBI BLink).2.2e-7451.52Show/hide
Query:  VCISQGGRFPPFSMEGKPPKKVNK-AQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRVF
        VC+S+GGRFPP+ +EGKPPK V + ++DLTLCRVFRKKTCC   QT+PA ++VR LA+ GE +QECL+L+ELLECSIC+P VG+QPGPP ICASFCDRVF
Subjt:  VCISQGGRFPPFSMEGKPPKKVNK-AQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRVF

Query:  KACSDAYFSVDAKTQVLAPCGVN-DFVCGRASQWVSNGTELCSTAGFSVKMSDE--ETSCYGSKARLDSIASSWKTSSSPVSSRGTGYLGVLEDFQQWVR
        +AC DAYF+ +A  +V+ PCGVN D +C +AS W SNGT  C  AGF+V+ +D+  E  CYGSKA L+S+  SW   S   +   T  L   +D  QWVR
Subjt:  KACSDAYFSVDAKTQVLAPCGVN-DFVCGRASQWVSNGTELCSTAGFSVKMSDE--ETSCYGSKARLDSIASSWKTSSSPVSSRGTGYLGVLEDFQQWVR

Query:  EMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGS
        EM+  +++S  +G   + AG+    +  + NQ+Q+ AAIQR  +++    + +S      RR S
Subjt:  EMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGS

AT5G27830.4 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: response to oxidative stress; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Folate receptor, conserved region (InterPro:IPR018143).4.2e-7350.94Show/hide
Query:  VCISQGGRFPPFSMEGKPPKKVNK-AQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRVF
        VC+S+GGRFPP+ +EGKPPK V + ++DLTLCRVFRKKTCC   QT+PA ++VR LA+ GE +QECL+L+ELLECSIC+P VG+QPGPP ICASFCDRVF
Subjt:  VCISQGGRFPPFSMEGKPPKKVNK-AQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRVF

Query:  KACSDAYFSVDAKTQVLAPCGVN-DFVCGRASQWVSNGTELCSTAGFSVKMSDE--ETSCYGSKARLDSIASSWKTSSSPVSSRGTGYLGVLEDFQQWVR
        +AC DAYF+ +A  +V+ PCGVN D +C +AS W SNGT  C  AGF+V+ +D+  E  CYGSKA L+S+  SW   S   +   T  L   +D  QWVR
Subjt:  KACSDAYFSVDAKTQVLAPCGVN-DFVCGRASQWVSNGTELCSTAGFSVKMSDE--ETSCYGSKARLDSIASSWKTSSSPVSSRGTGYLGVLEDFQQWVR

Query:  EMSFGEQVSWLIGSMVITAGLL---FSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGS
        EM+  +++S  +G   + AG+    +  +  + NQ+Q+ AAIQR  +++    + +S      RR S
Subjt:  EMSFGEQVSWLIGSMVITAGLL---FSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGS

AT5G27830.5 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: response to oxidative stress; LOCATED IN: endomembrane system; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Folate receptor, conserved region (InterPro:IPR018143).4.2e-7350.94Show/hide
Query:  VCISQGGRFPPFSMEGKPPKKVNK-AQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRVF
        VC+S+GGRFPP+ +EGKPPK V + ++DLTLCRVFRKKTCC   QT+PA ++VR LA+ GE +QECL+L+ELLECSIC+P VG+QPGPP ICASFCDRVF
Subjt:  VCISQGGRFPPFSMEGKPPKKVNK-AQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRVF

Query:  KACSDAYFSVDAKTQVLAPCGVN-DFVCGRASQWVSNGTELCSTAGFSVKMSDE--ETSCYGSKARLDSIASSWKTSSSPVSSRGTGYLGVLEDFQQWVR
        +AC DAYF+ +A  +V+ PCGVN D +C +AS W SNGT  C  AGF+V+ +D+  E  CYGSKA L+S+  SW   S   +   T  L   +D  QWVR
Subjt:  KACSDAYFSVDAKTQVLAPCGVN-DFVCGRASQWVSNGTELCSTAGFSVKMSDE--ETSCYGSKARLDSIASSWKTSSSPVSSRGTGYLGVLEDFQQWVR

Query:  EMSFGEQVSWLIGSMVITAGLL---FSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGS
        EM+  +++S  +G   + AG+    +  +  + NQ+Q+ AAIQR  +++    + +S      RR S
Subjt:  EMSFGEQVSWLIGSMVITAGLL---FSSKRKSHNQRQKYAAIQRATKKMEAKTSQNSIGTQAIRRGS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCTTACTCGACCATCTCTGGGACGATACCGTCGCCGGACCGAGGCCGGAAAGTGGCCTCGGCAAACTTCGCAAACACTCCACGTTTTTCGGTCGATCTAGCTCCGG
CAAGGAACTGGATGGCGGGAACGCGAGATCGTACGGCGACGAATCTTCAGAGTCGCCGATGAGGATCACGCGGAGTATTATGATCGTAAAACCTCCAGGTTACCAGTTCA
GTTCGCCTCCCATTTCACCGGCCGGATCCAATTCTCCGGCGTCTCCCTTTTCTGGTGAGTCAAATGATGTCTGCATTTCGCAAGGTGGTCGTTTCCCACCCTTCTCAATG
GAAGGAAAACCTCCAAAAAAGGTTAACAAAGCACAAGATTTGACGCTTTGCCGAGTCTTCCGAAAGAAGACTTGCTGTGGTGTAGCCCAAACGCACCCGGCTTTGCTTTC
TGTTAGGAGGCTGGCTTCAACAGGGGAAGGCAACCAGGAATGCTTGCAATTATGGGAACTTCTGGAGTGCTCGATCTGTGATCCACAAGTTGGTGTTCAACCTGGACCTC
CTCTAATATGTGCCTCTTTCTGTGACAGAGTCTTCAAAGCTTGCTCTGATGCTTACTTCTCTGTCGATGCTAAAACACAGGTTCTAGCACCATGCGGAGTGAATGACTTT
GTGTGTGGCAGGGCTTCTCAATGGGTCTCGAATGGCACGGAGCTTTGCAGTACTGCAGGTTTTTCAGTTAAGATGTCAGATGAAGAAACCTCTTGTTATGGTAGTAAAGC
TAGACTAGACTCCATTGCTAGTTCGTGGAAGACTTCATCATCTCCAGTGTCGTCCCGAGGAACAGGGTACTTGGGTGTTTTAGAGGATTTCCAGCAATGGGTAAGAGAAA
TGAGTTTCGGGGAACAGGTTTCTTGGCTGATTGGTTCCATGGTGATTACGGCAGGCCTTCTATTTTCCAGCAAAAGGAAAAGTCATAACCAGCGCCAGAAATACGCTGCT
ATTCAACGAGCAACGAAGAAAATGGAAGCAAAGACGAGCCAGAATTCAATTGGTACTCAAGCAATCAGGAGAGGAAGTAGAAGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCTTACTCGACCATCTCTGGGACGATACCGTCGCCGGACCGAGGCCGGAAAGTGGCCTCGGCAAACTTCGCAAACACTCCACGTTTTTCGGTCGATCTAGCTCCGG
CAAGGAACTGGATGGCGGGAACGCGAGATCGTACGGCGACGAATCTTCAGAGTCGCCGATGAGGATCACGCGGAGTATTATGATCGTAAAACCTCCAGGTTACCAGTTCA
GTTCGCCTCCCATTTCACCGGCCGGATCCAATTCTCCGGCGTCTCCCTTTTCTGGTGAGTCAAATGATGTCTGCATTTCGCAAGGTGGTCGTTTCCCACCCTTCTCAATG
GAAGGAAAACCTCCAAAAAAGGTTAACAAAGCACAAGATTTGACGCTTTGCCGAGTCTTCCGAAAGAAGACTTGCTGTGGTGTAGCCCAAACGCACCCGGCTTTGCTTTC
TGTTAGGAGGCTGGCTTCAACAGGGGAAGGCAACCAGGAATGCTTGCAATTATGGGAACTTCTGGAGTGCTCGATCTGTGATCCACAAGTTGGTGTTCAACCTGGACCTC
CTCTAATATGTGCCTCTTTCTGTGACAGAGTCTTCAAAGCTTGCTCTGATGCTTACTTCTCTGTCGATGCTAAAACACAGGTTCTAGCACCATGCGGAGTGAATGACTTT
GTGTGTGGCAGGGCTTCTCAATGGGTCTCGAATGGCACGGAGCTTTGCAGTACTGCAGGTTTTTCAGTTAAGATGTCAGATGAAGAAACCTCTTGTTATGGTAGTAAAGC
TAGACTAGACTCCATTGCTAGTTCGTGGAAGACTTCATCATCTCCAGTGTCGTCCCGAGGAACAGGGTACTTGGGTGTTTTAGAGGATTTCCAGCAATGGGTAAGAGAAA
TGAGTTTCGGGGAACAGGTTTCTTGGCTGATTGGTTCCATGGTGATTACGGCAGGCCTTCTATTTTCCAGCAAAAGGAAAAGTCATAACCAGCGCCAGAAATACGCTGCT
ATTCAACGAGCAACGAAGAAAATGGAAGCAAAGACGAGCCAGAATTCAATTGGTACTCAAGCAATCAGGAGAGGAAGTAGAAGA
Protein sequenceShow/hide protein sequence
MGLLDHLWDDTVAGPRPESGLGKLRKHSTFFGRSSSGKELDGGNARSYGDESSESPMRITRSIMIVKPPGYQFSSPPISPAGSNSPASPFSGESNDVCISQGGRFPPFSM
EGKPPKKVNKAQDLTLCRVFRKKTCCGVAQTHPALLSVRRLASTGEGNQECLQLWELLECSICDPQVGVQPGPPLICASFCDRVFKACSDAYFSVDAKTQVLAPCGVNDF
VCGRASQWVSNGTELCSTAGFSVKMSDEETSCYGSKARLDSIASSWKTSSSPVSSRGTGYLGVLEDFQQWVREMSFGEQVSWLIGSMVITAGLLFSSKRKSHNQRQKYAA
IQRATKKMEAKTSQNSIGTQAIRRGSRR