; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr025697 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr025697
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionNuclear factor 1 A-type isoform 2
Genome locationtig00152936:1979031..1980518
RNA-Seq ExpressionSgr025697
SyntenySgr025697
Gene Ontology termsNA
InterPro domainsIPR010410 - Protein of unknown function DUF1005


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039981.1 Nuclear factor 1 A-type isoform 2 [Cucumis melo var. makuwa]4.9e-20886.32Show/hide
Query:  AFIRLSIGWLGLRIPGAALNSTKPGMNAFSSPYSCEIRLRGFPVQTSSVPLLPSPEAIPDSHNIASSFYLEESDLKALLAPGCFYNTQACLEISVFSGRK
        AFIRLSIG LGLRIPG +LNSTKPG+NAFSSP SCEIRLRGFP+QTSS+PLLPSPEAIPDSH IASSFYLEESDLKALLAPGCFYNT ACLEISVFSGRK
Subjt:  AFIRLSIGWLGLRIPGAALNSTKPGMNAFSSPYSCEIRLRGFPVQTSSVPLLPSPEAIPDSHNIASSFYLEESDLKALLAPGCFYNTQACLEISVFSGRK

Query:  GSHCGVSIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKQENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFSRDRV
        GSHCGV IKRQLIGTFKLDV PEWGDGKPVILFNGWIGIGKSK ENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQIVQLQGSIKQPIFSCKFSRDRV
Subjt:  GSHCGVSIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKQENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFSRDRV

Query:  SQADPLSNYWSGSADGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRE---------
        SQAD LSNYWSG  DGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRE         
Subjt:  SQADPLSNYWSGSADGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRE---------

Query:  --HPYQRRERRGVL----------------IDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALA
          H     +  G L                IDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMRHVTC+EDAAIFMALA
Subjt:  --HPYQRRERRGVL----------------IDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALA

Query:  AAVDLSIEACRPFRRKIRRGPRHS
        AAVDLSIEACRPFRRKIRR PRHS
Subjt:  AAVDLSIEACRPFRRKIRRGPRHS

XP_004153015.1 uncharacterized protein LOC101204096 [Cucumis sativus]1.9e-20785.61Show/hide
Query:  AFIRLSIGWLGLRIPGAALNSTKPGMNAFSSPYSCEIRLRGFPVQTSSVPLLPSPEAIPDSHNIASSFYLEESDLKALLAPGCFYNTQACLEISVFSGRK
        AFIRLSIG LGLRIPG +LNSTKPG+NAFSSP SCEIRLRGFP+QTSS+PL+PSPEAIPDSH IASSFYLEESDLKALLAPGCFYNT ACLEISVFSGRK
Subjt:  AFIRLSIGWLGLRIPGAALNSTKPGMNAFSSPYSCEIRLRGFPVQTSSVPLLPSPEAIPDSHNIASSFYLEESDLKALLAPGCFYNTQACLEISVFSGRK

Query:  GSHCGVSIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKQENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFSRDRV
        GSHCGV IKRQLIGTFKLDV PEWGDGKPVILFNGWIGIGKSK ENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQ+VQL+GSIKQPIFSCKFSRDRV
Subjt:  GSHCGVSIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKQENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFSRDRV

Query:  SQADPLSNYWSGSADGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRE---------
        SQAD LSNYWSG  DGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRE         
Subjt:  SQADPLSNYWSGSADGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRE---------

Query:  --HPYQRRERRGVL----------------IDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALA
          H     +  G L                IDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMRHVTC+EDAAIFMALA
Subjt:  --HPYQRRERRGVL----------------IDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALA

Query:  AAVDLSIEACRPFRRKIRRGPRHS
        AAVDLSIEACRPFRRKIRR PRHS
Subjt:  AAVDLSIEACRPFRRKIRRGPRHS

XP_008460086.1 PREDICTED: uncharacterized protein LOC103499002 [Cucumis melo]1.1e-20786.08Show/hide
Query:  AFIRLSIGWLGLRIPGAALNSTKPGMNAFSSPYSCEIRLRGFPVQTSSVPLLPSPEAIPDSHNIASSFYLEESDLKALLAPGCFYNTQACLEISVFSGRK
        AFIRLSIG LGLRIPG +LNSTKPG+NAFSSP SCEIRLRGFP+QTSS+PL+PSPEAIPDSH IASSFYLEESDLKALLAPGCFYNT ACLEISVFSGRK
Subjt:  AFIRLSIGWLGLRIPGAALNSTKPGMNAFSSPYSCEIRLRGFPVQTSSVPLLPSPEAIPDSHNIASSFYLEESDLKALLAPGCFYNTQACLEISVFSGRK

Query:  GSHCGVSIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKQENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFSRDRV
        GSHCGV IKRQLIGTFKLDV PEWGDGKPVILFNGWIGIGKSK ENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQIVQLQGSIKQPIFSCKFSRDRV
Subjt:  GSHCGVSIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKQENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFSRDRV

Query:  SQADPLSNYWSGSADGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRE---------
        SQAD LSNYWSG  DGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRE         
Subjt:  SQADPLSNYWSGSADGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRE---------

Query:  --HPYQRRERRGVL----------------IDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALA
          H     +  G L                IDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMRHVTC+EDAAIFMALA
Subjt:  --HPYQRRERRGVL----------------IDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALA

Query:  AAVDLSIEACRPFRRKIRRGPRHS
        AAVDLSIEACRPFRRKIRR PRHS
Subjt:  AAVDLSIEACRPFRRKIRRGPRHS

XP_022155757.1 uncharacterized protein LOC111022801 [Momordica charantia]1.2e-20986.56Show/hide
Query:  AFIRLSIGWLGLRIPGAALNSTKPGMNAFSSPYSCEIRLRGFPVQTSSVPLLPSPEAIPDSHNIASSFYLEESDLKALLAPGCFYNTQACLEISVFSGRK
        AFIRLSIG LGLR+PG +LNSTKPG+N+FSSP SCEIRLRGFPVQ+SS+P+LPSPEAIP+SH+IASSFYLEESDLKALLAPGCFYNT ACLEISVFSGRK
Subjt:  AFIRLSIGWLGLRIPGAALNSTKPGMNAFSSPYSCEIRLRGFPVQTSSVPLLPSPEAIPDSHNIASSFYLEESDLKALLAPGCFYNTQACLEISVFSGRK

Query:  GSHCGVSIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKQENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFSRDRV
        GSHCGV IKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSK ENGRHGAELHL+VKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFSRDRV
Subjt:  GSHCGVSIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKQENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFSRDRV

Query:  SQADPLSNYWSGSADGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRE---------
        SQAD LSNYWSGSADG DLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARS+PGSWLIVRPDVCIPESWQPWGKLEAWRE         
Subjt:  SQADPLSNYWSGSADGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRE---------

Query:  --HPYQRRERRGVL----------------IDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALA
          H     +  G L                IDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALA
Subjt:  --HPYQRRERRGVL----------------IDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALA

Query:  AAVDLSIEACRPFRRKIRRGPRHS
        AAVDLSIEACRPFRRKIRR PRHS
Subjt:  AAVDLSIEACRPFRRKIRRGPRHS

XP_038874344.1 uncharacterized protein LOC120067041 [Benincasa hispida]3.6e-21187.26Show/hide
Query:  AFIRLSIGWLGLRIPGAALNSTKPGMNAFSSPYSCEIRLRGFPVQTSSVPLLPSPEAIPDSHNIASSFYLEESDLKALLAPGCFYNTQACLEISVFSGRK
        AFIRLSIG LGLRIPG +LNSTKPG+NAFSSP SCEIRLRGFPVQTSS+PL+PSPEAIPDSH+I SSFYLEESDLKALLAPGCFYNT ACLEISVFSGRK
Subjt:  AFIRLSIGWLGLRIPGAALNSTKPGMNAFSSPYSCEIRLRGFPVQTSSVPLLPSPEAIPDSHNIASSFYLEESDLKALLAPGCFYNTQACLEISVFSGRK

Query:  GSHCGVSIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKQENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFSRDRV
        GSHCGV IKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSK ENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFSRDRV
Subjt:  GSHCGVSIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKQENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFSRDRV

Query:  SQADPLSNYWSGSADGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRE---------
        SQAD LSNYWSG  DGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRE         
Subjt:  SQADPLSNYWSGSADGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRE---------

Query:  --HPYQRRERRGVL----------------IDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALA
          H     +  G L                IDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTC+EDAAIFMALA
Subjt:  --HPYQRRERRGVL----------------IDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALA

Query:  AAVDLSIEACRPFRRKIRRGPRHS
        AAVDLSIEACRPFRRKIRR PRHS
Subjt:  AAVDLSIEACRPFRRKIRRGPRHS

TrEMBL top hitse value%identityAlignment
A0A0A0K922 Uncharacterized protein9.0e-20885.61Show/hide
Query:  AFIRLSIGWLGLRIPGAALNSTKPGMNAFSSPYSCEIRLRGFPVQTSSVPLLPSPEAIPDSHNIASSFYLEESDLKALLAPGCFYNTQACLEISVFSGRK
        AFIRLSIG LGLRIPG +LNSTKPG+NAFSSP SCEIRLRGFP+QTSS+PL+PSPEAIPDSH IASSFYLEESDLKALLAPGCFYNT ACLEISVFSGRK
Subjt:  AFIRLSIGWLGLRIPGAALNSTKPGMNAFSSPYSCEIRLRGFPVQTSSVPLLPSPEAIPDSHNIASSFYLEESDLKALLAPGCFYNTQACLEISVFSGRK

Query:  GSHCGVSIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKQENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFSRDRV
        GSHCGV IKRQLIGTFKLDV PEWGDGKPVILFNGWIGIGKSK ENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQ+VQL+GSIKQPIFSCKFSRDRV
Subjt:  GSHCGVSIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKQENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFSRDRV

Query:  SQADPLSNYWSGSADGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRE---------
        SQAD LSNYWSG  DGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRE         
Subjt:  SQADPLSNYWSGSADGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRE---------

Query:  --HPYQRRERRGVL----------------IDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALA
          H     +  G L                IDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMRHVTC+EDAAIFMALA
Subjt:  --HPYQRRERRGVL----------------IDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALA

Query:  AAVDLSIEACRPFRRKIRRGPRHS
        AAVDLSIEACRPFRRKIRR PRHS
Subjt:  AAVDLSIEACRPFRRKIRRGPRHS

A0A1S3CCZ5 uncharacterized protein LOC1034990025.3e-20886.08Show/hide
Query:  AFIRLSIGWLGLRIPGAALNSTKPGMNAFSSPYSCEIRLRGFPVQTSSVPLLPSPEAIPDSHNIASSFYLEESDLKALLAPGCFYNTQACLEISVFSGRK
        AFIRLSIG LGLRIPG +LNSTKPG+NAFSSP SCEIRLRGFP+QTSS+PL+PSPEAIPDSH IASSFYLEESDLKALLAPGCFYNT ACLEISVFSGRK
Subjt:  AFIRLSIGWLGLRIPGAALNSTKPGMNAFSSPYSCEIRLRGFPVQTSSVPLLPSPEAIPDSHNIASSFYLEESDLKALLAPGCFYNTQACLEISVFSGRK

Query:  GSHCGVSIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKQENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFSRDRV
        GSHCGV IKRQLIGTFKLDV PEWGDGKPVILFNGWIGIGKSK ENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQIVQLQGSIKQPIFSCKFSRDRV
Subjt:  GSHCGVSIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKQENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFSRDRV

Query:  SQADPLSNYWSGSADGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRE---------
        SQAD LSNYWSG  DGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRE         
Subjt:  SQADPLSNYWSGSADGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRE---------

Query:  --HPYQRRERRGVL----------------IDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALA
          H     +  G L                IDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMRHVTC+EDAAIFMALA
Subjt:  --HPYQRRERRGVL----------------IDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALA

Query:  AAVDLSIEACRPFRRKIRRGPRHS
        AAVDLSIEACRPFRRKIRR PRHS
Subjt:  AAVDLSIEACRPFRRKIRRGPRHS

A0A5A7TE92 Nuclear factor 1 A-type isoform 22.4e-20886.32Show/hide
Query:  AFIRLSIGWLGLRIPGAALNSTKPGMNAFSSPYSCEIRLRGFPVQTSSVPLLPSPEAIPDSHNIASSFYLEESDLKALLAPGCFYNTQACLEISVFSGRK
        AFIRLSIG LGLRIPG +LNSTKPG+NAFSSP SCEIRLRGFP+QTSS+PLLPSPEAIPDSH IASSFYLEESDLKALLAPGCFYNT ACLEISVFSGRK
Subjt:  AFIRLSIGWLGLRIPGAALNSTKPGMNAFSSPYSCEIRLRGFPVQTSSVPLLPSPEAIPDSHNIASSFYLEESDLKALLAPGCFYNTQACLEISVFSGRK

Query:  GSHCGVSIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKQENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFSRDRV
        GSHCGV IKRQLIGTFKLDV PEWGDGKPVILFNGWIGIGKSK ENGRHGAELHLRVKLDPDPRYVFQF+DVTR SPQIVQLQGSIKQPIFSCKFSRDRV
Subjt:  GSHCGVSIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKQENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFSRDRV

Query:  SQADPLSNYWSGSADGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRE---------
        SQAD LSNYWSG  DGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRE         
Subjt:  SQADPLSNYWSGSADGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRE---------

Query:  --HPYQRRERRGVL----------------IDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALA
          H     +  G L                IDTDKQLRAA SPIPSPQSSGDFAALGQVVGGFVMSCRVQGEG+SSKP VQLAMRHVTC+EDAAIFMALA
Subjt:  --HPYQRRERRGVL----------------IDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALA

Query:  AAVDLSIEACRPFRRKIRRGPRHS
        AAVDLSIEACRPFRRKIRR PRHS
Subjt:  AAVDLSIEACRPFRRKIRRGPRHS

A0A6J1DQ78 uncharacterized protein LOC1110228015.6e-21086.56Show/hide
Query:  AFIRLSIGWLGLRIPGAALNSTKPGMNAFSSPYSCEIRLRGFPVQTSSVPLLPSPEAIPDSHNIASSFYLEESDLKALLAPGCFYNTQACLEISVFSGRK
        AFIRLSIG LGLR+PG +LNSTKPG+N+FSSP SCEIRLRGFPVQ+SS+P+LPSPEAIP+SH+IASSFYLEESDLKALLAPGCFYNT ACLEISVFSGRK
Subjt:  AFIRLSIGWLGLRIPGAALNSTKPGMNAFSSPYSCEIRLRGFPVQTSSVPLLPSPEAIPDSHNIASSFYLEESDLKALLAPGCFYNTQACLEISVFSGRK

Query:  GSHCGVSIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKQENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFSRDRV
        GSHCGV IKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSK ENGRHGAELHL+VKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFSRDRV
Subjt:  GSHCGVSIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKQENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFSRDRV

Query:  SQADPLSNYWSGSADGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRE---------
        SQAD LSNYWSGSADG DLE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARS+PGSWLIVRPDVCIPESWQPWGKLEAWRE         
Subjt:  SQADPLSNYWSGSADGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRE---------

Query:  --HPYQRRERRGVL----------------IDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALA
          H     +  G L                IDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALA
Subjt:  --HPYQRRERRGVL----------------IDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALA

Query:  AAVDLSIEACRPFRRKIRRGPRHS
        AAVDLSIEACRPFRRKIRR PRHS
Subjt:  AAVDLSIEACRPFRRKIRRGPRHS

A0A6J1HM61 uncharacterized protein LOC1114642382.2e-20684.91Show/hide
Query:  AFIRLSIGWLGLRIPGAALNSTKPGMNAFSSPYSCEIRLRGFPVQTSSVPLLPSPEAIPDSHNIASSFYLEESDLKALLAPGCFYNTQACLEISVFSGRK
        AFIRLSIG LGLRIPG A+NSTKPG++A SSP SCEIRLRGFPVQTSS+P++ SPEAIPDSH+IASSFYLEESDLKALLAPGCFYNT ACLEISVFSGRK
Subjt:  AFIRLSIGWLGLRIPGAALNSTKPGMNAFSSPYSCEIRLRGFPVQTSSVPLLPSPEAIPDSHNIASSFYLEESDLKALLAPGCFYNTQACLEISVFSGRK

Query:  GSHCGVSIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKQENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFSRDRV
        GSHCGV IKRQLIGTFKLDVGPEWGDGKPV+LFNGWIGIGKSK ENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQL+GSIKQPIFSCKFSRDRV
Subjt:  GSHCGVSIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKQENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFSRDRV

Query:  SQADPLSNYWSGSADGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRE---------
        SQAD LSNYWSGS DGSD+E ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDV IP+SWQPWGKLEAWRE         
Subjt:  SQADPLSNYWSGSADGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWRE---------

Query:  --HPYQRRERRGVL----------------IDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALA
          H     +  G L                IDTDKQLRAA SPIPSPQSSGDFAALGQV+GGFVMSCRVQGEGKSSKPMVQLAMRHVTC+EDAAIFMA+A
Subjt:  --HPYQRRERRGVL----------------IDTDKQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALA

Query:  AAVDLSIEACRPFRRKIRRGPRHS
        AAVDLSIEACRPFRRKIRR PRHS
Subjt:  AAVDLSIEACRPFRRKIRRGPRHS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10020.1 Protein of unknown function (DUF1005)2.4e-8038.91Show/hide
Query:  FIRLSIGWLGLRIPGAALNSTKPGMNAFSSPYSCEIRLRGFPVQTSSVPLLP-SPEAIPDSHNIASSFYLEESDLKALLAPGCFYNTQACLEISVFSGRK
        FIRL+IG L L++P AA  +T   ++  SSP  C+I+L+ FP QT+++P +P      P+   +A++F+L  SD++  LA    + ++ CL+I +++GR 
Subjt:  FIRLSIGWLGLRIPGAALNSTKPGMNAFSSPYSCEIRLRGFPVQTSSVPLLP-SPEAIPDSHNIASSFYLEESDLKALLAPGCFYNTQACLEISVFSGRK

Query:  GSHCGVSIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKQENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS----
        G+ CGV   R L+    + +       KP +  NGWI +GK   ++    A+ HL VK +PDPR+VFQF+     SPQ+VQ+QG+I+QP+F+CKFS    
Subjt:  GSHCGVSIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKQENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFS----

Query:  --RDRVSQADP----LSNYWSGSADGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWR
          R + S++ P    +S  W  S  GS+ E   +ERKGW + +HDLSGS VA A I TPFV S G D V+RSNPGSWLI+RP  C   +W+PWG+LEAWR
Subjt:  --RDRVSQADP----LSNYWSGSADGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWR

Query:  EH------------------------------------------------------PYQRRERRGVLIDTDKQLRAAASPIPSPQ-SSGDFAA---LGQV
        E                                                         + R RRG            ASP  SP+  SGD+        V
Subjt:  EH------------------------------------------------------PYQRRERRGVLIDTDKQLRAAASPIPSPQ-SSGDFAA---LGQV

Query:  VGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALAAAVDLSIEACRPFRRKIRR
          GFVMS  V+GEGK SKP V+++++HV+C+EDAA ++AL+AA+DLS++ACR F +++R+
Subjt:  VGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALAAAVDLSIEACRPFRRKIRR

AT1G50040.1 Protein of unknown function (DUF1005)8.7e-6234.57Show/hide
Query:  AFIRLSIGWLGLRIP------GAALNSTKPGMNAFSSPYS-CEIRLRGFPVQTSSVPLLPSPEAIPDSH-------NIASSFYLEESDLKALLAPGCFYN
        +F+R+ +G L +R P       ++ +S+ P ++  SS    C+I+ + FP Q  SVP+L   E+  +S         +A+ F L +S ++  L    +  
Subjt:  AFIRLSIGWLGLRIP------GAALNSTKPGMNAFSSPYS-CEIRLRGFPVQTSSVPLLPSPEAIPDSH-------NIASSFYLEESDLKALLAPGCFYN

Query:  TQACLEISVFSGRKGSHCG--VSIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKQENGRHGA--ELHLRVKLDPDPRYVFQFEDVTRLSPQIVQL
          + L + V+S R+ + CG   +   +LIG F++ +  +  + K  +  NGW+ +G   + N + G+  ELH+ V+++PD R+VFQF+     SPQ+ Q+
Subjt:  TQACLEISVFSGRKGSHCG--VSIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKQENGRHGA--ELHLRVKLDPDPRYVFQFEDVTRLSPQIVQL

Query:  QGSIKQPIFSCKF----SRDRVSQADPLSNYWSGSADGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPE
        QG+ KQ +F+CKF    S DR + +  LS+  SG    S      +ERKGW + IHDLSGS VA A + TPFVPS G + V+RS+PG+WLI+RPD     
Subjt:  QGSIKQPIFSCKF----SRDRVSQADPLSNYWSGSADGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPE

Query:  SWQPWGKLEAWRE-------------------------HPYQRRERRGVLIDTDKQLRAAASPIPSPQSSGDFAALGQVVG-------------------
        +W+PW +L+AWRE                              +     +ID        AS   S + S D ++   +                     
Subjt:  SWQPWGKLEAWRE-------------------------HPYQRRERRGVLIDTDKQLRAAASPIPSPQSSGDFAALGQVVG-------------------

Query:  ---GFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALAAAVDLSIEACRPFRRKIR
           GFVMS RVQG  K SKP V++ ++HVTC EDAA  +ALAAAVDLS++ACR F +K+R
Subjt:  ---GFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALAAAVDLSIEACRPFRRKIR

AT3G19680.1 Protein of unknown function (DUF1005)3.8e-6533.4Show/hide
Query:  AFIRLSIGWLGLRIPGAALNSTK------PGMNAFSSPYSCEIRLRGFPVQTSSVPLLPSPEAIPDSH--------NIASSFYLEESDLKALLAPGCFYN
        +F+R+ +G L +R P ++ +S+        G+N  +    C+IR + FP +  SVP++   E+  ++          +A+ F L ++ ++A L    F  
Subjt:  AFIRLSIGWLGLRIPGAALNSTK------PGMNAFSSPYSCEIRLRGFPVQTSSVPLLPSPEAIPDSH--------NIASSFYLEESDLKALLAPGCFYN

Query:  TQACLEISVFS--------GRKGSHCGVSIK-RQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKQENGRHGA--ELHLRVKLDPDPRYVFQFEDVTRL
          + L +  +S        G  G+ CG++    +L+G F++ +  +  + K  +  NGW+ +  SK+   + G+  ELH+ V+++PDPR+VFQF+     
Subjt:  TQACLEISVFS--------GRKGSHCGVSIK-RQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKQENGRHGA--ELHLRVKLDPDPRYVFQFEDVTRL

Query:  SPQIVQLQGSIKQPIFSCKF--------SRDRVSQADPLSNYWSG----SADGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSN
        SPQ+ Q+QG+ KQ +F+CKF         R+ +  +  +S   S     S+  S+ E   +ERKGW + +HDLSGS VA A + TPFVPS G + V RS+
Subjt:  SPQIVQLQGSIKQPIFSCKF--------SRDRVSQADPLSNYWSG----SADGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSN

Query:  PGSWLIVRPDVCIPESWQPWGKLEAWREHPY-------------------------QRRERRGVLID-TDKQLRAAASPIPSPQSSGDFAALGQVVG---
        PG+WLI+RPD C   +W+PWG+LEAWRE  Y                           +     +ID T      A++P  SPQ S D  + G   G   
Subjt:  PGSWLIVRPDVCIPESWQPWGKLEAWREHPY-------------------------QRRERRGVLID-TDKQLRAAASPIPSPQSSGDFAALGQVVG---

Query:  ---------------------------GFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALAAAVDLSIEACRPFRRKIRR
                                   GFVMS  V+G GK SKP V++ + HVTC EDAA  +ALAAAVDLS++ACR F  K+R+
Subjt:  ---------------------------GFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALAAAVDLSIEACRPFRRKIRR

AT4G29310.1 Protein of unknown function (DUF1005)1.6e-7138.81Show/hide
Query:  FIRLSIGWLGLRIPGAALNSTKPG-MNAFSSPYSCEIRLRGFPVQTSSVPLLPSPEAI--PDSHNIASSFYLEESDLKALLAPGCFYNTQACLEISVFSG
        F+RL+I  L LR+P  A N    G ++  S+P  C++R++ FP Q + +PL    +A   P+S   A  F+L+   ++ +         +  L +SV++G
Subjt:  FIRLSIGWLGLRIPGAALNSTKPG-MNAFSSPYSCEIRLRGFPVQTSSVPLLPSPEAI--PDSHNIASSFYLEESDLKALLAPGCFYNTQACLEISVFSG

Query:  RKGSHCGVSIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKQENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFSRD
        R G  CGV+   +L+G  ++ V       + V   NGW  +G    +  +  A LHL V  +PDPR+VFQF      SP + Q+Q ++KQP+FSCKFS D
Subjt:  RKGSHCGVSIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKQENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFSRD

Query:  RVSQADPLSNYWSGSADGSDLEV---------ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAW
        R  ++  L + ++ S+ G              + RERKGW + IHDLSGS VAAA + TPFV S G D V+RSNPG+WLI+RP      SW+PWG+LEAW
Subjt:  RVSQADPLSNYWSGSADGSDLEV---------ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAW

Query:  REH------PYQRRERRGVLIDTDKQLRAAASPIPSPQ-----------SSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMAL
        RE        Y+    R     T   +  A   + + Q             G+  A+   V GFVM   V+GEGK SKP+V +  +HVTC+ DAA+F+AL
Subjt:  REH------PYQRRERRGVLIDTDKQLRAAASPIPSPQ-----------SSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMAL

Query:  AAAVDLSIEACRPFRRKIRR
        +AAVDLS++AC+ F RK+R+
Subjt:  AAAVDLSIEACRPFRRKIRR

AT5G17640.1 Protein of unknown function (DUF1005)3.4e-16770.33Show/hide
Query:  AFIRLSIGWLGLRIPGAALNSTKPG--MNAFSSPYSCEIRLRGFPVQTSSVPLLPSPEAIPDSHNIASSFYLEESDLKALLAPGCFYNTQACLEISVFSG
        AFIRLS+G L LRIP   +NST        FSS  SCEI+LRGFPVQT+S+PL+PS +A PD H+I++SFYLEESDL+ALL PGCFY+  A LEISVF+G
Subjt:  AFIRLSIGWLGLRIPGAALNSTKPG--MNAFSSPYSCEIRLRGFPVQTSSVPLLPSPEAIPDSHNIASSFYLEESDLKALLAPGCFYNTQACLEISVFSG

Query:  RKGSHCGVSIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKQENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFSRD
        +K  +CGV  KRQ IG FKL+VGPEWG+GKP+ILFNGWI IGK+K++     AELHL+VKLDPDPRYVFQFEDVT LSPQIVQL+GS+KQPIFSCKFSRD
Subjt:  RKGSHCGVSIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKQENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFSRD

Query:  RVSQADPLSNYWSGSADGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWREHPYQ---
        RVSQ DPL+ YWS S DG++LE ERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVA+SNPG+WL+VRPD   P SWQPWGKLEAWRE   +   
Subjt:  RVSQADPLSNYWSGSADGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWREHPYQ---

Query:  -----------------------RRERRG-VLIDTDKQ-LRAAASPIPSPQSSGDFAALGQVV--GGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAI
                                 E+ G  LIDTDKQ L  AA+PIPSPQSSGDF+ LGQ V  GGFVMS RVQGEGKSSKP+VQLAMRHVTCVEDAAI
Subjt:  -----------------------RRERRG-VLIDTDKQ-LRAAASPIPSPQSSGDFAALGQVV--GGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAI

Query:  FMALAAAVDLSIEACRPFRRKIRRGPRH
        FMALAAAVDLSI AC+PFRR  RR  RH
Subjt:  FMALAAAVDLSIEACRPFRRKIRRGPRH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GATACTTGAGCTCGTCGAAATATAAACGATCAATCAAATGGACAAAAAATTGTAACCGAAATGGATCCTCAGCTTTTATTAGGTTGTCAATTGGTTGGTTGGGATTGAGA
ATCCCGGGAGCTGCTCTAAACTCTACAAAACCTGGAATGAATGCTTTCTCTTCTCCGTATTCGTGTGAAATTCGTCTTCGGGGTTTCCCTGTGCAGACGTCTTCGGTCCC
CCTACTCCCTTCTCCTGAAGCCATACCCGACTCTCATAACATTGCCTCAAGCTTCTATCTTGAAGAGTCTGATCTTAAAGCATTACTGGCACCTGGCTGCTTCTACAACA
CTCAGGCCTGTCTTGAAATATCTGTCTTCTCTGGAAGGAAGGGATCCCACTGTGGTGTTAGCATCAAAAGGCAGCTGATTGGGACGTTTAAGCTGGACGTTGGTCCCGAA
TGGGGTGATGGGAAGCCAGTCATTCTTTTCAATGGGTGGATAGGCATTGGCAAAAGCAAGCAAGAGAATGGAAGACATGGAGCAGAGCTTCATTTGAGAGTGAAACTGGA
TCCTGATCCAAGATATGTTTTTCAGTTTGAAGATGTTACCAGGTTAAGCCCGCAAATCGTCCAGCTTCAAGGGTCGATCAAACAGCCGATCTTCAGCTGCAAATTTAGTC
GAGACAGGGTGTCCCAGGCGGATCCATTGAGCAATTATTGGTCAGGTTCTGCTGATGGCTCCGATCTCGAGGTCGAGAGAAGAGAAAGAAAAGGCTGGAAGGTGAAGATA
CATGATCTTTCTGGCTCGGCTGTTGCAGCAGCCTTCATAACTACTCCCTTTGTGCCATCAACAGGTTGCGATTGGGTTGCCAGGTCGAACCCCGGATCCTGGCTGATTGT
TCGTCCCGATGTTTGCATACCTGAAAGTTGGCAGCCATGGGGAAAGCTCGAGGCGTGGCGCGAGCATCCATATCAACGCCGAGAAAGGCGGGGAGTTCTCATAGACACTG
ACAAACAGCTGCGAGCAGCTGCAAGTCCAATACCAAGCCCGCAGAGCAGTGGAGACTTTGCAGCATTAGGCCAAGTTGTTGGAGGTTTCGTCATGAGCTGCAGAGTACAA
GGGGAAGGAAAGAGCAGTAAGCCGATGGTTCAACTCGCAATGCGACATGTGACGTGTGTGGAGGATGCTGCCATTTTCATGGCGCTTGCAGCAGCGGTCGACCTCAGCAT
CGAAGCTTGCAGGCCGTTCCGAAGGAAGATCAGGAGAGGGCCTCGCCATTCTTAG
mRNA sequenceShow/hide mRNA sequence
GATACTTGAGCTCGTCGAAATATAAACGATCAATCAAATGGACAAAAAATTGTAACCGAAATGGATCCTCAGCTTTTATTAGGTTGTCAATTGGTTGGTTGGGATTGAGA
ATCCCGGGAGCTGCTCTAAACTCTACAAAACCTGGAATGAATGCTTTCTCTTCTCCGTATTCGTGTGAAATTCGTCTTCGGGGTTTCCCTGTGCAGACGTCTTCGGTCCC
CCTACTCCCTTCTCCTGAAGCCATACCCGACTCTCATAACATTGCCTCAAGCTTCTATCTTGAAGAGTCTGATCTTAAAGCATTACTGGCACCTGGCTGCTTCTACAACA
CTCAGGCCTGTCTTGAAATATCTGTCTTCTCTGGAAGGAAGGGATCCCACTGTGGTGTTAGCATCAAAAGGCAGCTGATTGGGACGTTTAAGCTGGACGTTGGTCCCGAA
TGGGGTGATGGGAAGCCAGTCATTCTTTTCAATGGGTGGATAGGCATTGGCAAAAGCAAGCAAGAGAATGGAAGACATGGAGCAGAGCTTCATTTGAGAGTGAAACTGGA
TCCTGATCCAAGATATGTTTTTCAGTTTGAAGATGTTACCAGGTTAAGCCCGCAAATCGTCCAGCTTCAAGGGTCGATCAAACAGCCGATCTTCAGCTGCAAATTTAGTC
GAGACAGGGTGTCCCAGGCGGATCCATTGAGCAATTATTGGTCAGGTTCTGCTGATGGCTCCGATCTCGAGGTCGAGAGAAGAGAAAGAAAAGGCTGGAAGGTGAAGATA
CATGATCTTTCTGGCTCGGCTGTTGCAGCAGCCTTCATAACTACTCCCTTTGTGCCATCAACAGGTTGCGATTGGGTTGCCAGGTCGAACCCCGGATCCTGGCTGATTGT
TCGTCCCGATGTTTGCATACCTGAAAGTTGGCAGCCATGGGGAAAGCTCGAGGCGTGGCGCGAGCATCCATATCAACGCCGAGAAAGGCGGGGAGTTCTCATAGACACTG
ACAAACAGCTGCGAGCAGCTGCAAGTCCAATACCAAGCCCGCAGAGCAGTGGAGACTTTGCAGCATTAGGCCAAGTTGTTGGAGGTTTCGTCATGAGCTGCAGAGTACAA
GGGGAAGGAAAGAGCAGTAAGCCGATGGTTCAACTCGCAATGCGACATGTGACGTGTGTGGAGGATGCTGCCATTTTCATGGCGCTTGCAGCAGCGGTCGACCTCAGCAT
CGAAGCTTGCAGGCCGTTCCGAAGGAAGATCAGGAGAGGGCCTCGCCATTCTTAG
Protein sequenceShow/hide protein sequence
YLSSSKYKRSIKWTKNCNRNGSSAFIRLSIGWLGLRIPGAALNSTKPGMNAFSSPYSCEIRLRGFPVQTSSVPLLPSPEAIPDSHNIASSFYLEESDLKALLAPGCFYNT
QACLEISVFSGRKGSHCGVSIKRQLIGTFKLDVGPEWGDGKPVILFNGWIGIGKSKQENGRHGAELHLRVKLDPDPRYVFQFEDVTRLSPQIVQLQGSIKQPIFSCKFSR
DRVSQADPLSNYWSGSADGSDLEVERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVARSNPGSWLIVRPDVCIPESWQPWGKLEAWREHPYQRRERRGVLIDTD
KQLRAAASPIPSPQSSGDFAALGQVVGGFVMSCRVQGEGKSSKPMVQLAMRHVTCVEDAAIFMALAAAVDLSIEACRPFRRKIRRGPRHS