; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS005900 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS005900
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionUnknown protein
Genome locationscaffold254:2313708..2315485
RNA-Seq ExpressionMS005900
SyntenyMS005900
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048879.1 uncharacterized protein E6C27_scaffold171G00940 [Cucumis melo var. makuwa]1.7e-21781.47Show/hide
Query:  MAAAEARAAFQRTVNRCFVQEDAKRAPRLACCQSSSSTSKQVDSGPANAAADGADHPSIGLIPLNRASSYASLFPDSKWWLQTQSSYGYQKIFTPEQINA
        MAAAEARAAFQRTVNRCFVQEDAKRAPRLACCQSSSSTSKQVDSGPANAAADG D PSIG +P +RASSY++L PDSKWWLQTQSSYG+QKIFT E +N 
Subjt:  MAAAEARAAFQRTVNRCFVQEDAKRAPRLACCQSSSSTSKQVDSGPANAAADGADHPSIGLIPLNRASSYASLFPDSKWWLQTQSSYGYQKIFTPEQINA

Query:  LEAGNETSKSGTEKSCTSSDIHPSKGSSTVYEIDDYSRSSLDTDHGVSAVCTKRVNSIMNEDIKALDDGTDSQECLCSVDMKANFECSEMDSFNSKTISK
        LEA NETSKSGTEKSCTSSDIH  +GS++V  +DD+SRSSLDTDHGVSA+CTKRV  I+NEDIK L +GTDSQEC  SVDMKA+FEC + DSFNSKT+SK
Subjt:  LEAGNETSKSGTEKSCTSSDIHPSKGSSTVYEIDDYSRSSLDTDHGVSAVCTKRVNSIMNEDIKALDDGTDSQECLCSVDMKANFECSEMDSFNSKTISK

Query:  NQDEFYFDPDSPWIQEDKAEPWWWITDRDELAYWVAQRSLDRIENCDLPPPKKTCLNLKRYPHAQKQCYEHDTNWISTFDSKHRNCGLDFCHLDETQREL
        NQDE YFDPDSPWIQEDKA PWWWITD+DELAYWVAQ+SLD IENCDLPPPKKTCLN KR P+AQKQC+EH+TN +STF S H+NCGLDFC    TQR+L
Subjt:  NQDEFYFDPDSPWIQEDKAEPWWWITDRDELAYWVAQRSLDRIENCDLPPPKKTCLNLKRYPHAQKQCYEHDTNWISTFDSKHRNCGLDFCHLDETQREL

Query:  GESIGQGNLLHGSHQSLSCTNPNNSTKTTQISEENTSKAELMDALLHSQTRAREAENAAKQAYAEKEHIVDLFFRQASQLFAYKQWFQLLQLESLQIKNN
         ES  QGNLLHGSH+S S T P+N TKT Q SE+NTSKAELMDALLHSQTRAREAE AAK+AYAEKEHIV+LFFRQA+QLFAYKQWFQLLQLESLQIKN+
Subjt:  GESIGQGNLLHGSHQSLSCTNPNNSTKTTQISEENTSKAELMDALLHSQTRAREAENAAKQAYAEKEHIVDLFFRQASQLFAYKQWFQLLQLESLQIKNN

Query:  DQPMSTLFPLILPWMSYKNMISDKRWQRFTRRKRVKQDRHRSDISTYAVAFALGLSLVSAGLLLGWTVGWMLPSF
        +QPMSTLFPL+LPW SYKNM+S KR +R T +KRV+QDRH+SDISTYAVAFALGLSLVSAG LLGWTVGWMLPSF
Subjt:  DQPMSTLFPLILPWMSYKNMISDKRWQRFTRRKRVKQDRHRSDISTYAVAFALGLSLVSAGLLLGWTVGWMLPSF

XP_008437876.1 PREDICTED: uncharacterized protein LOC103483175 isoform X2 [Cucumis melo]2.9e-21781.26Show/hide
Query:  MAAAEARAAFQRTVNRCFVQEDAKRAPRLACCQSSSSTSKQVDSGPANAAADGADHPSIGLIPLNRASSYASLFPDSKWWLQTQSSYGYQKIFTPEQINA
        MAAAEARAAFQRTVNRCFVQEDAKRAPRLACCQSSSSTSKQVDSGPANAAADG D PSIG +P +RASSY++L PDSKWWLQTQS YG+QKIFT E +N 
Subjt:  MAAAEARAAFQRTVNRCFVQEDAKRAPRLACCQSSSSTSKQVDSGPANAAADGADHPSIGLIPLNRASSYASLFPDSKWWLQTQSSYGYQKIFTPEQINA

Query:  LEAGNETSKSGTEKSCTSSDIHPSKGSSTVYEIDDYSRSSLDTDHGVSAVCTKRVNSIMNEDIKALDDGTDSQECLCSVDMKANFECSEMDSFNSKTISK
        LEA NETSKSGTEKSCTSSDIH  +GS++V  +DD+SRSSLDTDHGVSA+CTKRV  I+NEDIK L +GTDSQEC  SVDMKA+FEC + DSFNSKT+SK
Subjt:  LEAGNETSKSGTEKSCTSSDIHPSKGSSTVYEIDDYSRSSLDTDHGVSAVCTKRVNSIMNEDIKALDDGTDSQECLCSVDMKANFECSEMDSFNSKTISK

Query:  NQDEFYFDPDSPWIQEDKAEPWWWITDRDELAYWVAQRSLDRIENCDLPPPKKTCLNLKRYPHAQKQCYEHDTNWISTFDSKHRNCGLDFCHLDETQREL
        NQDE YFDPDSPWIQEDKA PWWWITD+DELAYWVAQ+SLD IENCDLPPPKKTCLN KR P+AQKQC+EH+TN +STF+S H+NCGLDFC    TQR+L
Subjt:  NQDEFYFDPDSPWIQEDKAEPWWWITDRDELAYWVAQRSLDRIENCDLPPPKKTCLNLKRYPHAQKQCYEHDTNWISTFDSKHRNCGLDFCHLDETQREL

Query:  GESIGQGNLLHGSHQSLSCTNPNNSTKTTQISEENTSKAELMDALLHSQTRAREAENAAKQAYAEKEHIVDLFFRQASQLFAYKQWFQLLQLESLQIKNN
         ES  QGNLLHGSH+S S T P+N TKT Q SE+NTSKAELMDALLHSQTRAREAE AAK+AYAEKEHIV+LFFRQA+QLFAYKQWFQLLQLESLQIKN+
Subjt:  GESIGQGNLLHGSHQSLSCTNPNNSTKTTQISEENTSKAELMDALLHSQTRAREAENAAKQAYAEKEHIVDLFFRQASQLFAYKQWFQLLQLESLQIKNN

Query:  DQPMSTLFPLILPWMSYKNMISDKRWQRFTRRKRVKQDRHRSDISTYAVAFALGLSLVSAGLLLGWTVGWMLPSF
        +QPMSTLFPL+LPW SYKNM+S KR +R T +KRV+QDRH+SDISTYAVAFALGLSLVSAG LLGWTVGWMLPSF
Subjt:  DQPMSTLFPLILPWMSYKNMISDKRWQRFTRRKRVKQDRHRSDISTYAVAFALGLSLVSAGLLLGWTVGWMLPSF

XP_011650672.1 uncharacterized protein LOC101208119 [Cucumis sativus]4.4e-22181.68Show/hide
Query:  MAAAEARAAFQRTVNRCFVQEDAKRAPRLACCQSSSSTSKQVDSGPANAAADGADHPSIGLIPLNRASSYASLFPDSKWWLQTQSSYGYQKIFTPEQINA
        MAAAEARAAFQRTVNRCFVQEDAKRAPRLACCQSSSSTSKQVDSGPANAAADG D PS G +P +RASSY++L PDSKWWLQTQSSYG+QKIFT E IN 
Subjt:  MAAAEARAAFQRTVNRCFVQEDAKRAPRLACCQSSSSTSKQVDSGPANAAADGADHPSIGLIPLNRASSYASLFPDSKWWLQTQSSYGYQKIFTPEQINA

Query:  LEAGNETSKSGTEKSCTSSDIHPSKGSSTVYEIDDYSRSSLDTDHGVSAVCTKRVNSIMNEDIKALDDGTDSQECLCSVDMKANFECSEMDSFNSKTISK
        LEAGNETSKSGTEKSCTSSDIH  +GS+TV  +DD+SRSSLDTDHGVS +CTKRV +I+NEDIK L +GTDSQEC+ SVDMKA+FEC E DSFNSKT+SK
Subjt:  LEAGNETSKSGTEKSCTSSDIHPSKGSSTVYEIDDYSRSSLDTDHGVSAVCTKRVNSIMNEDIKALDDGTDSQECLCSVDMKANFECSEMDSFNSKTISK

Query:  NQDEFYFDPDSPWIQEDKAEPWWWITDRDELAYWVAQRSLDRIENCDLPPPKKTCLNLKRYPHAQKQCYEHDTNWISTFDSKHRNCGLDFCHLDETQREL
        NQDEFYFDPDSPWIQE+KAEPWWWITD+DELAYWVAQ+SLD IENCDLPPPKKTCL+ KR P+A+KQCYEH+TN +STF+S H+NCGLDFC    TQR+L
Subjt:  NQDEFYFDPDSPWIQEDKAEPWWWITDRDELAYWVAQRSLDRIENCDLPPPKKTCLNLKRYPHAQKQCYEHDTNWISTFDSKHRNCGLDFCHLDETQREL

Query:  GESIGQGNLLHGSHQSLSCTNPNNSTKTTQISEENTSKAELMDALLHSQTRAREAENAAKQAYAEKEHIVDLFFRQASQLFAYKQWFQLLQLESLQIKNN
         ESI QGNLLH SH+S SCTNP++ TKT Q SE+NTSKAELMDALLHSQTRAREAE AAK+AYAEKEHIV+LF RQA+QLFAYKQWFQLLQLESLQIKN+
Subjt:  GESIGQGNLLHGSHQSLSCTNPNNSTKTTQISEENTSKAELMDALLHSQTRAREAENAAKQAYAEKEHIVDLFFRQASQLFAYKQWFQLLQLESLQIKNN

Query:  DQPMSTLFPLILPWMSYKNMISDKRWQRFTRRKRVKQDRHRSDISTYAVAFALGLSLVSAGLLLGWTVGWMLPSF
        +QPMS LFPL+LPW SYKNM+S KRW+R T +KRV+QD+ +SDISTYAVAFALGLSLVSAGLLLGWTVGWMLPSF
Subjt:  DQPMSTLFPLILPWMSYKNMISDKRWQRFTRRKRVKQDRHRSDISTYAVAFALGLSLVSAGLLLGWTVGWMLPSF

XP_022147481.1 uncharacterized protein LOC111016393 [Momordica charantia]2.1e-27199.16Show/hide
Query:  MAAAEARAAFQRTVNRCFVQEDAKRAPRLACCQSSSSTSKQVDSGPANAAADGADHPSIGLIPLNRASSYASLFPDSKWWLQTQSSYGYQKIFTPEQINA
        MAAAEARAAFQRTVNRCFVQEDAKRAPRLACCQSSSSTSKQVD GPANAAADGADHPSIGLIPLNRASSYASLFPDSKWWLQTQSSYGYQKIFTPEQINA
Subjt:  MAAAEARAAFQRTVNRCFVQEDAKRAPRLACCQSSSSTSKQVDSGPANAAADGADHPSIGLIPLNRASSYASLFPDSKWWLQTQSSYGYQKIFTPEQINA

Query:  LEAGNETSKSGTEKSCTSSDIHPSKGSSTVYEIDDYSRSSLDTDHGVSAVCTKRVNSIMNEDIKALDDGTDSQECLCSVDMKANFECSEMDSFNSKTISK
        LEAGNETSKSGTEKSCTSSDIHPSKGSSTVYEIDDYSRSSLDTDHGVSAVCTKRVNSIMNEDIKALDDGTDSQECLCSVDMKANFECSEMDSFNSKTISK
Subjt:  LEAGNETSKSGTEKSCTSSDIHPSKGSSTVYEIDDYSRSSLDTDHGVSAVCTKRVNSIMNEDIKALDDGTDSQECLCSVDMKANFECSEMDSFNSKTISK

Query:  NQDEFYFDPDSPWIQEDKAEPWWWITDRDELAYWVAQRSLDRIENCDLPPPKKTCLNLKRYPHAQKQCYEHDTNWISTFDSKHRNCGLDFCHLDETQREL
        NQDEFYFDPDSPWIQEDKAEPWWWITDRDELAYWVAQRSLD IENCDLPPPKKTCLNLKRYPH QKQCYEHDTNWISTFDSKHRNCGLDFCHLDETQREL
Subjt:  NQDEFYFDPDSPWIQEDKAEPWWWITDRDELAYWVAQRSLDRIENCDLPPPKKTCLNLKRYPHAQKQCYEHDTNWISTFDSKHRNCGLDFCHLDETQREL

Query:  GESIGQGNLLHGSHQSLSCTNPNNSTKTTQISEENTSKAELMDALLHSQTRAREAENAAKQAYAEKEHIVDLFFRQASQLFAYKQWFQLLQLESLQIKNN
        GESIGQGNLLHGSHQSLSCTNPNNSTKTTQISEENTSKAELMDALLHSQTRAREAENAAKQAYAEKEHIVDLFFRQASQLFAYKQWFQLLQLESLQIKNN
Subjt:  GESIGQGNLLHGSHQSLSCTNPNNSTKTTQISEENTSKAELMDALLHSQTRAREAENAAKQAYAEKEHIVDLFFRQASQLFAYKQWFQLLQLESLQIKNN

Query:  DQPMSTLFPLILPWMSYKNMISDKRWQRFTRRKRVKQDRHRSDISTYAVAFALGLSLVSAGLLLGWTVGWMLPSF
        DQPMSTLFPLILPWMSYKNMISDKRWQRFTRRKRVK+DRHRSDISTYAVAFALGLSLVSAGLLLGWTVGWMLPSF
Subjt:  DQPMSTLFPLILPWMSYKNMISDKRWQRFTRRKRVKQDRHRSDISTYAVAFALGLSLVSAGLLLGWTVGWMLPSF

XP_038896085.1 uncharacterized protein LOC120084287 [Benincasa hispida]7.7e-23485.89Show/hide
Query:  MAAAEARAAFQRTVNRCFVQEDAKRAPRLACCQSSSSTSKQVDSGPANAAADGADHPSIGLIPLNRASSYASLFPDSKWWLQTQSSYGYQKIFTPEQINA
        MAAAEARAAFQRTVNRCFVQEDAKRAPRLACCQSSSSTSKQVDSGPANAAADG D PSIG +P +RASSY++L PD+KWWLQTQSSYGYQKIFTPE +NA
Subjt:  MAAAEARAAFQRTVNRCFVQEDAKRAPRLACCQSSSSTSKQVDSGPANAAADGADHPSIGLIPLNRASSYASLFPDSKWWLQTQSSYGYQKIFTPEQINA

Query:  LEAGNETSKSGTEKSCTSSDIHPSKGSSTVYEIDDYSRSSLDTDHGVSAVCTKRVNSIMNEDIKALDDGTDSQECLCSVDMKANFECSEMDSFNSKTISK
        LEAGNETSKS TEKSCTSSDIH  +G +TV EID +SRSSLDTDHGVSA+CTKRVNSI+NEDIK L DGTDSQEC+ S DMKA FEC + DSFNSKT+SK
Subjt:  LEAGNETSKSGTEKSCTSSDIHPSKGSSTVYEIDDYSRSSLDTDHGVSAVCTKRVNSIMNEDIKALDDGTDSQECLCSVDMKANFECSEMDSFNSKTISK

Query:  NQDEFYFDPDSPWIQEDKAEPWWWITDRDELAYWVAQRSLDRIENCDLPPPKKTCLNLKRYPHAQKQCYEHDTNWISTFDSKHRNCGLDFCHLDETQREL
        NQDEFYFDPDSPWIQEDKAEPWWWITD+DELAYWVAQ+SLD IENCDLPPPKKTCLN  RYP+AQKQCYEHD N +STF+S HRNCGLDFC   +TQR+L
Subjt:  NQDEFYFDPDSPWIQEDKAEPWWWITDRDELAYWVAQRSLDRIENCDLPPPKKTCLNLKRYPHAQKQCYEHDTNWISTFDSKHRNCGLDFCHLDETQREL

Query:  GESIGQGNLLHGSHQSLSCTNPNNSTKTTQISEENTSKAELMDALLHSQTRAREAENAAKQAYAEKEHIVDLFFRQASQLFAYKQWFQLLQLESLQIKNN
        GESI QGN+L GS +S SCTNP+N TKTT ISE+NTSKAELMDALLHSQTRAREAE AAK+AYAEKEHIVDLFFRQASQLFAYKQWFQLLQLESLQIKNN
Subjt:  GESIGQGNLLHGSHQSLSCTNPNNSTKTTQISEENTSKAELMDALLHSQTRAREAENAAKQAYAEKEHIVDLFFRQASQLFAYKQWFQLLQLESLQIKNN

Query:  DQPMSTLFPLILPWMSYKNMISDKRWQRFTRRKRVKQDRHRSDISTYAVAFALGLSLVSAGLLLGWTVGWMLPSF
        DQPMSTLFPL+LPWMSYKNM+S KRWQRF+R+KRVK+DRHRSDISTYAVAFALGLSLVSAGLLLGWTVGWMLPSF
Subjt:  DQPMSTLFPLILPWMSYKNMISDKRWQRFTRRKRVKQDRHRSDISTYAVAFALGLSLVSAGLLLGWTVGWMLPSF

TrEMBL top hitse value%identityAlignment
A0A0A0L5V2 Uncharacterized protein2.1e-22181.68Show/hide
Query:  MAAAEARAAFQRTVNRCFVQEDAKRAPRLACCQSSSSTSKQVDSGPANAAADGADHPSIGLIPLNRASSYASLFPDSKWWLQTQSSYGYQKIFTPEQINA
        MAAAEARAAFQRTVNRCFVQEDAKRAPRLACCQSSSSTSKQVDSGPANAAADG D PS G +P +RASSY++L PDSKWWLQTQSSYG+QKIFT E IN 
Subjt:  MAAAEARAAFQRTVNRCFVQEDAKRAPRLACCQSSSSTSKQVDSGPANAAADGADHPSIGLIPLNRASSYASLFPDSKWWLQTQSSYGYQKIFTPEQINA

Query:  LEAGNETSKSGTEKSCTSSDIHPSKGSSTVYEIDDYSRSSLDTDHGVSAVCTKRVNSIMNEDIKALDDGTDSQECLCSVDMKANFECSEMDSFNSKTISK
        LEAGNETSKSGTEKSCTSSDIH  +GS+TV  +DD+SRSSLDTDHGVS +CTKRV +I+NEDIK L +GTDSQEC+ SVDMKA+FEC E DSFNSKT+SK
Subjt:  LEAGNETSKSGTEKSCTSSDIHPSKGSSTVYEIDDYSRSSLDTDHGVSAVCTKRVNSIMNEDIKALDDGTDSQECLCSVDMKANFECSEMDSFNSKTISK

Query:  NQDEFYFDPDSPWIQEDKAEPWWWITDRDELAYWVAQRSLDRIENCDLPPPKKTCLNLKRYPHAQKQCYEHDTNWISTFDSKHRNCGLDFCHLDETQREL
        NQDEFYFDPDSPWIQE+KAEPWWWITD+DELAYWVAQ+SLD IENCDLPPPKKTCL+ KR P+A+KQCYEH+TN +STF+S H+NCGLDFC    TQR+L
Subjt:  NQDEFYFDPDSPWIQEDKAEPWWWITDRDELAYWVAQRSLDRIENCDLPPPKKTCLNLKRYPHAQKQCYEHDTNWISTFDSKHRNCGLDFCHLDETQREL

Query:  GESIGQGNLLHGSHQSLSCTNPNNSTKTTQISEENTSKAELMDALLHSQTRAREAENAAKQAYAEKEHIVDLFFRQASQLFAYKQWFQLLQLESLQIKNN
         ESI QGNLLH SH+S SCTNP++ TKT Q SE+NTSKAELMDALLHSQTRAREAE AAK+AYAEKEHIV+LF RQA+QLFAYKQWFQLLQLESLQIKN+
Subjt:  GESIGQGNLLHGSHQSLSCTNPNNSTKTTQISEENTSKAELMDALLHSQTRAREAENAAKQAYAEKEHIVDLFFRQASQLFAYKQWFQLLQLESLQIKNN

Query:  DQPMSTLFPLILPWMSYKNMISDKRWQRFTRRKRVKQDRHRSDISTYAVAFALGLSLVSAGLLLGWTVGWMLPSF
        +QPMS LFPL+LPW SYKNM+S KRW+R T +KRV+QD+ +SDISTYAVAFALGLSLVSAGLLLGWTVGWMLPSF
Subjt:  DQPMSTLFPLILPWMSYKNMISDKRWQRFTRRKRVKQDRHRSDISTYAVAFALGLSLVSAGLLLGWTVGWMLPSF

A0A1S3AVN5 uncharacterized protein LOC103483175 isoform X21.4e-21781.26Show/hide
Query:  MAAAEARAAFQRTVNRCFVQEDAKRAPRLACCQSSSSTSKQVDSGPANAAADGADHPSIGLIPLNRASSYASLFPDSKWWLQTQSSYGYQKIFTPEQINA
        MAAAEARAAFQRTVNRCFVQEDAKRAPRLACCQSSSSTSKQVDSGPANAAADG D PSIG +P +RASSY++L PDSKWWLQTQS YG+QKIFT E +N 
Subjt:  MAAAEARAAFQRTVNRCFVQEDAKRAPRLACCQSSSSTSKQVDSGPANAAADGADHPSIGLIPLNRASSYASLFPDSKWWLQTQSSYGYQKIFTPEQINA

Query:  LEAGNETSKSGTEKSCTSSDIHPSKGSSTVYEIDDYSRSSLDTDHGVSAVCTKRVNSIMNEDIKALDDGTDSQECLCSVDMKANFECSEMDSFNSKTISK
        LEA NETSKSGTEKSCTSSDIH  +GS++V  +DD+SRSSLDTDHGVSA+CTKRV  I+NEDIK L +GTDSQEC  SVDMKA+FEC + DSFNSKT+SK
Subjt:  LEAGNETSKSGTEKSCTSSDIHPSKGSSTVYEIDDYSRSSLDTDHGVSAVCTKRVNSIMNEDIKALDDGTDSQECLCSVDMKANFECSEMDSFNSKTISK

Query:  NQDEFYFDPDSPWIQEDKAEPWWWITDRDELAYWVAQRSLDRIENCDLPPPKKTCLNLKRYPHAQKQCYEHDTNWISTFDSKHRNCGLDFCHLDETQREL
        NQDE YFDPDSPWIQEDKA PWWWITD+DELAYWVAQ+SLD IENCDLPPPKKTCLN KR P+AQKQC+EH+TN +STF+S H+NCGLDFC    TQR+L
Subjt:  NQDEFYFDPDSPWIQEDKAEPWWWITDRDELAYWVAQRSLDRIENCDLPPPKKTCLNLKRYPHAQKQCYEHDTNWISTFDSKHRNCGLDFCHLDETQREL

Query:  GESIGQGNLLHGSHQSLSCTNPNNSTKTTQISEENTSKAELMDALLHSQTRAREAENAAKQAYAEKEHIVDLFFRQASQLFAYKQWFQLLQLESLQIKNN
         ES  QGNLLHGSH+S S T P+N TKT Q SE+NTSKAELMDALLHSQTRAREAE AAK+AYAEKEHIV+LFFRQA+QLFAYKQWFQLLQLESLQIKN+
Subjt:  GESIGQGNLLHGSHQSLSCTNPNNSTKTTQISEENTSKAELMDALLHSQTRAREAENAAKQAYAEKEHIVDLFFRQASQLFAYKQWFQLLQLESLQIKNN

Query:  DQPMSTLFPLILPWMSYKNMISDKRWQRFTRRKRVKQDRHRSDISTYAVAFALGLSLVSAGLLLGWTVGWMLPSF
        +QPMSTLFPL+LPW SYKNM+S KR +R T +KRV+QDRH+SDISTYAVAFALGLSLVSAG LLGWTVGWMLPSF
Subjt:  DQPMSTLFPLILPWMSYKNMISDKRWQRFTRRKRVKQDRHRSDISTYAVAFALGLSLVSAGLLLGWTVGWMLPSF

A0A5A7U5X9 Uncharacterized protein8.3e-21881.47Show/hide
Query:  MAAAEARAAFQRTVNRCFVQEDAKRAPRLACCQSSSSTSKQVDSGPANAAADGADHPSIGLIPLNRASSYASLFPDSKWWLQTQSSYGYQKIFTPEQINA
        MAAAEARAAFQRTVNRCFVQEDAKRAPRLACCQSSSSTSKQVDSGPANAAADG D PSIG +P +RASSY++L PDSKWWLQTQSSYG+QKIFT E +N 
Subjt:  MAAAEARAAFQRTVNRCFVQEDAKRAPRLACCQSSSSTSKQVDSGPANAAADGADHPSIGLIPLNRASSYASLFPDSKWWLQTQSSYGYQKIFTPEQINA

Query:  LEAGNETSKSGTEKSCTSSDIHPSKGSSTVYEIDDYSRSSLDTDHGVSAVCTKRVNSIMNEDIKALDDGTDSQECLCSVDMKANFECSEMDSFNSKTISK
        LEA NETSKSGTEKSCTSSDIH  +GS++V  +DD+SRSSLDTDHGVSA+CTKRV  I+NEDIK L +GTDSQEC  SVDMKA+FEC + DSFNSKT+SK
Subjt:  LEAGNETSKSGTEKSCTSSDIHPSKGSSTVYEIDDYSRSSLDTDHGVSAVCTKRVNSIMNEDIKALDDGTDSQECLCSVDMKANFECSEMDSFNSKTISK

Query:  NQDEFYFDPDSPWIQEDKAEPWWWITDRDELAYWVAQRSLDRIENCDLPPPKKTCLNLKRYPHAQKQCYEHDTNWISTFDSKHRNCGLDFCHLDETQREL
        NQDE YFDPDSPWIQEDKA PWWWITD+DELAYWVAQ+SLD IENCDLPPPKKTCLN KR P+AQKQC+EH+TN +STF S H+NCGLDFC    TQR+L
Subjt:  NQDEFYFDPDSPWIQEDKAEPWWWITDRDELAYWVAQRSLDRIENCDLPPPKKTCLNLKRYPHAQKQCYEHDTNWISTFDSKHRNCGLDFCHLDETQREL

Query:  GESIGQGNLLHGSHQSLSCTNPNNSTKTTQISEENTSKAELMDALLHSQTRAREAENAAKQAYAEKEHIVDLFFRQASQLFAYKQWFQLLQLESLQIKNN
         ES  QGNLLHGSH+S S T P+N TKT Q SE+NTSKAELMDALLHSQTRAREAE AAK+AYAEKEHIV+LFFRQA+QLFAYKQWFQLLQLESLQIKN+
Subjt:  GESIGQGNLLHGSHQSLSCTNPNNSTKTTQISEENTSKAELMDALLHSQTRAREAENAAKQAYAEKEHIVDLFFRQASQLFAYKQWFQLLQLESLQIKNN

Query:  DQPMSTLFPLILPWMSYKNMISDKRWQRFTRRKRVKQDRHRSDISTYAVAFALGLSLVSAGLLLGWTVGWMLPSF
        +QPMSTLFPL+LPW SYKNM+S KR +R T +KRV+QDRH+SDISTYAVAFALGLSLVSAG LLGWTVGWMLPSF
Subjt:  DQPMSTLFPLILPWMSYKNMISDKRWQRFTRRKRVKQDRHRSDISTYAVAFALGLSLVSAGLLLGWTVGWMLPSF

A0A5D3DC19 Uncharacterized protein1.4e-21781.26Show/hide
Query:  MAAAEARAAFQRTVNRCFVQEDAKRAPRLACCQSSSSTSKQVDSGPANAAADGADHPSIGLIPLNRASSYASLFPDSKWWLQTQSSYGYQKIFTPEQINA
        MAAAEARAAFQRTVNRCFVQEDAKRAPRLACCQSSSSTSKQVDSGPANAAADG D PSIG +P +RASSY++L PDSKWWLQTQSSYG+QKIFT E +N 
Subjt:  MAAAEARAAFQRTVNRCFVQEDAKRAPRLACCQSSSSTSKQVDSGPANAAADGADHPSIGLIPLNRASSYASLFPDSKWWLQTQSSYGYQKIFTPEQINA

Query:  LEAGNETSKSGTEKSCTSSDIHPSKGSSTVYEIDDYSRSSLDTDHGVSAVCTKRVNSIMNEDIKALDDGTDSQECLCSVDMKANFECSEMDSFNSKTISK
        LEA NETSKSGTEKSCTSSDIH  +GS++V  +DD+SRSSLDTDHGVSA+CTKRV  I+NEDIK L +GTDSQEC  SVDMKA+FEC + DSFNSKT+SK
Subjt:  LEAGNETSKSGTEKSCTSSDIHPSKGSSTVYEIDDYSRSSLDTDHGVSAVCTKRVNSIMNEDIKALDDGTDSQECLCSVDMKANFECSEMDSFNSKTISK

Query:  NQDEFYFDPDSPWIQEDKAEPWWWITDRDELAYWVAQRSLDRIENCDLPPPKKTCLNLKRYPHAQKQCYEHDTNWISTFDSKHRNCGLDFCHLDETQREL
        NQDE YFDPDSPWIQEDKA PWWWITD+DELAYWVAQ+SLD IENCDLPPPKKTCLN KR P+AQKQC+EH+TN +STF+S H+NCGLDFC    TQR+L
Subjt:  NQDEFYFDPDSPWIQEDKAEPWWWITDRDELAYWVAQRSLDRIENCDLPPPKKTCLNLKRYPHAQKQCYEHDTNWISTFDSKHRNCGLDFCHLDETQREL

Query:  GESIGQGNLLHGSHQSLSCTNPNNSTKTTQISEENTSKAELMDALLHSQTRAREAENAAKQAYAEKEHIVDLFFRQASQLFAYKQWFQLLQLESLQIKNN
         ES  QGNLLHGSH+S S T P+N TKT Q SE+NTSKAELMDALLHSQTRAREAE AAK+AYAEKEHIV+LFFRQA+QLFAYKQWFQLLQLESLQIKN+
Subjt:  GESIGQGNLLHGSHQSLSCTNPNNSTKTTQISEENTSKAELMDALLHSQTRAREAENAAKQAYAEKEHIVDLFFRQASQLFAYKQWFQLLQLESLQIKNN

Query:  DQPMSTLFPLILPWMSYKNMISDKRWQRFTRRKRVKQDRHRSDISTYAVAFALGLSLVSAGLLLGWTVGWMLPSF
        +QPMS LFPL+LPW SYKNM+S KR +R T +KRV+QDRH+SDISTYAVAFALGLSLVSAG LLGWTVGWMLPSF
Subjt:  DQPMSTLFPLILPWMSYKNMISDKRWQRFTRRKRVKQDRHRSDISTYAVAFALGLSLVSAGLLLGWTVGWMLPSF

A0A6J1D2H8 uncharacterized protein LOC1110163931.0e-27199.16Show/hide
Query:  MAAAEARAAFQRTVNRCFVQEDAKRAPRLACCQSSSSTSKQVDSGPANAAADGADHPSIGLIPLNRASSYASLFPDSKWWLQTQSSYGYQKIFTPEQINA
        MAAAEARAAFQRTVNRCFVQEDAKRAPRLACCQSSSSTSKQVD GPANAAADGADHPSIGLIPLNRASSYASLFPDSKWWLQTQSSYGYQKIFTPEQINA
Subjt:  MAAAEARAAFQRTVNRCFVQEDAKRAPRLACCQSSSSTSKQVDSGPANAAADGADHPSIGLIPLNRASSYASLFPDSKWWLQTQSSYGYQKIFTPEQINA

Query:  LEAGNETSKSGTEKSCTSSDIHPSKGSSTVYEIDDYSRSSLDTDHGVSAVCTKRVNSIMNEDIKALDDGTDSQECLCSVDMKANFECSEMDSFNSKTISK
        LEAGNETSKSGTEKSCTSSDIHPSKGSSTVYEIDDYSRSSLDTDHGVSAVCTKRVNSIMNEDIKALDDGTDSQECLCSVDMKANFECSEMDSFNSKTISK
Subjt:  LEAGNETSKSGTEKSCTSSDIHPSKGSSTVYEIDDYSRSSLDTDHGVSAVCTKRVNSIMNEDIKALDDGTDSQECLCSVDMKANFECSEMDSFNSKTISK

Query:  NQDEFYFDPDSPWIQEDKAEPWWWITDRDELAYWVAQRSLDRIENCDLPPPKKTCLNLKRYPHAQKQCYEHDTNWISTFDSKHRNCGLDFCHLDETQREL
        NQDEFYFDPDSPWIQEDKAEPWWWITDRDELAYWVAQRSLD IENCDLPPPKKTCLNLKRYPH QKQCYEHDTNWISTFDSKHRNCGLDFCHLDETQREL
Subjt:  NQDEFYFDPDSPWIQEDKAEPWWWITDRDELAYWVAQRSLDRIENCDLPPPKKTCLNLKRYPHAQKQCYEHDTNWISTFDSKHRNCGLDFCHLDETQREL

Query:  GESIGQGNLLHGSHQSLSCTNPNNSTKTTQISEENTSKAELMDALLHSQTRAREAENAAKQAYAEKEHIVDLFFRQASQLFAYKQWFQLLQLESLQIKNN
        GESIGQGNLLHGSHQSLSCTNPNNSTKTTQISEENTSKAELMDALLHSQTRAREAENAAKQAYAEKEHIVDLFFRQASQLFAYKQWFQLLQLESLQIKNN
Subjt:  GESIGQGNLLHGSHQSLSCTNPNNSTKTTQISEENTSKAELMDALLHSQTRAREAENAAKQAYAEKEHIVDLFFRQASQLFAYKQWFQLLQLESLQIKNN

Query:  DQPMSTLFPLILPWMSYKNMISDKRWQRFTRRKRVKQDRHRSDISTYAVAFALGLSLVSAGLLLGWTVGWMLPSF
        DQPMSTLFPLILPWMSYKNMISDKRWQRFTRRKRVK+DRHRSDISTYAVAFALGLSLVSAGLLLGWTVGWMLPSF
Subjt:  DQPMSTLFPLILPWMSYKNMISDKRWQRFTRRKRVKQDRHRSDISTYAVAFALGLSLVSAGLLLGWTVGWMLPSF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G01240.1 unknown protein9.3e-3629.5Show/hide
Query:  MAAAEARAAFQRTVNRCF-VQEDAKRAPRLACCQSSSSTSKQVDSGPANAAADGADHPSIGLIPLNRASSYASLFPDSKWWLQTQSSYGYQKIFTPEQIN
        M AAEARA +QRT +RCF V EDAK APRLACCQ   S+S   +    ++ + G             +S ++    D+KWWL+                 
Subjt:  MAAAEARAAFQRTVNRCF-VQEDAKRAPRLACCQSSSSTSKQVDSGPANAAADGADHPSIGLIPLNRASSYASLFPDSKWWLQTQSSYGYQKIFTPEQIN

Query:  ALEAGNETSKSGTEKSCTSSDIHPSKGSSTVYEIDDYSRSSLDTDHGVSAVCTKRVNSIMNEDIKALDDGTDSQECLCSVDMKANFECSEMDSFNSKTIS
                  +G ++  T+S +  +K                                                             C ++  F      
Subjt:  ALEAGNETSKSGTEKSCTSSDIHPSKGSSTVYEIDDYSRSSLDTDHGVSAVCTKRVNSIMNEDIKALDDGTDSQECLCSVDMKANFECSEMDSFNSKTIS

Query:  KNQDEFYFDPDSPWIQEDKAEPWW-WITDRDELAYWVAQRSLD-RIENCDLPPPKKTCLNLKRYPHAQKQCYEHDTNWISTFDSKHRNCGLDFCHLDETQ
        + ++++ F       + D   PWW   TD+DELA  VA +S+D  I+NCDLPPP+K   ++            H ++    F +  ++        D  +
Subjt:  KNQDEFYFDPDSPWIQEDKAEPWW-WITDRDELAYWVAQRSLD-RIENCDLPPPKKTCLNLKRYPHAQKQCYEHDTNWISTFDSKHRNCGLDFCHLDETQ

Query:  RELGESIGQGNLLHGSHQSLSCTNPNNSTKTTQISEENTSKAELMDALLHSQTRAREAENAAKQAYAEKEHIVDLFFRQASQLFAYKQWFQLLQLES--L
        R L          +GS +S + T+P +S ++     ++ SK +L++AL HSQTRAREAE AA++A AEK+ ++ +  +QASQ+ AYKQW +LL++E+  L
Subjt:  RELGESIGQGNLLHGSHQSLSCTNPNNSTKTTQISEENTSKAELMDALLHSQTRAREAENAAKQAYAEKEHIVDLFFRQASQLFAYKQWFQLLQLES--L

Query:  QIKNNDQPMSTLFPLILPWMSYKNMISDKRWQRFTRRKRVKQDRHRSDISTYAVAFALGLSLVSAGLLLGWTVGWMLP
        Q+K  ++    +          K M   KR QR  ++K+ +  R       Y +AFALG SL+ AGLLLGWTVGW+LP
Subjt:  QIKNNDQPMSTLFPLILPWMSYKNMISDKRWQRFTRRKRVKQDRHRSDISTYAVAFALGLSLVSAGLLLGWTVGWMLP

AT1G01240.2 unknown protein9.3e-3629.5Show/hide
Query:  MAAAEARAAFQRTVNRCF-VQEDAKRAPRLACCQSSSSTSKQVDSGPANAAADGADHPSIGLIPLNRASSYASLFPDSKWWLQTQSSYGYQKIFTPEQIN
        M AAEARA +QRT +RCF V EDAK APRLACCQ   S+S   +    ++ + G             +S ++    D+KWWL+                 
Subjt:  MAAAEARAAFQRTVNRCF-VQEDAKRAPRLACCQSSSSTSKQVDSGPANAAADGADHPSIGLIPLNRASSYASLFPDSKWWLQTQSSYGYQKIFTPEQIN

Query:  ALEAGNETSKSGTEKSCTSSDIHPSKGSSTVYEIDDYSRSSLDTDHGVSAVCTKRVNSIMNEDIKALDDGTDSQECLCSVDMKANFECSEMDSFNSKTIS
                  +G ++  T+S +  +K                                                             C ++  F      
Subjt:  ALEAGNETSKSGTEKSCTSSDIHPSKGSSTVYEIDDYSRSSLDTDHGVSAVCTKRVNSIMNEDIKALDDGTDSQECLCSVDMKANFECSEMDSFNSKTIS

Query:  KNQDEFYFDPDSPWIQEDKAEPWW-WITDRDELAYWVAQRSLD-RIENCDLPPPKKTCLNLKRYPHAQKQCYEHDTNWISTFDSKHRNCGLDFCHLDETQ
        + ++++ F       + D   PWW   TD+DELA  VA +S+D  I+NCDLPPP+K   ++            H ++    F +  ++        D  +
Subjt:  KNQDEFYFDPDSPWIQEDKAEPWW-WITDRDELAYWVAQRSLD-RIENCDLPPPKKTCLNLKRYPHAQKQCYEHDTNWISTFDSKHRNCGLDFCHLDETQ

Query:  RELGESIGQGNLLHGSHQSLSCTNPNNSTKTTQISEENTSKAELMDALLHSQTRAREAENAAKQAYAEKEHIVDLFFRQASQLFAYKQWFQLLQLES--L
        R L          +GS +S + T+P +S ++     ++ SK +L++AL HSQTRAREAE AA++A AEK+ ++ +  +QASQ+ AYKQW +LL++E+  L
Subjt:  RELGESIGQGNLLHGSHQSLSCTNPNNSTKTTQISEENTSKAELMDALLHSQTRAREAENAAKQAYAEKEHIVDLFFRQASQLFAYKQWFQLLQLES--L

Query:  QIKNNDQPMSTLFPLILPWMSYKNMISDKRWQRFTRRKRVKQDRHRSDISTYAVAFALGLSLVSAGLLLGWTVGWMLP
        Q+K  ++    +          K M   KR QR  ++K+ +  R       Y +AFALG SL+ AGLLLGWTVGW+LP
Subjt:  QIKNNDQPMSTLFPLILPWMSYKNMISDKRWQRFTRRKRVKQDRHRSDISTYAVAFALGLSLVSAGLLLGWTVGWMLP

AT1G01240.3 unknown protein9.3e-3629.5Show/hide
Query:  MAAAEARAAFQRTVNRCF-VQEDAKRAPRLACCQSSSSTSKQVDSGPANAAADGADHPSIGLIPLNRASSYASLFPDSKWWLQTQSSYGYQKIFTPEQIN
        M AAEARA +QRT +RCF V EDAK APRLACCQ   S+S   +    ++ + G             +S ++    D+KWWL+                 
Subjt:  MAAAEARAAFQRTVNRCF-VQEDAKRAPRLACCQSSSSTSKQVDSGPANAAADGADHPSIGLIPLNRASSYASLFPDSKWWLQTQSSYGYQKIFTPEQIN

Query:  ALEAGNETSKSGTEKSCTSSDIHPSKGSSTVYEIDDYSRSSLDTDHGVSAVCTKRVNSIMNEDIKALDDGTDSQECLCSVDMKANFECSEMDSFNSKTIS
                  +G ++  T+S +  +K                                                             C ++  F      
Subjt:  ALEAGNETSKSGTEKSCTSSDIHPSKGSSTVYEIDDYSRSSLDTDHGVSAVCTKRVNSIMNEDIKALDDGTDSQECLCSVDMKANFECSEMDSFNSKTIS

Query:  KNQDEFYFDPDSPWIQEDKAEPWW-WITDRDELAYWVAQRSLD-RIENCDLPPPKKTCLNLKRYPHAQKQCYEHDTNWISTFDSKHRNCGLDFCHLDETQ
        + ++++ F       + D   PWW   TD+DELA  VA +S+D  I+NCDLPPP+K   ++            H ++    F +  ++        D  +
Subjt:  KNQDEFYFDPDSPWIQEDKAEPWW-WITDRDELAYWVAQRSLD-RIENCDLPPPKKTCLNLKRYPHAQKQCYEHDTNWISTFDSKHRNCGLDFCHLDETQ

Query:  RELGESIGQGNLLHGSHQSLSCTNPNNSTKTTQISEENTSKAELMDALLHSQTRAREAENAAKQAYAEKEHIVDLFFRQASQLFAYKQWFQLLQLES--L
        R L          +GS +S + T+P +S ++     ++ SK +L++AL HSQTRAREAE AA++A AEK+ ++ +  +QASQ+ AYKQW +LL++E+  L
Subjt:  RELGESIGQGNLLHGSHQSLSCTNPNNSTKTTQISEENTSKAELMDALLHSQTRAREAENAAKQAYAEKEHIVDLFFRQASQLFAYKQWFQLLQLES--L

Query:  QIKNNDQPMSTLFPLILPWMSYKNMISDKRWQRFTRRKRVKQDRHRSDISTYAVAFALGLSLVSAGLLLGWTVGWMLP
        Q+K  ++    +          K M   KR QR  ++K+ +  R       Y +AFALG SL+ AGLLLGWTVGW+LP
Subjt:  QIKNNDQPMSTLFPLILPWMSYKNMISDKRWQRFTRRKRVKQDRHRSDISTYAVAFALGLSLVSAGLLLGWTVGWMLP

AT2G46550.1 unknown protein1.4e-5536.53Show/hide
Query:  MAAAEARAAFQRTVNRCFVQEDAKRAPRLACCQSSS----STSKQVD---SGPANAAADGADHPSIGLIPLNRASSYASLFP-DSKWWLQTQSSYGYQKI
        MAAAEARA +QRTVNRCFVQEDAKRAP+L  CQSSS    S++KQV+   S P  +          G +PL R  ++  L P +++ W      +     
Subjt:  MAAAEARAAFQRTVNRCFVQEDAKRAPRLACCQSSS----STSKQVD---SGPANAAADGADHPSIGLIPLNRASSYASLFP-DSKWWLQTQSSYGYQKI

Query:  FTPEQINALEAGNETSKSGTEKSCTSSDIHPSKGSSTVYEIDDYSRSSLDTDHGVSAVCTKRVNSIMNEDIKALDDGTDSQECLCSVDMKANFECSEMDS
            Q+N                         K      E+++   S   ++ G      K  NS               QE +  ++ + ++  +  D 
Subjt:  FTPEQINALEAGNETSKSGTEKSCTSSDIHPSKGSSTVYEIDDYSRSSLDTDHGVSAVCTKRVNSIMNEDIKALDDGTDSQECLCSVDMKANFECSEMDS

Query:  FNSKTISKNQDEFYFDPDSPW--IQEDKAEPWWWITDRDELAYWVAQRSLDRIENCDLPPPKKTCLNLKRYPHAQKQCYEHDTNWISTFDSKHRNCGLDF
         + K +S    E  FDP SPW  +  +KA PWW  TD+DELA  VAQRSLD +ENCDLP P+K    +KR  +   + ++ D              GL  
Subjt:  FNSKTISKNQDEFYFDPDSPW--IQEDKAEPWWWITDRDELAYWVAQRSLDRIENCDLPPPKKTCLNLKRYPHAQKQCYEHDTNWISTFDSKHRNCGLDF

Query:  CHLDETQRELGESIGQGNLLHGSHQSLSCTNPNNSTKTTQISEENTSKAELMDALLHSQTRAREAENAAKQAYAEKEHIVDLFFRQASQLFAYKQWFQLL
               R+   S   G  + G+ +  SC N   ++     SE + SK+EL++AL  SQTRAREAEN AK+AYAEKEH+V +  +QA++LF YKQW QLL
Subjt:  CHLDETQRELGESIGQGNLLHGSHQSLSCTNPNNSTKTTQISEENTSKAELMDALLHSQTRAREAENAAKQAYAEKEHIVDLFFRQASQLFAYKQWFQLL

Query:  QLESLQI---------KNNDQPMSTLFPLILPWMSYKNMISDKRWQRFTRRKRVKQDRHRSDISTYAVAFALGLSLVSAGLLLGWTVGWM
        QLE+L +         KNND P  +    I  W       S+ + ++  R++R K  R + + + YAV  ALG+SLV AGLLLGWTVGWM
Subjt:  QLESLQI---------KNNDQPMSTLFPLILPWMSYKNMISDKRWQRFTRRKRVKQDRHRSDISTYAVAFALGLSLVSAGLLLGWTVGWM

AT2G46550.2 unknown protein1.6e-4343.11Show/hide
Query:  KNQDEFYFDPDSPW--IQEDKAEPWWWITDRDELAYWVAQRSLDRIENCDLPPPKKTCLNLKRYPHAQKQCYEHDTNWISTFDSKHRNCGLDFCHLDETQ
        K   E  FDP SPW  +  +KA PWW  TD+DELA  VAQRSLD +ENCDLP P+K    +KR  +   + ++ D              GL         
Subjt:  KNQDEFYFDPDSPW--IQEDKAEPWWWITDRDELAYWVAQRSLDRIENCDLPPPKKTCLNLKRYPHAQKQCYEHDTNWISTFDSKHRNCGLDFCHLDETQ

Query:  RELGESIGQGNLLHGSHQSLSCTNPNNSTKTTQISEENTSKAELMDALLHSQTRAREAENAAKQAYAEKEHIVDLFFRQASQLFAYKQWFQLLQLESLQI
        R+   S   G  + G+ +  SC N   ++     SE + SK+EL++AL  SQTRAREAEN AK+AYAEKEH+V +  +QA++LF YKQW QLLQLE+L +
Subjt:  RELGESIGQGNLLHGSHQSLSCTNPNNSTKTTQISEENTSKAELMDALLHSQTRAREAENAAKQAYAEKEHIVDLFFRQASQLFAYKQWFQLLQLESLQI

Query:  ---------KNNDQPMSTLFPLILPWMSYKNMISDKRWQRFTRRKRVKQDRHRSDISTYAVAFALGLSLVSAGLLLGWTVGWM
                 KNND P  +    I  W       S+ + ++  R++R K  R + + + YAV  ALG+SLV AGLLLGWTVGWM
Subjt:  ---------KNNDQPMSTLFPLILPWMSYKNMISDKRWQRFTRRKRVKQDRHRSDISTYAVAFALGLSLVSAGLLLGWTVGWM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGCAGCAGAAGCAAGGGCTGCCTTTCAGAGGACGGTTAATCGTTGCTTTGTCCAAGAAGATGCCAAAAGAGCTCCCAGATTAGCTTGCTGTCAATCATCATCTTC
AACATCCAAACAGGTTGATTCTGGACCTGCCAATGCAGCAGCAGATGGGGCAGATCATCCTTCCATAGGTTTGATTCCCCTTAACAGAGCTTCTTCATATGCTAGTTTAT
TTCCAGACTCAAAATGGTGGCTTCAGACTCAATCTAGCTATGGATACCAAAAGATTTTTACCCCTGAACAAATTAATGCTTTAGAAGCTGGCAATGAAACTTCTAAGTCT
GGTACGGAAAAATCATGTACTTCAAGTGATATTCACCCATCTAAAGGAAGCAGTACTGTCTATGAAATTGATGATTATAGCAGATCTTCTCTAGATACTGATCATGGTGT
CTCTGCTGTTTGTACAAAGAGAGTGAACAGCATTATGAATGAAGACATAAAGGCCTTGGACGATGGTACAGACTCTCAAGAATGTCTCTGTAGTGTAGATATGAAAGCAA
ACTTTGAATGTTCAGAGATGGATTCTTTCAACAGCAAAACAATTTCCAAAAATCAAGATGAGTTTTACTTTGATCCAGATTCTCCTTGGATTCAAGAGGACAAGGCTGAG
CCATGGTGGTGGATAACAGATAGAGATGAATTGGCCTATTGGGTTGCACAGAGATCTCTTGATCGTATTGAGAATTGTGATCTTCCACCACCTAAAAAAACATGCTTGAA
CTTAAAGAGATACCCTCATGCTCAAAAGCAATGCTACGAACATGATACAAATTGGATTTCTACCTTTGATTCGAAGCACCGAAATTGTGGACTGGACTTTTGTCATTTGG
ACGAAACACAGAGAGAATTGGGGGAATCGATTGGACAAGGGAACTTGCTGCATGGATCTCATCAATCTTTGAGCTGTACAAATCCCAACAACTCCACAAAAACAACACAA
ATTTCTGAGGAAAATACCAGCAAAGCTGAGCTAATGGATGCACTGCTACATTCTCAAACCCGTGCGAGGGAAGCCGAGAATGCAGCAAAGCAAGCTTATGCAGAGAAGGA
GCATATAGTTGACCTTTTCTTCAGACAAGCATCCCAGCTCTTTGCATATAAGCAATGGTTTCAACTCCTGCAACTAGAAAGCCTCCAGATAAAGAACAATGATCAGCCTA
TGTCCACTCTTTTTCCTTTAATTCTTCCATGGATGTCCTACAAGAACATGATATCTGATAAGAGATGGCAGAGATTCACCAGGCGGAAAAGAGTTAAGCAGGACCGGCAT
AGAAGTGATATTTCTACTTATGCCGTTGCGTTTGCTCTAGGATTGAGCTTGGTCAGTGCAGGTTTGCTTCTGGGGTGGACAGTAGGGTGGATGCTACCTTCTTTC
mRNA sequenceShow/hide mRNA sequence
ATGGCAGCAGCAGAAGCAAGGGCTGCCTTTCAGAGGACGGTTAATCGTTGCTTTGTCCAAGAAGATGCCAAAAGAGCTCCCAGATTAGCTTGCTGTCAATCATCATCTTC
AACATCCAAACAGGTTGATTCTGGACCTGCCAATGCAGCAGCAGATGGGGCAGATCATCCTTCCATAGGTTTGATTCCCCTTAACAGAGCTTCTTCATATGCTAGTTTAT
TTCCAGACTCAAAATGGTGGCTTCAGACTCAATCTAGCTATGGATACCAAAAGATTTTTACCCCTGAACAAATTAATGCTTTAGAAGCTGGCAATGAAACTTCTAAGTCT
GGTACGGAAAAATCATGTACTTCAAGTGATATTCACCCATCTAAAGGAAGCAGTACTGTCTATGAAATTGATGATTATAGCAGATCTTCTCTAGATACTGATCATGGTGT
CTCTGCTGTTTGTACAAAGAGAGTGAACAGCATTATGAATGAAGACATAAAGGCCTTGGACGATGGTACAGACTCTCAAGAATGTCTCTGTAGTGTAGATATGAAAGCAA
ACTTTGAATGTTCAGAGATGGATTCTTTCAACAGCAAAACAATTTCCAAAAATCAAGATGAGTTTTACTTTGATCCAGATTCTCCTTGGATTCAAGAGGACAAGGCTGAG
CCATGGTGGTGGATAACAGATAGAGATGAATTGGCCTATTGGGTTGCACAGAGATCTCTTGATCGTATTGAGAATTGTGATCTTCCACCACCTAAAAAAACATGCTTGAA
CTTAAAGAGATACCCTCATGCTCAAAAGCAATGCTACGAACATGATACAAATTGGATTTCTACCTTTGATTCGAAGCACCGAAATTGTGGACTGGACTTTTGTCATTTGG
ACGAAACACAGAGAGAATTGGGGGAATCGATTGGACAAGGGAACTTGCTGCATGGATCTCATCAATCTTTGAGCTGTACAAATCCCAACAACTCCACAAAAACAACACAA
ATTTCTGAGGAAAATACCAGCAAAGCTGAGCTAATGGATGCACTGCTACATTCTCAAACCCGTGCGAGGGAAGCCGAGAATGCAGCAAAGCAAGCTTATGCAGAGAAGGA
GCATATAGTTGACCTTTTCTTCAGACAAGCATCCCAGCTCTTTGCATATAAGCAATGGTTTCAACTCCTGCAACTAGAAAGCCTCCAGATAAAGAACAATGATCAGCCTA
TGTCCACTCTTTTTCCTTTAATTCTTCCATGGATGTCCTACAAGAACATGATATCTGATAAGAGATGGCAGAGATTCACCAGGCGGAAAAGAGTTAAGCAGGACCGGCAT
AGAAGTGATATTTCTACTTATGCCGTTGCGTTTGCTCTAGGATTGAGCTTGGTCAGTGCAGGTTTGCTTCTGGGGTGGACAGTAGGGTGGATGCTACCTTCTTTC
Protein sequenceShow/hide protein sequence
MAAAEARAAFQRTVNRCFVQEDAKRAPRLACCQSSSSTSKQVDSGPANAAADGADHPSIGLIPLNRASSYASLFPDSKWWLQTQSSYGYQKIFTPEQINALEAGNETSKS
GTEKSCTSSDIHPSKGSSTVYEIDDYSRSSLDTDHGVSAVCTKRVNSIMNEDIKALDDGTDSQECLCSVDMKANFECSEMDSFNSKTISKNQDEFYFDPDSPWIQEDKAE
PWWWITDRDELAYWVAQRSLDRIENCDLPPPKKTCLNLKRYPHAQKQCYEHDTNWISTFDSKHRNCGLDFCHLDETQRELGESIGQGNLLHGSHQSLSCTNPNNSTKTTQ
ISEENTSKAELMDALLHSQTRAREAENAAKQAYAEKEHIVDLFFRQASQLFAYKQWFQLLQLESLQIKNNDQPMSTLFPLILPWMSYKNMISDKRWQRFTRRKRVKQDRH
RSDISTYAVAFALGLSLVSAGLLLGWTVGWMLPSF