; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS022371 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS022371
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of unknown function (DUF1068)
Genome locationscaffold47:2845407..2846853
RNA-Seq ExpressionMS022371
SyntenyMS022371
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR010471 - Protein of unknown function DUF1068


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572284.1 hypothetical protein SDJN03_29012, partial [Cucurbita argyrosperma subsp. sororia]2.6e-8282.91Show/hide
Query:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEDSSTVELLSVVCCADCVKRDPEVSQDTEKN
        MAIQA+SC P LIKFGLA IAL IAGYI+GPPLYWHF EGLAVV+RSSSSSSCPPCFCDCPSQP+I+IP++      L      DC+K DPEVSQDTEKN
Subjt:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEDSSTVELLSVVCCADCVKRDPEVSQDTEKN

Query:  FADLLLEELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
        FADLLLEELKLKE EALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTA+WE RARQRGWKEG  +SR QKQ NIQTA
Subjt:  FADLLLEELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA

KAG6589153.1 hypothetical protein SDJN03_17718, partial [Cucurbita argyrosperma subsp. sororia]5.7e-8284.42Show/hide
Query:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEDSSTVELLSVVCCADCVKRDPEVSQDTEKN
        MAIQA SC P+LIK GLA IAL IAGYI+GPPLYWH  EG AVVSRSSSSSSCPPCFCDCPSQPVISIPE+      L     ADCVK DPEVSQDTEKN
Subjt:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEDSSTVELLSVVCCADCVKRDPEVSQDTEKN

Query:  FADLLLEELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
        FADLLLEELKLKEAEA E+ RRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTA WE RARQRGWKEGA +SRI+KQGNIQTA
Subjt:  FADLLLEELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA

XP_022136333.1 uncharacterized protein LOC111008045 [Momordica charantia]2.5e-9393.47Show/hide
Query:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEDSSTVELLSVVCCADCVKRDPEVSQDTEKN
        MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPE+      L      DCVKRDPEVSQDTEKN
Subjt:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEDSSTVELLSVVCCADCVKRDPEVSQDTEKN

Query:  FADLLLEELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
        FADLLLEELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
Subjt:  FADLLLEELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA

XP_022930736.1 uncharacterized protein LOC111437126 isoform X1 [Cucurbita moschata]2.0e-8284.92Show/hide
Query:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEDSSTVELLSVVCCADCVKRDPEVSQDTEKN
        MAIQA SC P+LIK GLA IAL IAGYI+GPPLYWH  EG AVVSRSSSSSSCPPCFCDCPSQPVISIPE+      L     ADCVK DPEVSQDTEKN
Subjt:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEDSSTVELLSVVCCADCVKRDPEVSQDTEKN

Query:  FADLLLEELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
        FADLLLEELKLKEAEA E+ RRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTA WE RARQRGWKEGA +SRIQKQGNIQTA
Subjt:  FADLLLEELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA

XP_022952351.1 uncharacterized protein LOC111455059 isoform X1 [Cucurbita moschata]2.0e-8282.91Show/hide
Query:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEDSSTVELLSVVCCADCVKRDPEVSQDTEKN
        MAIQA+SC P+LIKFGLA IAL IAGYI+GPPLYWHF EGLAVV+RSSSSSSCPPCFCDCPSQP+I+IP++      L      DC+K DPEVSQDTEKN
Subjt:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEDSSTVELLSVVCCADCVKRDPEVSQDTEKN

Query:  FADLLLEELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
        FADLLLEELKLKE EALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTA+WE RARQRGWKEG  +SR QKQ NIQTA
Subjt:  FADLLLEELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA

TrEMBL top hitse value%identityAlignment
A0A1S3C135 uncharacterized protein LOC103495614 isoform X14.7e-8282.91Show/hide
Query:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEDSSTVELLSVVCCADCVKRDPEVSQDTEKN
        MAIQ TSC P+LIK GLA IA+ I GYI+GPPLYWHF EGLAVVS SS+SSSCPPCFCDCPSQPVISIPE+      L     ADCVK DPEVS+DTEKN
Subjt:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEDSSTVELLSVVCCADCVKRDPEVSQDTEKN

Query:  FADLLLEELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
        FADLLLEELKLKEAEALENQRRAD+ALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVL AQKRLTA WE RARQRGWKEG  +SR QKQGNIQTA
Subjt:  FADLLLEELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA

A0A6J1C7C2 uncharacterized protein LOC1110080451.2e-9393.47Show/hide
Query:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEDSSTVELLSVVCCADCVKRDPEVSQDTEKN
        MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPE+      L      DCVKRDPEVSQDTEKN
Subjt:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEDSSTVELLSVVCCADCVKRDPEVSQDTEKN

Query:  FADLLLEELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
        FADLLLEELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
Subjt:  FADLLLEELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA

A0A6J1ERF9 uncharacterized protein LOC111437126 isoform X19.5e-8384.92Show/hide
Query:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEDSSTVELLSVVCCADCVKRDPEVSQDTEKN
        MAIQA SC P+LIK GLA IAL IAGYI+GPPLYWH  EG AVVSRSSSSSSCPPCFCDCPSQPVISIPE+      L     ADCVK DPEVSQDTEKN
Subjt:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEDSSTVELLSVVCCADCVKRDPEVSQDTEKN

Query:  FADLLLEELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
        FADLLLEELKLKEAEA E+ RRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTA WE RARQRGWKEGA +SRIQKQGNIQTA
Subjt:  FADLLLEELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA

A0A6J1GK01 uncharacterized protein LOC111455059 isoform X19.5e-8382.91Show/hide
Query:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEDSSTVELLSVVCCADCVKRDPEVSQDTEKN
        MAIQA+SC P+LIKFGLA IAL IAGYI+GPPLYWHF EGLAVV+RSSSSSSCPPCFCDCPSQP+I+IP++      L      DC+K DPEVSQDTEKN
Subjt:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEDSSTVELLSVVCCADCVKRDPEVSQDTEKN

Query:  FADLLLEELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
        FADLLLEELKLKE EALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTA+WE RARQRGWKEG  +SR QKQ NIQTA
Subjt:  FADLLLEELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA

A0A6J1HXB7 uncharacterized protein LOC1114683212.8e-8282.41Show/hide
Query:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEDSSTVELLSVVCCADCVKRDPEVSQDTEKN
        MAIQA+SC P+LIKFGLA IAL IAGYI+ PPLYWHF EGLAVV+RSSSSSSCPPCFCDCPSQP+I+IP++      L      DC+K DPEVSQDTEKN
Subjt:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEDSSTVELLSVVCCADCVKRDPEVSQDTEKN

Query:  FADLLLEELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
        FADLLLEELKLKE EALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTA+WE RARQRGWKEG  +SR QKQ NIQTA
Subjt:  FADLLLEELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05070.1 Protein of unknown function (DUF1068)3.1e-5759.79Show/hide
Query:  ALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEDSSTVELLSVVCCADCVKRDPEVSQDTEKNFADLLLEELK
        A +K GLA + L +AGYI+GPPLYWH  E LA V    S+SSCP C C+C +   ++IP++      LS    ADC K DPEV++DTEKN+A+LL EELK
Subjt:  ALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEDSSTVELLSVVCCADCVKRDPEVSQDTEKNFADLLLEELK

Query:  LKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
        L+EAE+LE  +RADM LLEAKK+TS YQKEADKCNSGMETCEEAREKAE  LA QK+LT+ WE RARQ+GW+EG+ +  ++ + N+Q A
Subjt:  LKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA

AT2G32580.1 Protein of unknown function (DUF1068)2.3e-5256.61Show/hide
Query:  ALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEDSSTVELLSVVCCADCVKRDPEVSQDTEKNFADLLLEELK
        A +K GLA +AL + GYI+GPPLYWH  E LAV     S++SC  C CDC S P+++IP        LS     DC KRDPEV++DTEKN+A+LL EELK
Subjt:  ALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEDSSTVELLSVVCCADCVKRDPEVSQDTEKNFADLLLEELK

Query:  LKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
         +EA ++E  +R D  LLEAKK+TS YQKEADKCNSGMETCEEAREKAE  L  QK+LT++WE RARQ+G+K+GA +S ++ +   + A
Subjt:  LKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA

AT2G32580.2 Protein of unknown function (DUF1068)4.3e-3562.61Show/hide
Query:  DCVKRDPEVSQDTEKNFADLLLEELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEG
        +C KRDPEV++DTEKN+A+LL EELK +EA ++E  +R D  LLEAKK+TS YQKEADKCNSGMETCEEAREKAE  L  QK+LT++WE RARQ+G+K+G
Subjt:  DCVKRDPEVSQDTEKNFADLLLEELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEG

Query:  AGRSRIQKQGNIQTA
        A +S ++ +   + A
Subjt:  AGRSRIQKQGNIQTA

AT4G04360.1 Protein of unknown function (DUF1068)9.5e-4354.97Show/hide
Query:  IALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEDSSTVELLSVVCCADCVKRDPEVSQDTEKNFADLLLEELKLKEAEALEN
        + LCI  YI GP LYWH  E +A     S  SSCPPC CDC SQP++SIP+  S    L      DC++ + E S+++E +F +++ EELKL+EA+A E+
Subjt:  IALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEDSSTVELLSVVCCADCVKRDPEVSQDTEKNFADLLLEELKLKEAEALEN

Query:  QRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRI
        + RAD  LL+AKK  SQYQKEADKC+ GMETCE AREKAEA L  Q+RL+ +WELRARQ GWKEG   S +
Subjt:  QRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRI

AT4G30996.1 Protein of unknown function (DUF1068)2.1e-3447.59Show/hide
Query:  LATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEDSSTVELLSVVCCADCVKRDPEVSQDTEKNFADLLLEELKLKEAEA
        L   A+  A  + GP LYW F +G   V  + ++S CPPC CDCP  P +S+ + +  +  LS+    DC   DPE+ Q+ EK F DLL EELKL+EA A
Subjt:  LATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEDSSTVELLSVVCCADCVKRDPEVSQDTEKNFADLLLEELKLKEAEA

Query:  LENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWK
         E+ R  ++ L EAK++ SQYQKEA+KCN+  E CE ARE+AEA+L  ++++T++WE RARQ GW+
Subjt:  LENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATTCAGGCCACTTCTTGCACTCCGGCACTGATTAAGTTCGGATTGGCTACCATTGCCCTCTGTATCGCCGGCTACATTATCGGCCCTCCTCTCTATTGGCACTT
CGTCGAAGGTTTGGCCGTCGTTAGCCGCTCCTCTTCTTCTTCCTCCTGCCCTCCTTGTTTCTGCGACTGCCCTTCTCAGCCCGTCATCTCAATTCCCGAAGACAGTTCTA
CTGTTGAACTGTTATCGGTTGTATGTTGTGCAGATTGTGTTAAGCGTGACCCAGAAGTGAGTCAAGACACCGAGAAGAACTTTGCAGACCTATTGTTAGAGGAGCTGAAG
CTAAAGGAAGCCGAAGCGTTGGAAAATCAGCGACGTGCTGATATGGCTCTACTCGAGGCGAAGAAGATGACATCTCAGTATCAAAAAGAGGCAGACAAGTGCAATTCCGG
GATGGAAACATGTGAAGAAGCGAGAGAGAAAGCCGAGGCAGTATTAGCTGCACAGAAGAGACTAACAGCAGTGTGGGAGCTAAGGGCTCGCCAAAGGGGATGGAAGGAAG
GGGCTGGCAGGTCTCGTATCCAAAAGCAGGGAAATATTCAGACTGCA
mRNA sequenceShow/hide mRNA sequence
ATGGCAATTCAGGCCACTTCTTGCACTCCGGCACTGATTAAGTTCGGATTGGCTACCATTGCCCTCTGTATCGCCGGCTACATTATCGGCCCTCCTCTCTATTGGCACTT
CGTCGAAGGTTTGGCCGTCGTTAGCCGCTCCTCTTCTTCTTCCTCCTGCCCTCCTTGTTTCTGCGACTGCCCTTCTCAGCCCGTCATCTCAATTCCCGAAGACAGTTCTA
CTGTTGAACTGTTATCGGTTGTATGTTGTGCAGATTGTGTTAAGCGTGACCCAGAAGTGAGTCAAGACACCGAGAAGAACTTTGCAGACCTATTGTTAGAGGAGCTGAAG
CTAAAGGAAGCCGAAGCGTTGGAAAATCAGCGACGTGCTGATATGGCTCTACTCGAGGCGAAGAAGATGACATCTCAGTATCAAAAAGAGGCAGACAAGTGCAATTCCGG
GATGGAAACATGTGAAGAAGCGAGAGAGAAAGCCGAGGCAGTATTAGCTGCACAGAAGAGACTAACAGCAGTGTGGGAGCTAAGGGCTCGCCAAAGGGGATGGAAGGAAG
GGGCTGGCAGGTCTCGTATCCAAAAGCAGGGAAATATTCAGACTGCA
Protein sequenceShow/hide protein sequence
MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEDSSTVELLSVVCCADCVKRDPEVSQDTEKNFADLLLEELK
LKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA