; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0010415 (gene) of Snake gourd v1 genome

Gene IDTan0010415
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionVARLMGL domain-containing protein
Genome locationLG09:72865111..72865911
RNA-Seq ExpressionTan0010415
SyntenyTan0010415
Gene Ontology termsNA
InterPro domainsIPR032795 - DUF3741-associated sequence motif


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057827.1 hypothetical protein E6C27_scaffold274G001140 [Cucumis melo var. makuwa]2.5e-13390.23Show/hide
Query:  MKELSFFLFKNSLAARMRKGIRTFCTGDGSTSTLNQHKTNQDHFPISPDLHCRQNPPTLEEMILQLELEEETARRAKLYNHDEMRGRMSCVNNSDILRSA
        MKELSFFLFKNSLAA+MRKG RTFC GDGSTSTLNQHKTNQD FPISPDLHCRQ PPTLEEMILQLELEEETARRAKLYN+DEMRGRMSCVNNSDILRSA
Subjt:  MKELSFFLFKNSLAARMRKGIRTFCTGDGSTSTLNQHKTNQDHFPISPDLHCRQNPPTLEEMILQLELEEETARRAKLYNHDEMRGRMSCVNNSDILRSA

Query:  RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALRGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAK
        RNALNQYPRFSLDGKDAMYRSSFRNLD AE+ GRKSVCCEY L+GR+H+NEFNLTLETALRLPSTIAGE+VVWRKPGVVAKLMGLEA+P P+NA+S KA 
Subjt:  RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALRGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAK

Query:  LTSILKRQTLRKRAKRHEKERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARKFL
        L SILKRQ+LRKRAKR EKERRFS+DYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAW+AR+FL
Subjt:  LTSILKRQTLRKRAKRHEKERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARKFL

KAG6589427.1 hypothetical protein SDJN03_14850, partial [Cucurbita argyrosperma subsp. sororia]4.1e-12886.89Show/hide
Query:  MKELSFFLFKNSLAARMRKGIRTFCTGDGSTSTLNQHKTNQDHFPISPDLHCRQNPPTLEEMILQLELEEETARRAKLYNHDEMRG-RMSCVNNSDILRS
        MK+LSFFLFKNSLAA++RKGIRTFC G+ ST+TLNQHKTNQD FP+SPDLHC Q PPTLEEMILQLELEEETARRAK YN+D+++G RMSCVNNSDIL S
Subjt:  MKELSFFLFKNSLAARMRKGIRTFCTGDGSTSTLNQHKTNQDHFPISPDLHCRQNPPTLEEMILQLELEEETARRAKLYNHDEMRG-RMSCVNNSDILRS

Query:  ARNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALRGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKA
        ARNALNQYPRFSLDGKDAMYRSSFRNLDAAE+ GRKSVCCEYAL+GR+HENEFNLTLETALRLPSTIAGE+V+WRKPGVVAKLMGLEAMP P+NA+SGKA
Subjt:  ARNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALRGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKA

Query:  KLTSILKRQTLRKRAKRHEKERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARKFL
         L+SI+KRQ LRKRAKRHEKERR SMDYPGSDGTITGRLSSCSSNNGCY+VKPIATESAAWKA +FL
Subjt:  KLTSILKRQTLRKRAKRHEKERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARKFL

XP_022135499.1 uncharacterized protein LOC111007440 [Momordica charantia]2.3e-13492.11Show/hide
Query:  MKELSFFLFKNSLAARMRKGIRTFCTGDGSTSTLNQHKTNQDHFPISPDLHCRQNPPTLEEMILQLELEEETARRAKLYNHDEMRGRMSCVNNSDILRSA
        MKELSFFLFKNSLAA+MRKGIRTFC GDGSTSTLNQHK NQDHFPISPD+ CR+NPPTLEEMILQLELEEETARRAKLYN+DEMRGRMSCVNNSDILRSA
Subjt:  MKELSFFLFKNSLAARMRKGIRTFCTGDGSTSTLNQHKTNQDHFPISPDLHCRQNPPTLEEMILQLELEEETARRAKLYNHDEMRGRMSCVNNSDILRSA

Query:  RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALRGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAK
        RNALNQYPRFSLDGKDAMYRSSFRNLDA   AGRKSVCCEYAL+GRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEA+P PVNAK GKAK
Subjt:  RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALRGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAK

Query:  LTSILKRQTLRKRAKRHEKERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARKFL
        L+SILKR  LR+RAKRHEKERRF MDYPGSDGTITGRLSSCSSNNGCY+VKPIATESAAWKAR+FL
Subjt:  LTSILKRQTLRKRAKRHEKERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARKFL

XP_031736616.1 uncharacterized protein LOC116402050 [Cucumis sativus]6.6e-13490.98Show/hide
Query:  MKELSFFLFKNSLAARMRKGIRTFCTGDGSTSTLNQHKTNQDHFPISPDLHCRQNPPTLEEMILQLELEEETARRAKLYNHDEMRGRMSCVNNSDILRSA
        MKELSFFLFKNSLAA+MRKG RTFC GDGSTSTLNQ KTNQD FPISPDLHCRQ PPTLEEMILQLELEEETARRAKLYN+DEMRGRMSCVNNSDILRSA
Subjt:  MKELSFFLFKNSLAARMRKGIRTFCTGDGSTSTLNQHKTNQDHFPISPDLHCRQNPPTLEEMILQLELEEETARRAKLYNHDEMRGRMSCVNNSDILRSA

Query:  RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALRGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAK
        RNALNQYPRFSLDGKDAMYRSSFRNLDAAE+ GRKSVCCEY L+GR+H+NEFNLTLETALRLPSTIAGE+VVWRKPGVVAKLMGLEAMP P+NA+S KA 
Subjt:  RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALRGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAK

Query:  LTSILKRQTLRKRAKRHEKERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARKFL
        LTSILKRQ+LRKRAKR EKERRFS+DYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAW+AR+FL
Subjt:  LTSILKRQTLRKRAKRHEKERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARKFL

XP_038878557.1 uncharacterized protein LOC120070753 [Benincasa hispida]4.1e-13692.86Show/hide
Query:  MKELSFFLFKNSLAARMRKGIRTFCTGDGSTSTLNQHKTNQDHFPISPDLHCRQNPPTLEEMILQLELEEETARRAKLYNHDEMRGRMSCVNNSDILRSA
        MKELSFFLFKNSLAA+MRKG RTFC GD STSTLNQHKTNQD FPISPDLHCRQ PPTLEEMILQLELEEETARRAKLYN+DEMRGRMSCVNNSDILRSA
Subjt:  MKELSFFLFKNSLAARMRKGIRTFCTGDGSTSTLNQHKTNQDHFPISPDLHCRQNPPTLEEMILQLELEEETARRAKLYNHDEMRGRMSCVNNSDILRSA

Query:  RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALRGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAK
        RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEY L+GRIHENEFNLTLETALRLPSTIAGE+VVWRKPGVVAKLMGLEAMP P+NA+S KA 
Subjt:  RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALRGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAK

Query:  LTSILKRQTLRKRAKRHEKERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARKFL
        LTSILKRQ+LRKRAKRHEKERRFSMDYPGSDGTITGRLSSCSSNNGCY+VKPIATESAAWKAR+ L
Subjt:  LTSILKRQTLRKRAKRHEKERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARKFL

TrEMBL top hitse value%identityAlignment
A0A0A0LNU8 VARLMGL domain-containing protein3.2e-13490.98Show/hide
Query:  MKELSFFLFKNSLAARMRKGIRTFCTGDGSTSTLNQHKTNQDHFPISPDLHCRQNPPTLEEMILQLELEEETARRAKLYNHDEMRGRMSCVNNSDILRSA
        MKELSFFLFKNSLAA+MRKG RTFC GDGSTSTLNQ KTNQD FPISPDLHCRQ PPTLEEMILQLELEEETARRAKLYN+DEMRGRMSCVNNSDILRSA
Subjt:  MKELSFFLFKNSLAARMRKGIRTFCTGDGSTSTLNQHKTNQDHFPISPDLHCRQNPPTLEEMILQLELEEETARRAKLYNHDEMRGRMSCVNNSDILRSA

Query:  RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALRGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAK
        RNALNQYPRFSLDGKDAMYRSSFRNLDAAE+ GRKSVCCEY L+GR+H+NEFNLTLETALRLPSTIAGE+VVWRKPGVVAKLMGLEAMP P+NA+S KA 
Subjt:  RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALRGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAK

Query:  LTSILKRQTLRKRAKRHEKERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARKFL
        LTSILKRQ+LRKRAKR EKERRFS+DYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAW+AR+FL
Subjt:  LTSILKRQTLRKRAKRHEKERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARKFL

A0A5D3BIJ6 VARLMGL domain-containing protein1.2e-13390.23Show/hide
Query:  MKELSFFLFKNSLAARMRKGIRTFCTGDGSTSTLNQHKTNQDHFPISPDLHCRQNPPTLEEMILQLELEEETARRAKLYNHDEMRGRMSCVNNSDILRSA
        MKELSFFLFKNSLAA+MRKG RTFC GDGSTSTLNQHKTNQD FPISPDLHCRQ PPTLEEMILQLELEEETARRAKLYN+DEMRGRMSCVNNSDILRSA
Subjt:  MKELSFFLFKNSLAARMRKGIRTFCTGDGSTSTLNQHKTNQDHFPISPDLHCRQNPPTLEEMILQLELEEETARRAKLYNHDEMRGRMSCVNNSDILRSA

Query:  RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALRGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAK
        RNALNQYPRFSLDGKDAMYRSSFRNLD AE+ GRKSVCCEY L+GR+H+NEFNLTLETALRLPSTIAGE+VVWRKPGVVAKLMGLEA+P P+NA+S KA 
Subjt:  RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALRGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAK

Query:  LTSILKRQTLRKRAKRHEKERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARKFL
        L SILKRQ+LRKRAKR EKERRFS+DYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAW+AR+FL
Subjt:  LTSILKRQTLRKRAKRHEKERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARKFL

A0A6J1C1L3 uncharacterized protein LOC1110074401.1e-13492.11Show/hide
Query:  MKELSFFLFKNSLAARMRKGIRTFCTGDGSTSTLNQHKTNQDHFPISPDLHCRQNPPTLEEMILQLELEEETARRAKLYNHDEMRGRMSCVNNSDILRSA
        MKELSFFLFKNSLAA+MRKGIRTFC GDGSTSTLNQHK NQDHFPISPD+ CR+NPPTLEEMILQLELEEETARRAKLYN+DEMRGRMSCVNNSDILRSA
Subjt:  MKELSFFLFKNSLAARMRKGIRTFCTGDGSTSTLNQHKTNQDHFPISPDLHCRQNPPTLEEMILQLELEEETARRAKLYNHDEMRGRMSCVNNSDILRSA

Query:  RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALRGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAK
        RNALNQYPRFSLDGKDAMYRSSFRNLDA   AGRKSVCCEYAL+GRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEA+P PVNAK GKAK
Subjt:  RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALRGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAK

Query:  LTSILKRQTLRKRAKRHEKERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARKFL
        L+SILKR  LR+RAKRHEKERRF MDYPGSDGTITGRLSSCSSNNGCY+VKPIATESAAWKAR+FL
Subjt:  LTSILKRQTLRKRAKRHEKERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARKFL

A0A6J1EVK1 uncharacterized protein LOC1114381861.9e-12689.18Show/hide
Query:  MKELSFFLFKNSLAARMRKGIRTFCTGDGSTSTLNQHKTNQDHFPISPDLHCRQNPPTLEEMILQLELEEETARRAKLYNHDEMRGRMSCVNNSDILRSA
        MKELSFFLFKNSLAA+MRKG RTFC GDGSTSTLNQHKTNQD FP SPD+ C QN PTLEEMI+QLELEEETARR KL N++EMRGRMSCVNNSDILRSA
Subjt:  MKELSFFLFKNSLAARMRKGIRTFCTGDGSTSTLNQHKTNQDHFPISPDLHCRQNPPTLEEMILQLELEEETARRAKLYNHDEMRGRMSCVNNSDILRSA

Query:  RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALRGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAK
        RNALNQYPRFSLDGKDAMYRSSFR  DAAEK GRKSVCCEYALRGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNA+ G AK
Subjt:  RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALRGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAK

Query:  LTSILKRQTLRKRA--KRHEKERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARKFL
        L+SILK+Q+LRKRA  KRHEKERRF MDY GSDGTITGRLSSCSSNNG YIVKPIATESAAWKAR FL
Subjt:  LTSILKRQTLRKRA--KRHEKERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARKFL

A0A6J1L4I6 uncharacterized protein LOC1114998224.2e-12689.18Show/hide
Query:  MKELSFFLFKNSLAARMRKGIRTFCTGDGSTSTLNQHKTNQDHFPISPDLHCRQNPPTLEEMILQLELEEETARRAKLYNHDEMRGRMSCVNNSDILRSA
        MKELSFFLFKNSLAA+MRKG RTFC GDGSTSTLNQHKTNQD FP SP++ C QN PTLEEMILQLELEEETARR KL N+DEMRGRMSCVNNSDILRSA
Subjt:  MKELSFFLFKNSLAARMRKGIRTFCTGDGSTSTLNQHKTNQDHFPISPDLHCRQNPPTLEEMILQLELEEETARRAKLYNHDEMRGRMSCVNNSDILRSA

Query:  RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALRGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAK
        +NALNQYPRFSLDGKDAMYRSSFR  DAAEK GRKSVCCEYALRGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNA+ G AK
Subjt:  RNALNQYPRFSLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALRGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAK

Query:  LTSILKRQTLRKRA--KRHEKERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARKFL
        L+SILK+Q+LRKRA  KRHEKERRF MDY GSDGTITGRLSSCSSNNG YIVKPIATESAAWKAR FL
Subjt:  LTSILKRQTLRKRA--KRHEKERRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARKFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGAGTTATCCTTCTTTCTCTTCAAGAACTCTCTGGCTGCAAGGATGAGGAAAGGCATCAGAACTTTCTGCACTGGTGATGGTTCTACCTCCACTCTCAACCAACA
CAAAACCAACCAGGATCACTTCCCTATCTCGCCCGACCTGCACTGCAGACAAAACCCGCCAACCTTGGAGGAGATGATTCTACAGTTGGAATTGGAGGAGGAGACAGCTA
GAAGGGCAAAACTCTACAACCACGATGAGATGCGAGGCAGGATGTCGTGTGTAAACAACTCGGATATCTTGCGATCTGCACGGAATGCCTTGAATCAGTATCCTCGGTTT
TCTCTTGATGGAAAGGATGCGATGTATCGGTCTTCCTTTAGGAACTTGGATGCTGCTGAAAAGGCCGGGAGGAAATCAGTTTGCTGCGAGTATGCTCTCAGAGGGAGAAT
TCATGAGAATGAGTTCAATTTGACGCTGGAGACAGCCTTGCGGTTGCCATCAACGATAGCTGGAGAGAGTGTCGTGTGGCGCAAACCAGGAGTGGTAGCCAAGTTGATGG
GTCTCGAGGCAATGCCAACACCAGTGAATGCCAAAAGTGGCAAAGCAAAGTTGACCTCTATATTGAAAAGACAGACTCTAAGGAAGAGAGCTAAGAGGCATGAAAAGGAG
AGAAGATTTTCCATGGACTATCCTGGTTCTGATGGCACTATAACCGGAAGGCTAAGTTCTTGCAGTTCAAACAATGGGTGCTATATCGTGAAGCCAATTGCAACAGAATC
AGCAGCATGGAAAGCCAGGAAATTTCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGGAGTTATCCTTCTTTCTCTTCAAGAACTCTCTGGCTGCAAGGATGAGGAAAGGCATCAGAACTTTCTGCACTGGTGATGGTTCTACCTCCACTCTCAACCAACA
CAAAACCAACCAGGATCACTTCCCTATCTCGCCCGACCTGCACTGCAGACAAAACCCGCCAACCTTGGAGGAGATGATTCTACAGTTGGAATTGGAGGAGGAGACAGCTA
GAAGGGCAAAACTCTACAACCACGATGAGATGCGAGGCAGGATGTCGTGTGTAAACAACTCGGATATCTTGCGATCTGCACGGAATGCCTTGAATCAGTATCCTCGGTTT
TCTCTTGATGGAAAGGATGCGATGTATCGGTCTTCCTTTAGGAACTTGGATGCTGCTGAAAAGGCCGGGAGGAAATCAGTTTGCTGCGAGTATGCTCTCAGAGGGAGAAT
TCATGAGAATGAGTTCAATTTGACGCTGGAGACAGCCTTGCGGTTGCCATCAACGATAGCTGGAGAGAGTGTCGTGTGGCGCAAACCAGGAGTGGTAGCCAAGTTGATGG
GTCTCGAGGCAATGCCAACACCAGTGAATGCCAAAAGTGGCAAAGCAAAGTTGACCTCTATATTGAAAAGACAGACTCTAAGGAAGAGAGCTAAGAGGCATGAAAAGGAG
AGAAGATTTTCCATGGACTATCCTGGTTCTGATGGCACTATAACCGGAAGGCTAAGTTCTTGCAGTTCAAACAATGGGTGCTATATCGTGAAGCCAATTGCAACAGAATC
AGCAGCATGGAAAGCCAGGAAATTTCTTTAG
Protein sequenceShow/hide protein sequence
MKELSFFLFKNSLAARMRKGIRTFCTGDGSTSTLNQHKTNQDHFPISPDLHCRQNPPTLEEMILQLELEEETARRAKLYNHDEMRGRMSCVNNSDILRSARNALNQYPRF
SLDGKDAMYRSSFRNLDAAEKAGRKSVCCEYALRGRIHENEFNLTLETALRLPSTIAGESVVWRKPGVVAKLMGLEAMPTPVNAKSGKAKLTSILKRQTLRKRAKRHEKE
RRFSMDYPGSDGTITGRLSSCSSNNGCYIVKPIATESAAWKARKFL