; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc10g23630 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc10g23630
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr10:17001497..17002707
RNA-Seq ExpressionMoc10g23630
SyntenyMoc10g23630
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]2.2e-5640.28Show/hide
Query:  KLSPSHFRTVKYQDNESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEFGSRPPASLKEMLTRARQYIDGLELWKANGAR------RSSRGK
        K + +H  T++ ++ E+L EY+ RF +E +KV  C+DD AM YF TGL D  LT++ G   PA+  E+L +A++ IDG EL +    R      R   GK
Subjt:  KLSPSHFRTVKYQDNESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEFGSRPPASLKEMLTRARQYIDGLELWKANGAR------RSSRGK

Query:  DWDRKSPHPKK-----------QRDHRGP----KFDRFTPLNASIAEIYAAVEDTDLEALFAAPEKLRRPSEKRDK------------------------
        D +   P  K            +R   GP     ++RFTP    I+EI   +E++ +E L   PEKLR   E+R K                        
Subjt:  DWDRKSPHPKK-----------QRDHRGP----KFDRFTPLNASIAEIYAAVEDTDLEALFAAPEKLRRPSEKRDK------------------------

Query:  ------GYLKKYVGSRERAELEGSAREEKRERSPPPRRKEDCSVVINTIHGGLNGGQSGQKRKALASEAAHE------------------DREGVYMPHN
              GY KK+VG    +  E   ++E+R+RS  P R+ D   VINTI GG +GGQSG+KRK LA  A  E                  D E V++PHN
Subjt:  ------GYLKKYVGSRERAELEGSAREEKRERSPPPRRKEDCSVVINTIHGGLNGGQSGQKRKALASEAAHE------------------DREGVYMPHN

Query:  DALIIAPPIDHVKVRRVPVDGGSSANVLSFSTYTALGWERRNLKRSPTPLVGFQG
        DAL+IAP IDHV V RV VDGG+SAN+LS  TY ALGW R  LK+SPTPLVGF G
Subjt:  DALIIAPPIDHVKVRRVPVDGGSSANVLSFSTYTALGWERRNLKRSPTPLVGFQG

XP_022149029.1 uncharacterized protein LOC111017548 [Momordica charantia]7.8e-7863.77Show/hide
Query:  LLKLSPSHFRTVKYQDNESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEFGSRPPASLKEMLTRARQYIDGLELWKANGARRSSRGKDWDR
        LLKL PSH  TVK +DNESLTEYIAR MDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEFGSRPPASL +ML RARQYIDGLELWKA GARRSSRGKD D+
Subjt:  LLKLSPSHFRTVKYQDNESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEFGSRPPASLKEMLTRARQYIDGLELWKANGARRSSRGKDWDR

Query:  KSPHPKKQR-------------------------DHRGPKFDRFTPLNASIAEIYAAVEDTDLEALFAAPEKLRRPSEKRDK------------------
        +S  PKK+                          D  GPKFD+FTPLNAS+AEIYA VE+TD++ALF AP+KL RPS KRDK                  
Subjt:  KSPHPKKQR-------------------------DHRGPKFDRFTPLNASIAEIYAAVEDTDLEALFAAPEKLRRPSEKRDK------------------

Query:  ------------GYLKKYVGSRERAELEGSAREEKRERSPPPRRKEDCSVVINTIHGGLNGGQSG
                    GYLKKYVGSRERA+ EGS REEKRERS PP RKED   VINTIHGG +G +SG
Subjt:  ------------GYLKKYVGSRERAELEGSAREEKRERSPPPRRKEDCSVVINTIHGGLNGGQSG

XP_022149377.1 uncharacterized protein LOC111017807 [Momordica charantia]1.6e-5139.42Show/hide
Query:  SHFRTVKYQDNESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEFGSRPPASLKEMLTRARQYIDGLELWKANGAR--------------RS
        +H  T++ ++ E+L EY+ RF +E +KVV C+DD AM YF TGL D  LT++ G   PA+  E+L +A++ IDG EL +    R              R 
Subjt:  SHFRTVKYQDNESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEFGSRPPASLKEMLTRARQYIDGLELWKANGAR--------------RS

Query:  SRGKDWDRKSPHPKKQRDHR----GPK----FDRFTPLNASIAEIYAAVEDTDLEALFAAPEKLRRPSEKRDK---------------------------
        +  K  D+ S     + +HR    GP     ++R+TP    I+EI   +E++ +E L  +PEKLR   EKR K                           
Subjt:  SRGKDWDRKSPHPKKQRDHR----GPK----FDRFTPLNASIAEIYAAVEDTDLEALFAAPEKLRRPSEKRDK---------------------------

Query:  ---GYLKKYVGSRERAELEGSAREEKRERSPPPRRKEDCSVVINTIHGGLNGGQSGQKRKALASEAAHE------------------DREGVYMPHNDAL
           GY KK+VG      +E   ++E+R+RS  P R++D   VINTI GG +GGQSG KRK LA EA  E                  D EGV++PHNDAL
Subjt:  ---GYLKKYVGSRERAELEGSAREEKRERSPPPRRKEDCSVVINTIHGGLNGGQSGQKRKALASEAAHE------------------DREGVYMPHNDAL

Query:  IIAPPIDHVKVRRVPVDGGSSANVLSFSTYTALGWERRNLKRSPT
        +IAP IDHV V  + VDGG+SAN+LS  TY ALGW R  LK+SPT
Subjt:  IIAPPIDHVKVRRVPVDGGSSANVLSFSTYTALGWERRNLKRSPT

XP_022150385.1 uncharacterized protein LOC111018561 [Momordica charantia]1.3e-4842.03Show/hide
Query:  VKYQDNESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEFGSRPPASLKEMLTRARQYIDGLELWKANGARRSSRGKDWDRKSPHPKKQRDH
        V++++ E+L EY+ RF +E +KV  C+DD AM YF TGL D  LT++ G   PA+  E+L +A++ IDG EL +    R     K  D+K  + +K++  
Subjt:  VKYQDNESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEFGSRPPASLKEMLTRARQYIDGLELWKANGARRSSRGKDWDRKSPHPKKQRDH

Query:  RGPKFDRFTPLNASIAEIYAAVEDTDLEALFAAPEKLRRPSEKRDKGYLKKYVGSRERAELEGSAREEKRERSPPPRRKEDCSVVINTIHGGLNGGQSGQ
          PK D+ +  + S  + +               E  R+  +    GY KK+VG      +E   ++E+R+RS  P R++D   VINTI GG +GGQSG 
Subjt:  RGPKFDRFTPLNASIAEIYAAVEDTDLEALFAAPEKLRRPSEKRDKGYLKKYVGSRERAELEGSAREEKRERSPPPRRKEDCSVVINTIHGGLNGGQSGQ

Query:  KRKALASEAAHE------------------DREGVYMPHNDALIIAPPIDHVKVRRVPVDGGSSANVLSFSTYTALGWERRNLKRSPTPLVGFQG
        KRK L  EA  E                  D EGV++PHNDAL+I P I+HV VRRV VDGG+SAN+LS  TY ALGW R  LK+SPTPLVGF G
Subjt:  KRKALASEAAHE------------------DREGVYMPHNDALIIAPPIDHVKVRRVPVDGGSSANVLSFSTYTALGWERRNLKRSPTPLVGFQG

XP_022158844.1 uncharacterized protein LOC111025310 [Momordica charantia]1.3e-10163.56Show/hide
Query:  MDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEFGSRPPASLKEMLTRARQYIDGLELWKANGARRSSRGKDWDRKSPHPKKQRD-----------------
        MDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEF SRPPASL EM  RARQYIDGLELWKANGARRSSRG+D D KSP  KK+ D                 
Subjt:  MDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEFGSRPPASLKEMLTRARQYIDGLELWKANGARRSSRGKDWDRKSPHPKKQRD-----------------

Query:  --------HRGPKFDRFTPLNASIAEIYAAVEDTDLEALFAAPEKLRRPSEKRDK------------------------------GYLKKYVGSRERAEL
                 RGPKFD+FTPLNASIAEIYA VEDTD+E LFA+PEKLRRPS KR+K                              GYLKKYVGSRE+AEL
Subjt:  --------HRGPKFDRFTPLNASIAEIYAAVEDTDLEALFAAPEKLRRPSEKRDK------------------------------GYLKKYVGSRERAEL

Query:  EGSAREEKRERSPPPRRKEDCSVVINTIHGGLNGGQSGQKRKALASEAAHE------------------DREGVYMPHNDALIIAPPIDHVKVRRVPVDG
        EGSAREEKRERS PPR KED   VINTIHGG +G +SGQKRKALA E AHE                  D E V+MPHNDAL+IAP IDHVKVRRV VDG
Subjt:  EGSAREEKRERSPPPRRKEDCSVVINTIHGGLNGGQSGQKRKALASEAAHE------------------DREGVYMPHNDALIIAPPIDHVKVRRVPVDG

Query:  GSSANVLSFSTYTALGWERRNLKRSPTPLVGFQGIWLARKDVS
        G+SAN+ SFSTYTALGWERR+LK   T LVGF     AR+ VS
Subjt:  GSSANVLSFSTYTALGWERRNLKRSPTPLVGFQGIWLARKDVS

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088131.1e-5640.28Show/hide
Query:  KLSPSHFRTVKYQDNESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEFGSRPPASLKEMLTRARQYIDGLELWKANGAR------RSSRGK
        K + +H  T++ ++ E+L EY+ RF +E +KV  C+DD AM YF TGL D  LT++ G   PA+  E+L +A++ IDG EL +    R      R   GK
Subjt:  KLSPSHFRTVKYQDNESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEFGSRPPASLKEMLTRARQYIDGLELWKANGAR------RSSRGK

Query:  DWDRKSPHPKK-----------QRDHRGP----KFDRFTPLNASIAEIYAAVEDTDLEALFAAPEKLRRPSEKRDK------------------------
        D +   P  K            +R   GP     ++RFTP    I+EI   +E++ +E L   PEKLR   E+R K                        
Subjt:  DWDRKSPHPKK-----------QRDHRGP----KFDRFTPLNASIAEIYAAVEDTDLEALFAAPEKLRRPSEKRDK------------------------

Query:  ------GYLKKYVGSRERAELEGSAREEKRERSPPPRRKEDCSVVINTIHGGLNGGQSGQKRKALASEAAHE------------------DREGVYMPHN
              GY KK+VG    +  E   ++E+R+RS  P R+ D   VINTI GG +GGQSG+KRK LA  A  E                  D E V++PHN
Subjt:  ------GYLKKYVGSRERAELEGSAREEKRERSPPPRRKEDCSVVINTIHGGLNGGQSGQKRKALASEAAHE------------------DREGVYMPHN

Query:  DALIIAPPIDHVKVRRVPVDGGSSANVLSFSTYTALGWERRNLKRSPTPLVGFQG
        DAL+IAP IDHV V RV VDGG+SAN+LS  TY ALGW R  LK+SPTPLVGF G
Subjt:  DALIIAPPIDHVKVRRVPVDGGSSANVLSFSTYTALGWERRNLKRSPTPLVGFQG

A0A6J1D5T3 uncharacterized protein LOC1110175483.8e-7863.77Show/hide
Query:  LLKLSPSHFRTVKYQDNESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEFGSRPPASLKEMLTRARQYIDGLELWKANGARRSSRGKDWDR
        LLKL PSH  TVK +DNESLTEYIAR MDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEFGSRPPASL +ML RARQYIDGLELWKA GARRSSRGKD D+
Subjt:  LLKLSPSHFRTVKYQDNESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEFGSRPPASLKEMLTRARQYIDGLELWKANGARRSSRGKDWDR

Query:  KSPHPKKQR-------------------------DHRGPKFDRFTPLNASIAEIYAAVEDTDLEALFAAPEKLRRPSEKRDK------------------
        +S  PKK+                          D  GPKFD+FTPLNAS+AEIYA VE+TD++ALF AP+KL RPS KRDK                  
Subjt:  KSPHPKKQR-------------------------DHRGPKFDRFTPLNASIAEIYAAVEDTDLEALFAAPEKLRRPSEKRDK------------------

Query:  ------------GYLKKYVGSRERAELEGSAREEKRERSPPPRRKEDCSVVINTIHGGLNGGQSG
                    GYLKKYVGSRERA+ EGS REEKRERS PP RKED   VINTIHGG +G +SG
Subjt:  ------------GYLKKYVGSRERAELEGSAREEKRERSPPPRRKEDCSVVINTIHGGLNGGQSG

A0A6J1D7S8 uncharacterized protein LOC1110178078.0e-5239.42Show/hide
Query:  SHFRTVKYQDNESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEFGSRPPASLKEMLTRARQYIDGLELWKANGAR--------------RS
        +H  T++ ++ E+L EY+ RF +E +KVV C+DD AM YF TGL D  LT++ G   PA+  E+L +A++ IDG EL +    R              R 
Subjt:  SHFRTVKYQDNESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEFGSRPPASLKEMLTRARQYIDGLELWKANGAR--------------RS

Query:  SRGKDWDRKSPHPKKQRDHR----GPK----FDRFTPLNASIAEIYAAVEDTDLEALFAAPEKLRRPSEKRDK---------------------------
        +  K  D+ S     + +HR    GP     ++R+TP    I+EI   +E++ +E L  +PEKLR   EKR K                           
Subjt:  SRGKDWDRKSPHPKKQRDHR----GPK----FDRFTPLNASIAEIYAAVEDTDLEALFAAPEKLRRPSEKRDK---------------------------

Query:  ---GYLKKYVGSRERAELEGSAREEKRERSPPPRRKEDCSVVINTIHGGLNGGQSGQKRKALASEAAHE------------------DREGVYMPHNDAL
           GY KK+VG      +E   ++E+R+RS  P R++D   VINTI GG +GGQSG KRK LA EA  E                  D EGV++PHNDAL
Subjt:  ---GYLKKYVGSRERAELEGSAREEKRERSPPPRRKEDCSVVINTIHGGLNGGQSGQKRKALASEAAHE------------------DREGVYMPHNDAL

Query:  IIAPPIDHVKVRRVPVDGGSSANVLSFSTYTALGWERRNLKRSPT
        +IAP IDHV V  + VDGG+SAN+LS  TY ALGW R  LK+SPT
Subjt:  IIAPPIDHVKVRRVPVDGGSSANVLSFSTYTALGWERRNLKRSPT

A0A6J1D8C0 uncharacterized protein LOC1110185616.3e-4942.03Show/hide
Query:  VKYQDNESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEFGSRPPASLKEMLTRARQYIDGLELWKANGARRSSRGKDWDRKSPHPKKQRDH
        V++++ E+L EY+ RF +E +KV  C+DD AM YF TGL D  LT++ G   PA+  E+L +A++ IDG EL +    R     K  D+K  + +K++  
Subjt:  VKYQDNESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEFGSRPPASLKEMLTRARQYIDGLELWKANGARRSSRGKDWDRKSPHPKKQRDH

Query:  RGPKFDRFTPLNASIAEIYAAVEDTDLEALFAAPEKLRRPSEKRDKGYLKKYVGSRERAELEGSAREEKRERSPPPRRKEDCSVVINTIHGGLNGGQSGQ
          PK D+ +  + S  + +               E  R+  +    GY KK+VG      +E   ++E+R+RS  P R++D   VINTI GG +GGQSG 
Subjt:  RGPKFDRFTPLNASIAEIYAAVEDTDLEALFAAPEKLRRPSEKRDKGYLKKYVGSRERAELEGSAREEKRERSPPPRRKEDCSVVINTIHGGLNGGQSGQ

Query:  KRKALASEAAHE------------------DREGVYMPHNDALIIAPPIDHVKVRRVPVDGGSSANVLSFSTYTALGWERRNLKRSPTPLVGFQG
        KRK L  EA  E                  D EGV++PHNDAL+I P I+HV VRRV VDGG+SAN+LS  TY ALGW R  LK+SPTPLVGF G
Subjt:  KRKALASEAAHE------------------DREGVYMPHNDALIIAPPIDHVKVRRVPVDGGSSANVLSFSTYTALGWERRNLKRSPTPLVGFQG

A0A6J1E0L8 uncharacterized protein LOC1110253106.4e-10263.56Show/hide
Query:  MDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEFGSRPPASLKEMLTRARQYIDGLELWKANGARRSSRGKDWDRKSPHPKKQRD-----------------
        MDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEF SRPPASL EM  RARQYIDGLELWKANGARRSSRG+D D KSP  KK+ D                 
Subjt:  MDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEFGSRPPASLKEMLTRARQYIDGLELWKANGARRSSRGKDWDRKSPHPKKQRD-----------------

Query:  --------HRGPKFDRFTPLNASIAEIYAAVEDTDLEALFAAPEKLRRPSEKRDK------------------------------GYLKKYVGSRERAEL
                 RGPKFD+FTPLNASIAEIYA VEDTD+E LFA+PEKLRRPS KR+K                              GYLKKYVGSRE+AEL
Subjt:  --------HRGPKFDRFTPLNASIAEIYAAVEDTDLEALFAAPEKLRRPSEKRDK------------------------------GYLKKYVGSRERAEL

Query:  EGSAREEKRERSPPPRRKEDCSVVINTIHGGLNGGQSGQKRKALASEAAHE------------------DREGVYMPHNDALIIAPPIDHVKVRRVPVDG
        EGSAREEKRERS PPR KED   VINTIHGG +G +SGQKRKALA E AHE                  D E V+MPHNDAL+IAP IDHVKVRRV VDG
Subjt:  EGSAREEKRERSPPPRRKEDCSVVINTIHGGLNGGQSGQKRKALASEAAHE------------------DREGVYMPHNDALIIAPPIDHVKVRRVPVDG

Query:  GSSANVLSFSTYTALGWERRNLKRSPTPLVGFQGIWLARKDVS
        G+SAN+ SFSTYTALGWERR+LK   T LVGF     AR+ VS
Subjt:  GSSANVLSFSTYTALGWERRNLKRSPTPLVGFQGIWLARKDVS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGTTGAAGTTGTCGCCCTCTCACTTCAGAACAGTGAAGTACCAGGACAATGAGTCCCTGACGGAGTACATCGCTCGGTTCATGGATGAGCACGTCAAGGTGGTGAG
TTGTACCGACGACATCGCCATGATGTACTTCACGACAGGGCTGAACGATAGAAATTTGACGATAGAATTCGGAAGCCGTCCGCCGGCCTCCCTAAAAGAGATGCTCACCC
GAGCTCGCCAATACATTGATGGCCTGGAGCTATGGAAGGCCAACGGAGCCAGGCGAAGCAGCCGCGGTAAAGATTGGGACCGGAAGTCTCCTCATCCCAAGAAGCAACGC
GACCATCGAGGGCCGAAGTTCGACAGGTTCACTCCGTTGAACGCCTCAATCGCAGAGATCTACGCAGCAGTCGAAGACACCGACTTGGAAGCACTGTTCGCAGCCCCAGA
AAAGCTCCGCCGACCTTCAGAGAAGCGAGACAAAGGTTATTTGAAGAAGTACGTCGGCAGTAGAGAAAGAGCTGAGCTAGAAGGATCAGCTCGGGAGGAGAAGCGAGAGA
GATCACCGCCACCAAGACGAAAGGAAGATTGTTCTGTCGTTATAAACACCATTCACGGGGGTTTGAACGGGGGACAGTCGGGTCAAAAAAGAAAGGCTCTAGCCTCGGAA
GCAGCGCACGAAGACAGGGAAGGGGTGTATATGCCTCATAACGACGCTTTGATAATCGCCCCACCGATAGACCATGTGAAGGTCAGAAGAGTTCCTGTCGACGGTGGGTC
GTCGGCTAATGTATTGTCCTTCTCGACCTACACGGCCCTGGGGTGGGAGAGAAGAAACTTGAAGCGTAGCCCGACACCCTTGGTTGGCTTTCAGGGGATTTGGTTAGCGC
GGAAGGATGTGTCTCGTTCCCTGTCACCATCGGCGAAGGAGATTAGCAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGTTGAAGTTGTCGCCCTCTCACTTCAGAACAGTGAAGTACCAGGACAATGAGTCCCTGACGGAGTACATCGCTCGGTTCATGGATGAGCACGTCAAGGTGGTGAG
TTGTACCGACGACATCGCCATGATGTACTTCACGACAGGGCTGAACGATAGAAATTTGACGATAGAATTCGGAAGCCGTCCGCCGGCCTCCCTAAAAGAGATGCTCACCC
GAGCTCGCCAATACATTGATGGCCTGGAGCTATGGAAGGCCAACGGAGCCAGGCGAAGCAGCCGCGGTAAAGATTGGGACCGGAAGTCTCCTCATCCCAAGAAGCAACGC
GACCATCGAGGGCCGAAGTTCGACAGGTTCACTCCGTTGAACGCCTCAATCGCAGAGATCTACGCAGCAGTCGAAGACACCGACTTGGAAGCACTGTTCGCAGCCCCAGA
AAAGCTCCGCCGACCTTCAGAGAAGCGAGACAAAGGTTATTTGAAGAAGTACGTCGGCAGTAGAGAAAGAGCTGAGCTAGAAGGATCAGCTCGGGAGGAGAAGCGAGAGA
GATCACCGCCACCAAGACGAAAGGAAGATTGTTCTGTCGTTATAAACACCATTCACGGGGGTTTGAACGGGGGACAGTCGGGTCAAAAAAGAAAGGCTCTAGCCTCGGAA
GCAGCGCACGAAGACAGGGAAGGGGTGTATATGCCTCATAACGACGCTTTGATAATCGCCCCACCGATAGACCATGTGAAGGTCAGAAGAGTTCCTGTCGACGGTGGGTC
GTCGGCTAATGTATTGTCCTTCTCGACCTACACGGCCCTGGGGTGGGAGAGAAGAAACTTGAAGCGTAGCCCGACACCCTTGGTTGGCTTTCAGGGGATTTGGTTAGCGC
GGAAGGATGTGTCTCGTTCCCTGTCACCATCGGCGAAGGAGATTAGCAAGTGA
Protein sequenceShow/hide protein sequence
MLLKLSPSHFRTVKYQDNESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEFGSRPPASLKEMLTRARQYIDGLELWKANGARRSSRGKDWDRKSPHPKKQR
DHRGPKFDRFTPLNASIAEIYAAVEDTDLEALFAAPEKLRRPSEKRDKGYLKKYVGSRERAELEGSAREEKRERSPPPRRKEDCSVVINTIHGGLNGGQSGQKRKALASE
AAHEDREGVYMPHNDALIIAPPIDHVKVRRVPVDGGSSANVLSFSTYTALGWERRNLKRSPTPLVGFQGIWLARKDVSRSLSPSAKEISK