; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g00880 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g00880
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr1:547476..549284
RNA-Seq ExpressionMoc01g00880
SyntenyMoc01g00880
Gene Ontology termsGO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]1.9e-8282.84Show/hide
Query:  AVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASIVKRKSKGRAHALEAAQSSEPATPAVAG
        +V+IRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKRKSKG+AHALEAAQSS+P TPAV G
Subjt:  AVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASIVKRKSKGRAHALEAAQSSEPATPAVAG

Query:  PASEDPAPVIELESSGGPSQEKRPRGQTEAVDVSPLGE---------------------KVGARGALPASFADRVDDPEARMGGTSDVTTRFRIEPSSSG
        PASEDPAPVIELESS GPS+EKRPR QTEAVDVSPLGE                     +VGARG LPASFADRVDDPEARMGGT DVTTRFR+EPSSSG
Subjt:  PASEDPAPVIELESSGGPSQEKRPRGQTEAVDVSPLGE---------------------KVGARGALPASFADRVDDPEARMGGTSDVTTRFRIEPSSSG

Query:  VRDQ
        VRDQ
Subjt:  VRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]3.0e-6763.53Show/hide
Query:  PEGWALSTSKCLSTASDFLFTLLSKTRDSEEAELLDVDQLLACFEAKRIAKKPGRSNSILSCA-------------------------------------
        P GW +  +  +      LF L  + RDSEEAELLDVDQLLACFEAKRIAKKPGR      CA                                     
Subjt:  PEGWALSTSKCLSTASDFLFTLLSKTRDSEEAELLDVDQLLACFEAKRIAKKPGRSNSILSCA-------------------------------------

Query:  -----------VSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASIVKRKSKGRAHALEAAQS
                   VSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIE SRPNS LAMVC FAS VKRKSKGRAHALEAAQS
Subjt:  -----------VSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASIVKRKSKGRAHALEAAQS

Query:  SEPATPAVAGPASEDPAPVIELESSGGPSQEKRPRGQTEAV-------DVSPLGE
        S+P TPAV GPASEDPAPVIELESSGGPS+EKRPR QTEAV       DV PLGE
Subjt:  SEPATPAVAGPASEDPAPVIELESSGGPSQEKRPRGQTEAV-------DVSPLGE

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]4.7e-10565.08Show/hide
Query:  MSSSFSSDLGSDEDLARRLESELDEIENFRFSDDREDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPKEGERADNPPEGWALSTSKCLS--
        MSSS SS+L  + DLARRLES+L+EIEN R SDD EDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLR+P+EGERADNPPEGW     K     
Subjt:  MSSSFSSDLGSDEDLARRLESELDEIENFRFSDDREDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPKEGERADNPPEGWALSTSKCLS--

Query:  -------TASDFLF----------------------TLLSKTRDSEEAELLDVDQLLACFEAKRIAKKPGRSNSILSCA---------------------
                  +FLF                          + RDSEEAEL DVDQLLACFEAKRIAKKPGR      CA                     
Subjt:  -------TASDFLF----------------------TLLSKTRDSEEAELLDVDQLLACFEAKRIAKKPGRSNSILSCA---------------------

Query:  ---------------------------VSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASIV
                                   VSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS V
Subjt:  ---------------------------VSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASIV

Query:  KRKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSGGPSQEKRPRGQTEAVD
        KRKSKGRAHALEAAQSS+PATPAV GPASEDPA VIELESSGGPS+EKRPR QTEAVD
Subjt:  KRKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSGGPSQEKRPRGQTEAVD

XP_022159185.1 uncharacterized protein LOC111025606 [Momordica charantia]5.8e-7977.92Show/hide
Query:  MVCGFASIVKRKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSGGPSQEKRPRGQTEAVDVSPLGE---------------------KVGARG
        MVCGFAS VKRKSKGRAHA EAAQSS+PATPAVAGPASEDPAPVIELESSGGPS+EKRPR QTEAVD  PLGE                     +VGA G
Subjt:  MVCGFASIVKRKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSGGPSQEKRPRGQTEAVDVSPLGE---------------------KVGARG

Query:  ALPASFADRVDDPEARMGGTSDVTTRFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAR
         LPASFADRVDDPEARMGGTSDVT RFR++PSS+GVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAR
Subjt:  ALPASFADRVDDPEARMGGTSDVTTRFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAR

Query:  EKEEFSAALEAASSTMKDELLRAQSEVDILK
        EKEEFS ALEA       EL  A +E++  K
Subjt:  EKEEFSAALEAASSTMKDELLRAQSEVDILK

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.4e-9366.03Show/hide
Query:  VSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASIVKRKSKGRAHALEAAQSSEPATPAV---
        VSI+ +PEL QA+FDTLK+YK+HFPR RK+ TLVTDKLLLESGLLDYNP VR IE+SRPNSELAMVCGF   VKRKSKGRAHAL+    +EP TP V   
Subjt:  VSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASIVKRKSKGRAHALEAAQSSEPATPAV---

Query:  -----AGPASEDPAPVIELESSGGPSQEKRPRGQTEAVDVSPLGE--------------------KVGARGALPASFADRVDDPEARMGGTSDVTTRFRI
             +GP+S  P PVIEL+ SGG S EKR R ++EA+DVSPL E                    + GARG LP S AD VDDPEARM GTS+V  RF +
Subjt:  -----AGPASEDPAPVIELESSGGPSQEKRPRGQTEAVDVSPLGE--------------------KVGARGALPASFADRVDDPEARMGGTSDVTTRFRI

Query:  EPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLRAQSEVDI
        EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF+ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELL+AQ EVDI
Subjt:  EPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLRAQSEVDI

Query:  LKAEVEAKAELL
        L+AEV+AK +LL
Subjt:  LKAEVEAKAELL

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092989.3e-8382.84Show/hide
Query:  AVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASIVKRKSKGRAHALEAAQSSEPATPAVAG
        +V+IRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKRKSKG+AHALEAAQSS+P TPAV G
Subjt:  AVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASIVKRKSKGRAHALEAAQSSEPATPAVAG

Query:  PASEDPAPVIELESSGGPSQEKRPRGQTEAVDVSPLGE---------------------KVGARGALPASFADRVDDPEARMGGTSDVTTRFRIEPSSSG
        PASEDPAPVIELESS GPS+EKRPR QTEAVDVSPLGE                     +VGARG LPASFADRVDDPEARMGGT DVTTRFR+EPSSSG
Subjt:  PASEDPAPVIELESSGGPSQEKRPRGQTEAVDVSPLGE---------------------KVGARGALPASFADRVDDPEARMGGTSDVTTRFRIEPSSSG

Query:  VRDQ
        VRDQ
Subjt:  VRDQ

A0A6J1CR42 uncharacterized protein LOC1110138261.4e-6763.53Show/hide
Query:  PEGWALSTSKCLSTASDFLFTLLSKTRDSEEAELLDVDQLLACFEAKRIAKKPGRSNSILSCA-------------------------------------
        P GW +  +  +      LF L  + RDSEEAELLDVDQLLACFEAKRIAKKPGR      CA                                     
Subjt:  PEGWALSTSKCLSTASDFLFTLLSKTRDSEEAELLDVDQLLACFEAKRIAKKPGRSNSILSCA-------------------------------------

Query:  -----------VSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASIVKRKSKGRAHALEAAQS
                   VSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIE SRPNS LAMVC FAS VKRKSKGRAHALEAAQS
Subjt:  -----------VSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASIVKRKSKGRAHALEAAQS

Query:  SEPATPAVAGPASEDPAPVIELESSGGPSQEKRPRGQTEAV-------DVSPLGE
        S+P TPAV GPASEDPAPVIELESSGGPS+EKRPR QTEAV       DV PLGE
Subjt:  SEPATPAVAGPASEDPAPVIELESSGGPSQEKRPRGQTEAV-------DVSPLGE

A0A6J1DXS5 uncharacterized protein LOC1110255022.3e-10565.08Show/hide
Query:  MSSSFSSDLGSDEDLARRLESELDEIENFRFSDDREDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPKEGERADNPPEGWALSTSKCLS--
        MSSS SS+L  + DLARRLES+L+EIEN R SDD EDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLR+P+EGERADNPPEGW     K     
Subjt:  MSSSFSSDLGSDEDLARRLESELDEIENFRFSDDREDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPKEGERADNPPEGWALSTSKCLS--

Query:  -------TASDFLF----------------------TLLSKTRDSEEAELLDVDQLLACFEAKRIAKKPGRSNSILSCA---------------------
                  +FLF                          + RDSEEAEL DVDQLLACFEAKRIAKKPGR      CA                     
Subjt:  -------TASDFLF----------------------TLLSKTRDSEEAELLDVDQLLACFEAKRIAKKPGRSNSILSCA---------------------

Query:  ---------------------------VSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASIV
                                   VSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS V
Subjt:  ---------------------------VSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASIV

Query:  KRKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSGGPSQEKRPRGQTEAVD
        KRKSKGRAHALEAAQSS+PATPAV GPASEDPA VIELESSGGPS+EKRPR QTEAVD
Subjt:  KRKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSGGPSQEKRPRGQTEAVD

A0A6J1DXZ1 uncharacterized protein LOC1110256062.8e-7977.92Show/hide
Query:  MVCGFASIVKRKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSGGPSQEKRPRGQTEAVDVSPLGE---------------------KVGARG
        MVCGFAS VKRKSKGRAHA EAAQSS+PATPAVAGPASEDPAPVIELESSGGPS+EKRPR QTEAVD  PLGE                     +VGA G
Subjt:  MVCGFASIVKRKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSGGPSQEKRPRGQTEAVDVSPLGE---------------------KVGARG

Query:  ALPASFADRVDDPEARMGGTSDVTTRFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAR
         LPASFADRVDDPEARMGGTSDVT RFR++PSS+GVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAR
Subjt:  ALPASFADRVDDPEARMGGTSDVTTRFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAR

Query:  EKEEFSAALEAASSTMKDELLRAQSEVDILK
        EKEEFS ALEA       EL  A +E++  K
Subjt:  EKEEFSAALEAASSTMKDELLRAQSEVDILK

A0A6J1DZB3 uncharacterized protein LOC1110256656.9e-9466.03Show/hide
Query:  VSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASIVKRKSKGRAHALEAAQSSEPATPAV---
        VSI+ +PEL QA+FDTLK+YK+HFPR RK+ TLVTDKLLLESGLLDYNP VR IE+SRPNSELAMVCGF   VKRKSKGRAHAL+    +EP TP V   
Subjt:  VSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASIVKRKSKGRAHALEAAQSSEPATPAV---

Query:  -----AGPASEDPAPVIELESSGGPSQEKRPRGQTEAVDVSPLGE--------------------KVGARGALPASFADRVDDPEARMGGTSDVTTRFRI
             +GP+S  P PVIEL+ SGG S EKR R ++EA+DVSPL E                    + GARG LP S AD VDDPEARM GTS+V  RF +
Subjt:  -----AGPASEDPAPVIELESSGGPSQEKRPRGQTEAVDVSPLGE--------------------KVGARGALPASFADRVDDPEARMGGTSDVTTRFRI

Query:  EPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLRAQSEVDI
        EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF+ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELL+AQ EVDI
Subjt:  EPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLRAQSEVDI

Query:  LKAEVEAKAELL
        L+AEV+AK +LL
Subjt:  LKAEVEAKAELL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTTTTAGTAGCGACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGATGAGATAGAAAACTTTAGGTTCTCCGACGATAGGGAGGA
TAGTGATGCTTCCACCTCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCCGAGCACTACCTCGGATCCCTCCGTAGGGGGTTCGCTATCCCTGAAAACATCCTCCTTA
GGATTCCGAAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGCACTCTCTACTTCAAAATGTTTGAGTACAGCCTCAGACTTCCTCTTCACCCTTTTGTCCAAG
ACTCGGGATAGTGAAGAGGCCGAGCTGTTGGATGTTGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATTGCTAAGAAGCCTGGTCGGTCTAACTCGATTTTGTCTTG
TGCAGTATCAATCCGGCCAGTCCCCGAGCTTACTCAAGCCTCTTTCGACACATTGAAATATTACAAGGAGCACTTTCCGAGGGGCAGGAAGGTCGGAACCTTGGTGACCG
ACAAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCTTCAAGGCCGAACTCCGAACTAGCAATGGTTTGCGGATTTGCAAGTATT
GTGAAACGCAAGTCCAAGGGTCGAGCCCATGCTCTCGAGGCCGCCCAGAGTTCGGAACCTGCAACTCCTGCTGTGGCAGGGCCAGCCTCAGAAGATCCAGCCCCAGTGAT
CGAGTTGGAGTCTTCTGGAGGTCCTTCGCAGGAGAAGCGCCCAAGGGGTCAGACCGAGGCGGTGGACGTTTCGCCCTTGGGCGAGAAGGTCGGAGCTCGTGGGGCCCTAC
CTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAAGCCAGGATGGGCGGGACGTCCGATGTGACAACACGGTTCAGAATTGAACCGTCAAGCTCTGGGGTGAGGGACCAG
GTGTCTCGCATCTCGGCTGCAAGTTTGGACCGCTGCCTCAGAAGAGCGTCTAAATTTGTAAGTGACCCGGGGTCCGTTCTGCAGAGGACCATTGACTATGCCGCCGAGGC
GTTTGTTGCTTCCATTCAATCGGCCCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTT
CCTCCACCATGAAGGATGAGCTGCTAAGGGCTCAATCTGAGGTGGACATTCTGAAGGCCGAGGTGGAGGCCAAGGCCGAACTACTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTTTTAGTAGCGACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGATGAGATAGAAAACTTTAGGTTCTCCGACGATAGGGAGGA
TAGTGATGCTTCCACCTCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCCGAGCACTACCTCGGATCCCTCCGTAGGGGGTTCGCTATCCCTGAAAACATCCTCCTTA
GGATTCCGAAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGCACTCTCTACTTCAAAATGTTTGAGTACAGCCTCAGACTTCCTCTTCACCCTTTTGTCCAAG
ACTCGGGATAGTGAAGAGGCCGAGCTGTTGGATGTTGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATTGCTAAGAAGCCTGGTCGGTCTAACTCGATTTTGTCTTG
TGCAGTATCAATCCGGCCAGTCCCCGAGCTTACTCAAGCCTCTTTCGACACATTGAAATATTACAAGGAGCACTTTCCGAGGGGCAGGAAGGTCGGAACCTTGGTGACCG
ACAAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCTTCAAGGCCGAACTCCGAACTAGCAATGGTTTGCGGATTTGCAAGTATT
GTGAAACGCAAGTCCAAGGGTCGAGCCCATGCTCTCGAGGCCGCCCAGAGTTCGGAACCTGCAACTCCTGCTGTGGCAGGGCCAGCCTCAGAAGATCCAGCCCCAGTGAT
CGAGTTGGAGTCTTCTGGAGGTCCTTCGCAGGAGAAGCGCCCAAGGGGTCAGACCGAGGCGGTGGACGTTTCGCCCTTGGGCGAGAAGGTCGGAGCTCGTGGGGCCCTAC
CTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAAGCCAGGATGGGCGGGACGTCCGATGTGACAACACGGTTCAGAATTGAACCGTCAAGCTCTGGGGTGAGGGACCAG
GTGTCTCGCATCTCGGCTGCAAGTTTGGACCGCTGCCTCAGAAGAGCGTCTAAATTTGTAAGTGACCCGGGGTCCGTTCTGCAGAGGACCATTGACTATGCCGCCGAGGC
GTTTGTTGCTTCCATTCAATCGGCCCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTT
CCTCCACCATGAAGGATGAGCTGCTAAGGGCTCAATCTGAGGTGGACATTCTGAAGGCCGAGGTGGAGGCCAAGGCCGAACTACTGTAG
Protein sequenceShow/hide protein sequence
MSSSFSSDLGSDEDLARRLESELDEIENFRFSDDREDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPKEGERADNPPEGWALSTSKCLSTASDFLFTLLSK
TRDSEEAELLDVDQLLACFEAKRIAKKPGRSNSILSCAVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASI
VKRKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSGGPSQEKRPRGQTEAVDVSPLGEKVGARGALPASFADRVDDPEARMGGTSDVTTRFRIEPSSSGVRDQ
VSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLRAQSEVDILKAEVEAKAELL