; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g21560 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g21560
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:15771585..15777358
RNA-Seq ExpressionMoc07g21560
SyntenyMoc07g21560
Gene Ontology termsGO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]3.0e-7778.92Show/hide
Query:  AVSIRPVPELTQASFDTFKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAIRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEPATLAVAG
        +V+IRPVPELTQASFDT KYYKEHFPRGRKVGTLVTDKLLLESGLLDYN A+RPIESSRPNSELAMVCGFASNVKRKSKG+AHALEAAQSS+P T AV G
Subjt:  AVSIRPVPELTQASFDTFKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAIRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEPATLAVAG

Query:  PTSEDPAPVIELESSGGPSRE-------------------------KRRKKKKKTTSPLEVGAHGALPASFADRVDDPEARMRGTSDVTTRFRIEPSSSG
        P SEDPAPVIELESS GPSRE                         KRR+KKKKTTSPLEVGA G LPASFADRVDDPEARM GT DVTTRFR+EPSSSG
Subjt:  PTSEDPAPVIELESSGGPSRE-------------------------KRRKKKKKTTSPLEVGAHGALPASFADRVDDPEARMRGTSDVTTRFRIEPSSSG

Query:  VRDQ
        VRDQ
Subjt:  VRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]4.9e-8874.5Show/hide
Query:  MFEYDLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPVR--------TSQVVPSLTFPLGLTR
        MFEY LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKP R           +V   T   G  R
Subjt:  MFEYDLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPVR--------TSQVVPSLTFPLGLTR

Query:  ILSCA------------------------VSIRPVPELTQASFDTFKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAIRPIESSRPNSELAMVCGFAS
            A                        VSIRPVPELTQASFDT KYYKE FPRGRKVGTLVTD+LLLESGLLDYN A+RPIE SRPNS LAMVC FAS
Subjt:  ILSCA------------------------VSIRPVPELTQASFDTFKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAIRPIESSRPNSELAMVCGFAS

Query:  NVKRKSKGRAHALEAAQSSEPATLAVAGPTSEDPAPVIELESSGGPSREKR
         VKRKSKGRAHALEAAQSS+P T AV GP SEDPAPVIELESSGGPSREKR
Subjt:  NVKRKSKGRAHALEAAQSSEPATLAVAGPTSEDPAPVIELESSGGPSREKR

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.2e-11869.63Show/hide
Query:  SDASDFREDPSRSLITRLEPLVGRSLPSHFPFRIIVSMSSSFSSDLGYDEDLARRIPEHYLGSLRRGFAIPENILLRIPEEGEKADNPPEGWVTLYFKMF
        S +S+   D +R L ++LE +           RI      S +S  G   +   RIPEHYLGSLRRGFAIPENILLR+PEEGE+ADNPPEGWVTLYFKMF
Subjt:  SDASDFREDPSRSLITRLEPLVGRSLPSHFPFRIIVSMSSSFSSDLGYDEDLARRIPEHYLGSLRRGFAIPENILLRIPEEGEKADNPPEGWVTLYFKMF

Query:  EYDLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPVR--------TSQVVPSLTFPLGLTRIL
        EY LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKP R           +V   T   G  R  
Subjt:  EYDLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPVR--------TSQVVPSLTFPLGLTRIL

Query:  SCA------------------------VSIRPVPELTQASFDTFKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAIRPIESSRPNSELAMVCGFASNV
          A                        VSIRPVPELTQASFDT KYYKE FPRGRKVGTLVTD+LLLESGLLDYN A+RPIESSRPNSELAMVCGFAS V
Subjt:  SCA------------------------VSIRPVPELTQASFDTFKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAIRPIESSRPNSELAMVCGFASNV

Query:  KRKSKGRAHALEAAQSSEPATLAVAGPTSEDPAPVIELESSGGPSREKR
        KRKSKGRAHALEAAQSS+PAT AV GP SEDPA VIELESSGGPSREKR
Subjt:  KRKSKGRAHALEAAQSSEPATLAVAGPTSEDPAPVIELESSGGPSREKR

XP_022159185.1 uncharacterized protein LOC111025606 [Momordica charantia]2.4e-7475.32Show/hide
Query:  MVCGFASNVKRKSKGRAHALEAAQSSEPATLAVAGPTSEDPAPVIELESSGGPSRE-------------------------KRRKKKKKTTSPLEVGAHG
        MVCGFAS+VKRKSKGRAHA EAAQSS+PAT AVAGP SEDPAPVIELESSGGPSRE                         KRR+KKKKT SPLEVGA G
Subjt:  MVCGFASNVKRKSKGRAHALEAAQSSEPATLAVAGPTSEDPAPVIELESSGGPSRE-------------------------KRRKKKKKTTSPLEVGAHG

Query:  ALPASFADRVDDPEARMRGTSDVTTRFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEVFVASIQLALAVKAELDGREVLAAR
         LPASFADRVDDPEARM GTSDVT RFR++PSS+GVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAE FVASIQ ALAVKAELDGREVLAAR
Subjt:  ALPASFADRVDDPEARMRGTSDVTTRFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEVFVASIQLALAVKAELDGREVLAAR

Query:  EKEEFSAALEAASSIMKDELLRAQFEVDILK
        EKEEFS ALEA       EL  A  E++  K
Subjt:  EKEEFSAALEAASSIMKDELLRAQFEVDILK

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.3e-10465.71Show/hide
Query:  VSIRPVPELTQASFDTFKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAIRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEPATLAV---
        VSI+ +PEL QA+FDT K+YK+HFPR RK+ TLVTDKLLLESGLLDYN  +R IE+SRPNSELAMVCGF  +VKRKSKGRAHAL+    +EP T  V   
Subjt:  VSIRPVPELTQASFDTFKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAIRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEPATLAV---

Query:  -----AGPTSEDPAPVIELESSGGPSREK------------------------RRKKKKKTTSPLEVGAHGALPASFADRVDDPEARMRGTSDVTTRFRI
             +GP+S  P PVIEL+ SGG S EK                        RR+KKKKT+S  E GA G LP S AD VDDPEARMRGTS+V  RF +
Subjt:  -----AGPTSEDPAPVIELESSGGPSREK------------------------RRKKKKKTTSPLEVGAHGALPASFADRVDDPEARMRGTSDVTTRFRI

Query:  EPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEVFVASIQLALAVKAELDGREVLAAREKEEFSAALEAASSIMKDELLRAQFEVDI
        EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AE F+ASI LA+ VKAELDGRE LAA+E+E   AALEAA++ +K ELL+AQ EVDI
Subjt:  EPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEVFVASIQLALAVKAELDGREVLAAREKEEFSAALEAASSIMKDELLRAQFEVDI

Query:  LKAEVEAKAELLKKEEDSRKAQLRAAHATTKGLEKEKFQLLKEKDDM
        L+AEV+AK +LLKKE +  KA LRAAHA TKGLEKEKFQLLKEKDD+
Subjt:  LKAEVEAKAELLKKEEDSRKAQLRAAHATTKGLEKEKFQLLKEKDDM

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092981.4e-7778.92Show/hide
Query:  AVSIRPVPELTQASFDTFKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAIRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEPATLAVAG
        +V+IRPVPELTQASFDT KYYKEHFPRGRKVGTLVTDKLLLESGLLDYN A+RPIESSRPNSELAMVCGFASNVKRKSKG+AHALEAAQSS+P T AV G
Subjt:  AVSIRPVPELTQASFDTFKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAIRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEPATLAVAG

Query:  PTSEDPAPVIELESSGGPSRE-------------------------KRRKKKKKTTSPLEVGAHGALPASFADRVDDPEARMRGTSDVTTRFRIEPSSSG
        P SEDPAPVIELESS GPSRE                         KRR+KKKKTTSPLEVGA G LPASFADRVDDPEARM GT DVTTRFR+EPSSSG
Subjt:  PTSEDPAPVIELESSGGPSRE-------------------------KRRKKKKKTTSPLEVGAHGALPASFADRVDDPEARMRGTSDVTTRFRIEPSSSG

Query:  VRDQ
        VRDQ
Subjt:  VRDQ

A0A6J1CR42 uncharacterized protein LOC1110138262.4e-8874.5Show/hide
Query:  MFEYDLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPVR--------TSQVVPSLTFPLGLTR
        MFEY LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKP R           +V   T   G  R
Subjt:  MFEYDLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPVR--------TSQVVPSLTFPLGLTR

Query:  ILSCA------------------------VSIRPVPELTQASFDTFKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAIRPIESSRPNSELAMVCGFAS
            A                        VSIRPVPELTQASFDT KYYKE FPRGRKVGTLVTD+LLLESGLLDYN A+RPIE SRPNS LAMVC FAS
Subjt:  ILSCA------------------------VSIRPVPELTQASFDTFKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAIRPIESSRPNSELAMVCGFAS

Query:  NVKRKSKGRAHALEAAQSSEPATLAVAGPTSEDPAPVIELESSGGPSREKR
         VKRKSKGRAHALEAAQSS+P T AV GP SEDPAPVIELESSGGPSREKR
Subjt:  NVKRKSKGRAHALEAAQSSEPATLAVAGPTSEDPAPVIELESSGGPSREKR

A0A6J1DXS5 uncharacterized protein LOC1110255025.8e-11969.63Show/hide
Query:  SDASDFREDPSRSLITRLEPLVGRSLPSHFPFRIIVSMSSSFSSDLGYDEDLARRIPEHYLGSLRRGFAIPENILLRIPEEGEKADNPPEGWVTLYFKMF
        S +S+   D +R L ++LE +           RI      S +S  G   +   RIPEHYLGSLRRGFAIPENILLR+PEEGE+ADNPPEGWVTLYFKMF
Subjt:  SDASDFREDPSRSLITRLEPLVGRSLPSHFPFRIIVSMSSSFSSDLGYDEDLARRIPEHYLGSLRRGFAIPENILLRIPEEGEKADNPPEGWVTLYFKMF

Query:  EYDLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPVR--------TSQVVPSLTFPLGLTRIL
        EY LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKP R           +V   T   G  R  
Subjt:  EYDLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPVR--------TSQVVPSLTFPLGLTRIL

Query:  SCA------------------------VSIRPVPELTQASFDTFKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAIRPIESSRPNSELAMVCGFASNV
          A                        VSIRPVPELTQASFDT KYYKE FPRGRKVGTLVTD+LLLESGLLDYN A+RPIESSRPNSELAMVCGFAS V
Subjt:  SCA------------------------VSIRPVPELTQASFDTFKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAIRPIESSRPNSELAMVCGFASNV

Query:  KRKSKGRAHALEAAQSSEPATLAVAGPTSEDPAPVIELESSGGPSREKR
        KRKSKGRAHALEAAQSS+PAT AV GP SEDPA VIELESSGGPSREKR
Subjt:  KRKSKGRAHALEAAQSSEPATLAVAGPTSEDPAPVIELESSGGPSREKR

A0A6J1DXZ1 uncharacterized protein LOC1110256061.1e-7475.32Show/hide
Query:  MVCGFASNVKRKSKGRAHALEAAQSSEPATLAVAGPTSEDPAPVIELESSGGPSRE-------------------------KRRKKKKKTTSPLEVGAHG
        MVCGFAS+VKRKSKGRAHA EAAQSS+PAT AVAGP SEDPAPVIELESSGGPSRE                         KRR+KKKKT SPLEVGA G
Subjt:  MVCGFASNVKRKSKGRAHALEAAQSSEPATLAVAGPTSEDPAPVIELESSGGPSRE-------------------------KRRKKKKKTTSPLEVGAHG

Query:  ALPASFADRVDDPEARMRGTSDVTTRFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEVFVASIQLALAVKAELDGREVLAAR
         LPASFADRVDDPEARM GTSDVT RFR++PSS+GVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAE FVASIQ ALAVKAELDGREVLAAR
Subjt:  ALPASFADRVDDPEARMRGTSDVTTRFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEVFVASIQLALAVKAELDGREVLAAR

Query:  EKEEFSAALEAASSIMKDELLRAQFEVDILK
        EKEEFS ALEA       EL  A  E++  K
Subjt:  EKEEFSAALEAASSIMKDELLRAQFEVDILK

A0A6J1DZB3 uncharacterized protein LOC1110256656.2e-10565.71Show/hide
Query:  VSIRPVPELTQASFDTFKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAIRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEPATLAV---
        VSI+ +PEL QA+FDT K+YK+HFPR RK+ TLVTDKLLLESGLLDYN  +R IE+SRPNSELAMVCGF  +VKRKSKGRAHAL+    +EP T  V   
Subjt:  VSIRPVPELTQASFDTFKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAIRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEPATLAV---

Query:  -----AGPTSEDPAPVIELESSGGPSREK------------------------RRKKKKKTTSPLEVGAHGALPASFADRVDDPEARMRGTSDVTTRFRI
             +GP+S  P PVIEL+ SGG S EK                        RR+KKKKT+S  E GA G LP S AD VDDPEARMRGTS+V  RF +
Subjt:  -----AGPTSEDPAPVIELESSGGPSREK------------------------RRKKKKKTTSPLEVGAHGALPASFADRVDDPEARMRGTSDVTTRFRI

Query:  EPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEVFVASIQLALAVKAELDGREVLAAREKEEFSAALEAASSIMKDELLRAQFEVDI
        EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AE F+ASI LA+ VKAELDGRE LAA+E+E   AALEAA++ +K ELL+AQ EVDI
Subjt:  EPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEVFVASIQLALAVKAELDGREVLAAREKEEFSAALEAASSIMKDELLRAQFEVDI

Query:  LKAEVEAKAELLKKEEDSRKAQLRAAHATTKGLEKEKFQLLKEKDDM
        L+AEV+AK +LLKKE +  KA LRAAHA TKGLEKEKFQLLKEKDD+
Subjt:  LKAEVEAKAELLKKEEDSRKAQLRAAHATTKGLEKEKFQLLKEKDDM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATTAGATTGTGATCAAATTTTGATACTTGAACCTGGAAATGGGTCCGCTTCTGATATTGAGGTGGAAGTGGATGAATTGTTTGGAATTGAATTGTGTGTGAACCT
TTCAGTAAATTTGAATCCTCCACTCCGAAATGAGGAAGCACCTTCACCGATGTTTACTGAAATAGATTTTGAGATTATAGACTCTATACGTGAAGAAGCTCGAACTCGGT
CTCCGGACCGATCTGAACACTTGTGCGGACCTGCACAAAAAGGTGAGCACTCCGACGATCAAGTCAGTATAGTTGCTCACATCGGACCCACCGAGCTTCCCGGTAGATCG
GACCTTGATCAGGTCGCACCTCGGCCCTCATACTTAGCACCTGTCAACGCTAGTGGTGGGTATTCTCTCCCCCAAACATTGGCCCCCTCTCTGTCTGGTTCGATCTCGAC
CTGGCAGAGAAGTTCATTCGACCCGCTTTGGACACGTGGCGACTTCCTATTCGTGGGAAAATATAACTGTTGCGGTAGATTTATCGTCGGAATATTCAAATATTCCGACG
CTTCGGATTTTCGGGAAGATCCCAGCCGCTCGTTGATTACACGTCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCTCACTTTCCCTTTCGAATCATAGTTTCCATGTCG
TCCTCTTTTAGTAGCGACTTAGGATACGATGAGGATTTAGCTCGTAGGATACCCGAGCACTACCTCGGATCCCTCCGTAGGGGGTTCGCTATCCCTGAAAACATCCTTCT
TAGGATTCCGGAGGAGGGGGAGAAAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGACCTCCGACTTCCCCTTCACCCTTTTGTCC
AAGAGTTTCTTTTCCGAACTGGGCTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTTGCTTTGGCCATCCTTTTTTGGTTACGAGCTCGGGATAGTGAA
GAGGCCGAGCTGTTGGATGTTGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGTTCGGACGAGTCAGGTCGTTCCTTCTTTGACGTTCCCACT
AGGTCTAACTCGGATTTTGTCTTGTGCAGTATCAATCCGGCCAGTCCCCGAGCTTACTCAAGCCTCCTTCGACACATTTAAATATTACAAGGAGCACTTTCCGAGGGGCA
GGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACTCTGCAATTCGTCCCATTGAATCTTCAAGGCCGAACTCCGAACTAGCC
ATGGTTTGCGGATTTGCAAGTAACGTGAAACGCAAGTCCAAGGGCCGAGCCCATGCTCTCGAGGCCGCCCAGAGTTCGGAACCTGCAACTCTTGCTGTGGCAGGGCCAAC
CTCAGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGAGGTCCTTCGCGGGAGAAGCGGAGGAAGAAGAAGAAGAAGACCACCTCCCCCTTGGAGGTCGGAGCTC
ACGGGGCCCTACCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAAGCCAGGATGCGCGGGACGTCCGATGTGACAACACGGTTCAGAATTGAACCGTCAAGCTCTGGG
GTGAGGGACCAGGTGTCCCGCATCTCGGCTGCAAGTTTGGACCGCTGCCTCAGAAGAGCGTCCAAATTTGTAAGTGACCCGGGGTCCGTTCTGCAGAGGACCATTGACTA
CGCCGCCGAGGTGTTTGTTGCTTCCATTCAATTGGCCCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCT
TGGAGGCTGCTTCCTCCATCATGAAGGATGAGCTGCTAAGGGCTCAATTTGAGGTGGACATTCTGAAGGCCGAGGTGGAGGCCAAGGCCGAGCTGCTGAAGAAGGAAGAG
GACAGCCGCAAGGCCCAGCTTCGAGCTGCCCATGCTACCACCAAAGGCCTGGAGAAGGAGAAGTTTCAACTTCTCAAGGAGAAGGACGACATGCTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCATTAGATTGTGATCAAATTTTGATACTTGAACCTGGAAATGGGTCCGCTTCTGATATTGAGGTGGAAGTGGATGAATTGTTTGGAATTGAATTGTGTGTGAACCT
TTCAGTAAATTTGAATCCTCCACTCCGAAATGAGGAAGCACCTTCACCGATGTTTACTGAAATAGATTTTGAGATTATAGACTCTATACGTGAAGAAGCTCGAACTCGGT
CTCCGGACCGATCTGAACACTTGTGCGGACCTGCACAAAAAGGTGAGCACTCCGACGATCAAGTCAGTATAGTTGCTCACATCGGACCCACCGAGCTTCCCGGTAGATCG
GACCTTGATCAGGTCGCACCTCGGCCCTCATACTTAGCACCTGTCAACGCTAGTGGTGGGTATTCTCTCCCCCAAACATTGGCCCCCTCTCTGTCTGGTTCGATCTCGAC
CTGGCAGAGAAGTTCATTCGACCCGCTTTGGACACGTGGCGACTTCCTATTCGTGGGAAAATATAACTGTTGCGGTAGATTTATCGTCGGAATATTCAAATATTCCGACG
CTTCGGATTTTCGGGAAGATCCCAGCCGCTCGTTGATTACACGTCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCTCACTTTCCCTTTCGAATCATAGTTTCCATGTCG
TCCTCTTTTAGTAGCGACTTAGGATACGATGAGGATTTAGCTCGTAGGATACCCGAGCACTACCTCGGATCCCTCCGTAGGGGGTTCGCTATCCCTGAAAACATCCTTCT
TAGGATTCCGGAGGAGGGGGAGAAAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGACCTCCGACTTCCCCTTCACCCTTTTGTCC
AAGAGTTTCTTTTCCGAACTGGGCTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTTGCTTTGGCCATCCTTTTTTGGTTACGAGCTCGGGATAGTGAA
GAGGCCGAGCTGTTGGATGTTGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGTTCGGACGAGTCAGGTCGTTCCTTCTTTGACGTTCCCACT
AGGTCTAACTCGGATTTTGTCTTGTGCAGTATCAATCCGGCCAGTCCCCGAGCTTACTCAAGCCTCCTTCGACACATTTAAATATTACAAGGAGCACTTTCCGAGGGGCA
GGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACTCTGCAATTCGTCCCATTGAATCTTCAAGGCCGAACTCCGAACTAGCC
ATGGTTTGCGGATTTGCAAGTAACGTGAAACGCAAGTCCAAGGGCCGAGCCCATGCTCTCGAGGCCGCCCAGAGTTCGGAACCTGCAACTCTTGCTGTGGCAGGGCCAAC
CTCAGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGAGGTCCTTCGCGGGAGAAGCGGAGGAAGAAGAAGAAGAAGACCACCTCCCCCTTGGAGGTCGGAGCTC
ACGGGGCCCTACCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAAGCCAGGATGCGCGGGACGTCCGATGTGACAACACGGTTCAGAATTGAACCGTCAAGCTCTGGG
GTGAGGGACCAGGTGTCCCGCATCTCGGCTGCAAGTTTGGACCGCTGCCTCAGAAGAGCGTCCAAATTTGTAAGTGACCCGGGGTCCGTTCTGCAGAGGACCATTGACTA
CGCCGCCGAGGTGTTTGTTGCTTCCATTCAATTGGCCCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCT
TGGAGGCTGCTTCCTCCATCATGAAGGATGAGCTGCTAAGGGCTCAATTTGAGGTGGACATTCTGAAGGCCGAGGTGGAGGCCAAGGCCGAGCTGCTGAAGAAGGAAGAG
GACAGCCGCAAGGCCCAGCTTCGAGCTGCCCATGCTACCACCAAAGGCCTGGAGAAGGAGAAGTTTCAACTTCTCAAGGAGAAGGACGACATGCTCTAG
Protein sequenceShow/hide protein sequence
MSLDCDQILILEPGNGSASDIEVEVDELFGIELCVNLSVNLNPPLRNEEAPSPMFTEIDFEIIDSIREEARTRSPDRSEHLCGPAQKGEHSDDQVSIVAHIGPTELPGRS
DLDQVAPRPSYLAPVNASGGYSLPQTLAPSLSGSISTWQRSSFDPLWTRGDFLFVGKYNCCGRFIVGIFKYSDASDFREDPSRSLITRLEPLVGRSLPSHFPFRIIVSMS
SSFSSDLGYDEDLARRIPEHYLGSLRRGFAIPENILLRIPEEGEKADNPPEGWVTLYFKMFEYDLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSE
EAELLDVDQLLACFEAKRIAKKPVRTSQVVPSLTFPLGLTRILSCAVSIRPVPELTQASFDTFKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAIRPIESSRPNSELA
MVCGFASNVKRKSKGRAHALEAAQSSEPATLAVAGPTSEDPAPVIELESSGGPSREKRRKKKKKTTSPLEVGAHGALPASFADRVDDPEARMRGTSDVTTRFRIEPSSSG
VRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEVFVASIQLALAVKAELDGREVLAAREKEEFSAALEAASSIMKDELLRAQFEVDILKAEVEAKAELLKKEE
DSRKAQLRAAHATTKGLEKEKFQLLKEKDDML