; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g15210 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g15210
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr1:9596898..9598825
RNA-Seq ExpressionMoc01g15210
SyntenyMoc01g15210
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]2.0e-11386.61Show/hide
Query:  MCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDES              V+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNP
Subjt:  MCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPTTPAVAGLASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVREEAPLKRRR
        AVRPIESSRPNSELAMVCGFAS+VKRKSKG+AHALEAAQSSKP TPAV G ASEDPAPVIELESS GPSREKRPRDQTEAVD  PLGEEVREE PLKRRR
Subjt:  AVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPTTPAVAGLASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVREEAPLKRRR

Query:  KKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRVEPSSSGVRDQ
        KKKK  SP EVGA  VLPASFADRVDDP ARMGGT DVT RFRVEPSSSGVRDQ
Subjt:  KKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRVEPSSSGVRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]2.5e-14094.14Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIIFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGACGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAI+FWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGA GIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIIFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGACGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  SVKRKSKGRAHALEAAQSSKPTTPAVAGLASEDPAPVIELESSGGPSREKRPRDQTEAVDA-------PPLGE
         VKRKSKGRAHALEAAQSSKP TPAV G ASEDPAPVIELESSGGPSREKRPRDQTEAVDA       PPLGE
Subjt:  SVKRKSKGRAHALEAAQSSKPTTPAVAGLASEDPAPVIELESSGGPSREKRPRDQTEAVDA-------PPLGE

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]4.0e-10698.96Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIIFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGACGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAI+FWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGA GIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIIFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGACGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL

XP_022158650.1 uncharacterized protein LOC111025108 [Momordica charantia]1.2e-10597.42Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIIFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGACGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAI+FWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGA GIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIIFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGACGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAM
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSEL M
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAM

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]2.8e-16386.97Show/hide
Query:  MSSSFSSNLGSDLARRLGSKLEEIENFRFSDDGEDSDASTS---------------GFRRRG----------------RADNPPEGWVTLYFKMFEYGLR
        MSSS SSNL SDLARRL SKLEEIEN R SDDGEDSDASTS               G  RRG                RADNPPEGWVTLYFKMFEYGLR
Subjt:  MSSSFSSNLGSDLARRLGSKLEEIENFRFSDDGEDSDASTS---------------GFRRRG----------------RADNPPEGWVTLYFKMFEYGLR

Query:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIIFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGACGIVKGPTSIKGWVRKWFYASG
        LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAI+FWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGA GIVKGPTSIKGWVRKWFYASG
Subjt:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIIFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGACGIVKGPTSIKGWVRKWFYASG

Query:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRKSK
        EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKRKSK
Subjt:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRKSK

Query:  GRAHALEAAQSSKPTTPAVAGLASEDPAPVIELESSGGPSREKRPRDQTEAVD
        GRAHALEAAQSSKP TPAV G ASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  GRAHALEAAQSSKPTTPAVAGLASEDPAPVIELESSGGPSREKRPRDQTEAVD

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092989.7e-11486.61Show/hide
Query:  MCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDES              V+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNP
Subjt:  MCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPTTPAVAGLASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVREEAPLKRRR
        AVRPIESSRPNSELAMVCGFAS+VKRKSKG+AHALEAAQSSKP TPAV G ASEDPAPVIELESS GPSREKRPRDQTEAVD  PLGEEVREE PLKRRR
Subjt:  AVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPTTPAVAGLASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVREEAPLKRRR

Query:  KKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRVEPSSSGVRDQ
        KKKK  SP EVGA  VLPASFADRVDDP ARMGGT DVT RFRVEPSSSGVRDQ
Subjt:  KKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRVEPSSSGVRDQ

A0A6J1CR42 uncharacterized protein LOC1110138261.2e-14094.14Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIIFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGACGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAI+FWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGA GIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIIFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGACGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  SVKRKSKGRAHALEAAQSSKPTTPAVAGLASEDPAPVIELESSGGPSREKRPRDQTEAVDA-------PPLGE
         VKRKSKGRAHALEAAQSSKP TPAV G ASEDPAPVIELESSGGPSREKRPRDQTEAVDA       PPLGE
Subjt:  SVKRKSKGRAHALEAAQSSKPTTPAVAGLASEDPAPVIELESSGGPSREKRPRDQTEAVDA-------PPLGE

A0A6J1DWD2 uncharacterized protein LOC1110246802.0e-10698.96Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIIFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGACGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAI+FWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGA GIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIIFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGACGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL

A0A6J1DWF1 uncharacterized protein LOC1110251085.7e-10697.42Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIIFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGACGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAI+FWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGA GIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIIFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGACGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAM
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSEL M
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAM

A0A6J1DXS5 uncharacterized protein LOC1110255021.3e-16386.97Show/hide
Query:  MSSSFSSNLGSDLARRLGSKLEEIENFRFSDDGEDSDASTS---------------GFRRRG----------------RADNPPEGWVTLYFKMFEYGLR
        MSSS SSNL SDLARRL SKLEEIEN R SDDGEDSDASTS               G  RRG                RADNPPEGWVTLYFKMFEYGLR
Subjt:  MSSSFSSNLGSDLARRLGSKLEEIENFRFSDDGEDSDASTS---------------GFRRRG----------------RADNPPEGWVTLYFKMFEYGLR

Query:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIIFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGACGIVKGPTSIKGWVRKWFYASG
        LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAI+FWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGA GIVKGPTSIKGWVRKWFYASG
Subjt:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIIFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGACGIVKGPTSIKGWVRKWFYASG

Query:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRKSK
        EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKRKSK
Subjt:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRKSK

Query:  GRAHALEAAQSSKPTTPAVAGLASEDPAPVIELESSGGPSREKRPRDQTEAVD
        GRAHALEAAQSSKP TPAV G ASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  GRAHALEAAQSSKPTTPAVAGLASEDPAPVIELESSGGPSREKRPRDQTEAVD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCCATCTTGGAGAACCAATAGGGGTCTTCCACGTGTCCAGGGTATTCTCTTCCCCAAACATTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTT
CATTCGACTTGCTTTGGACACGTGGCGACTTCCTATTCGTGGGAAAATACAACCGTCGCAGAAGATTATCGTCGGAATATTCAAATATTCCGACGCTTCGGATCTCAGGG
AGGATCCTAGCCGCTTGTTGATTACACGTTTCGATCTGAAGGCAGCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCTCTCTTTCGAACGTAGTTGCCATGTCGTCCTCT
TTTAGCAGCAACTTAGGATCCGATTTAGCTCGTAGGTTAGGGTCCAAGCTCGAGGAGATAGAAAACTTTAGATTCTCCGATGACGGGGAGGATAGTGACGCCTCCACTTC
AGGCTTCCGGAGGAGGGGGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAAG
AATTTCTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGCGTCATTTTCGCTTTGGCCATCATTTTTTGGCTACGAGCTCGGGATAGTGAGGAG
GCCGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCTAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGAGCATGCGGTATAGT
TAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCCCCACTAGGT
TTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGCTGAAATACTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTCGGAACCCTG
GTGACTGACGAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCTGAACTTGCCATGGTTTGCGGATTTGC
AAGCAGCGTGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTACCACCCCTGCCGTGGCAGGGCTTGCCTCGGAAGATCCAGCCC
CAGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCTAGGGATCAGACCGAGGCGGTGGACGCCCCGCCTTTGGGTGAGGAGGTGAGAGAGGAAGCC
CCTCTGAAGCGAAGAAGGAAGAAAAAGAAGGCGATCTCCCCCTCGGAGGTCGGAGCTTGCAGGGTCTTACCTGCAAGTTTCGCAGATCGGGTGGACGATCCTGCGGCCAG
GATGGGCGGGACGTCCGACGTGACGGCACGGTTCAGAGTTGAGCCGTCAAGTTCCGGGGTGAGGGACCAAGTGTCCTGCATCTTAGCTGCAAGTTTGGACCGCTGCCTAA
GGAGGGCGTCCAAATTTGTGAGTGACCCTGGGTCCGTTCGTCAGTGGTTTTGGCATCGCACCTCGTACCCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCCATCTTGGAGAACCAATAGGGGTCTTCCACGTGTCCAGGGTATTCTCTTCCCCAAACATTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTT
CATTCGACTTGCTTTGGACACGTGGCGACTTCCTATTCGTGGGAAAATACAACCGTCGCAGAAGATTATCGTCGGAATATTCAAATATTCCGACGCTTCGGATCTCAGGG
AGGATCCTAGCCGCTTGTTGATTACACGTTTCGATCTGAAGGCAGCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCTCTCTTTCGAACGTAGTTGCCATGTCGTCCTCT
TTTAGCAGCAACTTAGGATCCGATTTAGCTCGTAGGTTAGGGTCCAAGCTCGAGGAGATAGAAAACTTTAGATTCTCCGATGACGGGGAGGATAGTGACGCCTCCACTTC
AGGCTTCCGGAGGAGGGGGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAAG
AATTTCTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGCGTCATTTTCGCTTTGGCCATCATTTTTTGGCTACGAGCTCGGGATAGTGAGGAG
GCCGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCTAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGAGCATGCGGTATAGT
TAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCCCCACTAGGT
TTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGCTGAAATACTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTCGGAACCCTG
GTGACTGACGAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCTGAACTTGCCATGGTTTGCGGATTTGC
AAGCAGCGTGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTACCACCCCTGCCGTGGCAGGGCTTGCCTCGGAAGATCCAGCCC
CAGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCTAGGGATCAGACCGAGGCGGTGGACGCCCCGCCTTTGGGTGAGGAGGTGAGAGAGGAAGCC
CCTCTGAAGCGAAGAAGGAAGAAAAAGAAGGCGATCTCCCCCTCGGAGGTCGGAGCTTGCAGGGTCTTACCTGCAAGTTTCGCAGATCGGGTGGACGATCCTGCGGCCAG
GATGGGCGGGACGTCCGACGTGACGGCACGGTTCAGAGTTGAGCCGTCAAGTTCCGGGGTGAGGGACCAAGTGTCCTGCATCTTAGCTGCAAGTTTGGACCGCTGCCTAA
GGAGGGCGTCCAAATTTGTGAGTGACCCTGGGTCCGTTCGTCAGTGGTTTTGGCATCGCACCTCGTACCCTTAG
Protein sequenceShow/hide protein sequence
MSHLGEPIGVFHVSRVFSSPNIGPLSVWSDLDLAEKFIRLALDTWRLPIRGKIQPSQKIIVGIFKYSDASDLREDPSRLLITRFDLKAARTLGRSVSSLSLSNVVAMSSS
FSSNLGSDLARRLGSKLEEIENFRFSDDGEDSDASTSGFRRRGRADNPPEGWVTLYFKMFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIIFWLRARDSEE
AELLDVDQLLACFEAKRIAKKPGRFYMCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTL
VTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPTTPAVAGLASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVREEA
PLKRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRVEPSSSGVRDQVSCILAASLDRCLRRASKFVSDPGSVRQWFWHRTSYP