; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g01290 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g01290
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr1:944269..945765
RNA-Seq ExpressionMoc01g01290
SyntenyMoc01g01290
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]6.7e-7968.78Show/hide
Query:  MCARKGACGIVKRPASIKGWVKKWFFASGGWLAKNESCLPFFDVPISVNQTNSQLTQASWDTLKYYKDRFPSGRKVGTLVTDRLLLESGLLDYNPLVRPV
        MCARKGACGIVK P SIKGWV+KWF+ASG WLAK+ES +    VP        +LTQAS+DTLKYYK+ FP GRKVGTLVTD+LLLESGLLDYNP VRP+
Subjt:  MCARKGACGIVKRPASIKGWVKKWFFASGGWLAKNESCLPFFDVPISVNQTNSQLTQASWDTLKYYKDRFPSGRKVGTLVTDRLLLESGLLDYNPLVRPV

Query:  EDSRPNSKLAMVCGFTGSVKRKSKGRAHTLKTVQGTKPTTPAVTRPTVQDKAGPSSEVPTPVIELDSAGEHSREKRPRNESEALDVSPLC-EVREDSPLR
        E SRPNS+LAMVCGF  +VKRKSKG+AH L+  Q +KP TPAV         GP+SE P PVIEL+S+   SREKRPR+++EA+DVSPL  EVRE+ PL+
Subjt:  EDSRPNSKLAMVCGFTGSVKRKSKGRAHTLKTVQGTKPTTPAVTRPTVQDKAGPSSEVPTPVIELDSAGEHSREKRPRNESEALDVSPLC-EVREDSPLR

Query:  RRRKKKKTTSSLEVGPREPLPSSHADLVDDPEARMGG
        RRRKKKKTTS LEVG R  LP+S AD VDDPEARMGG
Subjt:  RRRKKKKTTSSLEVGPREPLPSSHADLVDDPEARMGG

XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]6.5e-5846.88Show/hide
Query:  QLTQASWDTLKYYKDRFPSGRKVGTLVTDRLLLESGLLDYNPLVRPVEDSRPNSKLAMVCGFTGSVKRKSKGRAHTLKTVQGTKPTTPAVTRPTVQDKAG
        +L QA++DTLK+YKD FP GRK+GTLVTD+LLLESGLLDYNPLVRP+E SRPNS+LAMVCGFT SVKRKSKGRAH LK VQ + P TPAV +   QD+AG
Subjt:  QLTQASWDTLKYYKDRFPSGRKVGTLVTDRLLLESGLLDYNPLVRPVEDSRPNSKLAMVCGFTGSVKRKSKGRAHTLKTVQGTKPTTPAVTRPTVQDKAG

Query:  PSSEVPTPVIELDSAGEHSREKRPRNESEALDVSPLCEVREDSPLRRRRKKKKTTSSLEVGPREPLPSSHADLVDDPEARMGGPWTAASEGHPKFVSDLG
        PSS  PTPVIELDS GE SREKR R+ESEALDVSPL EVR                                                            
Subjt:  PSSEVPTPVIELDSAGEHSREKRPRNESEALDVSPLCEVREDSPLRRRRKKKKTTSSLEVGPREPLPSSHADLVDDPEARMGGPWTAASEGHPKFVSDLG

Query:  SILQRTIDHAAEAIIASIHSAIMVKTKLDGREILAARESANSSATLEAATIMKGELLKARFELETLKAEVEAKAQLLKREEEKHKAHLRAAHAITKGLEK
                                                                              EAKA+LLKRE+E+HKAHLRAAHAITKGLEK
Subjt:  SILQRTIDHAAEAIIASIHSAIMVKTKLDGREILAARESANSSATLEAATIMKGELLKARFELETLKAEVEAKAQLLKREEEKHKAHLRAAHAITKGLEK

Query:  EKFQLLKEKDDLAQILEEMD
        EKFQLLKEKDD+ Q LE  D
Subjt:  EKFQLLKEKDDLAQILEEMD

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]2.9e-6668.34Show/hide
Query:  IAKKPGRYYMCARKGACGIVKRPASIKGWVKKWFFASGGWLAKNESCLPFFDVPISVNQTNS-----QLTQASWDTLKYYKDRFPSGRKVGTLVTDRLLL
        IAKKPGR+YMCARKGA GIVK P SIKGWV+KWF+ASG WLAK+ES   FFDVP       S     +LTQAS+DTLKYYK+RFP GRKVGTLVTD LLL
Subjt:  IAKKPGRYYMCARKGACGIVKRPASIKGWVKKWFFASGGWLAKNESCLPFFDVPISVNQTNS-----QLTQASWDTLKYYKDRFPSGRKVGTLVTDRLLL

Query:  ESGLLDYNPLVRPVEDSRPNSKLAMVCGFTGSVKRKSKGRAHTLKTVQGTKPTTPAVTRPTVQDKAGPSSEVPTPVIELDSAGEHSREKRPRNESEALD
        ESGLLDYNP VRP+E SRPNS LAMVC F   VKRKSKGRAH L+  Q +KP TPAV         GP+SE P PVIEL+S+G  SREKRPR+++EA+D
Subjt:  ESGLLDYNPLVRPVEDSRPNSKLAMVCGFTGSVKRKSKGRAHTLKTVQGTKPTTPAVTRPTVQDKAGPSSEVPTPVIELDSAGEHSREKRPRNESEALD

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.7e-6668.34Show/hide
Query:  IAKKPGRYYMCARKGACGIVKRPASIKGWVKKWFFASGGWLAKNESCLPFFDVPISVNQTNS-----QLTQASWDTLKYYKDRFPSGRKVGTLVTDRLLL
        IAKKPGR+YMCARKGA GIVK P SIKGWV+KWF+ASG WLAK+ES   FFDVP       S     +LTQAS+DTLKYYK+RFP GRKVGTLVTD LLL
Subjt:  IAKKPGRYYMCARKGACGIVKRPASIKGWVKKWFFASGGWLAKNESCLPFFDVPISVNQTNS-----QLTQASWDTLKYYKDRFPSGRKVGTLVTDRLLL

Query:  ESGLLDYNPLVRPVEDSRPNSKLAMVCGFTGSVKRKSKGRAHTLKTVQGTKPTTPAVTRPTVQDKAGPSSEVPTPVIELDSAGEHSREKRPRNESEALD
        ESGLLDYNP VRP+E SRPNS+LAMVCGF   VKRKSKGRAH L+  Q +KP TPAV         GP+SE P  VIEL+S+G  SREKRPR+++EA+D
Subjt:  ESGLLDYNPLVRPVEDSRPNSKLAMVCGFTGSVKRKSKGRAHTLKTVQGTKPTTPAVTRPTVQDKAGPSSEVPTPVIELDSAGEHSREKRPRNESEALD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]7.5e-13167.16Show/hide
Query:  MCARKGACGIVKRPASIKGWVKKWFFASGGWLAKNESCLPFFDVPISVNQTNS-----QLTQASWDTLKYYKDRFPSGRKVGTLVTDRLLLESGLLDYNP
        MCARKG  GIVK P SIKGWV KWFFASG WLAK+ES   FFDVP       S     +L QA++DTLK+YKD FP  RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGACGIVKRPASIKGWVKKWFFASGGWLAKNESCLPFFDVPISVNQTNS-----QLTQASWDTLKYYKDRFPSGRKVGTLVTDRLLLESGLLDYNP

Query:  LVRPVEDSRPNSKLAMVCGFTGSVKRKSKGRAHTLKTVQGTKPTTPAVTRPTVQDKAGPSSEVPTPVIELDSAGEHSREKRPRNESEALDVSPLCEVRED
        LVR +E SRPNS+LAMVCGFTGSVKRKSKGRAH LKTV GT+P TP V R   Q  +GPSS VPTPVIELD +G  S EKR R ESEALDVSPL EVR +
Subjt:  LVRPVEDSRPNSKLAMVCGFTGSVKRKSKGRAHTLKTVQGTKPTTPAVTRPTVQDKAGPSSEVPTPVIELDSAGEHSREKRPRNESEALDVSPLCEVRED

Query:  SPLRRRRKKKKTTSSLEVGPREPLPSSHADLVDDPEARMGGP--------WTAASEG-------------------HPKFVSDLGSILQRTIDHAAEAII
        SPLRRRRKKKKT+SS E G R  LP+SHADLVDDPEARM G            +S G                     KFVSD GS+LQRTID+ AEA I
Subjt:  SPLRRRRKKKKTTSSLEVGPREPLPSSHADLVDDPEARMGGP--------WTAASEG-------------------HPKFVSDLGSILQRTIDHAAEAII

Query:  ASIHSAIMVKTKLDGREILAARESANSSATLEAATIMKGELLKARFELETLKAEVEAKAQLLKREEEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQI
        ASIH A+MVK +LDGRE LAA+E  NS A LEAAT +KGELLKA+ E++ L+AEV+AK  LLK+E EKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQ+
Subjt:  ASIHSAIMVKTKLDGREILAARESANSSATLEAATIMKGELLKARFELETLKAEVEAKAQLLKREEEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQI

Query:  LEEMD
        LEE D
Subjt:  LEEMD

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092983.2e-7968.78Show/hide
Query:  MCARKGACGIVKRPASIKGWVKKWFFASGGWLAKNESCLPFFDVPISVNQTNSQLTQASWDTLKYYKDRFPSGRKVGTLVTDRLLLESGLLDYNPLVRPV
        MCARKGACGIVK P SIKGWV+KWF+ASG WLAK+ES +    VP        +LTQAS+DTLKYYK+ FP GRKVGTLVTD+LLLESGLLDYNP VRP+
Subjt:  MCARKGACGIVKRPASIKGWVKKWFFASGGWLAKNESCLPFFDVPISVNQTNSQLTQASWDTLKYYKDRFPSGRKVGTLVTDRLLLESGLLDYNPLVRPV

Query:  EDSRPNSKLAMVCGFTGSVKRKSKGRAHTLKTVQGTKPTTPAVTRPTVQDKAGPSSEVPTPVIELDSAGEHSREKRPRNESEALDVSPLC-EVREDSPLR
        E SRPNS+LAMVCGF  +VKRKSKG+AH L+  Q +KP TPAV         GP+SE P PVIEL+S+   SREKRPR+++EA+DVSPL  EVRE+ PL+
Subjt:  EDSRPNSKLAMVCGFTGSVKRKSKGRAHTLKTVQGTKPTTPAVTRPTVQDKAGPSSEVPTPVIELDSAGEHSREKRPRNESEALDVSPLC-EVREDSPLR

Query:  RRRKKKKTTSSLEVGPREPLPSSHADLVDDPEARMGG
        RRRKKKKTTS LEVG R  LP+S AD VDDPEARMGG
Subjt:  RRRKKKKTTSSLEVGPREPLPSSHADLVDDPEARMGG

A0A6J1CLV1 uncharacterized protein LOC1110124673.2e-5846.88Show/hide
Query:  QLTQASWDTLKYYKDRFPSGRKVGTLVTDRLLLESGLLDYNPLVRPVEDSRPNSKLAMVCGFTGSVKRKSKGRAHTLKTVQGTKPTTPAVTRPTVQDKAG
        +L QA++DTLK+YKD FP GRK+GTLVTD+LLLESGLLDYNPLVRP+E SRPNS+LAMVCGFT SVKRKSKGRAH LK VQ + P TPAV +   QD+AG
Subjt:  QLTQASWDTLKYYKDRFPSGRKVGTLVTDRLLLESGLLDYNPLVRPVEDSRPNSKLAMVCGFTGSVKRKSKGRAHTLKTVQGTKPTTPAVTRPTVQDKAG

Query:  PSSEVPTPVIELDSAGEHSREKRPRNESEALDVSPLCEVREDSPLRRRRKKKKTTSSLEVGPREPLPSSHADLVDDPEARMGGPWTAASEGHPKFVSDLG
        PSS  PTPVIELDS GE SREKR R+ESEALDVSPL EVR                                                            
Subjt:  PSSEVPTPVIELDSAGEHSREKRPRNESEALDVSPLCEVREDSPLRRRRKKKKTTSSLEVGPREPLPSSHADLVDDPEARMGGPWTAASEGHPKFVSDLG

Query:  SILQRTIDHAAEAIIASIHSAIMVKTKLDGREILAARESANSSATLEAATIMKGELLKARFELETLKAEVEAKAQLLKREEEKHKAHLRAAHAITKGLEK
                                                                              EAKA+LLKRE+E+HKAHLRAAHAITKGLEK
Subjt:  SILQRTIDHAAEAIIASIHSAIMVKTKLDGREILAARESANSSATLEAATIMKGELLKARFELETLKAEVEAKAQLLKREEEKHKAHLRAAHAITKGLEK

Query:  EKFQLLKEKDDLAQILEEMD
        EKFQLLKEKDD+ Q LE  D
Subjt:  EKFQLLKEKDDLAQILEEMD

A0A6J1CR42 uncharacterized protein LOC1110138261.4e-6668.34Show/hide
Query:  IAKKPGRYYMCARKGACGIVKRPASIKGWVKKWFFASGGWLAKNESCLPFFDVPISVNQTNS-----QLTQASWDTLKYYKDRFPSGRKVGTLVTDRLLL
        IAKKPGR+YMCARKGA GIVK P SIKGWV+KWF+ASG WLAK+ES   FFDVP       S     +LTQAS+DTLKYYK+RFP GRKVGTLVTD LLL
Subjt:  IAKKPGRYYMCARKGACGIVKRPASIKGWVKKWFFASGGWLAKNESCLPFFDVPISVNQTNS-----QLTQASWDTLKYYKDRFPSGRKVGTLVTDRLLL

Query:  ESGLLDYNPLVRPVEDSRPNSKLAMVCGFTGSVKRKSKGRAHTLKTVQGTKPTTPAVTRPTVQDKAGPSSEVPTPVIELDSAGEHSREKRPRNESEALD
        ESGLLDYNP VRP+E SRPNS LAMVC F   VKRKSKGRAH L+  Q +KP TPAV         GP+SE P PVIEL+S+G  SREKRPR+++EA+D
Subjt:  ESGLLDYNPLVRPVEDSRPNSKLAMVCGFTGSVKRKSKGRAHTLKTVQGTKPTTPAVTRPTVQDKAGPSSEVPTPVIELDSAGEHSREKRPRNESEALD

A0A6J1DXS5 uncharacterized protein LOC1110255028.3e-6768.34Show/hide
Query:  IAKKPGRYYMCARKGACGIVKRPASIKGWVKKWFFASGGWLAKNESCLPFFDVPISVNQTNS-----QLTQASWDTLKYYKDRFPSGRKVGTLVTDRLLL
        IAKKPGR+YMCARKGA GIVK P SIKGWV+KWF+ASG WLAK+ES   FFDVP       S     +LTQAS+DTLKYYK+RFP GRKVGTLVTD LLL
Subjt:  IAKKPGRYYMCARKGACGIVKRPASIKGWVKKWFFASGGWLAKNESCLPFFDVPISVNQTNS-----QLTQASWDTLKYYKDRFPSGRKVGTLVTDRLLL

Query:  ESGLLDYNPLVRPVEDSRPNSKLAMVCGFTGSVKRKSKGRAHTLKTVQGTKPTTPAVTRPTVQDKAGPSSEVPTPVIELDSAGEHSREKRPRNESEALD
        ESGLLDYNP VRP+E SRPNS+LAMVCGF   VKRKSKGRAH L+  Q +KP TPAV         GP+SE P  VIEL+S+G  SREKRPR+++EA+D
Subjt:  ESGLLDYNPLVRPVEDSRPNSKLAMVCGFTGSVKRKSKGRAHTLKTVQGTKPTTPAVTRPTVQDKAGPSSEVPTPVIELDSAGEHSREKRPRNESEALD

A0A6J1DZB3 uncharacterized protein LOC1110256653.6e-13167.16Show/hide
Query:  MCARKGACGIVKRPASIKGWVKKWFFASGGWLAKNESCLPFFDVPISVNQTNS-----QLTQASWDTLKYYKDRFPSGRKVGTLVTDRLLLESGLLDYNP
        MCARKG  GIVK P SIKGWV KWFFASG WLAK+ES   FFDVP       S     +L QA++DTLK+YKD FP  RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGACGIVKRPASIKGWVKKWFFASGGWLAKNESCLPFFDVPISVNQTNS-----QLTQASWDTLKYYKDRFPSGRKVGTLVTDRLLLESGLLDYNP

Query:  LVRPVEDSRPNSKLAMVCGFTGSVKRKSKGRAHTLKTVQGTKPTTPAVTRPTVQDKAGPSSEVPTPVIELDSAGEHSREKRPRNESEALDVSPLCEVRED
        LVR +E SRPNS+LAMVCGFTGSVKRKSKGRAH LKTV GT+P TP V R   Q  +GPSS VPTPVIELD +G  S EKR R ESEALDVSPL EVR +
Subjt:  LVRPVEDSRPNSKLAMVCGFTGSVKRKSKGRAHTLKTVQGTKPTTPAVTRPTVQDKAGPSSEVPTPVIELDSAGEHSREKRPRNESEALDVSPLCEVRED

Query:  SPLRRRRKKKKTTSSLEVGPREPLPSSHADLVDDPEARMGGP--------WTAASEG-------------------HPKFVSDLGSILQRTIDHAAEAII
        SPLRRRRKKKKT+SS E G R  LP+SHADLVDDPEARM G            +S G                     KFVSD GS+LQRTID+ AEA I
Subjt:  SPLRRRRKKKKTTSSLEVGPREPLPSSHADLVDDPEARMGGP--------WTAASEG-------------------HPKFVSDLGSILQRTIDHAAEAII

Query:  ASIHSAIMVKTKLDGREILAARESANSSATLEAATIMKGELLKARFELETLKAEVEAKAQLLKREEEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQI
        ASIH A+MVK +LDGRE LAA+E  NS A LEAAT +KGELLKA+ E++ L+AEV+AK  LLK+E EKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQ+
Subjt:  ASIHSAIMVKTKLDGREILAARESANSSATLEAATIMKGELLKARFELETLKAEVEAKAQLLKREEEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQI

Query:  LEEMD
        LEE D
Subjt:  LEEMD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAGCTAAGAAGCCAGGTCGGTACTACATGTGCGCAAGGAAGGGCGCATGTGGTATAGTTAAAAGGCCGGCCTCCATCAAGGGATGGGTGAAGAAGTGGTTCTTTGC
CTCTGGAGGATGGTTGGCAAAGAACGAGTCTTGTCTTCCCTTCTTTGACGTTCCCATTAGTGTCAATCAAACCAATTCCCAGCTAACTCAAGCATCTTGGGACACTCTCA
AGTATTACAAGGATCGCTTCCCAAGTGGCAGGAAGGTCGGAACCTTGGTGACTGACCGATTGCTGCTTGAGTCTGGGTTGTTAGACTACAACCCCTTAGTGCGTCCAGTC
GAAGACTCAAGACCAAACTCTAAGCTCGCAATGGTGTGTGGATTCACAGGCAGTGTGAAGCGCAAGTCTAAGGGCCGTGCTCACACTCTTAAGACTGTTCAAGGTACGAA
ACCAACAACTCCTGCTGTGACTCGTCCTACGGTCCAAGACAAAGCTGGGCCGTCTTCTGAAGTTCCAACTCCGGTGATCGAGTTGGATTCTGCTGGGGAACACTCCAGGG
AAAAGCGTCCAAGGAATGAGTCTGAGGCGCTGGACGTGTCACCTCTATGTGAGGTAAGAGAAGACTCTCCTCTGAGGAGGAGAAGGAAGAAGAAGAAAACCACCTCCTCC
TTAGAGGTTGGACCTCGTGAGCCTCTGCCCTCAAGCCACGCTGACCTGGTGGACGACCCCGAAGCTCGGATGGGGGGACCTTGGACCGCTGCCTCAGAAGGGCATCCTAA
GTTCGTAAGTGACCTTGGGTCCATACTGCAACGGACCATTGACCACGCCGCTGAGGCGATCATTGCTTCCATTCACTCGGCGATTATGGTGAAGACCAAGCTGGATGGAA
GGGAAATCTTGGCAGCGAGAGAGAGTGCGAATTCCTCTGCTACCTTGGAAGCTGCCACCATAATGAAGGGTGAGCTACTGAAAGCTCGCTTCGAATTGGAGACTTTGAAA
GCCGAGGTTGAGGCCAAGGCTCAGTTGCTGAAAAGAGAAGAAGAAAAGCACAAGGCCCACCTCCGAGCTGCTCACGCCATCACAAAGGGGTTGGAGAAGGAGAAGTTCCA
GCTCCTGAAGGAGAAGGACGACCTGGCTCAAATCCTTGAGGAGATGGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGATAGCTAAGAAGCCAGGTCGGTACTACATGTGCGCAAGGAAGGGCGCATGTGGTATAGTTAAAAGGCCGGCCTCCATCAAGGGATGGGTGAAGAAGTGGTTCTTTGC
CTCTGGAGGATGGTTGGCAAAGAACGAGTCTTGTCTTCCCTTCTTTGACGTTCCCATTAGTGTCAATCAAACCAATTCCCAGCTAACTCAAGCATCTTGGGACACTCTCA
AGTATTACAAGGATCGCTTCCCAAGTGGCAGGAAGGTCGGAACCTTGGTGACTGACCGATTGCTGCTTGAGTCTGGGTTGTTAGACTACAACCCCTTAGTGCGTCCAGTC
GAAGACTCAAGACCAAACTCTAAGCTCGCAATGGTGTGTGGATTCACAGGCAGTGTGAAGCGCAAGTCTAAGGGCCGTGCTCACACTCTTAAGACTGTTCAAGGTACGAA
ACCAACAACTCCTGCTGTGACTCGTCCTACGGTCCAAGACAAAGCTGGGCCGTCTTCTGAAGTTCCAACTCCGGTGATCGAGTTGGATTCTGCTGGGGAACACTCCAGGG
AAAAGCGTCCAAGGAATGAGTCTGAGGCGCTGGACGTGTCACCTCTATGTGAGGTAAGAGAAGACTCTCCTCTGAGGAGGAGAAGGAAGAAGAAGAAAACCACCTCCTCC
TTAGAGGTTGGACCTCGTGAGCCTCTGCCCTCAAGCCACGCTGACCTGGTGGACGACCCCGAAGCTCGGATGGGGGGACCTTGGACCGCTGCCTCAGAAGGGCATCCTAA
GTTCGTAAGTGACCTTGGGTCCATACTGCAACGGACCATTGACCACGCCGCTGAGGCGATCATTGCTTCCATTCACTCGGCGATTATGGTGAAGACCAAGCTGGATGGAA
GGGAAATCTTGGCAGCGAGAGAGAGTGCGAATTCCTCTGCTACCTTGGAAGCTGCCACCATAATGAAGGGTGAGCTACTGAAAGCTCGCTTCGAATTGGAGACTTTGAAA
GCCGAGGTTGAGGCCAAGGCTCAGTTGCTGAAAAGAGAAGAAGAAAAGCACAAGGCCCACCTCCGAGCTGCTCACGCCATCACAAAGGGGTTGGAGAAGGAGAAGTTCCA
GCTCCTGAAGGAGAAGGACGACCTGGCTCAAATCCTTGAGGAGATGGACTAG
Protein sequenceShow/hide protein sequence
MIAKKPGRYYMCARKGACGIVKRPASIKGWVKKWFFASGGWLAKNESCLPFFDVPISVNQTNSQLTQASWDTLKYYKDRFPSGRKVGTLVTDRLLLESGLLDYNPLVRPV
EDSRPNSKLAMVCGFTGSVKRKSKGRAHTLKTVQGTKPTTPAVTRPTVQDKAGPSSEVPTPVIELDSAGEHSREKRPRNESEALDVSPLCEVREDSPLRRRRKKKKTTSS
LEVGPREPLPSSHADLVDDPEARMGGPWTAASEGHPKFVSDLGSILQRTIDHAAEAIIASIHSAIMVKTKLDGREILAARESANSSATLEAATIMKGELLKARFELETLK
AEVEAKAQLLKREEEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQILEEMD