; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g16110 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g16110
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr2:12108028..12113817
RNA-Seq ExpressionMoc02g16110
SyntenyMoc02g16110
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]5.4e-7758.19Show/hide
Query:  ICARKGAGGIVKGPTSIKGWVGKWFFASGEWLANDESGRTFFDVPARFGNLGMESLLLSLCSSKSFSSLLLISNSNFPLFCAMSIKLIPEHTQATFDTLK
        +CARKGA GIVKGPTSIKGWV KWF+ASGEWLA DES                                             ++I+ +PE TQA+FDTLK
Subjt:  ICARKGAGGIVKGPTSIKGWVGKWFFASGEWLANDESGRTFFDVPARFGNLGMESLLLSLCSSKSFSSLLLISNSNFPLFCAMSIKLIPEHTQATFDTLK

Query:  YYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLLRPVEASRPNSELATMCGFSSSVKRKSMGRAYALKTIQSTEPTTPAVAQLAAQDKAGPSSEVPTPVIE
        YYK+HFPRGRK+GTLVTDKLLLESGLLDYNP +RP+E+SRPNSELA +CGF+S+VKRKS G+A+AL+  QS++P TPAV         GP+SE P PVIE
Subjt:  YYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLLRPVEASRPNSELATMCGFSSSVKRKSMGRAYALKTIQSTEPTTPAVAQLAAQDKAGPSSEVPTPVIE

Query:  LESAGEHSREKRPRDESEALDVSPL-REVREESPLRRKKKKKKTASSSEVGPRGPLPASHVDLVDDPEAQMGGRPMRASKFVSAPGS
        LES+   SREKRPRD++EA+DVSPL  EVREE PL+R++KKKKT S  EVG RG LPAS  D VDDPEA+MGG P   ++F   P S
Subjt:  LESAGEHSREKRPRDESEALDVSPL-REVREESPLRRKKKKKKTASSSEVGPRGPLPASHVDLVDDPEAQMGGRPMRASKFVSAPGS

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]2.6e-8766.04Show/hide
Query:  VIFALAILFRLRARDVDEAELLNVEQLLGCFEAKRIAKKPGRYYICARKGAGGIVKGPTSIKGWVGKWFFASGEWLANDESGRTFFDVPARFGNLGMESL
        VIFALAILF LRARD +EAELL+V+QLL CFEAKRIAKKPGR+Y+CARKGAGGIVKGPTSIKGWV KWF+ASGEWLA DESGR+FFDVP RFGNL     
Subjt:  VIFALAILFRLRARDVDEAELLNVEQLLGCFEAKRIAKKPGRYYICARKGAGGIVKGPTSIKGWVGKWFFASGEWLANDESGRTFFDVPARFGNLGMESL

Query:  LLSLCSSKSFSSLLLISNSNFPLFCAMSIKLIPEHTQATFDTLKYYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLLRPVEASRPNSELATMCGFSSSVK
                                  +SI+ +PE TQA+FDTLKYYK+ FPRGRK+GTLVTD+LLLESGLLDYNP +RP+E SRPNS LA +C F+S VK
Subjt:  LLSLCSSKSFSSLLLISNSNFPLFCAMSIKLIPEHTQATFDTLKYYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLLRPVEASRPNSELATMCGFSSSVK

Query:  RKSMGRAYALKTIQSTEPTTPAVAQLAAQDKAGPSSEVPTPVIELESAGEHSREKRPRDESEALD
        RKS GRA+AL+  QS++P TPAV         GP+SE P PVIELES+G  SREKRPRD++EA+D
Subjt:  RKSMGRAYALKTIQSTEPTTPAVAQLAAQDKAGPSSEVPTPVIELESAGEHSREKRPRDESEALD

XP_022158650.1 uncharacterized protein LOC111025108 [Momordica charantia]2.5e-6669.84Show/hide
Query:  VIFALAILFRLRARDVDEAELLNVEQLLGCFEAKRIAKKPGRYYICARKGAGGIVKGPTSIKGWVGKWFFASGEWLANDESGRTFFDVPARFGNLGMESL
        VIFALAILF LRARD +EAELL+V+QLL CFEAKRIAKKPGR+Y+CARKGA GIVKGPTSIKGWV KWF+ASGEWLA DESGR+FFDVP RFGNL     
Subjt:  VIFALAILFRLRARDVDEAELLNVEQLLGCFEAKRIAKKPGRYYICARKGAGGIVKGPTSIKGWVGKWFFASGEWLANDESGRTFFDVPARFGNLGMESL

Query:  LLSLCSSKSFSSLLLISNSNFPLFCAMSIKLIPEHTQATFDTLKYYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLLRPVEASRPNSEL
                                  +SI+ +PE TQA+FDTLKYYK+HFPRGRK+GTLVTDKLLLESGLLDYNP +RP+E+SRPNSEL
Subjt:  LLSLCSSKSFSSLLLISNSNFPLFCAMSIKLIPEHTQATFDTLKYYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLLRPVEASRPNSEL

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.6e-10055.44Show/hide
Query:  SDYGEDLALRLESELEEIENFRFLDDEEDSDTSISGQGLEYPSKMPEHYLGPLR----------------------------------------------
        S+   DLA RLES+LEEIEN R  DD EDSD S SGQGLEYPS++PEHYLG LR                                              
Subjt:  SDYGEDLALRLESELEEIENFRFLDDEEDSDTSISGQGLEYPSKMPEHYLGPLR----------------------------------------------

Query:  ---------------------VIFALAILFRLRARDVDEAELLNVEQLLGCFEAKRIAKKPGRYYICARKGAGGIVKGPTSIKGWVGKWFFASGEWLAND
                             VIFALAILF LRARD +EAEL +V+QLL CFEAKRIAKKPGR+Y+CARKGAGGIVKGPTSIKGWV KWF+ASGEWLA D
Subjt:  ---------------------VIFALAILFRLRARDVDEAELLNVEQLLGCFEAKRIAKKPGRYYICARKGAGGIVKGPTSIKGWVGKWFFASGEWLAND

Query:  ESGRTFFDVPARFGNLGMESLLLSLCSSKSFSSLLLISNSNFPLFCAMSIKLIPEHTQATFDTLKYYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLLRP
        ESGR+FFDVP RFGNL                               +SI+ +PE TQA+FDTLKYYK+ FPRGRK+GTLVTD+LLLESGLLDYNP +RP
Subjt:  ESGRTFFDVPARFGNLGMESLLLSLCSSKSFSSLLLISNSNFPLFCAMSIKLIPEHTQATFDTLKYYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLLRP

Query:  VEASRPNSELATMCGFSSSVKRKSMGRAYALKTIQSTEPTTPAVAQLAAQDKAGPSSEVPTPVIELESAGEHSREKRPRDESEALD
        +E+SRPNSELA +CGF+S VKRKS GRA+AL+  QS++P TPAV         GP+SE P  VIELES+G  SREKRPRD++EA+D
Subjt:  VEASRPNSELATMCGFSSSVKRKSMGRAYALKTIQSTEPTTPAVAQLAAQDKAGPSSEVPTPVIELESAGEHSREKRPRDESEALD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]7.0e-11765.95Show/hide
Query:  ICARKGAGGIVKGPTSIKGWVGKWFFASGEWLANDESGRTFFDVPARFGNLGMESLLLSLCSSKSFSSLLLISNSNFPLFCAMSIKLIPEHTQATFDTLK
        +CARKG GGIVKGPTSIKGWVGKWFFASGEWLA DESGR FFDVP RFGNL                               +SIKLIPE  QATFDTLK
Subjt:  ICARKGAGGIVKGPTSIKGWVGKWFFASGEWLANDESGRTFFDVPARFGNLGMESLLLSLCSSKSFSSLLLISNSNFPLFCAMSIKLIPEHTQATFDTLK

Query:  YYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLLRPVEASRPNSELATMCGFSSSVKRKSMGRAYALKTIQSTEPTTPAVAQLAAQDKAGPSSEVPTPVIE
        +YKDHFPR RKI TLVTDKLLLESGLLDYNPL+R +EASRPNSELA +CGF+ SVKRKS GRA+ALKT+  TEP TP V +  AQ  +GPSS VPTPVIE
Subjt:  YYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLLRPVEASRPNSELATMCGFSSSVKRKSMGRAYALKTIQSTEPTTPAVAQLAAQDKAGPSSEVPTPVIE

Query:  LESAGEHSREKRPRDESEALDVSPLREVREESPLRRKKKKKKTASSSEVGPRGPLPASHVDLVDDPEAQMGGRP--------------------------
        L+ +G  S EKR R+ESEALDVSPL EVR ESPLRR++KKKKT+SSSE G RG LP SH DLVDDPEA+M G                            
Subjt:  LESAGEHSREKRPRDESEALDVSPLREVREESPLRRKKKKKKTASSSEVGPRGPLPASHVDLVDDPEAQMGGRP--------------------------

Query:  -----MRASKFVSAPGSVLQRTIDHAVEAFNASIHSAVMIKAELDGREALAAKERENSSAALEAATTLKG
              RASKFVS PGSVLQRTID+  EAF ASIH AVM+KAELDGREALAAKERENS AALEAATTLKG
Subjt:  -----MRASKFVSAPGSVLQRTIDHAVEAFNASIHSAVMIKAELDGREALAAKERENSSAALEAATTLKG

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092982.6e-7758.19Show/hide
Query:  ICARKGAGGIVKGPTSIKGWVGKWFFASGEWLANDESGRTFFDVPARFGNLGMESLLLSLCSSKSFSSLLLISNSNFPLFCAMSIKLIPEHTQATFDTLK
        +CARKGA GIVKGPTSIKGWV KWF+ASGEWLA DES                                             ++I+ +PE TQA+FDTLK
Subjt:  ICARKGAGGIVKGPTSIKGWVGKWFFASGEWLANDESGRTFFDVPARFGNLGMESLLLSLCSSKSFSSLLLISNSNFPLFCAMSIKLIPEHTQATFDTLK

Query:  YYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLLRPVEASRPNSELATMCGFSSSVKRKSMGRAYALKTIQSTEPTTPAVAQLAAQDKAGPSSEVPTPVIE
        YYK+HFPRGRK+GTLVTDKLLLESGLLDYNP +RP+E+SRPNSELA +CGF+S+VKRKS G+A+AL+  QS++P TPAV         GP+SE P PVIE
Subjt:  YYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLLRPVEASRPNSELATMCGFSSSVKRKSMGRAYALKTIQSTEPTTPAVAQLAAQDKAGPSSEVPTPVIE

Query:  LESAGEHSREKRPRDESEALDVSPL-REVREESPLRRKKKKKKTASSSEVGPRGPLPASHVDLVDDPEAQMGGRPMRASKFVSAPGS
        LES+   SREKRPRD++EA+DVSPL  EVREE PL+R++KKKKT S  EVG RG LPAS  D VDDPEA+MGG P   ++F   P S
Subjt:  LESAGEHSREKRPRDESEALDVSPL-REVREESPLRRKKKKKKTASSSEVGPRGPLPASHVDLVDDPEAQMGGRPMRASKFVSAPGS

A0A6J1CR42 uncharacterized protein LOC1110138261.3e-8766.04Show/hide
Query:  VIFALAILFRLRARDVDEAELLNVEQLLGCFEAKRIAKKPGRYYICARKGAGGIVKGPTSIKGWVGKWFFASGEWLANDESGRTFFDVPARFGNLGMESL
        VIFALAILF LRARD +EAELL+V+QLL CFEAKRIAKKPGR+Y+CARKGAGGIVKGPTSIKGWV KWF+ASGEWLA DESGR+FFDVP RFGNL     
Subjt:  VIFALAILFRLRARDVDEAELLNVEQLLGCFEAKRIAKKPGRYYICARKGAGGIVKGPTSIKGWVGKWFFASGEWLANDESGRTFFDVPARFGNLGMESL

Query:  LLSLCSSKSFSSLLLISNSNFPLFCAMSIKLIPEHTQATFDTLKYYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLLRPVEASRPNSELATMCGFSSSVK
                                  +SI+ +PE TQA+FDTLKYYK+ FPRGRK+GTLVTD+LLLESGLLDYNP +RP+E SRPNS LA +C F+S VK
Subjt:  LLSLCSSKSFSSLLLISNSNFPLFCAMSIKLIPEHTQATFDTLKYYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLLRPVEASRPNSELATMCGFSSSVK

Query:  RKSMGRAYALKTIQSTEPTTPAVAQLAAQDKAGPSSEVPTPVIELESAGEHSREKRPRDESEALD
        RKS GRA+AL+  QS++P TPAV         GP+SE P PVIELES+G  SREKRPRD++EA+D
Subjt:  RKSMGRAYALKTIQSTEPTTPAVAQLAAQDKAGPSSEVPTPVIELESAGEHSREKRPRDESEALD

A0A6J1DWF1 uncharacterized protein LOC1110251081.2e-6669.84Show/hide
Query:  VIFALAILFRLRARDVDEAELLNVEQLLGCFEAKRIAKKPGRYYICARKGAGGIVKGPTSIKGWVGKWFFASGEWLANDESGRTFFDVPARFGNLGMESL
        VIFALAILF LRARD +EAELL+V+QLL CFEAKRIAKKPGR+Y+CARKGA GIVKGPTSIKGWV KWF+ASGEWLA DESGR+FFDVP RFGNL     
Subjt:  VIFALAILFRLRARDVDEAELLNVEQLLGCFEAKRIAKKPGRYYICARKGAGGIVKGPTSIKGWVGKWFFASGEWLANDESGRTFFDVPARFGNLGMESL

Query:  LLSLCSSKSFSSLLLISNSNFPLFCAMSIKLIPEHTQATFDTLKYYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLLRPVEASRPNSEL
                                  +SI+ +PE TQA+FDTLKYYK+HFPRGRK+GTLVTDKLLLESGLLDYNP +RP+E+SRPNSEL
Subjt:  LLSLCSSKSFSSLLLISNSNFPLFCAMSIKLIPEHTQATFDTLKYYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLLRPVEASRPNSEL

A0A6J1DXS5 uncharacterized protein LOC1110255027.6e-10155.44Show/hide
Query:  SDYGEDLALRLESELEEIENFRFLDDEEDSDTSISGQGLEYPSKMPEHYLGPLR----------------------------------------------
        S+   DLA RLES+LEEIEN R  DD EDSD S SGQGLEYPS++PEHYLG LR                                              
Subjt:  SDYGEDLALRLESELEEIENFRFLDDEEDSDTSISGQGLEYPSKMPEHYLGPLR----------------------------------------------

Query:  ---------------------VIFALAILFRLRARDVDEAELLNVEQLLGCFEAKRIAKKPGRYYICARKGAGGIVKGPTSIKGWVGKWFFASGEWLAND
                             VIFALAILF LRARD +EAEL +V+QLL CFEAKRIAKKPGR+Y+CARKGAGGIVKGPTSIKGWV KWF+ASGEWLA D
Subjt:  ---------------------VIFALAILFRLRARDVDEAELLNVEQLLGCFEAKRIAKKPGRYYICARKGAGGIVKGPTSIKGWVGKWFFASGEWLAND

Query:  ESGRTFFDVPARFGNLGMESLLLSLCSSKSFSSLLLISNSNFPLFCAMSIKLIPEHTQATFDTLKYYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLLRP
        ESGR+FFDVP RFGNL                               +SI+ +PE TQA+FDTLKYYK+ FPRGRK+GTLVTD+LLLESGLLDYNP +RP
Subjt:  ESGRTFFDVPARFGNLGMESLLLSLCSSKSFSSLLLISNSNFPLFCAMSIKLIPEHTQATFDTLKYYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLLRP

Query:  VEASRPNSELATMCGFSSSVKRKSMGRAYALKTIQSTEPTTPAVAQLAAQDKAGPSSEVPTPVIELESAGEHSREKRPRDESEALD
        +E+SRPNSELA +CGF+S VKRKS GRA+AL+  QS++P TPAV         GP+SE P  VIELES+G  SREKRPRD++EA+D
Subjt:  VEASRPNSELATMCGFSSSVKRKSMGRAYALKTIQSTEPTTPAVAQLAAQDKAGPSSEVPTPVIELESAGEHSREKRPRDESEALD

A0A6J1DZB3 uncharacterized protein LOC1110256653.4e-11765.95Show/hide
Query:  ICARKGAGGIVKGPTSIKGWVGKWFFASGEWLANDESGRTFFDVPARFGNLGMESLLLSLCSSKSFSSLLLISNSNFPLFCAMSIKLIPEHTQATFDTLK
        +CARKG GGIVKGPTSIKGWVGKWFFASGEWLA DESGR FFDVP RFGNL                               +SIKLIPE  QATFDTLK
Subjt:  ICARKGAGGIVKGPTSIKGWVGKWFFASGEWLANDESGRTFFDVPARFGNLGMESLLLSLCSSKSFSSLLLISNSNFPLFCAMSIKLIPEHTQATFDTLK

Query:  YYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLLRPVEASRPNSELATMCGFSSSVKRKSMGRAYALKTIQSTEPTTPAVAQLAAQDKAGPSSEVPTPVIE
        +YKDHFPR RKI TLVTDKLLLESGLLDYNPL+R +EASRPNSELA +CGF+ SVKRKS GRA+ALKT+  TEP TP V +  AQ  +GPSS VPTPVIE
Subjt:  YYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLLRPVEASRPNSELATMCGFSSSVKRKSMGRAYALKTIQSTEPTTPAVAQLAAQDKAGPSSEVPTPVIE

Query:  LESAGEHSREKRPRDESEALDVSPLREVREESPLRRKKKKKKTASSSEVGPRGPLPASHVDLVDDPEAQMGGRP--------------------------
        L+ +G  S EKR R+ESEALDVSPL EVR ESPLRR++KKKKT+SSSE G RG LP SH DLVDDPEA+M G                            
Subjt:  LESAGEHSREKRPRDESEALDVSPLREVREESPLRRKKKKKKTASSSEVGPRGPLPASHVDLVDDPEAQMGGRP--------------------------

Query:  -----MRASKFVSAPGSVLQRTIDHAVEAFNASIHSAVMIKAELDGREALAAKERENSSAALEAATTLKG
              RASKFVS PGSVLQRTID+  EAF ASIH AVM+KAELDGREALAAKERENS AALEAATTLKG
Subjt:  -----MRASKFVSAPGSVLQRTIDHAVEAFNASIHSAVMIKAELDGREALAAKERENSSAALEAATTLKG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTACGATCTCGCTGAAGAAGTTCAAGAATGGTCCCCCCTACGCAATAGAGGTCGAGGCACCGATCATTTCAAAGCCTCCTGCCGAGCGACCACTATGGGAGGGGAA
AAGGCTTTACGCGTGCGTTCGGAAACATAGGCTGATTGCTGAGCCCACTTTGGCTCAGCCGCACTACACTGGGTGCGAGCCACTTGAGAAGCAAGCTATTACAGCGAGCG
GCTGGGGAAAGTCAATGATAAGCAAGAAAAGAAGAATAAGCAAGAAGAAAAGAGAAGTCCCGAGAACTCCCGCTTTACCTTCGCCTACACCCGCTTGCTCGCTTCCAACA
CAAGCACGCTATCCTTTGTTCGCCTTCTACTGCTTTCTACCTCGTGAGAAAATTTGCTACAGTTTAAAAAAGAACAACAAACGGGCCTTACCGGGACCAGATGCGCTGCG
CTACCCAGCGCATAACCTTGTTGCCGCTACCCCTCTTTTGGTTATGCTATTACCAATCGTGGATAACCCCCGAGCCGGCCGTTCACACGCGCCCAAACGAGCACCGATCG
TAACAGCAGGGAACTATACAGGCAACAGCAAAGGCGGTGATGGATCCGACAGTGCTCACGACCGACGGTTACATGTTTTTTCTCATGTCCGACCTGTCGGGTTCCAAGCA
GATCGAACCCGAGTCAGGTCAAACTTCGGCATGTATTACTTAGCCTTTTTCCAATGGGTGACCCCGACCTCTTCGGCAGGTTCGAGGTGGACCTGGAATTCTTTAATCCT
TGGATGCCAGTCAAACCTTACGTTCCCTGAATTTCTAGAGTTCGATCTGAAACCAGCTCGAACCCTTCGTAGTAGTGATAGCCTAGGTACCGCAGGTCGGACTATAAACA
GTTCGCCCCCCAAACCAAGTGATTATGGGGAGGACTTAGCTCTTAGGTTAGAGTCCGAGCTGGAAGAGATAGAGAATTTTAGGTTTTTGGACGATGAGGAGGATAGTGAC
ACTTCCATCTCGGGCCAGGGTTTGGAATACCCTTCTAAAATGCCCGAGCACTATCTCGGACCCCTCCGTGTCATTTTTGCGTTGGCCATCCTTTTCCGGTTGCGAGCTCG
GGACGTGGACGAGGCCGAGCTGCTGAACGTGGAGCAGCTTCTTGGATGCTTCGAAGCTAAAAGGATAGCTAAGAAGCCTGGTCGGTACTATATATGCGCAAGGAAAGGCG
CAGGTGGTATAGTCAAGGGGCCGACCTCCATCAAAGGATGGGTTGGGAAGTGGTTCTTTGCCTCTGGGGAATGGTTGGCAAATGATGAGTCAGGTCGTACCTTCTTTGAT
GTTCCCGCTAGGTTTGGGAACTTAGGTATGGAGTCGCTCTTGCTTTCTTTATGTAGCTCTAAGTCTTTTAGCTCTTTGCTTCTTATATCTAACTCGAACTTTCCTTTGTT
TTGTGCAATGTCAATCAAACTGATTCCCGAGCACACTCAAGCCACCTTCGACACCCTGAAGTACTACAAGGATCACTTCCCGAGGGGCCGAAAGATCGGAACCTTGGTGA
CTGACAAGCTGCTCCTCGAGTCTGGGTTGTTAGATTACAACCCCTTGTTGCGTCCGGTTGAAGCTTCAAGGCCAAACTCTGAACTCGCAACGATGTGTGGATTCTCCAGC
AGCGTGAAGCGCAAATCTATGGGCCGTGCTTATGCCCTTAAGACTATTCAAAGCACGGAGCCAACGACGCCTGCTGTGGCTCAGCTTGCTGCTCAAGATAAGGCTGGACC
ATCTTCCGAAGTCCCAACTCCAGTGATCGAGTTGGAATCTGCTGGGGAACACTCCAGAGAGAAGCGCCCAAGGGATGAGTCTGAGGCGCTGGACGTGTCACCTCTGCGAG
AGGTGAGGGAAGAGTCTCCCTTGAGGAGAAAAAAGAAGAAGAAGAAAACTGCCTCCTCCTCGGAGGTTGGACCTCGTGGGCCCCTGCCCGCGAGTCATGTCGACTTGGTG
GACGACCCCGAAGCTCAGATGGGGGGACGTCCGATGAGAGCGTCCAAGTTTGTAAGCGCTCCTGGGTCCGTTCTACAAAGGACCATTGACCATGCTGTCGAGGCGTTCAA
TGCTTCCATCCATTCGGCGGTTATGATCAAGGCCGAACTGGATGGAAGAGAAGCTTTGGCGGCAAAGGAAAGGGAGAACTCCTCTGCTGCCTTAGAAGCTGCCACCACGC
TGAAGGGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGATTACGATCTCGCTGAAGAAGTTCAAGAATGGTCCCCCCTACGCAATAGAGGTCGAGGCACCGATCATTTCAAAGCCTCCTGCCGAGCGACCACTATGGGAGGGGAA
AAGGCTTTACGCGTGCGTTCGGAAACATAGGCTGATTGCTGAGCCCACTTTGGCTCAGCCGCACTACACTGGGTGCGAGCCACTTGAGAAGCAAGCTATTACAGCGAGCG
GCTGGGGAAAGTCAATGATAAGCAAGAAAAGAAGAATAAGCAAGAAGAAAAGAGAAGTCCCGAGAACTCCCGCTTTACCTTCGCCTACACCCGCTTGCTCGCTTCCAACA
CAAGCACGCTATCCTTTGTTCGCCTTCTACTGCTTTCTACCTCGTGAGAAAATTTGCTACAGTTTAAAAAAGAACAACAAACGGGCCTTACCGGGACCAGATGCGCTGCG
CTACCCAGCGCATAACCTTGTTGCCGCTACCCCTCTTTTGGTTATGCTATTACCAATCGTGGATAACCCCCGAGCCGGCCGTTCACACGCGCCCAAACGAGCACCGATCG
TAACAGCAGGGAACTATACAGGCAACAGCAAAGGCGGTGATGGATCCGACAGTGCTCACGACCGACGGTTACATGTTTTTTCTCATGTCCGACCTGTCGGGTTCCAAGCA
GATCGAACCCGAGTCAGGTCAAACTTCGGCATGTATTACTTAGCCTTTTTCCAATGGGTGACCCCGACCTCTTCGGCAGGTTCGAGGTGGACCTGGAATTCTTTAATCCT
TGGATGCCAGTCAAACCTTACGTTCCCTGAATTTCTAGAGTTCGATCTGAAACCAGCTCGAACCCTTCGTAGTAGTGATAGCCTAGGTACCGCAGGTCGGACTATAAACA
GTTCGCCCCCCAAACCAAGTGATTATGGGGAGGACTTAGCTCTTAGGTTAGAGTCCGAGCTGGAAGAGATAGAGAATTTTAGGTTTTTGGACGATGAGGAGGATAGTGAC
ACTTCCATCTCGGGCCAGGGTTTGGAATACCCTTCTAAAATGCCCGAGCACTATCTCGGACCCCTCCGTGTCATTTTTGCGTTGGCCATCCTTTTCCGGTTGCGAGCTCG
GGACGTGGACGAGGCCGAGCTGCTGAACGTGGAGCAGCTTCTTGGATGCTTCGAAGCTAAAAGGATAGCTAAGAAGCCTGGTCGGTACTATATATGCGCAAGGAAAGGCG
CAGGTGGTATAGTCAAGGGGCCGACCTCCATCAAAGGATGGGTTGGGAAGTGGTTCTTTGCCTCTGGGGAATGGTTGGCAAATGATGAGTCAGGTCGTACCTTCTTTGAT
GTTCCCGCTAGGTTTGGGAACTTAGGTATGGAGTCGCTCTTGCTTTCTTTATGTAGCTCTAAGTCTTTTAGCTCTTTGCTTCTTATATCTAACTCGAACTTTCCTTTGTT
TTGTGCAATGTCAATCAAACTGATTCCCGAGCACACTCAAGCCACCTTCGACACCCTGAAGTACTACAAGGATCACTTCCCGAGGGGCCGAAAGATCGGAACCTTGGTGA
CTGACAAGCTGCTCCTCGAGTCTGGGTTGTTAGATTACAACCCCTTGTTGCGTCCGGTTGAAGCTTCAAGGCCAAACTCTGAACTCGCAACGATGTGTGGATTCTCCAGC
AGCGTGAAGCGCAAATCTATGGGCCGTGCTTATGCCCTTAAGACTATTCAAAGCACGGAGCCAACGACGCCTGCTGTGGCTCAGCTTGCTGCTCAAGATAAGGCTGGACC
ATCTTCCGAAGTCCCAACTCCAGTGATCGAGTTGGAATCTGCTGGGGAACACTCCAGAGAGAAGCGCCCAAGGGATGAGTCTGAGGCGCTGGACGTGTCACCTCTGCGAG
AGGTGAGGGAAGAGTCTCCCTTGAGGAGAAAAAAGAAGAAGAAGAAAACTGCCTCCTCCTCGGAGGTTGGACCTCGTGGGCCCCTGCCCGCGAGTCATGTCGACTTGGTG
GACGACCCCGAAGCTCAGATGGGGGGACGTCCGATGAGAGCGTCCAAGTTTGTAAGCGCTCCTGGGTCCGTTCTACAAAGGACCATTGACCATGCTGTCGAGGCGTTCAA
TGCTTCCATCCATTCGGCGGTTATGATCAAGGCCGAACTGGATGGAAGAGAAGCTTTGGCGGCAAAGGAAAGGGAGAACTCCTCTGCTGCCTTAGAAGCTGCCACCACGC
TGAAGGGCTAA
Protein sequenceShow/hide protein sequence
MITISLKKFKNGPPYAIEVEAPIISKPPAERPLWEGKRLYACVRKHRLIAEPTLAQPHYTGCEPLEKQAITASGWGKSMISKKRRISKKKREVPRTPALPSPTPACSLPT
QARYPLFAFYCFLPREKICYSLKKNNKRALPGPDALRYPAHNLVAATPLLVMLLPIVDNPRAGRSHAPKRAPIVTAGNYTGNSKGGDGSDSAHDRRLHVFSHVRPVGFQA
DRTRVRSNFGMYYLAFFQWVTPTSSAGSRWTWNSLILGCQSNLTFPEFLEFDLKPARTLRSSDSLGTAGRTINSSPPKPSDYGEDLALRLESELEEIENFRFLDDEEDSD
TSISGQGLEYPSKMPEHYLGPLRVIFALAILFRLRARDVDEAELLNVEQLLGCFEAKRIAKKPGRYYICARKGAGGIVKGPTSIKGWVGKWFFASGEWLANDESGRTFFD
VPARFGNLGMESLLLSLCSSKSFSSLLLISNSNFPLFCAMSIKLIPEHTQATFDTLKYYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLLRPVEASRPNSELATMCGFSS
SVKRKSMGRAYALKTIQSTEPTTPAVAQLAAQDKAGPSSEVPTPVIELESAGEHSREKRPRDESEALDVSPLREVREESPLRRKKKKKKTASSSEVGPRGPLPASHVDLV
DDPEAQMGGRPMRASKFVSAPGSVLQRTIDHAVEAFNASIHSAVMIKAELDGREALAAKERENSSAALEAATTLKG