; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g18780 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g18780
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr8:14197477..14200240
RNA-Seq ExpressionMoc08g18780
SyntenyMoc08g18780
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]9.6e-6969.67Show/hide
Query:  VSIKPIPELTQASWDTLKYYKDHFPSGRKV-TLVTDWLLLESWLLDYNPLVRLVEASRPNSELAMVCGFSGNVKRKSKGRAHALKTVQSTEPTTPAVAKP
        V+I+P+PELTQAS+DTLKYYK+HFP GRKV TLVTD LLLES LLDYNP VR +E+SRPNSELAMVCGF+ NVKRKSKG+AHAL+  QS++P TPAV   
Subjt:  VSIKPIPELTQASWDTLKYYKDHFPSGRKV-TLVTDWLLLESWLLDYNPLVRLVEASRPNSELAMVCGFSGNVKRKSKGRAHALKTVQSTEPTTPAVAKP

Query:  TAQDQAGPSSEVPTPVIELDFAGEHSRDKRSRNESEALDVSPLC-EVREDSPLKRRRKKKKTTSSSEVGPRGPLPMSHVELVDDPEARMGGTSDVKMRFR
              GP+SE P PVIEL+ +   SR+KR R+++EA+DVSPL  EVRE+ PLKRRRKKKKTTS  EVG RG LP S  + VDDPEARMGGT DV  RFR
Subjt:  TAQDQAGPSSEVPTPVIELDFAGEHSRDKRSRNESEALDVSPLC-EVREDSPLKRRRKKKKTTSSSEVGPRGPLPMSHVELVDDPEARMGGTSDVKMRFR

Query:  VEPSSSGVKDK
        VEPSSSGV+D+
Subjt:  VEPSSSGVKDK

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]6.6e-8665.3Show/hide
Query:  MFEYNLRLPLHPLVKEFLNRTGLAPAQVAPNVWGVIFALAILFWLRAREEDGAELLDVDQLIACFEAKKIAKKLGRYYMCTRK-----------------
        MFEY LRLPLHP V+EFL RTGLAPAQVAPN WGVIFALAILFWLRAR+ + AELLDVDQL+ACFEAK+IAKK GR+YMC RK                 
Subjt:  MFEYNLRLPLHPLVKEFLNRTGLAPAQVAPNVWGVIFALAILFWLRAREEDGAELLDVDQLIACFEAKKIAKKLGRYYMCTRK-----------------

Query:  ---------VQVNESGRPFFDVPVRFGNLVSIKPIPELTQASWDTLKYYKDHFPSGRKV-TLVTDWLLLESWLLDYNPLVRLVEASRPNSELAMVCGFSG
                 +  +ESGR FFDVP RFGNLVSI+P+PELTQAS+DTLKYYK+ FP GRKV TLVTD LLLES LLDYNP VR +E SRPNS LAMVC F+ 
Subjt:  ---------VQVNESGRPFFDVPVRFGNLVSIKPIPELTQASWDTLKYYKDHFPSGRKV-TLVTDWLLLESWLLDYNPLVRLVEASRPNSELAMVCGFSG

Query:  NVKRKSKGRAHALKTVQSTEPTTPAVAKPTAQDQAGPSSEVPTPVIELDFAGEHSRDKRSRNESEALD
         VKRKSKGRAHAL+  QS++P TPAV         GP+SE P PVIEL+ +G  SR+KR R+++EA+D
Subjt:  NVKRKSKGRAHALKTVQSTEPTTPAVAKPTAQDQAGPSSEVPTPVIELDFAGEHSRDKRSRNESEALD

XP_022158650.1 uncharacterized protein LOC111025108 [Momordica charantia]5.8e-6669.59Show/hide
Query:  MFEYNLRLPLHPLVKEFLNRTGLAPAQVAPNVWGVIFALAILFWLRAREEDGAELLDVDQLIACFEAKKIAKKLGRYYMCTRK-----------------
        MFEY LRLPLHP V+EFL RTGLAPAQVAPN WGVIFALAILFWLRAR+ + AELLDVDQL+ACFEAK+IAKK GR+YMC RK                 
Subjt:  MFEYNLRLPLHPLVKEFLNRTGLAPAQVAPNVWGVIFALAILFWLRAREEDGAELLDVDQLIACFEAKKIAKKLGRYYMCTRK-----------------

Query:  ---------VQVNESGRPFFDVPVRFGNLVSIKPIPELTQASWDTLKYYKDHFPSGRKV-TLVTDWLLLESWLLDYNPLVRLVEASRPNSELAM
                 +  +ESGR FFDVP RFGNLVSI+P+PELTQAS+DTLKYYK+HFP GRKV TLVTD LLLES LLDYNP VR +E+SRPNSEL M
Subjt:  ---------VQVNESGRPFFDVPVRFGNLVSIKPIPELTQASWDTLKYYKDHFPSGRKV-TLVTDWLLLESWLLDYNPLVRLVEASRPNSELAM

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]3.3e-10161.08Show/hide
Query:  MSSPSNSDSLSSAGRTISSSPPKPSNYGEDLALRGFSIPDDINLRILEGGERVNNPPEGWVTLYLKMFEYNLRLPLHPLVKEFLNRTGLAPAQVAPNVWG
        +S        S++G+ +      P +Y   L  RGF+IP++I LR+ E GER +NPPEGWVTLY KMFEY LRLPLHP V+EFL RTGLAPAQVAPN WG
Subjt:  MSSPSNSDSLSSAGRTISSSPPKPSNYGEDLALRGFSIPDDINLRILEGGERVNNPPEGWVTLYLKMFEYNLRLPLHPLVKEFLNRTGLAPAQVAPNVWG

Query:  VIFALAILFWLRAREEDGAELLDVDQLIACFEAKKIAKKLGRYYMCTRK--------------------------VQVNESGRPFFDVPVRFGNLVSIKP
        VIFALAILFWLRAR+ + AEL DVDQL+ACFEAK+IAKK GR+YMC RK                          +  +ESGR FFDVP RFGNLVSI+P
Subjt:  VIFALAILFWLRAREEDGAELLDVDQLIACFEAKKIAKKLGRYYMCTRK--------------------------VQVNESGRPFFDVPVRFGNLVSIKP

Query:  IPELTQASWDTLKYYKDHFPSGRKV-TLVTDWLLLESWLLDYNPLVRLVEASRPNSELAMVCGFSGNVKRKSKGRAHALKTVQSTEPTTPAVAKPTAQDQ
        +PELTQAS+DTLKYYK+ FP GRKV TLVTD LLLES LLDYNP VR +E+SRPNSELAMVCGF+  VKRKSKGRAHAL+  QS++P TPAV        
Subjt:  IPELTQASWDTLKYYKDHFPSGRKV-TLVTDWLLLESWLLDYNPLVRLVEASRPNSELAMVCGFSGNVKRKSKGRAHALKTVQSTEPTTPAVAKPTAQDQ

Query:  AGPSSEVPTPVIELDFAGEHSRDKRSRNESEALD
         GP+SE P  VIEL+ +G  SR+KR R+++EA+D
Subjt:  AGPSSEVPTPVIELDFAGEHSRDKRSRNESEALD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]9.4e-12571.55Show/hide
Query:  NESGRPFFDVPVRFGNLVSIKPIPELTQASWDTLKYYKDHFPSGRK-VTLVTDWLLLESWLLDYNPLVRLVEASRPNSELAMVCGFSGNVKRKSKGRAHA
        +ESGR FFDVP RFGNLVSIK IPEL QA++DTLK+YKDHFP  RK VTLVTD LLLES LLDYNPLVRL+EASRPNSELAMVCGF+G+VKRKSKGRAHA
Subjt:  NESGRPFFDVPVRFGNLVSIKPIPELTQASWDTLKYYKDHFPSGRK-VTLVTDWLLLESWLLDYNPLVRLVEASRPNSELAMVCGFSGNVKRKSKGRAHA

Query:  LKTVQSTEPTTPAVAKPTAQDQAGPSSEVPTPVIELDFAGEHSRDKRSRNESEALDVSPLCEVREDSPLKRRRKKKKTTSSSEVGPRGPLPMSHVELVDD
        LKTV  TEP TP V +  AQ  +GPSS VPTPVIELD +G  S +KRSR ESEALDVSPL EVR +SPL+RRRKKKKT+SSSE G RG LP SH +LVDD
Subjt:  LKTVQSTEPTTPAVAKPTAQDQAGPSSEVPTPVIELDFAGEHSRDKRSRNESEALDVSPLCEVREDSPLKRRRKKKKTTSSSEVGPRGPLPMSHVELVDD

Query:  PEARMGGTSDVKMRFRVEPSSSGVKDKVSHISAACLDRCLRRASKFVSDPGSVLQRTIDHAAEAFIASIHSAVMLKAELDGRD-LGCKGEGELLCYLRSC
        PEARM GTS+V+MRF +EPSSSGVKD+VS ISA CLDR LRRASKFVSDPGSVLQRTID+ AEAFIASIH AVM+KAELDGR+ L  K        L + 
Subjt:  PEARMGGTSDVKMRFRVEPSSSGVKDKVSHISAACLDRCLRRASKFVSDPGSVLQRTIDHAAEAFIASIHSAVMLKAELDGRD-LGCKGEGELLCYLRSC

Query:  HHDEGRAIESSSEVDILKAEVEAKTQLLKNEDEKHKAHSELPMPSQKG
           +G  +++  EVDIL+AEV+AK  LLK E EKHKAH        KG
Subjt:  HHDEGRAIESSSEVDILKAEVEAKTQLLKNEDEKHKAHSELPMPSQKG

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092984.7e-6969.67Show/hide
Query:  VSIKPIPELTQASWDTLKYYKDHFPSGRKV-TLVTDWLLLESWLLDYNPLVRLVEASRPNSELAMVCGFSGNVKRKSKGRAHALKTVQSTEPTTPAVAKP
        V+I+P+PELTQAS+DTLKYYK+HFP GRKV TLVTD LLLES LLDYNP VR +E+SRPNSELAMVCGF+ NVKRKSKG+AHAL+  QS++P TPAV   
Subjt:  VSIKPIPELTQASWDTLKYYKDHFPSGRKV-TLVTDWLLLESWLLDYNPLVRLVEASRPNSELAMVCGFSGNVKRKSKGRAHALKTVQSTEPTTPAVAKP

Query:  TAQDQAGPSSEVPTPVIELDFAGEHSRDKRSRNESEALDVSPLC-EVREDSPLKRRRKKKKTTSSSEVGPRGPLPMSHVELVDDPEARMGGTSDVKMRFR
              GP+SE P PVIEL+ +   SR+KR R+++EA+DVSPL  EVRE+ PLKRRRKKKKTTS  EVG RG LP S  + VDDPEARMGGT DV  RFR
Subjt:  TAQDQAGPSSEVPTPVIELDFAGEHSRDKRSRNESEALDVSPLC-EVREDSPLKRRRKKKKTTSSSEVGPRGPLPMSHVELVDDPEARMGGTSDVKMRFR

Query:  VEPSSSGVKDK
        VEPSSSGV+D+
Subjt:  VEPSSSGVKDK

A0A6J1CR42 uncharacterized protein LOC1110138263.2e-8665.3Show/hide
Query:  MFEYNLRLPLHPLVKEFLNRTGLAPAQVAPNVWGVIFALAILFWLRAREEDGAELLDVDQLIACFEAKKIAKKLGRYYMCTRK-----------------
        MFEY LRLPLHP V+EFL RTGLAPAQVAPN WGVIFALAILFWLRAR+ + AELLDVDQL+ACFEAK+IAKK GR+YMC RK                 
Subjt:  MFEYNLRLPLHPLVKEFLNRTGLAPAQVAPNVWGVIFALAILFWLRAREEDGAELLDVDQLIACFEAKKIAKKLGRYYMCTRK-----------------

Query:  ---------VQVNESGRPFFDVPVRFGNLVSIKPIPELTQASWDTLKYYKDHFPSGRKV-TLVTDWLLLESWLLDYNPLVRLVEASRPNSELAMVCGFSG
                 +  +ESGR FFDVP RFGNLVSI+P+PELTQAS+DTLKYYK+ FP GRKV TLVTD LLLES LLDYNP VR +E SRPNS LAMVC F+ 
Subjt:  ---------VQVNESGRPFFDVPVRFGNLVSIKPIPELTQASWDTLKYYKDHFPSGRKV-TLVTDWLLLESWLLDYNPLVRLVEASRPNSELAMVCGFSG

Query:  NVKRKSKGRAHALKTVQSTEPTTPAVAKPTAQDQAGPSSEVPTPVIELDFAGEHSRDKRSRNESEALD
         VKRKSKGRAHAL+  QS++P TPAV         GP+SE P PVIEL+ +G  SR+KR R+++EA+D
Subjt:  NVKRKSKGRAHALKTVQSTEPTTPAVAKPTAQDQAGPSSEVPTPVIELDFAGEHSRDKRSRNESEALD

A0A6J1DWF1 uncharacterized protein LOC1110251082.8e-6669.59Show/hide
Query:  MFEYNLRLPLHPLVKEFLNRTGLAPAQVAPNVWGVIFALAILFWLRAREEDGAELLDVDQLIACFEAKKIAKKLGRYYMCTRK-----------------
        MFEY LRLPLHP V+EFL RTGLAPAQVAPN WGVIFALAILFWLRAR+ + AELLDVDQL+ACFEAK+IAKK GR+YMC RK                 
Subjt:  MFEYNLRLPLHPLVKEFLNRTGLAPAQVAPNVWGVIFALAILFWLRAREEDGAELLDVDQLIACFEAKKIAKKLGRYYMCTRK-----------------

Query:  ---------VQVNESGRPFFDVPVRFGNLVSIKPIPELTQASWDTLKYYKDHFPSGRKV-TLVTDWLLLESWLLDYNPLVRLVEASRPNSELAM
                 +  +ESGR FFDVP RFGNLVSI+P+PELTQAS+DTLKYYK+HFP GRKV TLVTD LLLES LLDYNP VR +E+SRPNSEL M
Subjt:  ---------VQVNESGRPFFDVPVRFGNLVSIKPIPELTQASWDTLKYYKDHFPSGRKV-TLVTDWLLLESWLLDYNPLVRLVEASRPNSELAM

A0A6J1DXS5 uncharacterized protein LOC1110255021.6e-10161.08Show/hide
Query:  MSSPSNSDSLSSAGRTISSSPPKPSNYGEDLALRGFSIPDDINLRILEGGERVNNPPEGWVTLYLKMFEYNLRLPLHPLVKEFLNRTGLAPAQVAPNVWG
        +S        S++G+ +      P +Y   L  RGF+IP++I LR+ E GER +NPPEGWVTLY KMFEY LRLPLHP V+EFL RTGLAPAQVAPN WG
Subjt:  MSSPSNSDSLSSAGRTISSSPPKPSNYGEDLALRGFSIPDDINLRILEGGERVNNPPEGWVTLYLKMFEYNLRLPLHPLVKEFLNRTGLAPAQVAPNVWG

Query:  VIFALAILFWLRAREEDGAELLDVDQLIACFEAKKIAKKLGRYYMCTRK--------------------------VQVNESGRPFFDVPVRFGNLVSIKP
        VIFALAILFWLRAR+ + AEL DVDQL+ACFEAK+IAKK GR+YMC RK                          +  +ESGR FFDVP RFGNLVSI+P
Subjt:  VIFALAILFWLRAREEDGAELLDVDQLIACFEAKKIAKKLGRYYMCTRK--------------------------VQVNESGRPFFDVPVRFGNLVSIKP

Query:  IPELTQASWDTLKYYKDHFPSGRKV-TLVTDWLLLESWLLDYNPLVRLVEASRPNSELAMVCGFSGNVKRKSKGRAHALKTVQSTEPTTPAVAKPTAQDQ
        +PELTQAS+DTLKYYK+ FP GRKV TLVTD LLLES LLDYNP VR +E+SRPNSELAMVCGF+  VKRKSKGRAHAL+  QS++P TPAV        
Subjt:  IPELTQASWDTLKYYKDHFPSGRKV-TLVTDWLLLESWLLDYNPLVRLVEASRPNSELAMVCGFSGNVKRKSKGRAHALKTVQSTEPTTPAVAKPTAQDQ

Query:  AGPSSEVPTPVIELDFAGEHSRDKRSRNESEALD
         GP+SE P  VIEL+ +G  SR+KR R+++EA+D
Subjt:  AGPSSEVPTPVIELDFAGEHSRDKRSRNESEALD

A0A6J1DZB3 uncharacterized protein LOC1110256654.6e-12571.55Show/hide
Query:  NESGRPFFDVPVRFGNLVSIKPIPELTQASWDTLKYYKDHFPSGRK-VTLVTDWLLLESWLLDYNPLVRLVEASRPNSELAMVCGFSGNVKRKSKGRAHA
        +ESGR FFDVP RFGNLVSIK IPEL QA++DTLK+YKDHFP  RK VTLVTD LLLES LLDYNPLVRL+EASRPNSELAMVCGF+G+VKRKSKGRAHA
Subjt:  NESGRPFFDVPVRFGNLVSIKPIPELTQASWDTLKYYKDHFPSGRK-VTLVTDWLLLESWLLDYNPLVRLVEASRPNSELAMVCGFSGNVKRKSKGRAHA

Query:  LKTVQSTEPTTPAVAKPTAQDQAGPSSEVPTPVIELDFAGEHSRDKRSRNESEALDVSPLCEVREDSPLKRRRKKKKTTSSSEVGPRGPLPMSHVELVDD
        LKTV  TEP TP V +  AQ  +GPSS VPTPVIELD +G  S +KRSR ESEALDVSPL EVR +SPL+RRRKKKKT+SSSE G RG LP SH +LVDD
Subjt:  LKTVQSTEPTTPAVAKPTAQDQAGPSSEVPTPVIELDFAGEHSRDKRSRNESEALDVSPLCEVREDSPLKRRRKKKKTTSSSEVGPRGPLPMSHVELVDD

Query:  PEARMGGTSDVKMRFRVEPSSSGVKDKVSHISAACLDRCLRRASKFVSDPGSVLQRTIDHAAEAFIASIHSAVMLKAELDGRD-LGCKGEGELLCYLRSC
        PEARM GTS+V+MRF +EPSSSGVKD+VS ISA CLDR LRRASKFVSDPGSVLQRTID+ AEAFIASIH AVM+KAELDGR+ L  K        L + 
Subjt:  PEARMGGTSDVKMRFRVEPSSSGVKDKVSHISAACLDRCLRRASKFVSDPGSVLQRTIDHAAEAFIASIHSAVMLKAELDGRD-LGCKGEGELLCYLRSC

Query:  HHDEGRAIESSSEVDILKAEVEAKTQLLKNEDEKHKAHSELPMPSQKG
           +G  +++  EVDIL+AEV+AK  LLK E EKHKAH        KG
Subjt:  HHDEGRAIESSSEVDILKAEVEAKTQLLKNEDEKHKAHSELPMPSQKG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCAAAGATCGTCATTCATGGTTGGACTCACCGGAATTGGGCTGCTACTACTCGAGGAAGGTGAGGTGTAGTGATTTGGAGAGTGTCGTCGTTGATCGCCAATTGCC
GCCGCGAGCTGCTGTAGGAGGACGCCGGCCAATGCTCATGCTGCTTGCTGCCATAGCCGGAGGACGCCGTCGCTGCTGCTTTGCGGACGGAGAAGAATCAATGGGGGCAG
CTAGTGTCTCGAACCTTCTGTATGTCAGTAATATGGTAGTTTTCATGTCTTCCCCTTCCAATAGTGATAGTCTAAGTAGTGCAGGTCGGACGATAAGTAGTTCGCCCCCT
AAGCCAAGTAATTATGGGGAGGACTTAGCTCTTAGGGGGTTCAGTATTCCGGATGATATCAACCTTAGGATTCTAGAGGGAGGAGAAAGAGTTAACAACCCTCCAGAGGG
GTGGGTCACTCTTTACTTAAAAATGTTTGAGTACAACCTCAGACTTCCCCTTCACCCTCTTGTCAAAGAGTTTCTAAATCGAACTGGTCTGGCCCCTGCTCAAGTGGCCC
CCAATGTTTGGGGTGTCATTTTTGCTTTGGCCATCCTTTTCTGGTTACGAGCTCGAGAAGAGGACGGCGCCGAGCTTCTTGACGTTGACCAGCTTATAGCGTGCTTCGAA
GCCAAAAAGATAGCTAAGAAGCTAGGTCGGTACTACATGTGCACAAGGAAGGTGCAGGTGAACGAGTCTGGTCGTCCCTTCTTCGACGTTCCCGTTAGGTTTGGGAATTT
AGTTTCAATCAAACCAATTCCTGAACTAACTCAAGCATCTTGGGACACTCTCAAGTATTATAAGGATCACTTCCCGAGTGGCAGGAAGGTCACCTTGGTAACTGACTGGC
TGTTACTGGAGTCCTGGTTGTTAGACTACAACCCCCTAGTACGCCTAGTCGAAGCTTCAAGGCCAAACTCTGAGCTCGCGATGGTGTGTGGATTCTCGGGCAATGTGAAA
CGTAAGTCCAAGGGTCGTGCTCATGCCCTTAAGACCGTTCAAAGCACGGAGCCAACAACTCCTGCTGTTGCCAAACCTACAGCTCAAGACCAAGCTGGGCCGTCTTCTGA
AGTCCCAACTCCGGTGATCGAGTTGGATTTTGCTGGGGAGCACTCCAGAGATAAACGCTCAAGGAACGAGTCCGAGGCGCTGGACGTGTCACCTCTATGCGAGGTGAGAG
AAGACTCTCCTCTGAAGAGGAGAAGGAAGAAGAAGAAAACCACCTCCTCCTCGGAGGTTGGACCTCGTGGGCCTCTGCCCATGAGCCATGTTGAGTTGGTGGATGACCCC
GAAGCCAGGATGGGGGGAACGTCCGATGTGAAGATGCGGTTCAGAGTAGAACCGTCGAGCTCCGGGGTAAAGGACAAGGTGTCCCACATCTCGGCTGCATGCTTGGACCG
CTGCCTCAGAAGGGCGTCCAAGTTCGTAAGTGACCCTGGGTCCGTGCTGCAACGGACCATTGACCACGCCGCTGAGGCGTTCATTGCTTCTATTCACTCGGCAGTTATGT
TGAAGGCCGAGCTGGACGGAAGAGATCTTGGCTGCAAGGGAGAAGGTGAACTCCTCTGCTACCTTAGAAGCTGCCACCACGATGAAGGGCGAGCTATTGAAAGCTCGTCC
GAAGTGGACATTCTGAAGGCTGAGGTGGAAGCCAAGACCCAACTACTGAAGAACGAGGATGAGAAGCACAAGGCCCACTCCGAGCTGCCCATGCCATCACAAAAGGGCTA
G
mRNA sequenceShow/hide mRNA sequence
ATGCTCAAAGATCGTCATTCATGGTTGGACTCACCGGAATTGGGCTGCTACTACTCGAGGAAGGTGAGGTGTAGTGATTTGGAGAGTGTCGTCGTTGATCGCCAATTGCC
GCCGCGAGCTGCTGTAGGAGGACGCCGGCCAATGCTCATGCTGCTTGCTGCCATAGCCGGAGGACGCCGTCGCTGCTGCTTTGCGGACGGAGAAGAATCAATGGGGGCAG
CTAGTGTCTCGAACCTTCTGTATGTCAGTAATATGGTAGTTTTCATGTCTTCCCCTTCCAATAGTGATAGTCTAAGTAGTGCAGGTCGGACGATAAGTAGTTCGCCCCCT
AAGCCAAGTAATTATGGGGAGGACTTAGCTCTTAGGGGGTTCAGTATTCCGGATGATATCAACCTTAGGATTCTAGAGGGAGGAGAAAGAGTTAACAACCCTCCAGAGGG
GTGGGTCACTCTTTACTTAAAAATGTTTGAGTACAACCTCAGACTTCCCCTTCACCCTCTTGTCAAAGAGTTTCTAAATCGAACTGGTCTGGCCCCTGCTCAAGTGGCCC
CCAATGTTTGGGGTGTCATTTTTGCTTTGGCCATCCTTTTCTGGTTACGAGCTCGAGAAGAGGACGGCGCCGAGCTTCTTGACGTTGACCAGCTTATAGCGTGCTTCGAA
GCCAAAAAGATAGCTAAGAAGCTAGGTCGGTACTACATGTGCACAAGGAAGGTGCAGGTGAACGAGTCTGGTCGTCCCTTCTTCGACGTTCCCGTTAGGTTTGGGAATTT
AGTTTCAATCAAACCAATTCCTGAACTAACTCAAGCATCTTGGGACACTCTCAAGTATTATAAGGATCACTTCCCGAGTGGCAGGAAGGTCACCTTGGTAACTGACTGGC
TGTTACTGGAGTCCTGGTTGTTAGACTACAACCCCCTAGTACGCCTAGTCGAAGCTTCAAGGCCAAACTCTGAGCTCGCGATGGTGTGTGGATTCTCGGGCAATGTGAAA
CGTAAGTCCAAGGGTCGTGCTCATGCCCTTAAGACCGTTCAAAGCACGGAGCCAACAACTCCTGCTGTTGCCAAACCTACAGCTCAAGACCAAGCTGGGCCGTCTTCTGA
AGTCCCAACTCCGGTGATCGAGTTGGATTTTGCTGGGGAGCACTCCAGAGATAAACGCTCAAGGAACGAGTCCGAGGCGCTGGACGTGTCACCTCTATGCGAGGTGAGAG
AAGACTCTCCTCTGAAGAGGAGAAGGAAGAAGAAGAAAACCACCTCCTCCTCGGAGGTTGGACCTCGTGGGCCTCTGCCCATGAGCCATGTTGAGTTGGTGGATGACCCC
GAAGCCAGGATGGGGGGAACGTCCGATGTGAAGATGCGGTTCAGAGTAGAACCGTCGAGCTCCGGGGTAAAGGACAAGGTGTCCCACATCTCGGCTGCATGCTTGGACCG
CTGCCTCAGAAGGGCGTCCAAGTTCGTAAGTGACCCTGGGTCCGTGCTGCAACGGACCATTGACCACGCCGCTGAGGCGTTCATTGCTTCTATTCACTCGGCAGTTATGT
TGAAGGCCGAGCTGGACGGAAGAGATCTTGGCTGCAAGGGAGAAGGTGAACTCCTCTGCTACCTTAGAAGCTGCCACCACGATGAAGGGCGAGCTATTGAAAGCTCGTCC
GAAGTGGACATTCTGAAGGCTGAGGTGGAAGCCAAGACCCAACTACTGAAGAACGAGGATGAGAAGCACAAGGCCCACTCCGAGCTGCCCATGCCATCACAAAAGGGCTA
G
Protein sequenceShow/hide protein sequence
MLKDRHSWLDSPELGCYYSRKVRCSDLESVVVDRQLPPRAAVGGRRPMLMLLAAIAGGRRRCCFADGEESMGAASVSNLLYVSNMVVFMSSPSNSDSLSSAGRTISSSPP
KPSNYGEDLALRGFSIPDDINLRILEGGERVNNPPEGWVTLYLKMFEYNLRLPLHPLVKEFLNRTGLAPAQVAPNVWGVIFALAILFWLRAREEDGAELLDVDQLIACFE
AKKIAKKLGRYYMCTRKVQVNESGRPFFDVPVRFGNLVSIKPIPELTQASWDTLKYYKDHFPSGRKVTLVTDWLLLESWLLDYNPLVRLVEASRPNSELAMVCGFSGNVK
RKSKGRAHALKTVQSTEPTTPAVAKPTAQDQAGPSSEVPTPVIELDFAGEHSRDKRSRNESEALDVSPLCEVREDSPLKRRRKKKKTTSSSEVGPRGPLPMSHVELVDDP
EARMGGTSDVKMRFRVEPSSSGVKDKVSHISAACLDRCLRRASKFVSDPGSVLQRTIDHAAEAFIASIHSAVMLKAELDGRDLGCKGEGELLCYLRSCHHDEGRAIESSS
EVDILKAEVEAKTQLLKNEDEKHKAHSELPMPSQKG