; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g17320 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g17320
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr9:13765032..13778121
RNA-Seq ExpressionMoc09g17320
SyntenyMoc09g17320
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]1.1e-6565.88Show/hide
Query:  VSIKPIPELAQATFDTFKYYKDNFPKGRKIGTLVTDKLLLESRLLDYNPLVRPIEASRPNSELAMVCGFTGSVKCKSKGRAHTLKTVVGIEPVTPTVPRT
        V+I+P+PEL QA+FDT KYYK++FP+GRK+GTLVTDKLLLES LLDYNP VRPIE+SRPNSELAMVCGF  +VK KSKG+AH L+     +PVTP V   
Subjt:  VSIKPIPELAQATFDTFKYYKDNFPKGRKIGTLVTDKLLLESRLLDYNPLVRPIEASRPNSELAMVCGFTGSVKCKSKGRAHTLKTVVGIEPVTPTVPRT

Query:  EAQGNSGPSSALPTLVIELDLSEGRSEEKRPREESETLDVSPL-NEVRGESPLRRRRKKKKTSSSSEAGARGTLPTSHADLVDDPAARMRGTSDVRMRFR
              GP+S  P  VIEL+ S G S EKRPR+++E +DVSPL  EVR E PL+RRRKKKKT+S  E GARG LP S AD VDDP ARM GT DV  RFR
Subjt:  EAQGNSGPSSALPTLVIELDLSEGRSEEKRPREESETLDVSPL-NEVRGESPLRRRRKKKKTSSSSEAGARGTLPTSHADLVDDPAARMRGTSDVRMRFR

Query:  MEPTSSGVKDQ
        +EP+SSGV+DQ
Subjt:  MEPTSSGVKDQ

XP_022139697.1 uncharacterized protein LOC111010544 [Momordica charantia]6.0e-6788.46Show/hide
Query:  VSIKPIPELAQATFDTFKYYKDNFPKGRKIGTLVTDKLLLESRLLDYNPLVRPIEASRPNSELAMVCGFTGSVKCKSKGRAHTLKTVVGIEPVTPTVPRT
        +SIKPIPELAQATFDT KYYKD+FPKGRKIGTLVTDKLL ES LLDYNPLVRPIEASRPNSELAMVCGFTGSVK KSKGRAH LKTVVG EPV PTVPRT
Subjt:  VSIKPIPELAQATFDTFKYYKDNFPKGRKIGTLVTDKLLLESRLLDYNPLVRPIEASRPNSELAMVCGFTGSVKCKSKGRAHTLKTVVGIEPVTPTVPRT

Query:  EAQGNSGPSSALPTLVIELDLSEGRSEEKRPREESETLDVSPLNEVRGESPLRRRR
        EA GNSGPSS  PT VIELDLS  RS EKRPREESE LDVSPLNEVRGE PLRRRR
Subjt:  EAQGNSGPSSALPTLVIELDLSEGRSEEKRPREESETLDVSPLNEVRGESPLRRRR

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]3.1e-7155.87Show/hide
Query:  MFEYGLRLPLHPFAQEFLNRIGLAPAQVAPNGWGVIFALAILFWLRARDENEAELL--------------------------------------------
        MFEYGLRLPLHPF QEFL R GLAPAQVAPNGWGVIFALAILFWLRARD  EAELL                                            
Subjt:  MFEYGLRLPLHPFAQEFLNRIGLAPAQVAPNGWGVIFALAILFWLRARDENEAELL--------------------------------------------

Query:  -----------SDESGRAFFDVPTRFGNLVSIKPIPELAQATFDTFKYYKDNFPKGRKIGTLVTDKLLLESRLLDYNPLVRPIEASRPNSELAMVCGFTG
                    DESGR+FFDVPTRFGNLVSI+P+PEL QA+FDT KYYK+ FP+GRK+GTLVTD+LLLES LLDYNP VRPIE SRPNS LAMVC F  
Subjt:  -----------SDESGRAFFDVPTRFGNLVSIKPIPELAQATFDTFKYYKDNFPKGRKIGTLVTDKLLLESRLLDYNPLVRPIEASRPNSELAMVCGFTG

Query:  SVKCKSKGRAHTLKTVVGIEPVTPTVPRTEAQGNSGPSSALPTLVIELDLSEGRSEEKRPREESETL-------DVSPLNE
         VK KSKGRAH L+     +P TP V         GP+S  P  VIEL+ S G S EKRPR+++E +       DV PL E
Subjt:  SVKCKSKGRAHTLKTVVGIEPVTPTVPRTEAQGNSGPSSALPTLVIELDLSEGRSEEKRPREESETL-------DVSPLNE

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]3.1e-11664.18Show/hide
Query:  LARRLESELEEIENFRFSDDGADSDTSTSGQGLEYPSRMPEHYLGPLRRGFKIPNDILLRVPEEGERADNPPEGWVTLYLKMFEYGLRLPLHPFAQEFLN
        LARRLES+LEEIEN R SDDG DSD STSGQGLEYPSR+PEHYLG LRRGF IP +ILLR+PEEGERADNPPEGWVTLY KMFEYGLRLPLHPF QEFL 
Subjt:  LARRLESELEEIENFRFSDDGADSDTSTSGQGLEYPSRMPEHYLGPLRRGFKIPNDILLRVPEEGERADNPPEGWVTLYLKMFEYGLRLPLHPFAQEFLN

Query:  RIGLAPAQVAPNGWGVIFALAILFWLRARDENEAE-------------------------------------------------------LLSDESGRAF
        R GLAPAQVAPNGWGVIFALAILFWLRARD  EAE                                                       L  DESGR+F
Subjt:  RIGLAPAQVAPNGWGVIFALAILFWLRARDENEAE-------------------------------------------------------LLSDESGRAF

Query:  FDVPTRFGNLVSIKPIPELAQATFDTFKYYKDNFPKGRKIGTLVTDKLLLESRLLDYNPLVRPIEASRPNSELAMVCGFTGSVKCKSKGRAHTLKTVVGI
        FDVPTRFGNLVSI+P+PEL QA+FDT KYYK+ FP+GRK+GTLVTD+LLLES LLDYNP VRPIE+SRPNSELAMVCGF   VK KSKGRAH L+     
Subjt:  FDVPTRFGNLVSIKPIPELAQATFDTFKYYKDNFPKGRKIGTLVTDKLLLESRLLDYNPLVRPIEASRPNSELAMVCGFTGSVKCKSKGRAHTLKTVVGI

Query:  EPVTPTVPRTEAQGNSGPSSALPTLVIELDLSEGRSEEKRPREESETLD
        +P TP V         GP+S  P LVIEL+ S G S EKRPR+++E +D
Subjt:  EPVTPTVPRTEAQGNSGPSSALPTLVIELDLSEGRSEEKRPREESETLD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]3.0e-11188.66Show/hide
Query:  LLSDESGRAFFDVPTRFGNLVSIKPIPELAQATFDTFKYYKDNFPKGRKIGTLVTDKLLLESRLLDYNPLVRPIEASRPNSELAMVCGFTGSVKCKSKGR
        L  DESGRAFFDVPTRFGNLVSIK IPELAQATFDT K+YKD+FP+ RKI TLVTDKLLLES LLDYNPLVR IEASRPNSELAMVCGFTGSVK KSKGR
Subjt:  LLSDESGRAFFDVPTRFGNLVSIKPIPELAQATFDTFKYYKDNFPKGRKIGTLVTDKLLLESRLLDYNPLVRPIEASRPNSELAMVCGFTGSVKCKSKGR

Query:  AHTLKTVVGIEPVTPTVPRTEAQGNSGPSSALPTLVIELDLSEGRSEEKRPREESETLDVSPLNEVRGESPLRRRRKKKKTSSSSEAGARGTLPTSHADL
        AH LKTVVG EPVTPTVPRT AQGNSGPSSA+PT VIELDLS GRS EKR REESE LDVSPLNEVRGESPLRRRRKKKKTSSSSEAGARGTLPTSHADL
Subjt:  AHTLKTVVGIEPVTPTVPRTEAQGNSGPSSALPTLVIELDLSEGRSEEKRPREESETLDVSPLNEVRGESPLRRRRKKKKTSSSSEAGARGTLPTSHADL

Query:  VDDPAARMRGTSDVRMRFRMEPTSSGVKDQVSRISATCLDRCLMRLS
        VDDP ARMRGTS+VRMRF MEP+SSGVKDQVSRISATCLDR L R S
Subjt:  VDDPAARMRGTSDVRMRFRMEPTSSGVKDQVSRISATCLDRCLMRLS

TrEMBL top hitse value%identityAlignment
A0A6J1CEP5 uncharacterized protein LOC1110105442.9e-6788.46Show/hide
Query:  VSIKPIPELAQATFDTFKYYKDNFPKGRKIGTLVTDKLLLESRLLDYNPLVRPIEASRPNSELAMVCGFTGSVKCKSKGRAHTLKTVVGIEPVTPTVPRT
        +SIKPIPELAQATFDT KYYKD+FPKGRKIGTLVTDKLL ES LLDYNPLVRPIEASRPNSELAMVCGFTGSVK KSKGRAH LKTVVG EPV PTVPRT
Subjt:  VSIKPIPELAQATFDTFKYYKDNFPKGRKIGTLVTDKLLLESRLLDYNPLVRPIEASRPNSELAMVCGFTGSVKCKSKGRAHTLKTVVGIEPVTPTVPRT

Query:  EAQGNSGPSSALPTLVIELDLSEGRSEEKRPREESETLDVSPLNEVRGESPLRRRR
        EA GNSGPSS  PT VIELDLS  RS EKRPREESE LDVSPLNEVRGE PLRRRR
Subjt:  EAQGNSGPSSALPTLVIELDLSEGRSEEKRPREESETLDVSPLNEVRGESPLRRRR

A0A6J1CEP5 uncharacterized protein LOC1110105449.5e-0286.67Show/hide
Query:  VTVAEDVSICQVVGALKYSVVAYLEKILAA
        +TVAEDVSICQVV ALK+S VAYLEKILAA
Subjt:  VTVAEDVSICQVVGALKYSVVAYLEKILAA

A0A6J1CEP5 uncharacterized protein LOC1110105445.5e-6665.88Show/hide
Query:  VSIKPIPELAQATFDTFKYYKDNFPKGRKIGTLVTDKLLLESRLLDYNPLVRPIEASRPNSELAMVCGFTGSVKCKSKGRAHTLKTVVGIEPVTPTVPRT
        V+I+P+PEL QA+FDT KYYK++FP+GRK+GTLVTDKLLLES LLDYNP VRPIE+SRPNSELAMVCGF  +VK KSKG+AH L+     +PVTP V   
Subjt:  VSIKPIPELAQATFDTFKYYKDNFPKGRKIGTLVTDKLLLESRLLDYNPLVRPIEASRPNSELAMVCGFTGSVKCKSKGRAHTLKTVVGIEPVTPTVPRT

Query:  EAQGNSGPSSALPTLVIELDLSEGRSEEKRPREESETLDVSPL-NEVRGESPLRRRRKKKKTSSSSEAGARGTLPTSHADLVDDPAARMRGTSDVRMRFR
              GP+S  P  VIEL+ S G S EKRPR+++E +DVSPL  EVR E PL+RRRKKKKT+S  E GARG LP S AD VDDP ARM GT DV  RFR
Subjt:  EAQGNSGPSSALPTLVIELDLSEGRSEEKRPREESETLDVSPL-NEVRGESPLRRRRKKKKTSSSSEAGARGTLPTSHADLVDDPAARMRGTSDVRMRFR

Query:  MEPTSSGVKDQ
        +EP+SSGV+DQ
Subjt:  MEPTSSGVKDQ

A0A6J1CR42 uncharacterized protein LOC1110138261.5e-7155.87Show/hide
Query:  MFEYGLRLPLHPFAQEFLNRIGLAPAQVAPNGWGVIFALAILFWLRARDENEAELL--------------------------------------------
        MFEYGLRLPLHPF QEFL R GLAPAQVAPNGWGVIFALAILFWLRARD  EAELL                                            
Subjt:  MFEYGLRLPLHPFAQEFLNRIGLAPAQVAPNGWGVIFALAILFWLRARDENEAELL--------------------------------------------

Query:  -----------SDESGRAFFDVPTRFGNLVSIKPIPELAQATFDTFKYYKDNFPKGRKIGTLVTDKLLLESRLLDYNPLVRPIEASRPNSELAMVCGFTG
                    DESGR+FFDVPTRFGNLVSI+P+PEL QA+FDT KYYK+ FP+GRK+GTLVTD+LLLES LLDYNP VRPIE SRPNS LAMVC F  
Subjt:  -----------SDESGRAFFDVPTRFGNLVSIKPIPELAQATFDTFKYYKDNFPKGRKIGTLVTDKLLLESRLLDYNPLVRPIEASRPNSELAMVCGFTG

Query:  SVKCKSKGRAHTLKTVVGIEPVTPTVPRTEAQGNSGPSSALPTLVIELDLSEGRSEEKRPREESETL-------DVSPLNE
         VK KSKGRAH L+     +P TP V         GP+S  P  VIEL+ S G S EKRPR+++E +       DV PL E
Subjt:  SVKCKSKGRAHTLKTVVGIEPVTPTVPRTEAQGNSGPSSALPTLVIELDLSEGRSEEKRPREESETL-------DVSPLNE

A0A6J1DXS5 uncharacterized protein LOC1110255021.5e-11664.18Show/hide
Query:  LARRLESELEEIENFRFSDDGADSDTSTSGQGLEYPSRMPEHYLGPLRRGFKIPNDILLRVPEEGERADNPPEGWVTLYLKMFEYGLRLPLHPFAQEFLN
        LARRLES+LEEIEN R SDDG DSD STSGQGLEYPSR+PEHYLG LRRGF IP +ILLR+PEEGERADNPPEGWVTLY KMFEYGLRLPLHPF QEFL 
Subjt:  LARRLESELEEIENFRFSDDGADSDTSTSGQGLEYPSRMPEHYLGPLRRGFKIPNDILLRVPEEGERADNPPEGWVTLYLKMFEYGLRLPLHPFAQEFLN

Query:  RIGLAPAQVAPNGWGVIFALAILFWLRARDENEAE-------------------------------------------------------LLSDESGRAF
        R GLAPAQVAPNGWGVIFALAILFWLRARD  EAE                                                       L  DESGR+F
Subjt:  RIGLAPAQVAPNGWGVIFALAILFWLRARDENEAE-------------------------------------------------------LLSDESGRAF

Query:  FDVPTRFGNLVSIKPIPELAQATFDTFKYYKDNFPKGRKIGTLVTDKLLLESRLLDYNPLVRPIEASRPNSELAMVCGFTGSVKCKSKGRAHTLKTVVGI
        FDVPTRFGNLVSI+P+PEL QA+FDT KYYK+ FP+GRK+GTLVTD+LLLES LLDYNP VRPIE+SRPNSELAMVCGF   VK KSKGRAH L+     
Subjt:  FDVPTRFGNLVSIKPIPELAQATFDTFKYYKDNFPKGRKIGTLVTDKLLLESRLLDYNPLVRPIEASRPNSELAMVCGFTGSVKCKSKGRAHTLKTVVGI

Query:  EPVTPTVPRTEAQGNSGPSSALPTLVIELDLSEGRSEEKRPREESETLD
        +P TP V         GP+S  P LVIEL+ S G S EKRPR+++E +D
Subjt:  EPVTPTVPRTEAQGNSGPSSALPTLVIELDLSEGRSEEKRPREESETLD

A0A6J1DZB3 uncharacterized protein LOC1110256651.5e-11188.66Show/hide
Query:  LLSDESGRAFFDVPTRFGNLVSIKPIPELAQATFDTFKYYKDNFPKGRKIGTLVTDKLLLESRLLDYNPLVRPIEASRPNSELAMVCGFTGSVKCKSKGR
        L  DESGRAFFDVPTRFGNLVSIK IPELAQATFDT K+YKD+FP+ RKI TLVTDKLLLES LLDYNPLVR IEASRPNSELAMVCGFTGSVK KSKGR
Subjt:  LLSDESGRAFFDVPTRFGNLVSIKPIPELAQATFDTFKYYKDNFPKGRKIGTLVTDKLLLESRLLDYNPLVRPIEASRPNSELAMVCGFTGSVKCKSKGR

Query:  AHTLKTVVGIEPVTPTVPRTEAQGNSGPSSALPTLVIELDLSEGRSEEKRPREESETLDVSPLNEVRGESPLRRRRKKKKTSSSSEAGARGTLPTSHADL
        AH LKTVVG EPVTPTVPRT AQGNSGPSSA+PT VIELDLS GRS EKR REESE LDVSPLNEVRGESPLRRRRKKKKTSSSSEAGARGTLPTSHADL
Subjt:  AHTLKTVVGIEPVTPTVPRTEAQGNSGPSSALPTLVIELDLSEGRSEEKRPREESETLDVSPLNEVRGESPLRRRRKKKKTSSSSEAGARGTLPTSHADL

Query:  VDDPAARMRGTSDVRMRFRMEPTSSGVKDQVSRISATCLDRCLMRLS
        VDDP ARMRGTS+VRMRF MEP+SSGVKDQVSRISATCLDR L R S
Subjt:  VDDPAARMRGTSDVRMRFRMEPTSSGVKDQVSRISATCLDRCLMRLS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCGTACAACAGTTAGGAAGTAGAGAGGGAGGAAAGGGGAATAGCTCAGTGAGGTGTTCAACAGGTTATCAGTTTCTAGCCCCAATAGTAAAACGGTTTGGCGAGCT
GGTGAAGTCGCGTAGGCGAAAGAGTGCATCAAGGGAGAAGACTACAGTATGTGCTGAAGTCAAGGGCAATGCCTATACATTGGCAACTAACTTAGTGGAAGGTGAGGTCG
TTGCTTTGTCTCCAAGTGGGAGATTGTTGGGATTGGCCAATGAAGTAACCCCTAGGGTCAAGAGGAAGTGTGGAGAAGAAGAGAAGTCAAGCAAGTGGCCCCATTTCAAG
GATAAGCCTAACAAGGAGCAACATACATGCAAGCATCGTCAGGCGCACGCCCCTGCCCGCGCACGCCGTCCCGCGCACGCCTTGTCGCTAACCTCACAGCACGTCCCTAC
CGCCCACACTGTGCCGCTAACCTCACAGCGCCCATGCCTGTGCCCGCTACCGCCCGCACATGCCTATGCCAAACGCTTCCACCCGCGCTCCAACATGCCCTGCACGACAA
AGAGGCCGCGCATCCTGCCCCACCTGCGCACCCTACAGCCAGGGAATGTGACCGTTGCGGAAGACGTTTCGATCTGCCAGGTTGTCGGAGCACTCAAGTATTCCGTCGTT
GCGTATCTCGAGAAGATCCTAGCCGCTCGTGATTACACGTGTCAGTCGCTTTTCCTTGCTCTTAGTATTCTTTCAAACATGGTAGTTTTCTTGTCTTCCCCCTCCAGTAG
TGATAGCCTGGGTAGTGCAGGTCGGACTATAAGTAGTTCACCCCCCAAACCAAGTGATTCTGGGGAGGTCTTAGCTCGTAGGTTAGAGTCCGAGCTAGAAGAAATTGAGA
ACTTTAGGTTCTCAGATGATGGAGCGGATAGTGATACCTCCACCTCGGGCCAGGGTCTGGAGTACCCTTCTAGGATGCCCGAGCACTATCTCGGACCCCTTCGTAGGGGG
TTTAAAATCCCGAATGACATCCTCCTTAGAGTTCCAGAGGAAGGGGAAAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTTTACTTGAAAATGTTTGAGTATGGCCT
CAGACTTCCTCTCCACCCTTTTGCTCAAGAGTTCCTCAACCGAATTGGTTTGGCGCCGGCACAAGTGGCCCCCAATGGGTGGGGTGTCATTTTTGCCTTAGCCATCCTTT
TTTGGTTGCGAGCTCGGGATGAGAACGAGGCCGAGCTACTAAGTGACGAGTCTGGTCGTGCCTTTTTTGATGTTCCCACTAGGTTTGGGAACTTAGTATCGATTAAGCCG
ATTCCCGAGCTCGCTCAAGCCACCTTCGACACCTTCAAATACTACAAGGATAACTTCCCCAAGGGCAGGAAGATCGGAACCTTGGTCACCGACAAGCTACTCTTGGAGTC
GAGGCTACTTGACTACAACCCTCTGGTTCGACCAATTGAAGCTTCCAGGCCGAACTCCGAGCTCGCAATGGTGTGTGGATTCACTGGAAGTGTGAAGTGCAAGTCCAAGG
GTCGTGCTCACACCCTCAAGACCGTAGTGGGCATCGAACCGGTGACGCCTACAGTGCCGCGGACTGAGGCTCAGGGTAACTCTGGGCCATCCTCTGCACTCCCCACCCTC
GTGATCGAACTAGACTTGTCTGAGGGTCGATCTGAAGAGAAGCGTCCAAGGGAGGAGTCCGAGACGCTTGATGTATCTCCCCTAAACGAGGTGAGGGGAGAGTCTCCTTT
GAGGAGAAGAAGAAAGAAGAAGAAGACTTCCTCCTCCTCGGAGGCTGGGGCTCGTGGGACTCTGCCTACGAGCCATGCTGATCTGGTGGATGACCCCGCAGCTCGAATGA
GGGGAACATCCGATGTGAGAATGCGGTTCAGGATGGAGCCGACAAGTTCCGGGGTGAAGGACCAGGTGTCCCGCATCTCGGCCACATGTTTGGACCGCTGCCTGATGCGT
TTATCGCTTCCATTCATTCAGCTATTATGGTCAAGGCCGAACTGGATGGAAAAGAGGCTTTGGCAAAAAAAGGAGAGGGAGAACTCCTCTGCTTTGGCACCTCGAACCCG
TGGGGGCAGGGAGAAGCCACGTCGGTACAACGCTCCTTTCCGAAGTGTGAACCGAGCTGCCCTCAAAGCTATCTTCTTTTGCTCCTTCGGATCTTGCGGTGGACTTCCTT
TGATGAACTCCACGATTGGGTCCATCCAAGAGGGTGATGGAGTATCAACCTCCATCACATCTGGCTCCAAGATTGAAGGAGTGTCCAAGATCTCAATCGGGACCGATCTA
GCCAGGACCATTTGGGAGTGCAAATTAATCAAAAGAAGCGAAAAGACGAAAAAACACCTCAGGAGGCGCCAGGCGCCTGGGATGCCTGCAGAAAAACAGATATGTTGGGA
GGGACAAGATGTTGGCAGATGCGCCTCCATCAATCAGTACTCTTCGGACCAGGACGTGATCAATGAGAGGGGCGATCACGAGCACATCATTATGGGGCAAATAGACCCCC
CCCAGATCGGCATCGTCGAAAGTGATGGAGCAAGTGGGCTTCTGCTCCCTGATGATACATACCTCGCGCCTGGCCTCGAGAGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTCGTACAACAGTTAGGAAGTAGAGAGGGAGGAAAGGGGAATAGCTCAGTGAGGTGTTCAACAGGTTATCAGTTTCTAGCCCCAATAGTAAAACGGTTTGGCGAGCT
GGTGAAGTCGCGTAGGCGAAAGAGTGCATCAAGGGAGAAGACTACAGTATGTGCTGAAGTCAAGGGCAATGCCTATACATTGGCAACTAACTTAGTGGAAGGTGAGGTCG
TTGCTTTGTCTCCAAGTGGGAGATTGTTGGGATTGGCCAATGAAGTAACCCCTAGGGTCAAGAGGAAGTGTGGAGAAGAAGAGAAGTCAAGCAAGTGGCCCCATTTCAAG
GATAAGCCTAACAAGGAGCAACATACATGCAAGCATCGTCAGGCGCACGCCCCTGCCCGCGCACGCCGTCCCGCGCACGCCTTGTCGCTAACCTCACAGCACGTCCCTAC
CGCCCACACTGTGCCGCTAACCTCACAGCGCCCATGCCTGTGCCCGCTACCGCCCGCACATGCCTATGCCAAACGCTTCCACCCGCGCTCCAACATGCCCTGCACGACAA
AGAGGCCGCGCATCCTGCCCCACCTGCGCACCCTACAGCCAGGGAATGTGACCGTTGCGGAAGACGTTTCGATCTGCCAGGTTGTCGGAGCACTCAAGTATTCCGTCGTT
GCGTATCTCGAGAAGATCCTAGCCGCTCGTGATTACACGTGTCAGTCGCTTTTCCTTGCTCTTAGTATTCTTTCAAACATGGTAGTTTTCTTGTCTTCCCCCTCCAGTAG
TGATAGCCTGGGTAGTGCAGGTCGGACTATAAGTAGTTCACCCCCCAAACCAAGTGATTCTGGGGAGGTCTTAGCTCGTAGGTTAGAGTCCGAGCTAGAAGAAATTGAGA
ACTTTAGGTTCTCAGATGATGGAGCGGATAGTGATACCTCCACCTCGGGCCAGGGTCTGGAGTACCCTTCTAGGATGCCCGAGCACTATCTCGGACCCCTTCGTAGGGGG
TTTAAAATCCCGAATGACATCCTCCTTAGAGTTCCAGAGGAAGGGGAAAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTTTACTTGAAAATGTTTGAGTATGGCCT
CAGACTTCCTCTCCACCCTTTTGCTCAAGAGTTCCTCAACCGAATTGGTTTGGCGCCGGCACAAGTGGCCCCCAATGGGTGGGGTGTCATTTTTGCCTTAGCCATCCTTT
TTTGGTTGCGAGCTCGGGATGAGAACGAGGCCGAGCTACTAAGTGACGAGTCTGGTCGTGCCTTTTTTGATGTTCCCACTAGGTTTGGGAACTTAGTATCGATTAAGCCG
ATTCCCGAGCTCGCTCAAGCCACCTTCGACACCTTCAAATACTACAAGGATAACTTCCCCAAGGGCAGGAAGATCGGAACCTTGGTCACCGACAAGCTACTCTTGGAGTC
GAGGCTACTTGACTACAACCCTCTGGTTCGACCAATTGAAGCTTCCAGGCCGAACTCCGAGCTCGCAATGGTGTGTGGATTCACTGGAAGTGTGAAGTGCAAGTCCAAGG
GTCGTGCTCACACCCTCAAGACCGTAGTGGGCATCGAACCGGTGACGCCTACAGTGCCGCGGACTGAGGCTCAGGGTAACTCTGGGCCATCCTCTGCACTCCCCACCCTC
GTGATCGAACTAGACTTGTCTGAGGGTCGATCTGAAGAGAAGCGTCCAAGGGAGGAGTCCGAGACGCTTGATGTATCTCCCCTAAACGAGGTGAGGGGAGAGTCTCCTTT
GAGGAGAAGAAGAAAGAAGAAGAAGACTTCCTCCTCCTCGGAGGCTGGGGCTCGTGGGACTCTGCCTACGAGCCATGCTGATCTGGTGGATGACCCCGCAGCTCGAATGA
GGGGAACATCCGATGTGAGAATGCGGTTCAGGATGGAGCCGACAAGTTCCGGGGTGAAGGACCAGGTGTCCCGCATCTCGGCCACATGTTTGGACCGCTGCCTGATGCGT
TTATCGCTTCCATTCATTCAGCTATTATGGTCAAGGCCGAACTGGATGGAAAAGAGGCTTTGGCAAAAAAAGGAGAGGGAGAACTCCTCTGCTTTGGCACCTCGAACCCG
TGGGGGCAGGGAGAAGCCACGTCGGTACAACGCTCCTTTCCGAAGTGTGAACCGAGCTGCCCTCAAAGCTATCTTCTTTTGCTCCTTCGGATCTTGCGGTGGACTTCCTT
TGATGAACTCCACGATTGGGTCCATCCAAGAGGGTGATGGAGTATCAACCTCCATCACATCTGGCTCCAAGATTGAAGGAGTGTCCAAGATCTCAATCGGGACCGATCTA
GCCAGGACCATTTGGGAGTGCAAATTAATCAAAAGAAGCGAAAAGACGAAAAAACACCTCAGGAGGCGCCAGGCGCCTGGGATGCCTGCAGAAAAACAGATATGTTGGGA
GGGACAAGATGTTGGCAGATGCGCCTCCATCAATCAGTACTCTTCGGACCAGGACGTGATCAATGAGAGGGGCGATCACGAGCACATCATTATGGGGCAAATAGACCCCC
CCCAGATCGGCATCGTCGAAAGTGATGGAGCAAGTGGGCTTCTGCTCCCTGATGATACATACCTCGCGCCTGGCCTCGAGAGCTAG
Protein sequenceShow/hide protein sequence
MFVQQLGSREGGKGNSSVRCSTGYQFLAPIVKRFGELVKSRRRKSASREKTTVCAEVKGNAYTLATNLVEGEVVALSPSGRLLGLANEVTPRVKRKCGEEEKSSKWPHFK
DKPNKEQHTCKHRQAHAPARARRPAHALSLTSQHVPTAHTVPLTSQRPCLCPLPPAHAYAKRFHPRSNMPCTTKRPRILPHLRTLQPGNVTVAEDVSICQVVGALKYSVV
AYLEKILAARDYTCQSLFLALSILSNMVVFLSSPSSSDSLGSAGRTISSSPPKPSDSGEVLARRLESELEEIENFRFSDDGADSDTSTSGQGLEYPSRMPEHYLGPLRRG
FKIPNDILLRVPEEGERADNPPEGWVTLYLKMFEYGLRLPLHPFAQEFLNRIGLAPAQVAPNGWGVIFALAILFWLRARDENEAELLSDESGRAFFDVPTRFGNLVSIKP
IPELAQATFDTFKYYKDNFPKGRKIGTLVTDKLLLESRLLDYNPLVRPIEASRPNSELAMVCGFTGSVKCKSKGRAHTLKTVVGIEPVTPTVPRTEAQGNSGPSSALPTL
VIELDLSEGRSEEKRPREESETLDVSPLNEVRGESPLRRRRKKKKTSSSSEAGARGTLPTSHADLVDDPAARMRGTSDVRMRFRMEPTSSGVKDQVSRISATCLDRCLMR
LSLPFIQLLWSRPNWMEKRLWQKKERENSSALAPRTRGGREKPRRYNAPFRSVNRAALKAIFFCSFGSCGGLPLMNSTIGSIQEGDGVSTSITSGSKIEGVSKISIGTDL
ARTIWECKLIKRSEKTKKHLRRRQAPGMPAEKQICWEGQDVGRCASINQYSSDQDVINERGDHEHIIMGQIDPPQIGIVESDGASGLLLPDDTYLAPGLES