; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g11240 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g11240
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptiongeranyl diphosphate phosphohydrolase-like
Genome locationchr2:8085331..8090814
RNA-Seq ExpressionMoc02g11240
SyntenyMoc02g11240
Gene Ontology termsGO:0009706 - chloroplast inner membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR000086 - NUDIX hydrolase domain
IPR015797 - NUDIX hydrolase-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7010970.1 hypothetical protein SDJN02_27768, partial [Cucurbita argyrosperma subsp. argyrosperma]7.2e-7191.08Show/hide
Query:  MAACFAPSLSVSGGLIKASDLSSKSITFGQAPKLAIQKKCLRTNRKLSVRAEYND-GRGGGGDFVAGFLLGGAVFGTLAYIFAPQIRRSLLNEDEYGFRR
        MAACFAPSLSVSGGLIKASDLSSKSI+FGQAPKLAIQ+KC RTN KLSVRAEYND GR GGG+FVAGFLLGGAVFGTLAYIFAPQIRRS+LNEDEYGFRR
Subjt:  MAACFAPSLSVSGGLIKASDLSSKSITFGQAPKLAIQKKCLRTNRKLSVRAEYND-GRGGGGDFVAGFLLGGAVFGTLAYIFAPQIRRSLLNEDEYGFRR

Query:  AKRPIYYDEGLEKTRQTLNAKIGQLNSAIDNVSSRLRGGNNTPAVPVEADPEKEATM
        A+RPIYYD+GLEKTRQTLN KIGQLNSAIDNVSSRLRGGN TP+VPVEADPE EATM
Subjt:  AKRPIYYDEGLEKTRQTLNAKIGQLNSAIDNVSSRLRGGNNTPAVPVEADPEKEATM

XP_022148671.1 uncharacterized protein LOC111017273 [Momordica charantia]5.5e-79100Show/hide
Query:  MAACFAPSLSVSGGLIKASDLSSKSITFGQAPKLAIQKKCLRTNRKLSVRAEYNDGRGGGGDFVAGFLLGGAVFGTLAYIFAPQIRRSLLNEDEYGFRRA
        MAACFAPSLSVSGGLIKASDLSSKSITFGQAPKLAIQKKCLRTNRKLSVRAEYNDGRGGGGDFVAGFLLGGAVFGTLAYIFAPQIRRSLLNEDEYGFRRA
Subjt:  MAACFAPSLSVSGGLIKASDLSSKSITFGQAPKLAIQKKCLRTNRKLSVRAEYNDGRGGGGDFVAGFLLGGAVFGTLAYIFAPQIRRSLLNEDEYGFRRA

Query:  KRPIYYDEGLEKTRQTLNAKIGQLNSAIDNVSSRLRGGNNTPAVPVEADPEKEATM
        KRPIYYDEGLEKTRQTLNAKIGQLNSAIDNVSSRLRGGNNTPAVPVEADPEKEATM
Subjt:  KRPIYYDEGLEKTRQTLNAKIGQLNSAIDNVSSRLRGGNNTPAVPVEADPEKEATM

XP_022944447.1 uncharacterized protein LOC111448897 [Cucurbita moschata]1.2e-7091.08Show/hide
Query:  MAACFAPSLSVSGGLIKASDLSSKSITFGQAPKLAIQKKCLRTNRKLSVRAEYND-GRGGGGDFVAGFLLGGAVFGTLAYIFAPQIRRSLLNEDEYGFRR
        MAACFAPSLSVSGGLIKASDLSSKSI+FGQAPKLAIQ+KC RT+ KLSVRAEYND GR GGG+FVAGFLLGGAVFGTLAYIFAPQIRRS+LNEDEYGFRR
Subjt:  MAACFAPSLSVSGGLIKASDLSSKSITFGQAPKLAIQKKCLRTNRKLSVRAEYND-GRGGGGDFVAGFLLGGAVFGTLAYIFAPQIRRSLLNEDEYGFRR

Query:  AKRPIYYDEGLEKTRQTLNAKIGQLNSAIDNVSSRLRGGNNTPAVPVEADPEKEATM
        A+RPIYYD+GLEKTRQTLN KIGQLNSAIDNVSSRLRGGN TPAVPVEADPE EATM
Subjt:  AKRPIYYDEGLEKTRQTLNAKIGQLNSAIDNVSSRLRGGNNTPAVPVEADPEKEATM

XP_023512433.1 uncharacterized protein LOC111777191 [Cucurbita pepo subsp. pepo]9.4e-7191.08Show/hide
Query:  MAACFAPSLSVSGGLIKASDLSSKSITFGQAPKLAIQKKCLRTNRKLSVRAEYND-GRGGGGDFVAGFLLGGAVFGTLAYIFAPQIRRSLLNEDEYGFRR
        MAACFAPSLSVSGGLIKASDLSSKSI+FGQ PKLAIQ+KC RTN KLSVRAEYND GR GGG+FVAGFLLGGAVFGTLAYIFAPQIRRS+LNEDEYGFRR
Subjt:  MAACFAPSLSVSGGLIKASDLSSKSITFGQAPKLAIQKKCLRTNRKLSVRAEYND-GRGGGGDFVAGFLLGGAVFGTLAYIFAPQIRRSLLNEDEYGFRR

Query:  AKRPIYYDEGLEKTRQTLNAKIGQLNSAIDNVSSRLRGGNNTPAVPVEADPEKEATM
        A+RPIYYD+GLEKTRQTLN KIGQLNSAIDNVSSRLRGGN TPAVPVEADPE EATM
Subjt:  AKRPIYYDEGLEKTRQTLNAKIGQLNSAIDNVSSRLRGGNNTPAVPVEADPEKEATM

XP_038902580.1 uncharacterized protein LOC120089235 [Benincasa hispida]3.8e-7294.27Show/hide
Query:  MAACFAPSLSVSGGLIKASDLSSKSITFGQAPKLAIQKKCLRTNRKLSVRAEYNDG-RGGGGDFVAGFLLGGAVFGTLAYIFAPQIRRSLLNEDEYGFRR
        MAACFAPSLSVSGGLIKASDLSSKSI+FGQAPKLAIQ+K  RTN+KLSVRAEYNDG R GGGDFVAGFLLGGAVFGTLAYIFAPQIRRSLLNEDEYGFRR
Subjt:  MAACFAPSLSVSGGLIKASDLSSKSITFGQAPKLAIQKKCLRTNRKLSVRAEYNDG-RGGGGDFVAGFLLGGAVFGTLAYIFAPQIRRSLLNEDEYGFRR

Query:  AKRPIYYDEGLEKTRQTLNAKIGQLNSAIDNVSSRLRGGNNTPAVPVEADPEKEATM
        AKRPIYYDEGLEKTRQTLNAKI QLNSAIDNVSSRLRGGNNTPAVPVEADPE EATM
Subjt:  AKRPIYYDEGLEKTRQTLNAKIGQLNSAIDNVSSRLRGGNNTPAVPVEADPEKEATM

TrEMBL top hitse value%identityAlignment
A0A1S3C9M4 uncharacterized protein LOC1034983957.2e-6988.61Show/hide
Query:  MAACFAPSLSVSGGLIKASDLSSKSITFGQAPKLAIQKKCLRTNRKLSVRAEYND-GRGGGGDFVAGFLLGGAVFGTLAYIFAPQIRRSLLNEDEYGFRR
        MAACFAPSLSVSGGLIKASDLSSKS++FGQ PKLAI++KC +TN KLSVRAEYND GR GGGDFVAGFLLGGAVFGTLAY+FAPQIRRS+LNEDE+GFRR
Subjt:  MAACFAPSLSVSGGLIKASDLSSKSITFGQAPKLAIQKKCLRTNRKLSVRAEYND-GRGGGGDFVAGFLLGGAVFGTLAYIFAPQIRRSLLNEDEYGFRR

Query:  AKRPIYYDEGLEKTRQTLNAKIGQLNSAIDNVSSRLRGGNNTPAVPV-EADPEKEATM
        AKRP+YYDEGLEKTRQTLNAKI QLNSAIDNVSSRLRGGNNTPAVPV EA+PE EATM
Subjt:  AKRPIYYDEGLEKTRQTLNAKIGQLNSAIDNVSSRLRGGNNTPAVPV-EADPEKEATM

A0A5D3CQ66 Uncharacterized protein7.2e-6988.61Show/hide
Query:  MAACFAPSLSVSGGLIKASDLSSKSITFGQAPKLAIQKKCLRTNRKLSVRAEYND-GRGGGGDFVAGFLLGGAVFGTLAYIFAPQIRRSLLNEDEYGFRR
        MAACFAPSLSVSGGLIKASDLSSKS++FGQ PKLAI++KC +TN KLSVRAEYND GR GGGDFVAGFLLGGAVFGTLAY+FAPQIRRS+LNEDE+GFRR
Subjt:  MAACFAPSLSVSGGLIKASDLSSKSITFGQAPKLAIQKKCLRTNRKLSVRAEYND-GRGGGGDFVAGFLLGGAVFGTLAYIFAPQIRRSLLNEDEYGFRR

Query:  AKRPIYYDEGLEKTRQTLNAKIGQLNSAIDNVSSRLRGGNNTPAVPV-EADPEKEATM
        AKRP+YYDEGLEKTRQTLNAKI QLNSAIDNVSSRLRGGNNTPAVPV EA+PE EATM
Subjt:  AKRPIYYDEGLEKTRQTLNAKIGQLNSAIDNVSSRLRGGNNTPAVPV-EADPEKEATM

A0A6J1D647 uncharacterized protein LOC1110172732.7e-79100Show/hide
Query:  MAACFAPSLSVSGGLIKASDLSSKSITFGQAPKLAIQKKCLRTNRKLSVRAEYNDGRGGGGDFVAGFLLGGAVFGTLAYIFAPQIRRSLLNEDEYGFRRA
        MAACFAPSLSVSGGLIKASDLSSKSITFGQAPKLAIQKKCLRTNRKLSVRAEYNDGRGGGGDFVAGFLLGGAVFGTLAYIFAPQIRRSLLNEDEYGFRRA
Subjt:  MAACFAPSLSVSGGLIKASDLSSKSITFGQAPKLAIQKKCLRTNRKLSVRAEYNDGRGGGGDFVAGFLLGGAVFGTLAYIFAPQIRRSLLNEDEYGFRRA

Query:  KRPIYYDEGLEKTRQTLNAKIGQLNSAIDNVSSRLRGGNNTPAVPVEADPEKEATM
        KRPIYYDEGLEKTRQTLNAKIGQLNSAIDNVSSRLRGGNNTPAVPVEADPEKEATM
Subjt:  KRPIYYDEGLEKTRQTLNAKIGQLNSAIDNVSSRLRGGNNTPAVPVEADPEKEATM

A0A6J1FVP6 uncharacterized protein LOC1114488975.9e-7191.08Show/hide
Query:  MAACFAPSLSVSGGLIKASDLSSKSITFGQAPKLAIQKKCLRTNRKLSVRAEYND-GRGGGGDFVAGFLLGGAVFGTLAYIFAPQIRRSLLNEDEYGFRR
        MAACFAPSLSVSGGLIKASDLSSKSI+FGQAPKLAIQ+KC RT+ KLSVRAEYND GR GGG+FVAGFLLGGAVFGTLAYIFAPQIRRS+LNEDEYGFRR
Subjt:  MAACFAPSLSVSGGLIKASDLSSKSITFGQAPKLAIQKKCLRTNRKLSVRAEYND-GRGGGGDFVAGFLLGGAVFGTLAYIFAPQIRRSLLNEDEYGFRR

Query:  AKRPIYYDEGLEKTRQTLNAKIGQLNSAIDNVSSRLRGGNNTPAVPVEADPEKEATM
        A+RPIYYD+GLEKTRQTLN KIGQLNSAIDNVSSRLRGGN TPAVPVEADPE EATM
Subjt:  AKRPIYYDEGLEKTRQTLNAKIGQLNSAIDNVSSRLRGGNNTPAVPVEADPEKEATM

A0A6J1JGK4 uncharacterized protein LOC1114843142.3e-7090.45Show/hide
Query:  MAACFAPSLSVSGGLIKASDLSSKSITFGQAPKLAIQKKCLRTNRKLSVRAEYND-GRGGGGDFVAGFLLGGAVFGTLAYIFAPQIRRSLLNEDEYGFRR
        MAAC APSLSVSGGLIKASDLSSKSI+FGQAPKLAIQ+KC R+N KLSVRAEYND GR GGG+FVAGFLLGGAVFGTLAYIFAPQIRRS+LNEDEYGFRR
Subjt:  MAACFAPSLSVSGGLIKASDLSSKSITFGQAPKLAIQKKCLRTNRKLSVRAEYND-GRGGGGDFVAGFLLGGAVFGTLAYIFAPQIRRSLLNEDEYGFRR

Query:  AKRPIYYDEGLEKTRQTLNAKIGQLNSAIDNVSSRLRGGNNTPAVPVEADPEKEATM
        A+RPIYYD+GLEKTRQTLN KIGQLNSAIDNVSSRLRGGN TPAVPVEADPE EATM
Subjt:  AKRPIYYDEGLEKTRQTLNAKIGQLNSAIDNVSSRLRGGNNTPAVPVEADPEKEATM

SwissProt top hitse value%identityAlignment
M4I1C6 Geranyl diphosphate phosphohydrolase3.5e-3671.43Show/hide
Query:  FRESFEECAARELKEETGLDIEKIEFITATNNLFLDNPSPSHYVVIFMRATLADPDQIARNLEPEKCDGWDWYEWDRLPQPLFSPLLNFVNTGFDPFP
        F ESFEECAARELKEET LDI KIE +T TNNLFLD   PS YV +FMRA LADP Q  +N+EPE CDGW WYEWD LP+PLF PL N V  GF+PFP
Subjt:  FRESFEECAARELKEETGLDIEKIEFITATNNLFLDNPSPSHYVVIFMRATLADPDQIARNLEPEKCDGWDWYEWDRLPQPLFSPLLNFVNTGFDPFP

Q8BG93 Nucleotide triphosphate diphosphatase NUDT153.5e-1239.25Show/hide
Query:  FRESFEECAARELKEETGLDIEKIEFITATNNLFLDNPSPSHYVVIFMRATL-ADPDQIARNLEPEKCDGWDWYEWDRLP--QPLFSPLLNFVNTGFDPF
        F E++EECA RE  EE GL ++ + F +  N+ F++  +  HYV I M+  +    D   RN+EPEK + W+W  W+  P    LF  L      G+DPF
Subjt:  FRESFEECAARELKEETGLDIEKIEFITATNNLFLDNPSPSHYVVIFMRATL-ADPDQIARNLEPEKCDGWDWYEWDRLP--QPLFSPLLNFVNTGFDPF

Query:  PKCDQNH
         K D NH
Subjt:  PKCDQNH

Q9CA40 Nudix hydrolase 11.4e-3260.82Show/hide
Query:  FRESFEECAARELKEETGLDIEKIEFITATNNLFLDNPSPSHYVVIFMRATLADPDQIARNLEPEKCDGWDWYEWDRLPQPLFSPLLNFVNTGFDPF
        F ESFEECAARE+ EETGL IEK++ +T TNN+F + P+PSHYV + +RA L DP Q  +N+EPEKC+GWDWY+W+ LP+PLF PL     +GF+PF
Subjt:  FRESFEECAARELKEETGLDIEKIEFITATNNLFLDNPSPSHYVVIFMRATLADPDQIARNLEPEKCDGWDWYEWDRLPQPLFSPLLNFVNTGFDPF

Q9NV35 Nucleotide triphosphate diphosphatase NUDT154.6e-1238.32Show/hide
Query:  FRESFEECAARELKEETGLDIEKIEFITATNNLFLDNPSPSHYVVIFMRATL-ADPDQIARNLEPEKCDGWDWYEWDRLP--QPLFSPLLNFVNTGFDPF
        F E++EECA RE  EE  L ++ + F +  N+ F++  +  HYV I M+  +    D   +N+EPEK + W+W  W+ LP    LF  L      G+DPF
Subjt:  FRESFEECAARELKEETGLDIEKIEFITATNNLFLDNPSPSHYVVIFMRATL-ADPDQIARNLEPEKCDGWDWYEWDRLP--QPLFSPLLNFVNTGFDPF

Query:  PKCDQNH
         K D NH
Subjt:  PKCDQNH

Arabidopsis top hitse value%identityAlignment
AT1G68760.1 nudix hydrolase 19.8e-3460.82Show/hide
Query:  FRESFEECAARELKEETGLDIEKIEFITATNNLFLDNPSPSHYVVIFMRATLADPDQIARNLEPEKCDGWDWYEWDRLPQPLFSPLLNFVNTGFDPF
        F ESFEECAARE+ EETGL IEK++ +T TNN+F + P+PSHYV + +RA L DP Q  +N+EPEKC+GWDWY+W+ LP+PLF PL     +GF+PF
Subjt:  FRESFEECAARELKEETGLDIEKIEFITATNNLFLDNPSPSHYVVIFMRATLADPDQIARNLEPEKCDGWDWYEWDRLPQPLFSPLLNFVNTGFDPF

AT3G02900.1 unknown protein2.1e-3654.88Show/hide
Query:  MAACFAPSLSVSGGLIKASDLSSKSITFGQAPKLAIQKKCLRTNRKLSVRAEYNDGRGGGG--DFVAGFLLGGAVFGTLAYIFAPQIRRSLLNEDEYGFR
        MA+  A  +S SG     +  +  +I   ++  L +Q K  R++ KLSV A Y  G  GGG  DFV GFLLG AVFGTLAYIFAPQIRRS+L+E+EYGF+
Subjt:  MAACFAPSLSVSGGLIKASDLSSKSITFGQAPKLAIQKKCLRTNRKLSVRAEYNDGRGGGG--DFVAGFLLGGAVFGTLAYIFAPQIRRSLLNEDEYGFR

Query:  RAKRPIYYDEGLEKTRQTLNAKIGQLNSAIDNVSSRLRGG-------NNTPAVPVEADPEKEAT
        + ++P+YYDEGLE+ R+ LN KIGQLNSAID VSSRL+GG        ++P+VPVE D E EAT
Subjt:  RAKRPIYYDEGLEKTRQTLNAKIGQLNSAIDNVSSRLRGG-------NNTPAVPVEADPEKEAT

AT3G02900.2 unknown protein9.4e-3763.36Show/hide
Query:  LAIQKKCLRTNRKLSVRAEYNDGRGGGG--DFVAGFLLGGAVFGTLAYIFAPQIRRSLLNEDEYGFRRAKRPIYYDEGLEKTRQTLNAKIGQLNSAIDNV
        L +Q K  R++ KLSV A Y  G  GGG  DFV GFLLG AVFGTLAYIFAPQIRRS+L+E+EYGF++ ++P+YYDEGLE+ R+ LN KIGQLNSAID V
Subjt:  LAIQKKCLRTNRKLSVRAEYNDGRGGGG--DFVAGFLLGGAVFGTLAYIFAPQIRRSLLNEDEYGFRRAKRPIYYDEGLEKTRQTLNAKIGQLNSAIDNV

Query:  SSRLRGG-------NNTPAVPVEADPEKEAT
        SSRL+GG        ++P+VPVE D E EAT
Subjt:  SSRLRGG-------NNTPAVPVEADPEKEAT

AT5G16660.1 unknown protein3.7e-4163.1Show/hide
Query:  MAACFAPS-LSVSG----GLIKASDLS--SKSITFGQAPKLAIQKKCLRTNRKLSVRAEYNDG--RGGGGDFVAGFLLGGAVFGTLAYIFAPQIRRSLLN
        MA+C A + LS+SG      +KA+ LS  +K  +  +   L I KK  RT RK SV A Y DG   G  GDF+AGFLLGGAVFG +AYIFAPQIRRS+LN
Subjt:  MAACFAPS-LSVSG----GLIKASDLS--SKSITFGQAPKLAIQKKCLRTNRKLSVRAEYNDG--RGGGGDFVAGFLLGGAVFGTLAYIFAPQIRRSLLN

Query:  -EDEYGFRRAKRPIYYDEGLEKTRQTLNAKIGQLNSAIDNVSSRLRG-GNNTPA--VPVEADPEKEAT
         EDEYGF + K+P YYDEGLEKTR+TLN KIGQLNSAIDNVSSRLRG   NT +  VPVE DPE EAT
Subjt:  -EDEYGFRRAKRPIYYDEGLEKTRQTLNAKIGQLNSAIDNVSSRLRG-GNNTPA--VPVEADPEKEAT

AT5G16660.2 unknown protein7.0e-4062.05Show/hide
Query:  MAACFAPS-LSVSG----GLIKASDLS--SKSITFGQAPKLAIQKKCLRTNRKLSVRAEYNDGRGGGGDFVAGFLLGGAVFGTLAYIFAPQIRRSLLN-E
        MA+C A + LS+SG      +KA+ LS  +K  +  +   L I KK  RT RK SV A      G  GDF+AGFLLGGAVFG +AYIFAPQIRRS+LN E
Subjt:  MAACFAPS-LSVSG----GLIKASDLS--SKSITFGQAPKLAIQKKCLRTNRKLSVRAEYNDGRGGGGDFVAGFLLGGAVFGTLAYIFAPQIRRSLLN-E

Query:  DEYGFRRAKRPIYYDEGLEKTRQTLNAKIGQLNSAIDNVSSRLRG-GNNTPA--VPVEADPEKEAT
        DEYGF + K+P YYDEGLEKTR+TLN KIGQLNSAIDNVSSRLRG   NT +  VPVE DPE EAT
Subjt:  DEYGFRRAKRPIYYDEGLEKTRQTLNAKIGQLNSAIDNVSSRLRG-GNNTPA--VPVEADPEKEAT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCCTGCTTCGCTCCTTCGCTGTCCGTGTCTGGGGGATTGATCAAGGCGTCAGATCTCTCCTCGAAGTCCATTACCTTTGGGCAAGCACCAAAACTCGCCATTCA
AAAGAAGTGCTTGAGAACCAACCGCAAGTTATCTGTTCGTGCAGAGTACAATGATGGTAGAGGTGGAGGTGGGGATTTTGTTGCTGGTTTTCTTCTAGGGGGTGCAGTAT
TTGGAACTTTAGCTTATATTTTTGCTCCGCAGATCAGGAGATCTCTACTAAATGAAGACGAGTACGGTTTTCGGAGGGCCAAGCGTCCAATCTACTATGACGAAGGTTTA
GAGAAAACCAGACAGACGTTGAATGCAAAAATAGGCCAATTGAATTCTGCCATTGACAATGTATCTTCACGTCTGAGAGGTGGAAACAATACACCAGCTGTGCCAGTTGA
AGCTGATCCTGAGAAAGAAGCTACCATGCGTAGCAGTCTTCCTTTTCAAAGGAAAATCCGTGCTCATGGGCCGCCGCCGCGTCCCCCACGGAGACTCCACATTCGCCGTC
CCCGGTGGCCACCTCGAGTTCGGTTCGTCCCATTACTCTTCTCTTTCCCTCTTTTTCCCATTCAGATTTCCGATTCTGCTTTTAGGGAGAGTTTCGAGGAGTGTGCGGCG
AGGGAATTGAAGGAGGAGACCGGTTTGGACATCGAGAAAATCGAGTTCATTACTGCGACGAACAATCTGTTTCTGGATAATCCGAGTCCGTCGCATTACGTGGTGATCTT
CATGCGCGCAACATTGGCGGATCCGGACCAGATTGCTCGGAATTTGGAGCCGGAGAAGTGCGATGGCTGGGACTGGTATGAATGGGATCGTCTTCCTCAACCTCTCTTTA
GCCCTCTTCTGAATTTTGTCAACACTGGCTTTGATCCATTCCCTAAATGTGATCAAAACCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGCCTGCTTCGCTCCTTCGCTGTCCGTGTCTGGGGGATTGATCAAGGCGTCAGATCTCTCCTCGAAGTCCATTACCTTTGGGCAAGCACCAAAACTCGCCATTCA
AAAGAAGTGCTTGAGAACCAACCGCAAGTTATCTGTTCGTGCAGAGTACAATGATGGTAGAGGTGGAGGTGGGGATTTTGTTGCTGGTTTTCTTCTAGGGGGTGCAGTAT
TTGGAACTTTAGCTTATATTTTTGCTCCGCAGATCAGGAGATCTCTACTAAATGAAGACGAGTACGGTTTTCGGAGGGCCAAGCGTCCAATCTACTATGACGAAGGTTTA
GAGAAAACCAGACAGACGTTGAATGCAAAAATAGGCCAATTGAATTCTGCCATTGACAATGTATCTTCACGTCTGAGAGGTGGAAACAATACACCAGCTGTGCCAGTTGA
AGCTGATCCTGAGAAAGAAGCTACCATGCGTAGCAGTCTTCCTTTTCAAAGGAAAATCCGTGCTCATGGGCCGCCGCCGCGTCCCCCACGGAGACTCCACATTCGCCGTC
CCCGGTGGCCACCTCGAGTTCGGTTCGTCCCATTACTCTTCTCTTTCCCTCTTTTTCCCATTCAGATTTCCGATTCTGCTTTTAGGGAGAGTTTCGAGGAGTGTGCGGCG
AGGGAATTGAAGGAGGAGACCGGTTTGGACATCGAGAAAATCGAGTTCATTACTGCGACGAACAATCTGTTTCTGGATAATCCGAGTCCGTCGCATTACGTGGTGATCTT
CATGCGCGCAACATTGGCGGATCCGGACCAGATTGCTCGGAATTTGGAGCCGGAGAAGTGCGATGGCTGGGACTGGTATGAATGGGATCGTCTTCCTCAACCTCTCTTTA
GCCCTCTTCTGAATTTTGTCAACACTGGCTTTGATCCATTCCCTAAATGTGATCAAAACCATTGA
Protein sequenceShow/hide protein sequence
MAACFAPSLSVSGGLIKASDLSSKSITFGQAPKLAIQKKCLRTNRKLSVRAEYNDGRGGGGDFVAGFLLGGAVFGTLAYIFAPQIRRSLLNEDEYGFRRAKRPIYYDEGL
EKTRQTLNAKIGQLNSAIDNVSSRLRGGNNTPAVPVEADPEKEATMRSSLPFQRKIRAHGPPPRPPRRLHIRRPRWPPRVRFVPLLFSFPLFPIQISDSAFRESFEECAA
RELKEETGLDIEKIEFITATNNLFLDNPSPSHYVVIFMRATLADPDQIARNLEPEKCDGWDWYEWDRLPQPLFSPLLNFVNTGFDPFPKCDQNH