; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc05G06390 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc05G06390
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionMitochondrial intermediate peptidase
Genome locationClcChr05:4704516..4726820
RNA-Seq ExpressionClc05G06390
SyntenyClc05G06390
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582303.1 hypothetical protein SDJN03_22305, partial [Cucurbita argyrosperma subsp. sororia]1.0e-8668.52Show/hide
Query:  NRSSAMAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGLLGGGVTLA--------VCSGRNMEAEGLHSATYFCRSLNSSVDDILALDGSR
        +RSS M  A   L  VL SKQN LTI+EAN+LQTC SKAVRD+TFG L+GGGVT A        V    +  A  L     F RSL+S VD ILALDGSR
Subjt:  NRSSAMAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGLLGGGVTLA--------VCSGRNMEAEGLHSATYFCRSLNSSVDDILALDGSR

Query:  MQKELANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DIVHAQRTNDNDHNENVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNA
        MQKELANI+VT+Y N+PRTMQ I KHF+YEEVFDDSTLDRPK   RYRNFFS D+ HAQRT+ ND  +N+HGN HHDSSNRDS+  Q DSYG+PDDKGNA
Subjt:  MQKELANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DIVHAQRTNDNDHNENVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNA

Query:  LEFKPVLIKRGTDAATADPLNCIFGTLAKEEEIQHSSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHV
         EF PVL K G DAATADPL+ IFGTL +EEEIQHSSAS+PSPKSH RS+RYNRRHRR  +TMPT+FEHV
Subjt:  LEFKPVLIKRGTDAATADPLNCIFGTLAKEEEIQHSSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHV

XP_022956077.1 uncharacterized protein LOC111457878 [Cucurbita moschata]1.6e-8769.43Show/hide
Query:  MAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGLLGGGVTLA--------VCSGRNMEAEGLHSATYFCRSLNSSVDDILALDGSRMQKEL
        M  A   L  VL SKQN LTI+EAN+LQTC SKAVRD+TFG L+GGGVT A        V    +  A  L     F RSL+S VD ILALDGSRMQKEL
Subjt:  MAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGLLGGGVTLA--------VCSGRNMEAEGLHSATYFCRSLNSSVDDILALDGSRMQKEL

Query:  ANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DIVHAQRTNDNDHNENVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNALEFKP
        ANI+VT+Y N+PRTMQHI KHF+YEEVFDDSTLDRPK   RYRNFFS D+ HAQRT+ ND  +N+HGN HHDSSNRDS+  QSDSYG+PDDKGNA EF P
Subjt:  ANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DIVHAQRTNDNDHNENVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNALEFKP

Query:  VLIKRGTDAATADPLNCIFGTLAKEEEIQHSSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHV
        VL K G DAATADPL+ IFGTL +EEEIQHSSAS+PSPKSH RS+RYNRRHRR  +TMPT+FEHV
Subjt:  VLIKRGTDAATADPLNCIFGTLAKEEEIQHSSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHV

XP_022980008.1 uncharacterized protein LOC111479542 [Cucurbita maxima]3.9e-8669.06Show/hide
Query:  MAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGLLGGGVTLA--------VCSGRNMEAEGLHSATYFCRSLNSSVDDILALDGSRMQKEL
        M  A   L  VL SKQN LTI+EAN+LQTC SKAVRD+TFG L+GGGVT A        V    +  A  L     F RSL+S VD ILALDGSRMQKEL
Subjt:  MAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGLLGGGVTLA--------VCSGRNMEAEGLHSATYFCRSLNSSVDDILALDGSRMQKEL

Query:  ANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DIVHAQRTNDNDHNENVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNALEFKP
        ANI+VT+  N+PRTMQHI KHF+YEEVFDDSTLDRPK   RYRNFFS D+ HAQR + ND  +N+HGN HHDSSNRDS+  QSDSYGEPDDKGNA EF P
Subjt:  ANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DIVHAQRTNDNDHNENVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNALEFKP

Query:  VLIKRGTDAATADPLNCIFGTLAKEEEIQHSSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHV
        VL K G DAATADPL+ IFGTL +EEEIQHSSAS+PSPKSH RS+RYNRRHRR  +TMPT+FEHV
Subjt:  VLIKRGTDAATADPLNCIFGTLAKEEEIQHSSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHV

XP_023527180.1 uncharacterized protein LOC111790494 isoform X1 [Cucurbita pepo subsp. pepo]2.7e-8769.06Show/hide
Query:  MAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGLLGGGVTLA--------VCSGRNMEAEGLHSATYFCRSLNSSVDDILALDGSRMQKEL
        M  A   L  VL SKQN LTI+EAN+LQTC SKAVRD+TFG L+GGGVT A        V    +  A  L     F RSL+S VD ILALDGSRMQKEL
Subjt:  MAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGLLGGGVTLA--------VCSGRNMEAEGLHSATYFCRSLNSSVDDILALDGSRMQKEL

Query:  ANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DIVHAQRTNDNDHNENVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNALEFKP
        ANI+VT+Y N+PRTMQHI KHF+YEEVFDDSTLDRPK   RYRNFFS D+ HAQRT+ ND  +N+HGN HHDSSNRDS+  QSDSYG+PDDKGNA EF P
Subjt:  ANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DIVHAQRTNDNDHNENVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNALEFKP

Query:  VLIKRGTDAATADPLNCIFGTLAKEEEIQHSSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHV
        VL K G DAATADPL+ IFGT+ +EEEIQHSSAS+PSPKSH RS+RYNRRHRR  +TMPT+FEHV
Subjt:  VLIKRGTDAATADPLNCIFGTLAKEEEIQHSSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHV

XP_038878005.1 uncharacterized protein LOC120070209 isoform X1 [Benincasa hispida]6.9e-9171.54Show/hide
Query:  MAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGLLGGGVTLA--------VCSGRNMEAEGLHSATYFCRSLNSSVDDILALDGSRMQKEL
        M  A  HL  VL SKQN LTI+EANLLQTC SKAVRD+T GGL+GGGVT A        +    +  A  L     F  SL S VD ILAL GSRMQKEL
Subjt:  MAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGLLGGGVTLA--------VCSGRNMEAEGLHSATYFCRSLNSSVDDILALDGSRMQKEL

Query:  ANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DIVHAQRTNDNDHNENVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNALEFKP
        ANI+VTRY N+PR MQ I KHFYYEEVFDDSTLDRPK   R RNFFS D+ HAQRT DND  +N+HGNSHHDSSNRDSSAYQSDSYG+PDDKGNALE KP
Subjt:  ANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DIVHAQRTNDNDHNENVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNALEFKP

Query:  VLIKRGTDAATADPLNCIFGTLAK--EEEIQHSSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHV
        VL K GTDA T DPL+CIFGTLA+  EEEIQHSSAS+PSPKSHSRSRRYNRRHR+  +TMPTNFEHV
Subjt:  VLIKRGTDAATADPLNCIFGTLAK--EEEIQHSSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHV

TrEMBL top hitse value%identityAlignment
A0A0A0L4T1 Uncharacterized protein1.3e-8265.82Show/hide
Query:  VAEFSNRSSAMAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGLLGGGVTLAVCSGRN--------MEAEGLHSATYFCRSLNSSVDDILA
        V  F ++SS M      L +VL SK N LTI+EA LLQTC SKAVRD+TFGG+LGGG+T A     N        + A  L     F RSLNS VD ILA
Subjt:  VAEFSNRSSAMAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGLLGGGVTLAVCSGRN--------MEAEGLHSATYFCRSLNSSVDDILA

Query:  LDGSRMQKELANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DIVHAQRTNDNDHNENVHGNSHHDSSNRDSSAYQSDSYGEPD
        LDGSRMQKELANI+VTRY N+P  MQ+I KHFYYEEVFDDST DRPK   RYRNFFS D+ H+QRT+ ND+  NVH NSH     RDSSAYQ DSYG+PD
Subjt:  LDGSRMQKELANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DIVHAQRTNDNDHNENVHGNSHHDSSNRDSSAYQSDSYGEPD

Query:  DKGNALEFKPVLIKRGTDAATADPLNCIFGTLAKEEEIQHSSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHV
        D GNA EFKPVL K GTDAATADPL+CIFGTLA++EEIQ+S+ S PSPK HSRSRRYNRRHR+D  T  TNFEHV
Subjt:  DKGNALEFKPVLIKRGTDAATADPLNCIFGTLAKEEEIQHSSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHV

A0A1S3AWL2 uncharacterized protein LOC103483703 isoform X17.0e-8167.83Show/hide
Query:  LHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGLLGGGVTLAVCSGRNM--------EAEGLHSATYFCRSLNSSVDDILALDGSRMQKELANIIVTR
        L +VL SK N LTI+EA LLQTC SKAVRD+TFGG+LGGG+T A     N          A  L     F RSLNS VD IL+LDGSRMQKELANI+VTR
Subjt:  LHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGLLGGGVTLAVCSGRNM--------EAEGLHSATYFCRSLNSSVDDILALDGSRMQKELANIIVTR

Query:  YPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DIVHAQRTNDNDHNENVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNALEFKPVLIKRGT
        Y N+PR MQ+I KHF+YEEVFDDST DRPK   RYRNFFS D+ H+QRT+ ND+  NVH NSH     RDSSA+Q DSYG+ DDKGNA EFKPVL K GT
Subjt:  YPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DIVHAQRTNDNDHNENVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNALEFKPVLIKRGT

Query:  DAATADPLNCIFGTLAKEEEIQHSSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHV
        D+ATADPL+CIFGTLA+EEEIQHS+ S PSPK HSRSRRYNRRHR+D +T PTNFE+V
Subjt:  DAATADPLNCIFGTLAKEEEIQHSSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHV

A0A6J1C8I6 uncharacterized protein LOC111009363 isoform X16.6e-7966.54Show/hide
Query:  MAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGLLGGGVTLA--------VCSGRNMEAEGLHSATYFCRSLNSSVDDILALDGSRMQKEL
        M  A   L  VL SKQN LTI+EA LLQTC SKAVRD+TFG L GGGVT A        +    +  A  L     F RSLNS VD ILALDGSRMQKEL
Subjt:  MAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGLLGGGVTLA--------VCSGRNMEAEGLHSATYFCRSLNSSVDDILALDGSRMQKEL

Query:  ANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DIVHAQRTNDNDHNENVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNALEFKP
        ANI+VT+Y N+PRTMQHI KHFYYE+VFDDSTLDRP+   RYRNFFS D+ H QRT+DND   N+HGNSHH SSN DS++ Q+ SY EPDDKGNALEFKP
Subjt:  ANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DIVHAQRTNDNDHNENVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNALEFKP

Query:  VLIKRGTDAATADPLNCIFGTLAKEEEIQHSSASNPSPKSHSRSRRYNRRHRRDKKT
        VL K GTD ATADPL+C+FG LAK EEIQHS++S  + KSHSRSRRY+RRHRR  +T
Subjt:  VLIKRGTDAATADPLNCIFGTLAKEEEIQHSSASNPSPKSHSRSRRYNRRHRRDKKT

A0A6J1GVC2 uncharacterized protein LOC1114578787.7e-8869.43Show/hide
Query:  MAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGLLGGGVTLA--------VCSGRNMEAEGLHSATYFCRSLNSSVDDILALDGSRMQKEL
        M  A   L  VL SKQN LTI+EAN+LQTC SKAVRD+TFG L+GGGVT A        V    +  A  L     F RSL+S VD ILALDGSRMQKEL
Subjt:  MAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGLLGGGVTLA--------VCSGRNMEAEGLHSATYFCRSLNSSVDDILALDGSRMQKEL

Query:  ANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DIVHAQRTNDNDHNENVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNALEFKP
        ANI+VT+Y N+PRTMQHI KHF+YEEVFDDSTLDRPK   RYRNFFS D+ HAQRT+ ND  +N+HGN HHDSSNRDS+  QSDSYG+PDDKGNA EF P
Subjt:  ANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DIVHAQRTNDNDHNENVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNALEFKP

Query:  VLIKRGTDAATADPLNCIFGTLAKEEEIQHSSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHV
        VL K G DAATADPL+ IFGTL +EEEIQHSSAS+PSPKSH RS+RYNRRHRR  +TMPT+FEHV
Subjt:  VLIKRGTDAATADPLNCIFGTLAKEEEIQHSSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHV

A0A6J1IXZ4 uncharacterized protein LOC1114795421.9e-8669.06Show/hide
Query:  MAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGLLGGGVTLA--------VCSGRNMEAEGLHSATYFCRSLNSSVDDILALDGSRMQKEL
        M  A   L  VL SKQN LTI+EAN+LQTC SKAVRD+TFG L+GGGVT A        V    +  A  L     F RSL+S VD ILALDGSRMQKEL
Subjt:  MAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGLLGGGVTLA--------VCSGRNMEAEGLHSATYFCRSLNSSVDDILALDGSRMQKEL

Query:  ANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DIVHAQRTNDNDHNENVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNALEFKP
        ANI+VT+  N+PRTMQHI KHF+YEEVFDDSTLDRPK   RYRNFFS D+ HAQR + ND  +N+HGN HHDSSNRDS+  QSDSYGEPDDKGNA EF P
Subjt:  ANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DIVHAQRTNDNDHNENVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNALEFKP

Query:  VLIKRGTDAATADPLNCIFGTLAKEEEIQHSSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHV
        VL K G DAATADPL+ IFGTL +EEEIQHSSAS+PSPKSH RS+RYNRRHRR  +TMPT+FEHV
Subjt:  VLIKRGTDAATADPLNCIFGTLAKEEEIQHSSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05430.1 unknown protein3.3e-0627.68Show/hide
Query:  AAFIHLHHVLMSK--QNDLTIDEANLLQTCVSKAVRDYTFGGLLGGGVTLAVCS--------GRNMEAEGLHSATYF-------CRSLNSSVDDILALDG
        AA   L  VL SK  Q  +T +E+  + +C  KA+    F   +GGG+T  V           R   A G+ ++T+         +   SS+D IL+ D 
Subjt:  AAFIHLHHVLMSK--QNDLTIDEANLLQTCVSKAVRDYTFGGLLGGGVTLAVCS--------GRNMEAEGLHSATYF-------CRSLNSSVDDILALDG

Query:  SRMQKELANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFSDIVHAQ---RTNDNDHNENVHGNSHHDSSNRDSSAYQSDSYGEPDD
        +RMQKEL N++V          Q + KHFY E V+ D   D+P+   R R  F++I  +        +  N N   N  H   +  S A ++    + + 
Subjt:  SRMQKELANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFSDIVHAQ---RTNDNDHNENVHGNSHHDSSNRDSSAYQSDSYGEPDD

Query:  KGNALEFKPVLIKRGTDAATADPLNCIFGTLAKEEEIQHSSASNPSPKSHSR-SRRYNRRHRRDKKTMPTN
         GN+            + A  D L+ +FG     E I     S  + K+ +R  +R  RR R   +   TN
Subjt:  KGNALEFKPVLIKRGTDAATADPLNCIFGTLAKEEEIQHSSASNPSPKSHSR-SRRYNRRHRRDKKTMPTN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCTCACCGTGGATCTGACAAGCAGCCATAAGGGATCCCCAAATAATAGCATTAGGAGCAAATGGCATTGCCTCAATCACCTCAAGAGCTTCTCTCAGAAGATTTG
CACGGCCAAAGAGAGCCTCTTGAGGGCAGTCACTCTCTGTATAGCCAGAAATCATTGCGCTCCAACATATCAAGTCCTTCTCTACCATCTGATCAAACACGTAGCGAGCT
TCTCCAATCTGTCCACCTTTTGCAAGCCCAGAAACCATGGCAGTCGAAACAACCATGTTCTTGGGGGAAATCTTTTCATAGAAATCCCAAGCCAAGTCCATGGAGCCACA
GCTCGCATACATTGTGATGAGAGCACTTTACTACTAGAATTGGCCACAATATCCATGAGCAACAACATTAGCTTCATACCCATCAATCATGATGCTCCAAGCAACGACAT
CCCTGTGAGACATTTTATCAAACACCAACCGAGCTTCCATTATCCGTCCACAGGCTGCGTACATTCTAACCAAACCCGTCTCCACAAATGGGTCCGACCCAAATCCCAAC
TTCGACGCGAGCCCATGAATCTCCATCCCCGTTCTCAAGGAAAGATTCCTCGAAGCAGCTTTCAACAGCGGAGGGAAGCAGTACCTATCCAAACTCAGACCCTCCGCCCT
CATCTTCTCGTATACAAAAAGCAGACGGGTCTTGGGCTGGGGAATTTGATCAAACACAGAGAGGGCATAGTCGAGGCTAGGCGAGAGAGCACAAGAGGAAAGAATAAGTT
CAAAAAGAAGGGAATTGGAATCACAGCGTTCGAGTTTGGAGCGAAGGATTTGAGCGTGGACTTGTTTGAGGTGGAAGAGGCTGGAGGCGGAGGAGAGAGCGGCGGAGAGA
GCGGTGGGTCTGGTGGGGAAATCGGAATGCAGAGCAATGGTGGTGGTGAGGTGGATGGGGAAAATGGAGGGAAATATCAGGACGAAGACGTTGAATTCTTATACGTCGCG
GAGTTTAGTAATCGGAGCTCCGCCATGGCCGCAGCTTTCATTCATCTTCATCATGTTCTCATGTCCAAACAGAACGATTTGACGATCGATGAAGCAAATTTGCTCCAAAC
GTGTGTGTCTAAGGCTGTTCGAGATTATACCTTTGGAGGACTCCTTGGAGGTGGCGTCACATTGGCAGTTTGCTCTGGCAGGAACATGGAGGCTGAAGGTCTCCACTCGG
CTACATATTTTTGCAGGTCCCTAAATTCAAGTGTCGATGATATTCTTGCACTGGATGGAAGTAGGATGCAAAAGGAATTGGCAAATATTATAGTGACGAGGTATCCCAAT
AATCCTCGTACCATGCAGCACATATTCAAGCATTTTTATTATGAGGAAGTATTTGACGATTCAACCTTGGACCGGCCAAAAAGAATGTTGCGTTATCGAAATTTCTTTAG
TGATATTGTTCATGCTCAGAGGACGAATGACAATGACCATAACGAAAACGTGCATGGAAACTCCCACCATGACTCATCCAACCGTGATTCCAGTGCCTACCAGAGTGATT
CCTATGGTGAGCCTGATGACAAAGGAAATGCCCTTGAGTTCAAGCCAGTCCTTATTAAGCGTGGCACCGATGCTGCGACCGCGGACCCTCTAAATTGTATTTTTGGTACT
TTGGCCAAAGAAGAAGAAATTCAACACTCCAGTGCCTCTAATCCATCTCCCAAATCTCACTCTCGCAGCAGAAGATACAATCGTCGGCATCGAAGAGATAAGAAGACAAT
GCCAACAAACTTTGAACATGTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATCTCACCGTGGATCTGACAAGCAGCCATAAGGGATCCCCAAATAATAGCATTAGGAGCAAATGGCATTGCCTCAATCACCTCAAGAGCTTCTCTCAGAAGATTTG
CACGGCCAAAGAGAGCCTCTTGAGGGCAGTCACTCTCTGTATAGCCAGAAATCATTGCGCTCCAACATATCAAGTCCTTCTCTACCATCTGATCAAACACGTAGCGAGCT
TCTCCAATCTGTCCACCTTTTGCAAGCCCAGAAACCATGGCAGTCGAAACAACCATGTTCTTGGGGGAAATCTTTTCATAGAAATCCCAAGCCAAGTCCATGGAGCCACA
GCTCGCATACATTGTGATGAGAGCACTTTACTACTAGAATTGGCCACAATATCCATGAGCAACAACATTAGCTTCATACCCATCAATCATGATGCTCCAAGCAACGACAT
CCCTGTGAGACATTTTATCAAACACCAACCGAGCTTCCATTATCCGTCCACAGGCTGCGTACATTCTAACCAAACCCGTCTCCACAAATGGGTCCGACCCAAATCCCAAC
TTCGACGCGAGCCCATGAATCTCCATCCCCGTTCTCAAGGAAAGATTCCTCGAAGCAGCTTTCAACAGCGGAGGGAAGCAGTACCTATCCAAACTCAGACCCTCCGCCCT
CATCTTCTCGTATACAAAAAGCAGACGGGTCTTGGGCTGGGGAATTTGATCAAACACAGAGAGGGCATAGTCGAGGCTAGGCGAGAGAGCACAAGAGGAAAGAATAAGTT
CAAAAAGAAGGGAATTGGAATCACAGCGTTCGAGTTTGGAGCGAAGGATTTGAGCGTGGACTTGTTTGAGGTGGAAGAGGCTGGAGGCGGAGGAGAGAGCGGCGGAGAGA
GCGGTGGGTCTGGTGGGGAAATCGGAATGCAGAGCAATGGTGGTGGTGAGGTGGATGGGGAAAATGGAGGGAAATATCAGGACGAAGACGTTGAATTCTTATACGTCGCG
GAGTTTAGTAATCGGAGCTCCGCCATGGCCGCAGCTTTCATTCATCTTCATCATGTTCTCATGTCCAAACAGAACGATTTGACGATCGATGAAGCAAATTTGCTCCAAAC
GTGTGTGTCTAAGGCTGTTCGAGATTATACCTTTGGAGGACTCCTTGGAGGTGGCGTCACATTGGCAGTTTGCTCTGGCAGGAACATGGAGGCTGAAGGTCTCCACTCGG
CTACATATTTTTGCAGGTCCCTAAATTCAAGTGTCGATGATATTCTTGCACTGGATGGAAGTAGGATGCAAAAGGAATTGGCAAATATTATAGTGACGAGGTATCCCAAT
AATCCTCGTACCATGCAGCACATATTCAAGCATTTTTATTATGAGGAAGTATTTGACGATTCAACCTTGGACCGGCCAAAAAGAATGTTGCGTTATCGAAATTTCTTTAG
TGATATTGTTCATGCTCAGAGGACGAATGACAATGACCATAACGAAAACGTGCATGGAAACTCCCACCATGACTCATCCAACCGTGATTCCAGTGCCTACCAGAGTGATT
CCTATGGTGAGCCTGATGACAAAGGAAATGCCCTTGAGTTCAAGCCAGTCCTTATTAAGCGTGGCACCGATGCTGCGACCGCGGACCCTCTAAATTGTATTTTTGGTACT
TTGGCCAAAGAAGAAGAAATTCAACACTCCAGTGCCTCTAATCCATCTCCCAAATCTCACTCTCGCAGCAGAAGATACAATCGTCGGCATCGAAGAGATAAGAAGACAAT
GCCAACAAACTTTGAACATGTGTAACTTCAGGTTCTCTATCAATGCTAAGAAGATTTTATTTTCTCATTTGAAGAAATAAAAAGAGACATTTTTGCTATCCAAAGATGCC
ATAGCCATCCATAGTGCAGAGACAGTGCTCATGAGGCAGTTGTGTGGTGGTTGTTGAAATGGAAGAGCCATATTTCAGAGAACACAATAGTGGTTGAAGGTAGGACATTC
CAGTCTTTTCTCCATTTGCTTAATTCTCTAACATCCTCAGGAAGATTCGTGGGGAAAAATCTTTCATCATCCAGACCGTGTGGCGTGTTGAAGGCCTTACACAAGAAAGA
CGACTGGGATTACGCGTCAAGAATGAACGAGCTCGAAATCAACACCACCGCCGCCATGAAGGTCCACCCATTGCCGAGGAAGCGCAATATCGCCGTCAGGAATAACCCCA
CTTCGAGAAACTCTCTTGAAGATCAATCCCTTCTGAACAACCACAAGAAACTCAGGAGATTACCTCATATCTTCAGTCGGGTCCTTGAGCTTCCGTTTCGATCTGATGCG
GATGTTTTGGTGGAGGAAAATCCCGATTGTTTCCGATTCATTGCTGAAACTGACGGTAACATTAGCGATGGAGTAAGAGCTCATGCTGTGGAAATCCATCCTGGGGTTAT
TAAGATCGTTGTCCGTGAGAATGAATCGTTGGAAATGTCAATGGATGAGCTCGAATTGGACATGTGGAGGTTTCGGCTACCGGAGACGACGCGACCGGAGCTTGCGAGTG
CGGCGTTTGTTGATGGAGAGCTTATAGTTACTGTTCCAAAGGGGAATGAGGAAGAGAATTCTGAAGATGGTGGAGGAGATATCTGGGGAGATGGGAACGAGAGCTTCAGA
GATGAAATGGAAGGTCGGCTTGTTCTTTTTTAGGGACTCAATCCAATGATCAAAAGGGGTGGACATAGTCCATTCTGAGCTTATAATGGAGTCTGAAGTGTCATGAGAAC
CTTCTGTAGAATCAGCATTATCAGGCAAGAGTACGGAAGATTGATGGTGAAAATGGCTGTTGAACATACTAAATAGTTGGCAGGTTATATACATTAACCACTTTAGAGAC
ATTGTTTCTAAGAAGAGTTTGTTCGAGTATGTAGACTAATCAGACCCTCAAGTGATGAAATGAAATCTTTCTCCATAGTTATCAAGGGAGGGATTTGACAAAAAAATATT
GCCAGTGCAATTAACTGAGCCCCTAATAAATGGTATTACAATTTCTCGTGCTCATACCACTGCGAACTTATCCATACTATTTGATTTGAAGAAAAATTACCAGTGCAATT
CGCTGAGCTGTCTTCCTCCATCTTCCAAATCTCTTCCAAGCAAGTCCCAGATTTAAGCTGAAGCAAGCATTCAAAGTCAAATAGCATGGTAAAGTTCACAGAGCACATAG
GATATTGTAAAATTATTTAGTACTTGTAGAAAATACCAGTATTACCTTTTCAATGTTCAATTCAATATTTCTAAACAGCTCCTTTGCTCTCATTTTGTGTGAACCACGCA
AAATGCAGAAACCAAGGCTTAAGATCTCCTCAATATCATCTCGACAGTCAGCTGATTGACATGACTGCAAAAATCTTTCTACATGTGATACCAAATCCGTTCTGGCATCA
CAACGTCGACAATAATACTCAGCATCCAATCCAATGCTTCCTCCAACTGTCCCAGCTGTATATGATTTAAGACCACATTTTATATGAGCATGATGTCCACAAATATAACC
ATCACCCACCACTGCTTTACATTTTATGTAGCTATAACTTTCTGTGGTCGTGTCTATAATCTTGCTGCATAGTATACAGCAGCAATCACGGCAAAACCGAGGTTCGCTGC
AGCAAATATCACAGGACATGGATTTTAATGAAGATGGATTCTCTGCAACAGATAAACTATTACAGTTCTTATTTCCAGCCTTGCAACCCACTCTATCAATCTGGGACTCA
GATGCAGAGCATTCTTCCATCTCTTTTGAAGGTAGAGGGCATGAAATTTGTTTTACTTGAATACCTTGCGCTAAAGATGACTTTTTTGCTGGTATCTTCCAACTGAATGA
GGCAAAAAATGCATCAATGTCTGCATTGGGAAACTCAGACTGGATATATCTTTCAACTGAAAGCTTGCTTGCAAAACCATGCCCTTTACGAGCTGAGTTCTCAGAAGTGC
CAATACCACGAGGAGAATAAAGGTACCTATCCAGAAAATGGCCAGTTATAGCAACTCTCTTCCCCACCCTCCAACTCCAGTTATCACCAGGATTGGGCCAATTTTCAGGA
GCATATGGCAAGCCCTCCCCAGATTCATCTTGAGAAACTGGCCTAAGGATCAGTTCATTTTTCTTCGCCCTAGGTGTACAGCCATTTGTATCCTCAAGAACTTTAGTCTC
CACAGGATCCCCCGACATCTCGCTCGAAGAATTTAATAAGACATCCAGAGAACTTTAGGGTTTTCTCTGTCCAGTAGTCGTCTCTTAATGTTGTAGAGAAACTAGAGAGA
ATGAGAAAGTAGAGAGATTAAGCAGATGAATTTTTTCGCGGGAATGAGAAGATGCTGCACAAGAACGAATGCCGAAGCATAGGAAATCGAGAAAAAGGTTCTTTCCGCCG
CACAACGGGAAGAAAAAGGAGGCTCGACGCCTTTGATGGACCGACGAAGTACTGGACCTGAAGGCCCAATACGATAGGTCCAACTGATCAATGGATTTTTGTTTTTGTTT
TTTGTTTTTTAAATATTTACACACAAAAAAAAGTGAAC
Protein sequenceShow/hide protein sequence
MDLTVDLTSSHKGSPNNSIRSKWHCLNHLKSFSQKICTAKESLLRAVTLCIARNHCAPTYQVLLYHLIKHVASFSNLSTFCKPRNHGSRNNHVLGGNLFIEIPSQVHGAT
ARIHCDESTLLLELATISMSNNISFIPINHDAPSNDIPVRHFIKHQPSFHYPSTGCVHSNQTRLHKWVRPKSQLRREPMNLHPRSQGKIPRSSFQQRREAVPIQTQTLRP
HLLVYKKQTGLGLGNLIKHREGIVEARRESTRGKNKFKKKGIGITAFEFGAKDLSVDLFEVEEAGGGGESGGESGGSGGEIGMQSNGGGEVDGENGGKYQDEDVEFLYVA
EFSNRSSAMAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGLLGGGVTLAVCSGRNMEAEGLHSATYFCRSLNSSVDDILALDGSRMQKELANIIVTRYPN
NPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFSDIVHAQRTNDNDHNENVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNALEFKPVLIKRGTDAATADPLNCIFGT
LAKEEEIQHSSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHV