; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC05G086860 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC05G086860
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionMitochondrial intermediate peptidase
Genome locationCicolChr05:4820026..4842920
RNA-Seq ExpressionCcUC05G086860
SyntenyCcUC05G086860
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582303.1 hypothetical protein SDJN03_22305, partial [Cucurbita argyrosperma subsp. sororia]2.0e-8567.78Show/hide
Query:  NRSSAMAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGFLGGGVTLA--------VCSGRNMEAEDLHSATYFCRSLNSSVDDILALDGSR
        +RSS M  A   L  VL SKQN LTI+EAN+LQTC SKAVRD+TFG  +GGGVT A        V    +  A  L     F RSL+S VD ILALDGSR
Subjt:  NRSSAMAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGFLGGGVTLA--------VCSGRNMEAEDLHSATYFCRSLNSSVDDILALDGSR

Query:  MQKELANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DVVHAQRTNDNDHKDNVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNA
        MQKELANI+VT+Y N+PRTMQ I KHF+YEEVFDDSTLDRPK   RYRNFFS DV HAQRT+ ND KDN+HGN HHDSSNRDS+  Q DSYG+PDDKGNA
Subjt:  MQKELANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DVVHAQRTNDNDHKDNVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNA

Query:  LVFKPVLIKRGTDAATADPLDCIFGTVAKEEESQHFSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHM
          F PVL K G DAATADPLD IFGT+ +EEE QH SAS+PSPKSH RS+RYNRRHRR  +TMPT+FEH+
Subjt:  LVFKPVLIKRGTDAATADPLDCIFGTVAKEEESQHFSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHM

XP_022956077.1 uncharacterized protein LOC111457878 [Cucurbita moschata]6.7e-8668.68Show/hide
Query:  MAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGFLGGGVTLA--------VCSGRNMEAEDLHSATYFCRSLNSSVDDILALDGSRMQKEL
        M  A   L  VL SKQN LTI+EAN+LQTC SKAVRD+TFG  +GGGVT A        V    +  A  L     F RSL+S VD ILALDGSRMQKEL
Subjt:  MAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGFLGGGVTLA--------VCSGRNMEAEDLHSATYFCRSLNSSVDDILALDGSRMQKEL

Query:  ANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DVVHAQRTNDNDHKDNVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNALVFKP
        ANI+VT+Y N+PRTMQHI KHF+YEEVFDDSTLDRPK   RYRNFFS DV HAQRT+ ND KDN+HGN HHDSSNRDS+  QSDSYG+PDDKGNA  F P
Subjt:  ANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DVVHAQRTNDNDHKDNVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNALVFKP

Query:  VLIKRGTDAATADPLDCIFGTVAKEEESQHFSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHM
        VL K G DAATADPLD IFGT+ +EEE QH SAS+PSPKSH RS+RYNRRHRR  +TMPT+FEH+
Subjt:  VLIKRGTDAATADPLDCIFGTVAKEEESQHFSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHM

XP_022980008.1 uncharacterized protein LOC111479542 [Cucurbita maxima]1.7e-8468.3Show/hide
Query:  MAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGFLGGGVTLA--------VCSGRNMEAEDLHSATYFCRSLNSSVDDILALDGSRMQKEL
        M  A   L  VL SKQN LTI+EAN+LQTC SKAVRD+TFG  +GGGVT A        V    +  A  L     F RSL+S VD ILALDGSRMQKEL
Subjt:  MAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGFLGGGVTLA--------VCSGRNMEAEDLHSATYFCRSLNSSVDDILALDGSRMQKEL

Query:  ANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DVVHAQRTNDNDHKDNVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNALVFKP
        ANI+VT+  N+PRTMQHI KHF+YEEVFDDSTLDRPK   RYRNFFS DV HAQR + ND KDN+HGN HHDSSNRDS+  QSDSYGEPDDKGNA  F P
Subjt:  ANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DVVHAQRTNDNDHKDNVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNALVFKP

Query:  VLIKRGTDAATADPLDCIFGTVAKEEESQHFSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHM
        VL K G DAATADPLD IFGT+ +EEE QH SAS+PSPKSH RS+RYNRRHRR  +TMPT+FEH+
Subjt:  VLIKRGTDAATADPLDCIFGTVAKEEESQHFSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHM

XP_023527180.1 uncharacterized protein LOC111790494 isoform X1 [Cucurbita pepo subsp. pepo]6.7e-8668.68Show/hide
Query:  MAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGFLGGGVTLA--------VCSGRNMEAEDLHSATYFCRSLNSSVDDILALDGSRMQKEL
        M  A   L  VL SKQN LTI+EAN+LQTC SKAVRD+TFG  +GGGVT A        V    +  A  L     F RSL+S VD ILALDGSRMQKEL
Subjt:  MAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGFLGGGVTLA--------VCSGRNMEAEDLHSATYFCRSLNSSVDDILALDGSRMQKEL

Query:  ANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DVVHAQRTNDNDHKDNVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNALVFKP
        ANI+VT+Y N+PRTMQHI KHF+YEEVFDDSTLDRPK   RYRNFFS DV HAQRT+ ND KDN+HGN HHDSSNRDS+  QSDSYG+PDDKGNA  F P
Subjt:  ANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DVVHAQRTNDNDHKDNVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNALVFKP

Query:  VLIKRGTDAATADPLDCIFGTVAKEEESQHFSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHM
        VL K G DAATADPLD IFGT+ +EEE QH SAS+PSPKSH RS+RYNRRHRR  +TMPT+FEH+
Subjt:  VLIKRGTDAATADPLDCIFGTVAKEEESQHFSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHM

XP_038878005.1 uncharacterized protein LOC120070209 isoform X1 [Benincasa hispida]1.3e-8970.79Show/hide
Query:  MAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGFLGGGVTLA--------VCSGRNMEAEDLHSATYFCRSLNSSVDDILALDGSRMQKEL
        M  A  HL  VL SKQN LTI+EANLLQTC SKAVRD+T GG +GGGVT A        +    +  A  L     F  SL S VD ILAL GSRMQKEL
Subjt:  MAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGFLGGGVTLA--------VCSGRNMEAEDLHSATYFCRSLNSSVDDILALDGSRMQKEL

Query:  ANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DVVHAQRTNDNDHKDNVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNALVFKP
        ANI+VTRY N+PR MQ I KHFYYEEVFDDSTLDRPK   R RNFFS DV HAQRT DND KDN+HGNSHHDSSNRDSSAYQSDSYG+PDDKGNAL  KP
Subjt:  ANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DVVHAQRTNDNDHKDNVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNALVFKP

Query:  VLIKRGTDAATADPLDCIFGTVAKEEES--QHFSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHM
        VL K GTDA T DPLDCIFGT+A+EEE   QH SAS+PSPKSHSRSRRYNRRHR+  +TMPTNFEH+
Subjt:  VLIKRGTDAATADPLDCIFGTVAKEEES--QHFSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHM

TrEMBL top hitse value%identityAlignment
A0A0A0L4T1 Uncharacterized protein2.0e-8064.73Show/hide
Query:  VAEFSNRSSAMAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGFLGGGVTLAVCSGRN--------MEAEDLHSATYFCRSLNSSVDDILA
        V  F ++SS M      L +VL SK N LTI+EA LLQTC SKAVRD+TFGG LGGG+T A     N        + A  L     F RSLNS VD ILA
Subjt:  VAEFSNRSSAMAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGFLGGGVTLAVCSGRN--------MEAEDLHSATYFCRSLNSSVDDILA

Query:  LDGSRMQKELANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DVVHAQRTNDNDHKDNVHGNSHHDSSNRDSSAYQSDSYGEPD
        LDGSRMQKELANI+VTRY N+P  MQ+I KHFYYEEVFDDST DRPK   RYRNFFS DV H+QRT+ ND  +NVH NSH     RDSSAYQ DSYG+PD
Subjt:  LDGSRMQKELANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DVVHAQRTNDNDHKDNVHGNSHHDSSNRDSSAYQSDSYGEPD

Query:  DKGNALVFKPVLIKRGTDAATADPLDCIFGTVAKEEESQHFSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHM
        D GNA  FKPVL K GTDAATADPLDCIFGT+A++EE Q+ + S PSPK HSRSRRYNRRHR+D  T  TNFEH+
Subjt:  DKGNALVFKPVLIKRGTDAATADPLDCIFGTVAKEEESQHFSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHM

A0A1S3AWL2 uncharacterized protein LOC103483703 isoform X11.5e-7866.67Show/hide
Query:  LHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGFLGGGVTLAVCSGRNM--------EAEDLHSATYFCRSLNSSVDDILALDGSRMQKELANIIVTR
        L +VL SK N LTI+EA LLQTC SKAVRD+TFGG LGGG+T A     N          A  L     F RSLNS VD IL+LDGSRMQKELANI+VTR
Subjt:  LHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGFLGGGVTLAVCSGRNM--------EAEDLHSATYFCRSLNSSVDDILALDGSRMQKELANIIVTR

Query:  YPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DVVHAQRTNDNDHKDNVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNALVFKPVLIKRGT
        Y N+PR MQ+I KHF+YEEVFDDST DRPK   RYRNFFS DV H+QRT+ ND  +NVH NSH     RDSSA+Q DSYG+ DDKGNA  FKPVL K GT
Subjt:  YPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DVVHAQRTNDNDHKDNVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNALVFKPVLIKRGT

Query:  DAATADPLDCIFGTVAKEEESQHFSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHM
        D+ATADPLDCIFGT+A+EEE QH + S PSPK HSRSRRYNRRHR+D +T PTNFE++
Subjt:  DAATADPLDCIFGTVAKEEESQHFSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHM

A0A6J1C8I6 uncharacterized protein LOC111009363 isoform X11.6e-7765.76Show/hide
Query:  MAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGFLGGGVTLA--------VCSGRNMEAEDLHSATYFCRSLNSSVDDILALDGSRMQKEL
        M  A   L  VL SKQN LTI+EA LLQTC SKAVRD+TFG   GGGVT A        +    +  A  L     F RSLNS VD ILALDGSRMQKEL
Subjt:  MAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGFLGGGVTLA--------VCSGRNMEAEDLHSATYFCRSLNSSVDDILALDGSRMQKEL

Query:  ANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DVVHAQRTNDNDHKDNVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNALVFKP
        ANI+VT+Y N+PRTMQHI KHFYYE+VFDDSTLDRP+   RYRNFFS DV H QRT+DND K+N+HGNSHH SSN DS++ Q+ SY EPDDKGNAL FKP
Subjt:  ANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DVVHAQRTNDNDHKDNVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNALVFKP

Query:  VLIKRGTDAATADPLDCIFGTVAKEEESQHFSASNPSPKSHSRSRRYNRRHRRDKKT
        VL K GTD ATADPLDC+FG +AK EE QH ++S  + KSHSRSRRY+RRHRR  +T
Subjt:  VLIKRGTDAATADPLDCIFGTVAKEEESQHFSASNPSPKSHSRSRRYNRRHRRDKKT

A0A6J1GVC2 uncharacterized protein LOC1114578783.2e-8668.68Show/hide
Query:  MAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGFLGGGVTLA--------VCSGRNMEAEDLHSATYFCRSLNSSVDDILALDGSRMQKEL
        M  A   L  VL SKQN LTI+EAN+LQTC SKAVRD+TFG  +GGGVT A        V    +  A  L     F RSL+S VD ILALDGSRMQKEL
Subjt:  MAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGFLGGGVTLA--------VCSGRNMEAEDLHSATYFCRSLNSSVDDILALDGSRMQKEL

Query:  ANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DVVHAQRTNDNDHKDNVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNALVFKP
        ANI+VT+Y N+PRTMQHI KHF+YEEVFDDSTLDRPK   RYRNFFS DV HAQRT+ ND KDN+HGN HHDSSNRDS+  QSDSYG+PDDKGNA  F P
Subjt:  ANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DVVHAQRTNDNDHKDNVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNALVFKP

Query:  VLIKRGTDAATADPLDCIFGTVAKEEESQHFSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHM
        VL K G DAATADPLD IFGT+ +EEE QH SAS+PSPKSH RS+RYNRRHRR  +TMPT+FEH+
Subjt:  VLIKRGTDAATADPLDCIFGTVAKEEESQHFSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHM

A0A6J1IXZ4 uncharacterized protein LOC1114795428.0e-8568.3Show/hide
Query:  MAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGFLGGGVTLA--------VCSGRNMEAEDLHSATYFCRSLNSSVDDILALDGSRMQKEL
        M  A   L  VL SKQN LTI+EAN+LQTC SKAVRD+TFG  +GGGVT A        V    +  A  L     F RSL+S VD ILALDGSRMQKEL
Subjt:  MAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGFLGGGVTLA--------VCSGRNMEAEDLHSATYFCRSLNSSVDDILALDGSRMQKEL

Query:  ANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DVVHAQRTNDNDHKDNVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNALVFKP
        ANI+VT+  N+PRTMQHI KHF+YEEVFDDSTLDRPK   RYRNFFS DV HAQR + ND KDN+HGN HHDSSNRDS+  QSDSYGEPDDKGNA  F P
Subjt:  ANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFS-DVVHAQRTNDNDHKDNVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNALVFKP

Query:  VLIKRGTDAATADPLDCIFGTVAKEEESQHFSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHM
        VL K G DAATADPLD IFGT+ +EEE QH SAS+PSPKSH RS+RYNRRHRR  +TMPT+FEH+
Subjt:  VLIKRGTDAATADPLDCIFGTVAKEEESQHFSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05430.1 unknown protein2.1e-0528.04Show/hide
Query:  AAFIHLHHVLMSK--QNDLTIDEANLLQTCVSKAVRDYTFGGFLGGGVTLAVC----SGRNMEAEDLH---SATYFCRSLN--------SSVDDILALDG
        AA   L  VL SK  Q  +T +E+  + +C  KA+    F   +GGG+T  V       + +E   L    +A+ F  + N        SS+D IL+ D 
Subjt:  AAFIHLHHVLMSK--QNDLTIDEANLLQTCVSKAVRDYTFGGFLGGGVTLAVC----SGRNMEAEDLH---SATYFCRSLN--------SSVDDILALDG

Query:  SRMQKELANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFSDVVHA-QRTNDNDHKDNVHG--NSHHDSSNRDSSAYQSDSYGEPDD
        +RMQKEL N++V          Q + KHFY E V+ D   D+P+   R R  F+++  +    N    + N +G  N  H   +  S A ++    + + 
Subjt:  SRMQKELANIIVTRYPNNPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFSDVVHA-QRTNDNDHKDNVHG--NSHHDSSNRDSSAYQSDSYGEPDD

Query:  KGNALVFKPVLIKRGTDAATADPLDCIFGTVAKEEESQHFSASNPSPKSHSR-SRRYNRRHRRDKKTMPTN
         GN+            + A  D LD +FG     E       S  + K+ +R  +R  RR R   +   TN
Subjt:  KGNALVFKPVLIKRGTDAATADPLDCIFGTVAKEEESQHFSASNPSPKSHSR-SRRYNRRHRRDKKTMPTN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCTCACCGTGGATCTGACAAGCAGCCATAAGGGATCCCCAAATAATAGCATTAGGAGCAAATGGCATTGCCTCAATCACCTCAAGAGCTTCTCTCAGAAGATTTG
CGCGGCCAAAGAGAGCCTCTTGAGGGCAGTCACTCTCTGTATATCCAGAAATCATTGCGCTCCAACATATCAAGTCCTTCTCTACCATCTGATCAAACACATAGCGAGCT
TCTCCAATCTGTCCACCTTTTGCAAGCCCAGAAACCATGGCAGTCGAAACAACCATGTTCTTGGGGGAAATCTTTTCATAGAAATCCCAAGCCAAGTCCATGGAGCCACA
GCTCGCATACATTGTGATGAGAGCACTTTACTACTAAAATTGGCCACGATATCCATGAGCAACAACATTAGCTTCATACCCATCAATCATGATGCTCCAAGCAACGACAT
CCCTGTGAGACATTTTATCAAACACCAACCGAGCTTCCATTATCCGTCCACAGGCTGCGTACATTCTAACCAAACCCGTCTCCACAAATGGGTCCGACCCAAATCCCAAC
TTCGACGCGAGCCCATGAATCTCCATCCCTGTTCTCAAGGAAAGATTCCTCGAAGCAGCTTTCAACAGCGGAGGGAAGCAGTACCTATCCAAACTCAGACCCTCCGCCCT
CATCTTCTCGTATACGAAAAGCAGACGGGTCTTGGGCTGGGGAATTTGATCAAACACAGAGAGGGCATAGTCGAGGGTAGGCGAGAGAGCACAAGAGGAAAGAATAAGTT
CAAAAAGAAGGGAATTGGAATCATAGCGTTCGAGTTTGGAGCGAAGGATTTGAGCGTGGACTTGTTTGAGGTGGAAGAGGCTGGAGGCGGAGGAGAGAGCGGCGGAGAGA
GCGGTGGGTCTGGCGGGGAAATCGGAATGCAGAGCAATGGTGGTGGTGAGGTGGATGGGGAAAATGGAGGGAAATATCAGGACGAAGACGTTGAATTCTTATACGTCGCG
GAGTTTAGTAATCGGAGCTCCGCCATGGCCGCAGCTTTCATTCATCTTCATCATGTTCTCATGTCCAAACAGAACGATTTGACGATCGATGAAGCAAATTTGCTCCAAAC
GTGTGTGTCTAAGGCTGTTCGAGATTATACCTTTGGAGGATTCCTTGGAGGTGGCGTCACATTGGCAGTTTGCTCTGGCAGGAACATGGAGGCTGAAGATCTCCACTCGG
CTACATATTTTTGCAGGTCCCTAAATTCAAGTGTCGATGATATTCTTGCACTGGATGGAAGTAGGATGCAAAAGGAATTGGCAAATATTATAGTGACGAGATATCCCAAT
AATCCTCGTACCATGCAGCACATATTCAAGCATTTTTATTATGAGGAAGTATTTGACGATTCAACCTTGGACCGGCCAAAAAGAATGTTGCGTTATCGAAATTTCTTTAG
TGATGTTGTTCATGCTCAGAGGACGAATGACAATGACCATAAAGACAACGTGCATGGAAACTCCCACCATGACTCATCCAACCGTGATTCCAGTGCCTACCAGAGTGATT
CCTATGGTGAGCCTGATGACAAAGGAAATGCCCTTGTGTTCAAGCCAGTCCTTATTAAGCGTGGCACCGATGCTGCGACCGCAGACCCTCTAGATTGTATTTTTGGTACT
GTGGCCAAAGAAGAAGAAAGTCAACACTTCAGTGCCTCTAATCCATCTCCCAAATCTCACTCTCGCAGCAGAAGATACAATCGTCGGCATCGAAGAGATAAGAAGACAAT
GCCAACAAACTTTGAACATATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATCTCACCGTGGATCTGACAAGCAGCCATAAGGGATCCCCAAATAATAGCATTAGGAGCAAATGGCATTGCCTCAATCACCTCAAGAGCTTCTCTCAGAAGATTTG
CGCGGCCAAAGAGAGCCTCTTGAGGGCAGTCACTCTCTGTATATCCAGAAATCATTGCGCTCCAACATATCAAGTCCTTCTCTACCATCTGATCAAACACATAGCGAGCT
TCTCCAATCTGTCCACCTTTTGCAAGCCCAGAAACCATGGCAGTCGAAACAACCATGTTCTTGGGGGAAATCTTTTCATAGAAATCCCAAGCCAAGTCCATGGAGCCACA
GCTCGCATACATTGTGATGAGAGCACTTTACTACTAAAATTGGCCACGATATCCATGAGCAACAACATTAGCTTCATACCCATCAATCATGATGCTCCAAGCAACGACAT
CCCTGTGAGACATTTTATCAAACACCAACCGAGCTTCCATTATCCGTCCACAGGCTGCGTACATTCTAACCAAACCCGTCTCCACAAATGGGTCCGACCCAAATCCCAAC
TTCGACGCGAGCCCATGAATCTCCATCCCTGTTCTCAAGGAAAGATTCCTCGAAGCAGCTTTCAACAGCGGAGGGAAGCAGTACCTATCCAAACTCAGACCCTCCGCCCT
CATCTTCTCGTATACGAAAAGCAGACGGGTCTTGGGCTGGGGAATTTGATCAAACACAGAGAGGGCATAGTCGAGGGTAGGCGAGAGAGCACAAGAGGAAAGAATAAGTT
CAAAAAGAAGGGAATTGGAATCATAGCGTTCGAGTTTGGAGCGAAGGATTTGAGCGTGGACTTGTTTGAGGTGGAAGAGGCTGGAGGCGGAGGAGAGAGCGGCGGAGAGA
GCGGTGGGTCTGGCGGGGAAATCGGAATGCAGAGCAATGGTGGTGGTGAGGTGGATGGGGAAAATGGAGGGAAATATCAGGACGAAGACGTTGAATTCTTATACGTCGCG
GAGTTTAGTAATCGGAGCTCCGCCATGGCCGCAGCTTTCATTCATCTTCATCATGTTCTCATGTCCAAACAGAACGATTTGACGATCGATGAAGCAAATTTGCTCCAAAC
GTGTGTGTCTAAGGCTGTTCGAGATTATACCTTTGGAGGATTCCTTGGAGGTGGCGTCACATTGGCAGTTTGCTCTGGCAGGAACATGGAGGCTGAAGATCTCCACTCGG
CTACATATTTTTGCAGGTCCCTAAATTCAAGTGTCGATGATATTCTTGCACTGGATGGAAGTAGGATGCAAAAGGAATTGGCAAATATTATAGTGACGAGATATCCCAAT
AATCCTCGTACCATGCAGCACATATTCAAGCATTTTTATTATGAGGAAGTATTTGACGATTCAACCTTGGACCGGCCAAAAAGAATGTTGCGTTATCGAAATTTCTTTAG
TGATGTTGTTCATGCTCAGAGGACGAATGACAATGACCATAAAGACAACGTGCATGGAAACTCCCACCATGACTCATCCAACCGTGATTCCAGTGCCTACCAGAGTGATT
CCTATGGTGAGCCTGATGACAAAGGAAATGCCCTTGTGTTCAAGCCAGTCCTTATTAAGCGTGGCACCGATGCTGCGACCGCAGACCCTCTAGATTGTATTTTTGGTACT
GTGGCCAAAGAAGAAGAAAGTCAACACTTCAGTGCCTCTAATCCATCTCCCAAATCTCACTCTCGCAGCAGAAGATACAATCGTCGGCATCGAAGAGATAAGAAGACAAT
GCCAACAAACTTTGAACATATGTAATTTCAGGTTCTCTATCAATGCTAAGAAGATTTTATTTTCTCATTTGAAGAAATAAAAAGAGACATTTTTGTACCTAACCATACAA
TTGAAGAAGGAATGTTAAGTGTCGGCATCGAAGAGATAAGAAGACAATGCCAACAAACTTTGAACATGTGTAATTTCAGGCTATACAAAGAAGCCATAGCCATCCATAGT
GCAGAGACAGTGCTCATGAGGCAGTTGTGTGGTGGTTGTTGAAATGGAAGAGCCATATTTCAGAGAACACAATAGTGGTTGAAGGTAGGACATTACAGTCTTTTCTCCAT
TTGCTTAATTCTCTAACATCCTCAGGAAGATTCGTGGGGAAAAGTCTTTCATCATCCAGACCGTGTGGCGTGTTGAAGGCCTTACAGAAGAAAGACGACTAGGAATACGC
GTCAAGAATGAACGAGCTCGAAATCAACGCCACCGCCGCCATGAAGGTCCACCCATTGCCGAGGAAGCGCAATATCGCCGTCAGGAATAACCCCACTTCGAGAAACTCTC
TTGAAGATCAATCCCTTCTGAACAACCACAAGAAACTCAGGAGATTACCTCATATCTTCAGTCGGGTCCTTGAGCTTCCGTTTCGATCTGATGCGGATGTTTTGGTGGAG
GAAAATCCCGATTGTTTCCGATTCATTGCTGAAACTGACGGTAACATTAGCGATGGAGTAAGAGCTCATGCGGTGGAAATCCATCCTGGGGTTATTAAGATCGTTGTCCG
TGAGAATGAATCGTTGGAAATGTCAATGGATGAGCTTGAATTGGACATGTGGAGGTTTCGGCTACCGGAGACGACGCGACCGGAGCTTGCGAGTGCGGCGTTTGTTGATG
GAGAGCTTATAGTTACTGTTCCAAAGGGGAATGAGGAAGAGAATTCTGAAGATGGTGGAGGAGATATCTGGGGAGATGGGAACGAGAGCTTCAGAGATGAAATGGAAGGT
CGGCTTGTTCTTTTTTAGGGACTCAATCCAATAATCAAAAGGGGTGGACATAGTCCATTCTGAGCTTATAATGGACTCTGAAGTGTCATGAGAACCTTCTGTAGAATCAG
CATTATCAGGCAAGAGTACGGAAGATTGATGGTGAAAATGGCTGTTGAACATCCTAAATAGTTGGCAGGTTATATACATTAACCACTTTAGAGACATTGTTTCCAAGAAG
AGTTTGTTCGAGTATGTAGACTAATCAGACCCTCAAGTGATGAAATGAAATCTTTCTCCATAGTTATCAAGGGAGGGATTTGACAAAAAAATATTGCCAGTGCAATTAAC
TGAGAACCTAATAAATGGTATTACAATTTCTCGTGCTCATACCATGTGGGAACTTATCCATACTATTTGATTTGGAGAAAAATTACCAGTGCAATTCGCTGAGCTGTCTC
CCTCCATCTTCCAAATCTCTTCCAAGCAAGTCCCAGATTTAAGCTGAAGCAAGCATTCAAAGTCAAATAGCATGGTAAAGTTCACAGAGCAGCATAGGATATTGTAATAT
TATTTAGTACTTGTAAAAAATACCAGTATTACCTTTTCAATGTTCACTTCAATATTTCTAAACAACTCCTTTGCTCTCATTTTGTGTGAACCACGCAAAATGCAGAAACC
AAGGCTTAAGATCTCCTCAATATCATCTCGACAGTCAGCTGATTGACATGACTGCAAAAATCTTTCTACATGTGATACCAAATCCGTTCTGGCATCACAACGTCGACAAT
AATACTCAGCATCCAATCCAATGCTTCCTCCAACTGTCCCAGCTGTATATGATTTAAGACCACATTTTATATGAGCATGATGTCCACAAATATAACCATCACCCACCACT
GCTTTACATTTTATGTAGCTATAACTTTCTGTGGTCGTGTCTATAATCTTGCTGCATAGTATACAGCAGCAATCACGGCAAAACCGAGGTTCGCTGCAGCAAATATCACA
GGACATGGATTTTAATGAAGATGGATTCTCTGCAACAGATAAACTATTACAGTTCTTATTTCCAGCCTTGCAACCCACTCTATCAGTCTGGGACTCAGATGCAGAGCATT
CTTCCATCTCTTTTGAGGGTAGAGGGCATGAAATTTGTTTTACTTGAATACCTTGCGCTAAAGATGACTTTTTTGCTGGTATCTTCCAACTGAATGAGGCAAAAAATGCA
TCAATGTCTGCATTGGGAAACTCAGACTGGATATATCTTTCAACTGAAAGCTTGCTTGCAAAACCATGCCCTTTACGAGCCGAGTTCTCAGAAGTGCCAATACCACGAGG
AGAATAAAGGTACCTATCCAGAAAATGGCCAGTTATAGCAACTCTCTTCCCCACCCTCCAACTCCAGTTATCACCAGGATTGGGCCAATTTTCAGGAGCATATGGCAAGC
CCTCCCCAGATTCATCTTGAGAAACTGGCCTAAGGATCAGTTCATTTTTATTCGCCCTAGGTGTGCAGCCATTTGTATCCTCAAGAACTTCAGTCTCCACAGGATCCCCC
GACATCTCGCTCGAAGAATTTAATAAGACATCCAGAGAACTTTAGGGTTTGCTCTGTCCAGTAGTCGTCTCTTAATATTGTAGAGAAACTAGAGAGAATGAGAAAGTAGA
GAGATTAAGCAGATGAATTTTTTCGCGGGAATGAGAAGATGCTGCACGAGAACGAATGCCGAAGCATAGCAAATCGAGAAAAAGGTTCTTTCCGCCGCACAACGGGAAGA
AAAAGGAGGCTCGCCTTTGATGGACCGACCAAGTACTGGACCTGAAGGCCCAATACGATAGGTCCAGCTGATCAATGGTTTTTTGTTTTTTAAATATTTACACAAAAAAA
AAGTGAAC
Protein sequenceShow/hide protein sequence
MDLTVDLTSSHKGSPNNSIRSKWHCLNHLKSFSQKICAAKESLLRAVTLCISRNHCAPTYQVLLYHLIKHIASFSNLSTFCKPRNHGSRNNHVLGGNLFIEIPSQVHGAT
ARIHCDESTLLLKLATISMSNNISFIPINHDAPSNDIPVRHFIKHQPSFHYPSTGCVHSNQTRLHKWVRPKSQLRREPMNLHPCSQGKIPRSSFQQRREAVPIQTQTLRP
HLLVYEKQTGLGLGNLIKHREGIVEGRRESTRGKNKFKKKGIGIIAFEFGAKDLSVDLFEVEEAGGGGESGGESGGSGGEIGMQSNGGGEVDGENGGKYQDEDVEFLYVA
EFSNRSSAMAAAFIHLHHVLMSKQNDLTIDEANLLQTCVSKAVRDYTFGGFLGGGVTLAVCSGRNMEAEDLHSATYFCRSLNSSVDDILALDGSRMQKELANIIVTRYPN
NPRTMQHIFKHFYYEEVFDDSTLDRPKRMLRYRNFFSDVVHAQRTNDNDHKDNVHGNSHHDSSNRDSSAYQSDSYGEPDDKGNALVFKPVLIKRGTDAATADPLDCIFGT
VAKEEESQHFSASNPSPKSHSRSRRYNRRHRRDKKTMPTNFEHM