; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh04G029560 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh04G029560
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionUnknown protein
Genome locationCma_Chr04:19442151..19442888
RNA-Seq ExpressionCmaCh04G029560
SyntenyCmaCh04G029560
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022955490.1 uncharacterized protein LOC111457498 isoform X1 [Cucurbita moschata]7.5e-10092.42Show/hide
Query:  MDPGEAARRMLVGSNCGKAIINPITMRFRPIAPKPVAGASVSGDTSSILKGRRKRKYVRVQGYNRKKKTTRSNTTTTEAAATTLQLLPERHVEIFGSGSP
        MD GEAARRMLVGSN GK IINPITMRFRPIAPKPVAGASVSGDTSSI  GRRKRKYVRVQGYNRKKKTTRSN  TTEAAATTLQLLPERHVEIFGSGS 
Subjt:  MDPGEAARRMLVGSNCGKAIINPITMRFRPIAPKPVAGASVSGDTSSILKGRRKRKYVRVQGYNRKKKTTRSNTTTTEAAATTLQLLPERHVEIFGSGSP

Query:  EVGLVSDGKRGAAAEEVWVTVECARGACIEG-DEEERMKKLEEDTCPGFVSDGVDRVEWVNEALKRMVRQRQRSSEAGEEIEIRLRTRPRLGYLGRAFTC
        EVGLVSDGKRGAAAEEVWVTVECAR ACIEG +EEERMKKLEEDTCPGFVSDGVDRVEWVNEALKRMVRQR+RSSEAGEEIE+RLRT PRLGYLGRAFTC
Subjt:  EVGLVSDGKRGAAAEEVWVTVECARGACIEG-DEEERMKKLEEDTCPGFVSDGVDRVEWVNEALKRMVRQRQRSSEAGEEIEIRLRTRPRLGYLGRAFTC

Query:  QMKLHFKAGTT
        Q+KLHF AGTT
Subjt:  QMKLHFKAGTT

XP_022955497.1 uncharacterized protein LOC111457498 isoform X2 [Cucurbita moschata]7.5e-10092.42Show/hide
Query:  MDPGEAARRMLVGSNCGKAIINPITMRFRPIAPKPVAGASVSGDTSSILKGRRKRKYVRVQGYNRKKKTTRSNTTTTEAAATTLQLLPERHVEIFGSGSP
        MD GEAARRMLVGSN GK IINPITMRFRPIAPKPVAGASVSGDTSSI  GRRKRKYVRVQGYNRKKKTTRSN  TTEAAATTLQLLPERHVEIFGSGS 
Subjt:  MDPGEAARRMLVGSNCGKAIINPITMRFRPIAPKPVAGASVSGDTSSILKGRRKRKYVRVQGYNRKKKTTRSNTTTTEAAATTLQLLPERHVEIFGSGSP

Query:  EVGLVSDGKRGAAAEEVWVTVECARGACIEG-DEEERMKKLEEDTCPGFVSDGVDRVEWVNEALKRMVRQRQRSSEAGEEIEIRLRTRPRLGYLGRAFTC
        EVGLVSDGKRGAAAEEVWVTVECAR ACIEG +EEERMKKLEEDTCPGFVSDGVDRVEWVNEALKRMVRQR+RSSEAGEEIE+RLRT PRLGYLGRAFTC
Subjt:  EVGLVSDGKRGAAAEEVWVTVECARGACIEG-DEEERMKKLEEDTCPGFVSDGVDRVEWVNEALKRMVRQRQRSSEAGEEIEIRLRTRPRLGYLGRAFTC

Query:  QMKLHFKAGTT
        Q+KLHF AGTT
Subjt:  QMKLHFKAGTT

XP_022955504.1 uncharacterized protein LOC111457498 isoform X3 [Cucurbita moschata]7.5e-10092.42Show/hide
Query:  MDPGEAARRMLVGSNCGKAIINPITMRFRPIAPKPVAGASVSGDTSSILKGRRKRKYVRVQGYNRKKKTTRSNTTTTEAAATTLQLLPERHVEIFGSGSP
        MD GEAARRMLVGSN GK IINPITMRFRPIAPKPVAGASVSGDTSSI  GRRKRKYVRVQGYNRKKKTTRSN  TTEAAATTLQLLPERHVEIFGSGS 
Subjt:  MDPGEAARRMLVGSNCGKAIINPITMRFRPIAPKPVAGASVSGDTSSILKGRRKRKYVRVQGYNRKKKTTRSNTTTTEAAATTLQLLPERHVEIFGSGSP

Query:  EVGLVSDGKRGAAAEEVWVTVECARGACIEG-DEEERMKKLEEDTCPGFVSDGVDRVEWVNEALKRMVRQRQRSSEAGEEIEIRLRTRPRLGYLGRAFTC
        EVGLVSDGKRGAAAEEVWVTVECAR ACIEG +EEERMKKLEEDTCPGFVSDGVDRVEWVNEALKRMVRQR+RSSEAGEEIE+RLRT PRLGYLGRAFTC
Subjt:  EVGLVSDGKRGAAAEEVWVTVECARGACIEG-DEEERMKKLEEDTCPGFVSDGVDRVEWVNEALKRMVRQRQRSSEAGEEIEIRLRTRPRLGYLGRAFTC

Query:  QMKLHFKAGTT
        Q+KLHF AGTT
Subjt:  QMKLHFKAGTT

XP_022989872.1 uncharacterized protein LOC111486935 [Cucurbita maxima]9.4e-135100Show/hide
Query:  MDPGEAARRMLVGSNCGKAIINPITMRFRPIAPKPVAGASVSGDTSSILKGRRKRKYVRVQGYNRKKKTTRSNTTTTEAAATTLQLLPERHVEIFGSGSP
        MDPGEAARRMLVGSNCGKAIINPITMRFRPIAPKPVAGASVSGDTSSILKGRRKRKYVRVQGYNRKKKTTRSNTTTTEAAATTLQLLPERHVEIFGSGSP
Subjt:  MDPGEAARRMLVGSNCGKAIINPITMRFRPIAPKPVAGASVSGDTSSILKGRRKRKYVRVQGYNRKKKTTRSNTTTTEAAATTLQLLPERHVEIFGSGSP

Query:  EVGLVSDGKRGAAAEEVWVTVECARGACIEGDEEERMKKLEEDTCPGFVSDGVDRVEWVNEALKRMVRQRQRSSEAGEEIEIRLRTRPRLGYLGRAFTCQ
        EVGLVSDGKRGAAAEEVWVTVECARGACIEGDEEERMKKLEEDTCPGFVSDGVDRVEWVNEALKRMVRQRQRSSEAGEEIEIRLRTRPRLGYLGRAFTCQ
Subjt:  EVGLVSDGKRGAAAEEVWVTVECARGACIEGDEEERMKKLEEDTCPGFVSDGVDRVEWVNEALKRMVRQRQRSSEAGEEIEIRLRTRPRLGYLGRAFTCQ

Query:  MKLHFKAGTTDMVVPCDAWRIAGGGTAWKLDLNAALSLAPPLQQD
        MKLHFKAGTTDMVVPCDAWRIAGGGTAWKLDLNAALSLAPPLQQD
Subjt:  MKLHFKAGTTDMVVPCDAWRIAGGGTAWKLDLNAALSLAPPLQQD

XP_023543904.1 uncharacterized protein LOC111803631 [Cucurbita pepo subsp. pepo]1.3e-9690.82Show/hide
Query:  MDPGEAARRMLVGSNCGKAIINPITMRFRPIAPKPVAGASVSGDTSSILKGRRKRKYVRVQGYNRKKKTTRSN---TTTTEAAATTLQLLPERHVEIFGS
        MD GEAARRMLVGSN G  IINPI MRFRPIAPKPVAGASVSGDTSSI KGRRKRKYVRVQGYNRKKK TRSN   TTTTEAAATTLQLLPERHVEIFGS
Subjt:  MDPGEAARRMLVGSNCGKAIINPITMRFRPIAPKPVAGASVSGDTSSILKGRRKRKYVRVQGYNRKKKTTRSN---TTTTEAAATTLQLLPERHVEIFGS

Query:  GSPEVGLVSDGKRGAAAEEVWVTVECARGACIEGDEEERMKKLEEDTCPGFVSDGVDRVEWVNEALKRMVRQRQRSSEAGEEIEIRLRTRPRLGYLGRAF
        GSPE GLVSD KRGAAAEEVWVTVECAR ACIEG+EEERMKKLEEDTCPGFVSDGVDRVEWVNEALKRMVRQR+RSSEAGEEIE+RLRTRP LGYLGR+F
Subjt:  GSPEVGLVSDGKRGAAAEEVWVTVECARGACIEGDEEERMKKLEEDTCPGFVSDGVDRVEWVNEALKRMVRQRQRSSEAGEEIEIRLRTRPRLGYLGRAF

Query:  TCQMKLH
        TCQ+KLH
Subjt:  TCQMKLH

TrEMBL top hitse value%identityAlignment
A0A6J1GTT8 uncharacterized protein LOC111457498 isoform X23.6e-10092.42Show/hide
Query:  MDPGEAARRMLVGSNCGKAIINPITMRFRPIAPKPVAGASVSGDTSSILKGRRKRKYVRVQGYNRKKKTTRSNTTTTEAAATTLQLLPERHVEIFGSGSP
        MD GEAARRMLVGSN GK IINPITMRFRPIAPKPVAGASVSGDTSSI  GRRKRKYVRVQGYNRKKKTTRSN  TTEAAATTLQLLPERHVEIFGSGS 
Subjt:  MDPGEAARRMLVGSNCGKAIINPITMRFRPIAPKPVAGASVSGDTSSILKGRRKRKYVRVQGYNRKKKTTRSNTTTTEAAATTLQLLPERHVEIFGSGSP

Query:  EVGLVSDGKRGAAAEEVWVTVECARGACIEG-DEEERMKKLEEDTCPGFVSDGVDRVEWVNEALKRMVRQRQRSSEAGEEIEIRLRTRPRLGYLGRAFTC
        EVGLVSDGKRGAAAEEVWVTVECAR ACIEG +EEERMKKLEEDTCPGFVSDGVDRVEWVNEALKRMVRQR+RSSEAGEEIE+RLRT PRLGYLGRAFTC
Subjt:  EVGLVSDGKRGAAAEEVWVTVECARGACIEG-DEEERMKKLEEDTCPGFVSDGVDRVEWVNEALKRMVRQRQRSSEAGEEIEIRLRTRPRLGYLGRAFTC

Query:  QMKLHFKAGTT
        Q+KLHF AGTT
Subjt:  QMKLHFKAGTT

A0A6J1GV91 uncharacterized protein LOC111457498 isoform X13.6e-10092.42Show/hide
Query:  MDPGEAARRMLVGSNCGKAIINPITMRFRPIAPKPVAGASVSGDTSSILKGRRKRKYVRVQGYNRKKKTTRSNTTTTEAAATTLQLLPERHVEIFGSGSP
        MD GEAARRMLVGSN GK IINPITMRFRPIAPKPVAGASVSGDTSSI  GRRKRKYVRVQGYNRKKKTTRSN  TTEAAATTLQLLPERHVEIFGSGS 
Subjt:  MDPGEAARRMLVGSNCGKAIINPITMRFRPIAPKPVAGASVSGDTSSILKGRRKRKYVRVQGYNRKKKTTRSNTTTTEAAATTLQLLPERHVEIFGSGSP

Query:  EVGLVSDGKRGAAAEEVWVTVECARGACIEG-DEEERMKKLEEDTCPGFVSDGVDRVEWVNEALKRMVRQRQRSSEAGEEIEIRLRTRPRLGYLGRAFTC
        EVGLVSDGKRGAAAEEVWVTVECAR ACIEG +EEERMKKLEEDTCPGFVSDGVDRVEWVNEALKRMVRQR+RSSEAGEEIE+RLRT PRLGYLGRAFTC
Subjt:  EVGLVSDGKRGAAAEEVWVTVECARGACIEG-DEEERMKKLEEDTCPGFVSDGVDRVEWVNEALKRMVRQRQRSSEAGEEIEIRLRTRPRLGYLGRAFTC

Query:  QMKLHFKAGTT
        Q+KLHF AGTT
Subjt:  QMKLHFKAGTT

A0A6J1GWG6 uncharacterized protein LOC111457498 isoform X33.6e-10092.42Show/hide
Query:  MDPGEAARRMLVGSNCGKAIINPITMRFRPIAPKPVAGASVSGDTSSILKGRRKRKYVRVQGYNRKKKTTRSNTTTTEAAATTLQLLPERHVEIFGSGSP
        MD GEAARRMLVGSN GK IINPITMRFRPIAPKPVAGASVSGDTSSI  GRRKRKYVRVQGYNRKKKTTRSN  TTEAAATTLQLLPERHVEIFGSGS 
Subjt:  MDPGEAARRMLVGSNCGKAIINPITMRFRPIAPKPVAGASVSGDTSSILKGRRKRKYVRVQGYNRKKKTTRSNTTTTEAAATTLQLLPERHVEIFGSGSP

Query:  EVGLVSDGKRGAAAEEVWVTVECARGACIEG-DEEERMKKLEEDTCPGFVSDGVDRVEWVNEALKRMVRQRQRSSEAGEEIEIRLRTRPRLGYLGRAFTC
        EVGLVSDGKRGAAAEEVWVTVECAR ACIEG +EEERMKKLEEDTCPGFVSDGVDRVEWVNEALKRMVRQR+RSSEAGEEIE+RLRT PRLGYLGRAFTC
Subjt:  EVGLVSDGKRGAAAEEVWVTVECARGACIEG-DEEERMKKLEEDTCPGFVSDGVDRVEWVNEALKRMVRQRQRSSEAGEEIEIRLRTRPRLGYLGRAFTC

Query:  QMKLHFKAGTT
        Q+KLHF AGTT
Subjt:  QMKLHFKAGTT

A0A6J1JH06 uncharacterized protein LOC1114869354.5e-135100Show/hide
Query:  MDPGEAARRMLVGSNCGKAIINPITMRFRPIAPKPVAGASVSGDTSSILKGRRKRKYVRVQGYNRKKKTTRSNTTTTEAAATTLQLLPERHVEIFGSGSP
        MDPGEAARRMLVGSNCGKAIINPITMRFRPIAPKPVAGASVSGDTSSILKGRRKRKYVRVQGYNRKKKTTRSNTTTTEAAATTLQLLPERHVEIFGSGSP
Subjt:  MDPGEAARRMLVGSNCGKAIINPITMRFRPIAPKPVAGASVSGDTSSILKGRRKRKYVRVQGYNRKKKTTRSNTTTTEAAATTLQLLPERHVEIFGSGSP

Query:  EVGLVSDGKRGAAAEEVWVTVECARGACIEGDEEERMKKLEEDTCPGFVSDGVDRVEWVNEALKRMVRQRQRSSEAGEEIEIRLRTRPRLGYLGRAFTCQ
        EVGLVSDGKRGAAAEEVWVTVECARGACIEGDEEERMKKLEEDTCPGFVSDGVDRVEWVNEALKRMVRQRQRSSEAGEEIEIRLRTRPRLGYLGRAFTCQ
Subjt:  EVGLVSDGKRGAAAEEVWVTVECARGACIEGDEEERMKKLEEDTCPGFVSDGVDRVEWVNEALKRMVRQRQRSSEAGEEIEIRLRTRPRLGYLGRAFTCQ

Query:  MKLHFKAGTTDMVVPCDAWRIAGGGTAWKLDLNAALSLAPPLQQD
        MKLHFKAGTTDMVVPCDAWRIAGGGTAWKLDLNAALSLAPPLQQD
Subjt:  MKLHFKAGTTDMVVPCDAWRIAGGGTAWKLDLNAALSLAPPLQQD

A0A6J1JZR6 uncharacterized protein LOC1114889111.6e-6353.16Show/hide
Query:  MDPGEAARRMLVGSNCGKAIINPITMRFRPIAPKPVAGASVS-------GDTSSILKGRRKRKYVRVQGYNRKKKTTRSNTTTTEA-------------A
        MD G AA R ++GS CGK II+PI MRFRPIAPKPV G SVS         TSSI K R KRKYVRV+ YNRKKKTTR+ T +T+               
Subjt:  MDPGEAARRMLVGSNCGKAIINPITMRFRPIAPKPVAGASVS-------GDTSSILKGRRKRKYVRVQGYNRKKKTTRSNTTTTEA-------------A

Query:  ATTLQLLPERHVEIFGSG--------SPEVG-------LVSDGKRGAAAEEVWVTVECARGACI-----EGD----EEERMKKLEEDTCPGFVSDGVDRV
         TTLQLLPERH+E+ G G          EVG       LVSD KRGA   E+WVTVEC R AC+     EG+    +EER+K L+EDTCPGFVS+G++RV
Subjt:  ATTLQLLPERHVEIFGSG--------SPEVG-------LVSDGKRGAAAEEVWVTVECARGACI-----EGD----EEERMKKLEEDTCPGFVSDGVDRV

Query:  EWVNEALKRMVRQ-RQR------SSEAG-EEIEIRLRTRPRLGYLGRAFTCQMKLHFKAGT----TDMVVPCDAWRIAGGGTAWKLDLNAALSLAPPLQQ
        EW+N+A KRMVRQ RQR       SE+G  E+ + L + P+LG   R F+CQ+KL FK  T       VVPCDAWR+  GG AWKLD+ AAL+LAP L +
Subjt:  EWVNEALKRMVRQ-RQR------SSEAG-EEIEIRLRTRPRLGYLGRAFTCQMKLHFKAGT----TDMVVPCDAWRIAGGGTAWKLDLNAALSLAPPLQQ

Query:  D
        D
Subjt:  D

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G27250.1 unknown protein1.9e-2432.7Show/hide
Query:  KAIINPITMRFRPIAPKPVAG-----ASVSGDTSSILKGRRKRKYVRV----QGYNRKKKTTRSNTTTTEAAATTLQLLPERH-----------------
        K  ++ + +R+RPIAPKP  G     A  + ++S  +  R KRKYVRV    +G  R K  +  +    +    TLQLLPE+                  
Subjt:  KAIINPITMRFRPIAPKPVAG-----ASVSGDTSSILKGRRKRKYVRV----QGYNRKKKTTRSNTTTTEAAATTLQLLPERH-----------------

Query:  -VEIFGSGSPEVGLVSDGKRGAAAE-EVWVTVECARGACIEGD---------EEERMKKLEEDTCPGFVSDGVDRVEWVNEALKRMVRQRQRSSEAGEEI
           I G  + E            AE E WVTVE     C EG          + E +  L +DTCP FVSDG +RV WVNEA +R V     ++    E+
Subjt:  -VEIFGSGSPEVGLVSDGKRGAAAE-EVWVTVECARGACIEGD---------EEERMKKLEEDTCPGFVSDGVDRVEWVNEALKRMVRQRQRSSEAGEEI

Query:  EIRLRTRPRLGYLG---RAFTCQMKLHF--KAGTTDMVVPCDAWRIAGGGTAWKLDLNAALSL
         + L        +    +AFTC++++ +  K       VPCD W++  GG AW+LD  AAL+L
Subjt:  EIRLRTRPRLGYLG---RAFTCQMKLHF--KAGTTDMVVPCDAWRIAGGGTAWKLDLNAALSL

AT3G48510.1 unknown protein1.6e-2333.96Show/hide
Query:  ITMRFRPIAPKPVAGASV-----------SGDTSSILKGRRKRK--YVRVQGYNRKKKTTRSNTTTT----EAAATTLQLLPERHVEIFGS-------GS
        I +RFRPIAPKP +   V           S D S     RRKRK   ++  G N K+ T R  +        A A TL LLPE+ +++  +       G 
Subjt:  ITMRFRPIAPKPVAGASV-----------SGDTSSILKGRRKRK--YVRVQGYNRKKKTTRSNTTTT----EAAATTLQLLPERHVEIFGS-------GS

Query:  PEVGLVSDGKRGAAAEEV------------WVTVECARGACIEG-----DEEERMKKLEEDTCPGFVSDGVDRVEWVNEALKRMVRQRQRSSEAGEEIE-
          +     G     A +              +TVE    A I+G      +EER   L  DTCPGF+SDG  RV W N+A ++M R      E   EI  
Subjt:  PEVGLVSDGKRGAAAEEV------------WVTVECARGACIEG-----DEEERMKKLEEDTCPGFVSDGVDRVEWVNEALKRMVRQRQRSSEAGEEIE-

Query:  -------IRLRTRPRLGYLGRAFTCQMKLHFKAGTTD---MVVPCDAWRI-AGGGTAWKLDLNAALSL
               +RL  R R       FTC+MKL +     +   + VPCD WR+  GGG AW+LD+ AAL L
Subjt:  -------IRLRTRPRLGYLGRAFTCQMKLHFKAGTTD---MVVPCDAWRI-AGGGTAWKLDLNAALSL

AT5G40790.1 unknown protein3.8e-2533.79Show/hide
Query:  KAIINPITMRFRPIAPKPVAGASVSGDTSSILKGRR---KRKYVRVQ------GYNRKKKTT-------RSNTTT---------TEAAATTLQLLPERHV
        K +I+ I  RFRPIAPKP  G S     S    GR    KRKYVRV+      G N KK  T       R N  T               TLQLLPE+  
Subjt:  KAIINPITMRFRPIAPKPVAGASVSGDTSSILKGRR---KRKYVRVQ------GYNRKKKTT-------RSNTTT---------TEAAATTLQLLPERHV

Query:  EIFGSGSPE--------------------VGLVSDGKRGAAAEEVWVTVECARGAC--------------IEGDEEERMKKLEEDTCPGFVSDGVDRVEW
        +I  +G                       +GL S   R     E W+TVEC    C              ++  EE  M+ LE DTCP  VSDG +RV W
Subjt:  EIFGSGSPE--------------------VGLVSDGKRGAAAEEVWVTVECARGAC--------------IEGDEEERMKKLEEDTCPGFVSDGVDRVEW

Query:  VNEALKRMVRQRQ---------RSSEAGEEIEIRLRTRPRLGYLGRAFTCQMKLHFKAGT-TDMVVPCDAWRIAGGGTAWKLDLNAALSL
        VN A +RM+              + +  EEI   +        L  A TC++++ ++  T   M VPCD WRI  GG AW+LD+ +AL L
Subjt:  VNEALKRMVRQRQ---------RSSEAGEEIEIRLRTRPRLGYLGRAFTCQMKLHFKAGT-TDMVVPCDAWRIAGGGTAWKLDLNAALSL

AT5G40800.1 unknown protein1.9e-2434.11Show/hide
Query:  KAIINPITMRFRPIAPKP-VAGASVSGDTSSILKGRRKRKYVRVQGYNRKKKTTRSNTTTTEAAAT----------TLQLLPERHVEI---FGSGSPEVG
        K  ++ + +++RPIAPKP   G  + GDTSS    R KRKYVRV   N  K T RS T    +++T          TLQLLPER   +     +  P V 
Subjt:  KAIINPITMRFRPIAPKP-VAGASVSGDTSSILKGRRKRKYVRVQGYNRKKKTTRSNTTTTEAAAT----------TLQLLPERHVEI---FGSGSPEVG

Query:  LVS----------------DGKRGAAAEEVWVTVECARGACIEG----DEEERMKKLEEDTCPGFVSDGVDRVEWVNEALKRMVRQRQRSSEAGEEIEIR
         ++                D        E WVTVE      +       +EE    L++DTCPGF+SDG +RV  VNEA +R+V         G E+ + 
Subjt:  LVS----------------DGKRGAAAEEVWVTVECARGACIEG----DEEERMKKLEEDTCPGFVSDGVDRVEWVNEALKRMVRQRQRSSEAGEEIEIR

Query:  LRTRPRLGYLG-RAFTCQMKLHFKAGTT--DMVVPCDAWRIAGGGTAWKLDLNAALSL
        L       +   R FTC++++ +    T     +PCD W++  GG AW+LD  AAL+L
Subjt:  LRTRPRLGYLG-RAFTCQMKLHFKAGTT--DMVVPCDAWRIAGGGTAWKLDLNAALSL

AT5G63350.1 unknown protein6.5e-2534.36Show/hide
Query:  GSNCGKAIINPITMRFRPIAPKPVA-GASV-----SGDTS---------SILKGRRKRKYVR------VQGYNRKKKTTRSNTTTTEAAATTLQLLPERH
        G   G +  + I +RFRPIAPKP + G SV     SG T+         S   GR KRKY +       +  N+KK+    +T TT   A TL LLPE  
Subjt:  GSNCGKAIINPITMRFRPIAPKPVA-GASV-----SGDTS---------SILKGRRKRKYVR------VQGYNRKKKTTRSNTTTTEAAATTLQLLPERH

Query:  VE---------------------IFGSGSPEV---GLVSDGKRGAAAEEVWVTVECARGACIEG-----DEEERMKKLEEDTCPGFVSDGVDRVEWVNEA
         +                     +  +G  E+      ++  R        VTVE    A I+G       +ER   L EDTCPGF+SDGV RV W NEA
Subjt:  VE---------------------IFGSGSPEV---GLVSDGKRGAAAEEVWVTVECARGACIEG-----DEEERMKKLEEDTCPGFVSDGVDRVEWVNEA

Query:  LKRMVRQ----------RQRSSEAGEEIEIRL--RTRPRLGYLGRAFTCQMKLHFKAGTTD---MVVPCDAWRIAGGGTAWKLDLNAALSL
         K+M R+           +  S     + +RL  + RP L Y   AFTC+++L +     +   + VPCD WR+ GGG AW+LD+ AAL L
Subjt:  LKRMVRQ----------RQRSSEAGEEIEIRL--RTRPRLGYLGRAFTCQMKLHFKAGTTD---MVVPCDAWRIAGGGTAWKLDLNAALSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCTGGTGAAGCGGCGCGTCGGATGCTTGTCGGATCTAATTGTGGTAAGGCGATTATCAATCCCATAACGATGAGATTCCGACCGATCGCGCCGAAGCCTGTCGC
CGGAGCCTCTGTTTCCGGCGACACCTCATCGATATTGAAAGGAAGGAGGAAGAGGAAGTACGTTAGGGTTCAGGGGTATAATAGGAAGAAGAAAACTACAAGATCAAACA
CCACCACCACCGAGGCCGCCGCGACGACGCTGCAACTGCTCCCGGAACGACACGTGGAGATCTTCGGAAGCGGATCTCCGGAGGTGGGGTTGGTATCAGATGGGAAGCGT
GGAGCGGCGGCGGAGGAGGTGTGGGTGACGGTAGAATGCGCGAGGGGGGCGTGCATTGAGGGAGACGAGGAGGAGAGGATGAAGAAGCTAGAAGAGGACACGTGTCCAGG
GTTCGTATCGGACGGTGTGGATAGAGTAGAGTGGGTAAACGAGGCGTTGAAAAGAATGGTGAGGCAAAGGCAGAGAAGCAGTGAGGCAGGTGAGGAAATAGAGATAAGGC
TGAGAACAAGGCCAAGATTGGGTTACTTGGGGAGAGCATTCACGTGCCAAATGAAGCTGCACTTCAAAGCTGGTACTACGGACATGGTGGTGCCCTGTGATGCGTGGAGA
ATAGCGGGCGGAGGGACTGCATGGAAACTCGACCTCAATGCCGCCCTCTCTCTAGCTCCTCCACTTCAACAAGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATCCTGGTGAAGCGGCGCGTCGGATGCTTGTCGGATCTAATTGTGGTAAGGCGATTATCAATCCCATAACGATGAGATTCCGACCGATCGCGCCGAAGCCTGTCGC
CGGAGCCTCTGTTTCCGGCGACACCTCATCGATATTGAAAGGAAGGAGGAAGAGGAAGTACGTTAGGGTTCAGGGGTATAATAGGAAGAAGAAAACTACAAGATCAAACA
CCACCACCACCGAGGCCGCCGCGACGACGCTGCAACTGCTCCCGGAACGACACGTGGAGATCTTCGGAAGCGGATCTCCGGAGGTGGGGTTGGTATCAGATGGGAAGCGT
GGAGCGGCGGCGGAGGAGGTGTGGGTGACGGTAGAATGCGCGAGGGGGGCGTGCATTGAGGGAGACGAGGAGGAGAGGATGAAGAAGCTAGAAGAGGACACGTGTCCAGG
GTTCGTATCGGACGGTGTGGATAGAGTAGAGTGGGTAAACGAGGCGTTGAAAAGAATGGTGAGGCAAAGGCAGAGAAGCAGTGAGGCAGGTGAGGAAATAGAGATAAGGC
TGAGAACAAGGCCAAGATTGGGTTACTTGGGGAGAGCATTCACGTGCCAAATGAAGCTGCACTTCAAAGCTGGTACTACGGACATGGTGGTGCCCTGTGATGCGTGGAGA
ATAGCGGGCGGAGGGACTGCATGGAAACTCGACCTCAATGCCGCCCTCTCTCTAGCTCCTCCACTTCAACAAGATTAG
Protein sequenceShow/hide protein sequence
MDPGEAARRMLVGSNCGKAIINPITMRFRPIAPKPVAGASVSGDTSSILKGRRKRKYVRVQGYNRKKKTTRSNTTTTEAAATTLQLLPERHVEIFGSGSPEVGLVSDGKR
GAAAEEVWVTVECARGACIEGDEEERMKKLEEDTCPGFVSDGVDRVEWVNEALKRMVRQRQRSSEAGEEIEIRLRTRPRLGYLGRAFTCQMKLHFKAGTTDMVVPCDAWR
IAGGGTAWKLDLNAALSLAPPLQQD