; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G15090 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G15090
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Description29 kDa ribonucleoprotein, chloroplastic
Genome locationClcChr09:20588817..20591685
RNA-Seq ExpressionClc09G15090
SyntenyClc09G15090
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR035979 - RNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143616.1 uncharacterized protein LOC111013476 [Momordica charantia]2.3e-3946.72Show/hide
Query:  LNKFDMDYSQPHIRKYIEYEIGTRFKDYRSRLYQYYKKLGDPVEARQHPCKGVTLEEWQFLCNKWESPTWKEKSEKTKSSRSKLPFNHCAGMKSFLSYRE
        L KF +D SQPH+ +YI YEIGTRFKDYR++LY++YKK+ D   ARQ P K +  E+W  LC+K ES  WKEKS K K +RSKL FNH  G K F  +RE
Subjt:  LNKFDMDYSQPHIRKYIEYEIGTRFKDYRSRLYQYYKKLGDPVEARQHPCKGVTLEEWQFLCNKWESPTWKEKSEKTKSSRSKLPFNHCAGMKSFLSYRE

Query:  EKEKMVALKQKQEHSTTLMNDEEIMATILGKNPSYVNGMGYEPKPPRSQ--KEMYSQDYVQSLEAGLAQTQELVESERWENERNRVIVEEALQSQRIQFE
        + E+M+ LK           +EEIM T+LGK  +YV GMGY PKP R++     YS +YV+SLEA L + +E  E  R   E+    +++AL++Q  +F 
Subjt:  EKEKMVALKQKQEHSTTLMNDEEIMATILGKNPSYVNGMGYEPKPPRSQ--KEMYSQDYVQSLEAGLAQTQELVESERWENERNRVIVEEALQSQRIQFE

Query:  MQREENEKMLSQMNEILERFTEGSSSLSK
             N K+L      L     GSSS SK
Subjt:  MQREENEKMLSQMNEILERFTEGSSSLSK

XP_022148911.1 uncharacterized protein LOC111017461 [Momordica charantia]1.4e-3646.19Show/hide
Query:  DYSQPHIRKYIEYEIGTRFKDYRSRLYQYYKKLGDPVEARQHPCKGVTLEEWQFLCNKWESPTWKEKSEKTKSSRSKLPFNHCAGMKSFLSYREEKEKMV
        D SQ H+ +YI YEIGTRFKDYR++LY++YKK+ D   ARQ P K +  E+W  LC+K ES  WKEKS K K +RSKL FNH  G K F  +RE+ E+M+
Subjt:  DYSQPHIRKYIEYEIGTRFKDYRSRLYQYYKKLGDPVEARQHPCKGVTLEEWQFLCNKWESPTWKEKSEKTKSSRSKLPFNHCAGMKSFLSYREEKEKMV

Query:  ALKQKQEHSTTLMNDEEIMATILGKNPSYVNGMGYEPKPPRSQ--KEMYSQDYVQSLEAGLAQTQELVESERWENERNRVIVEEALQSQRIQFEMQREEN
         LK           +EEIM T+LGK  +YV GMGY PKP R++     YS +YV+SLEA L + +E  E  R   E+    +++AL++Q  +F      N
Subjt:  ALKQKQEHSTTLMNDEEIMATILGKNPSYVNGMGYEPKPPRSQ--KEMYSQDYVQSLEAGLAQTQELVESERWENERNRVIVEEALQSQRIQFEMQREEN

Query:  EKMLSQMNEILERFTEGSSSLSK
         K+L      L     GSSS SK
Subjt:  EKMLSQMNEILERFTEGSSSLSK

XP_038895319.1 uncharacterized protein LOC120083572 isoform X1 [Benincasa hispida]1.0e-4741.28Show/hide
Query:  SSSNEENIV--DTSNSPTNSQ--VRGVTRGVRLHRVIEASGGRIPISWDPFPGKPVGKVANLFSSEIGT----------------------------LNK
        S  N ++ V  +T+N  T ++  VRG +RGVRL++   A+ GRI ++W P  GKP+G +A+LF+ EIG                             LN+
Subjt:  SSSNEENIV--DTSNSPTNSQ--VRGVTRGVRLHRVIEASGGRIPISWDPFPGKPVGKVANLFSSEIGT----------------------------LNK

Query:  FDMDYSQPHIRKYIEYEIGTRFKDYRSRLYQYYKKLGDPVEARQHPCKGVTLEEWQFLCNKWESPTWKEKSEKTKSSRSKLPFNHCAGMKSFLSYREEK-
        FD+D SQPHI++YI YEIG RFKDYR  LY++Y+K  DPVEAR++P K  T ++W  LC++WES +WKEKS + K SRSK+ FNHC G KSFLS R +K 
Subjt:  FDMDYSQPHIRKYIEYEIGTRFKDYRSRLYQYYKKLGDPVEARQHPCKGVTLEEWQFLCNKWESPTWKEKSEKTKSSRSKLPFNHCAGMKSFLSYREEK-

Query:  ---------------------------------EKMVALKQKQEHSTTLMNDEEIMATILGKNPSYVNGMGYEPKPPRSQK
                                         E M+ L++ ++H T    DEEI+  +LGK  SY+NG GY PKPPR ++
Subjt:  ---------------------------------EKMVALKQKQEHSTTLMNDEEIMATILGKNPSYVNGMGYEPKPPRSQK

XP_038895320.1 uncharacterized protein LOC120083572 isoform X2 [Benincasa hispida]8.0e-4046.77Show/hide
Query:  SSSNEENIV--DTSNSPTNSQ--VRGVTRGVRLHRVIEASGGRIPISWDPFPGKPVGKVANLFSSEIGT----------------------------LNK
        S  N ++ V  +T+N  T ++  VRG +RGVRL++   A+ GRI ++W P  GKP+G +A+LF+ EIG                             LN+
Subjt:  SSSNEENIV--DTSNSPTNSQ--VRGVTRGVRLHRVIEASGGRIPISWDPFPGKPVGKVANLFSSEIGT----------------------------LNK

Query:  FDMDYSQPHIRKYIEYEIGTRFKDYRSRLYQYYKKLGDPVEARQHPCKGVTLEEWQFLCNKWESPTWKEKSEKTKSSRSKLPFNHCAGMKSFLSYREEKE
        FD+D SQPHI++YI YEIG RFKDYR  LY++Y+K  DPVEAR++P K  T ++W  LC++WES +WKEKS + K SRSK+ FNHC G KSFLS R +K 
Subjt:  FDMDYSQPHIRKYIEYEIGTRFKDYRSRLYQYYKKLGDPVEARQHPCKGVTLEEWQFLCNKWESPTWKEKSEKTKSSRSKLPFNHCAGMKSFLSYREEKE

Query:  K
        K
Subjt:  K

XP_038895321.1 uncharacterized protein LOC120083572 isoform X3 [Benincasa hispida]8.0e-4046.77Show/hide
Query:  SSSNEENIV--DTSNSPTNSQ--VRGVTRGVRLHRVIEASGGRIPISWDPFPGKPVGKVANLFSSEIGT----------------------------LNK
        S  N ++ V  +T+N  T ++  VRG +RGVRL++   A+ GRI ++W P  GKP+G +A+LF+ EIG                             LN+
Subjt:  SSSNEENIV--DTSNSPTNSQ--VRGVTRGVRLHRVIEASGGRIPISWDPFPGKPVGKVANLFSSEIGT----------------------------LNK

Query:  FDMDYSQPHIRKYIEYEIGTRFKDYRSRLYQYYKKLGDPVEARQHPCKGVTLEEWQFLCNKWESPTWKEKSEKTKSSRSKLPFNHCAGMKSFLSYREEKE
        FD+D SQPHI++YI YEIG RFKDYR  LY++Y+K  DPVEAR++P K  T ++W  LC++WES +WKEKS + K SRSK+ FNHC G KSFLS R +K 
Subjt:  FDMDYSQPHIRKYIEYEIGTRFKDYRSRLYQYYKKLGDPVEARQHPCKGVTLEEWQFLCNKWESPTWKEKSEKTKSSRSKLPFNHCAGMKSFLSYREEKE

Query:  K
        K
Subjt:  K

TrEMBL top hitse value%identityAlignment
A0A438CMH8 Uncharacterized protein3.0e-2429.52Show/hide
Query:  SPTNSQVRGVTRGVRLHRVIEASGGR-IPISWDPFPGKPVGKVANLFSSEIGTL------------------------------------NKFDMDYSQP
        +P    VRG TRGV L ++IEA+GG+ +PI+  P  GK  GK     S+EIG                                       KF +D +Q 
Subjt:  SPTNSQVRGVTRGVRLHRVIEASGGR-IPISWDPFPGKPVGKVANLFSSEIGTL------------------------------------NKFDMDYSQP

Query:  HIRKYIEYEIGTRFKDYRSRLYQYYKKLGDPVEARQHPCKGVT-LEEWQFLCNKWESPTWKEKSEKTKSSRSKLPFNHCAGMKSFLSY------------
        H++K +E ++  RF+++R  L++++KK    VEA+++P + V+  E+W +LC+++ S  +K +S     +RSK+PF+H  G +SF+ +            
Subjt:  HIRKYIEYEIGTRFKDYRSRLYQYYKKLGDPVEARQHPCKGVT-LEEWQFLCNKWESPTWKEKSEKTKSSRSKLPFNHCAGMKSFLSY------------

Query:  ----------------------REEKEKMVAL-KQKQEHSTTLMNDEEIMATILGKNPSYVNGMGYEPKP-PRSQKEMYSQDYVQSLEAGLAQTQELVES
                              R+  EKM+ L +Q        M + EI   +LG+   YV G+G+ PKP   S+    S ++   LE  L +TQ LVE+
Subjt:  ----------------------REEKEKMVAL-KQKQEHSTTLMNDEEIMATILGKNPSYVNGMGYEPKP-PRSQKEMYSQDYVQSLEAGLAQTQELVES

Query:  --ERWENERNRVIVEEAL-QSQRIQFEMQREE
          ++ E +++R+   EAL Q Q  Q   Q EE
Subjt:  --ERWENERNRVIVEEAL-QSQRIQFEMQREE

A0A6J1CQT5 uncharacterized protein LOC1110134761.1e-3946.72Show/hide
Query:  LNKFDMDYSQPHIRKYIEYEIGTRFKDYRSRLYQYYKKLGDPVEARQHPCKGVTLEEWQFLCNKWESPTWKEKSEKTKSSRSKLPFNHCAGMKSFLSYRE
        L KF +D SQPH+ +YI YEIGTRFKDYR++LY++YKK+ D   ARQ P K +  E+W  LC+K ES  WKEKS K K +RSKL FNH  G K F  +RE
Subjt:  LNKFDMDYSQPHIRKYIEYEIGTRFKDYRSRLYQYYKKLGDPVEARQHPCKGVTLEEWQFLCNKWESPTWKEKSEKTKSSRSKLPFNHCAGMKSFLSYRE

Query:  EKEKMVALKQKQEHSTTLMNDEEIMATILGKNPSYVNGMGYEPKPPRSQ--KEMYSQDYVQSLEAGLAQTQELVESERWENERNRVIVEEALQSQRIQFE
        + E+M+ LK           +EEIM T+LGK  +YV GMGY PKP R++     YS +YV+SLEA L + +E  E  R   E+    +++AL++Q  +F 
Subjt:  EKEKMVALKQKQEHSTTLMNDEEIMATILGKNPSYVNGMGYEPKPPRSQ--KEMYSQDYVQSLEAGLAQTQELVESERWENERNRVIVEEALQSQRIQFE

Query:  MQREENEKMLSQMNEILERFTEGSSSLSK
             N K+L      L     GSSS SK
Subjt:  MQREENEKMLSQMNEILERFTEGSSSLSK

A0A6J1D6S9 uncharacterized protein LOC1110174616.9e-3746.19Show/hide
Query:  DYSQPHIRKYIEYEIGTRFKDYRSRLYQYYKKLGDPVEARQHPCKGVTLEEWQFLCNKWESPTWKEKSEKTKSSRSKLPFNHCAGMKSFLSYREEKEKMV
        D SQ H+ +YI YEIGTRFKDYR++LY++YKK+ D   ARQ P K +  E+W  LC+K ES  WKEKS K K +RSKL FNH  G K F  +RE+ E+M+
Subjt:  DYSQPHIRKYIEYEIGTRFKDYRSRLYQYYKKLGDPVEARQHPCKGVTLEEWQFLCNKWESPTWKEKSEKTKSSRSKLPFNHCAGMKSFLSYREEKEKMV

Query:  ALKQKQEHSTTLMNDEEIMATILGKNPSYVNGMGYEPKPPRSQ--KEMYSQDYVQSLEAGLAQTQELVESERWENERNRVIVEEALQSQRIQFEMQREEN
         LK           +EEIM T+LGK  +YV GMGY PKP R++     YS +YV+SLEA L + +E  E  R   E+    +++AL++Q  +F      N
Subjt:  ALKQKQEHSTTLMNDEEIMATILGKNPSYVNGMGYEPKPPRSQ--KEMYSQDYVQSLEAGLAQTQELVESERWENERNRVIVEEALQSQRIQFEMQREEN

Query:  EKMLSQMNEILERFTEGSSSLSK
         K+L      L     GSSS SK
Subjt:  EKMLSQMNEILERFTEGSSSLSK

A0A6J1DLF1 uncharacterized protein LOC1110211386.7e-3249.66Show/hide
Query:  LNKFDMDYSQPHIRKYIEYEIGTRFKDYRSRLYQYYKKLGDPVEARQHPCKGVTLEEWQFLCNKWESPTWKEKSEKTKSSRSKLPFNHCAGMKSFLSYRE
        L KF +  SQPH+ +YI YEIGTRFKDYR++L+++YKK  D   ARQ P K +  E+W  LC++ E P WKEKS K K + SKL FNH    K F  +RE
Subjt:  LNKFDMDYSQPHIRKYIEYEIGTRFKDYRSRLYQYYKKLGDPVEARQHPCKGVTLEEWQFLCNKWESPTWKEKSEKTKSSRSKLPFNHCAGMKSFLSYRE

Query:  EKEKMVALKQKQEHSTTLMNDEEIMATILGKNPSYVNGMGYEPKP
        + E+M+ LK           +EEIM T+LG+  +Y+ GMGY PKP
Subjt:  EKEKMVALKQKQEHSTTLMNDEEIMATILGKNPSYVNGMGYEPKP

A0A6J1DXU5 uncharacterized protein LOC1110255256.2e-3043.75Show/hide
Query:  MSSSNEENIVDTSNSPTNSQVRGVTRGVRLHRVIEASGGRIPISWDPFPGKPVGKVANLFSSEIG----------------------------TLNKFDM
        MSS +E    DT    T +Q  G TRG  L RV+    G+I + W    G+PVG  +  F+SEIG                             L KF +
Subjt:  MSSSNEENIVDTSNSPTNSQVRGVTRGVRLHRVIEASGGRIPISWDPFPGKPVGKVANLFSSEIG----------------------------TLNKFDM

Query:  DYSQPHIRKYIEYEIGTRFKDYRSRLYQYYKKLGDPVEARQHPCKGVTLEEWQFLCNKWESPTWKEKSEKTKSSRSKLPFNHCAGMKSFLSY
        D SQPH+ +YI YEIGTRFKDYR++L+++YKK  DP  ARQ P K +  E W  LC++WESP WKEKS + K +RSKL FNH  G K FL +
Subjt:  DYSQPHIRKYIEYEIGTRFKDYRSRLYQYYKKLGDPVEARQHPCKGVTLEEWQFLCNKWESPTWKEKSEKTKSSRSKLPFNHCAGMKSFLSY

SwissProt top hitse value%identityAlignment
P49314 31 kDa ribonucleoprotein, chloroplastic3.5e-0644.09Show/hide
Query:  NEILERFTEGSSSLSKSSTTTCVKSSPSPR-----FVQNVAVSSDYNQEEDTLEADGEDSSYALHLKLFVGNLHFSVDSAQLTGLFESVGQIE
        N  L  F+  SSSL+ S +++ +  S S +     F   VA+ SD++Q ED +E   E   ++  LKLFVGNL FSVDSA L GLFE  G +E
Subjt:  NEILERFTEGSSSLSKSSTTTCVKSSPSPR-----FVQNVAVSSDYNQEEDTLEADGEDSSYALHLKLFVGNLHFSVDSAQLTGLFESVGQIE

Q08935 29 kDa ribonucleoprotein A, chloroplastic3.5e-0644.44Show/hide
Query:  SSSLSKSSTTTCVKSSPSPRFVQNVAVSSDYNQEEDTLEADG---EDSSYALHLKLFVGNLHFSVDSAQLTGLFESVGQIE
        S +LS SS+++C  S    RFV+ V + SD++Q ED  + D    E+ +++  LK+FVGNL FS DSA L  LFE  G +E
Subjt:  SSSLSKSSTTTCVKSSPSPRFVQNVAVSSDYNQEEDTLEADG---EDSSYALHLKLFVGNLHFSVDSAQLTGLFESVGQIE

Q08937 29 kDa ribonucleoprotein B, chloroplastic7.9e-0645.65Show/hide
Query:  NEILERFTEGSS----SLSKSSTTTCVKSSPSPRFVQNVAVSSDYNQEEDTLEADGEDSSYALHLKLFVGNLHFSVDSAQLTGLFESVGQIE
        N  L  F+  SS    SLS SST  C        F   VA+S  ++Q ED +E   E   ++  LKLFVGNL FSVDSA L GLFE  G +E
Subjt:  NEILERFTEGSS----SLSKSSTTTCVKSSPSPRFVQNVAVSSDYNQEEDTLEADGEDSSYALHLKLFVGNLHFSVDSAQLTGLFESVGQIE

Q43349 29 kDa ribonucleoprotein, chloroplastic5.6e-1254.22Show/hide
Query:  SLSKSSTTTCVKSSPSP-RFVQNVAVSSDYNQEEDTLEADGEDS------SYALHLKLFVGNLHFSVDSAQLTGLFESVGQIE
        S+  SS   C   +  P RFV+NVAVSSD+  EED + ADG+DS      S++  LKLFVGNL F+VDSAQL  LFES G +E
Subjt:  SLSKSSTTTCVKSSPSP-RFVQNVAVSSDYNQEEDTLEADGEDS------SYALHLKLFVGNLHFSVDSAQLTGLFESVGQIE

Q9ZUU4 RNA-binding protein CP29B, chloroplastic2.6e-0951.52Show/hide
Query:  SSPSPRFVQNVAVSSDYNQEEDTLE--ADGEDSSYALHLKLFVGNLHFSVDSAQLTGLFESVGQIE
        +SP+ RF +NVA++S++  EED     A  ++ S++  LKLFVGNL F+VDSAQL  LFES G +E
Subjt:  SSPSPRFVQNVAVSSDYNQEEDTLE--ADGEDSSYALHLKLFVGNLHFSVDSAQLTGLFESVGQIE

Arabidopsis top hitse value%identityAlignment
AT2G37220.1 RNA-binding (RRM/RBD/RNP motifs) family protein1.9e-1051.52Show/hide
Query:  SSPSPRFVQNVAVSSDYNQEEDTLE--ADGEDSSYALHLKLFVGNLHFSVDSAQLTGLFESVGQIE
        +SP+ RF +NVA++S++  EED     A  ++ S++  LKLFVGNL F+VDSAQL  LFES G +E
Subjt:  SSPSPRFVQNVAVSSDYNQEEDTLE--ADGEDSSYALHLKLFVGNLHFSVDSAQLTGLFESVGQIE

AT3G53460.1 chloroplast RNA-binding protein 294.0e-1354.22Show/hide
Query:  SLSKSSTTTCVKSSPSP-RFVQNVAVSSDYNQEEDTLEADGEDS------SYALHLKLFVGNLHFSVDSAQLTGLFESVGQIE
        S+  SS   C   +  P RFV+NVAVSSD+  EED + ADG+DS      S++  LKLFVGNL F+VDSAQL  LFES G +E
Subjt:  SLSKSSTTTCVKSSPSP-RFVQNVAVSSDYNQEEDTLEADGEDS------SYALHLKLFVGNLHFSVDSAQLTGLFESVGQIE

AT3G53460.2 chloroplast RNA-binding protein 294.0e-1354.22Show/hide
Query:  SLSKSSTTTCVKSSPSP-RFVQNVAVSSDYNQEEDTLEADGEDS------SYALHLKLFVGNLHFSVDSAQLTGLFESVGQIE
        S+  SS   C   +  P RFV+NVAVSSD+  EED + ADG+DS      S++  LKLFVGNL F+VDSAQL  LFES G +E
Subjt:  SLSKSSTTTCVKSSPSP-RFVQNVAVSSDYNQEEDTLEADGEDS------SYALHLKLFVGNLHFSVDSAQLTGLFESVGQIE

AT3G53460.3 chloroplast RNA-binding protein 294.0e-1354.22Show/hide
Query:  SLSKSSTTTCVKSSPSP-RFVQNVAVSSDYNQEEDTLEADGEDS------SYALHLKLFVGNLHFSVDSAQLTGLFESVGQIE
        S+  SS   C   +  P RFV+NVAVSSD+  EED + ADG+DS      S++  LKLFVGNL F+VDSAQL  LFES G +E
Subjt:  SLSKSSTTTCVKSSPSP-RFVQNVAVSSDYNQEEDTLEADGEDS------SYALHLKLFVGNLHFSVDSAQLTGLFESVGQIE

AT3G53460.4 chloroplast RNA-binding protein 294.0e-1354.22Show/hide
Query:  SLSKSSTTTCVKSSPSP-RFVQNVAVSSDYNQEEDTLEADGEDS------SYALHLKLFVGNLHFSVDSAQLTGLFESVGQIE
        S+  SS   C   +  P RFV+NVAVSSD+  EED + ADG+DS      S++  LKLFVGNL F+VDSAQL  LFES G +E
Subjt:  SLSKSSTTTCVKSSPSP-RFVQNVAVSSDYNQEEDTLEADGEDS------SYALHLKLFVGNLHFSVDSAQLTGLFESVGQIE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATCAAGTAATGAAGAGAACATTGTAGACACGAGTAACAGTCCAACTAATAGCCAAGTTCGTGGTGTTACACGAGGAGTCAGATTACATCGTGTGATTGAGGCAAG
TGGAGGAAGAATACCTATCTCATGGGATCCTTTCCCCGGGAAACCAGTTGGAAAAGTTGCGAATCTTTTTAGCAGTGAGATTGGTACACTGAATAAATTTGACATGGATT
ACTCTCAACCACACATCAGAAAGTACATTGAATACGAGATTGGTACTCGCTTTAAGGACTATAGATCAAGATTGTACCAGTATTATAAAAAATTGGGTGATCCGGTCGAA
GCTCGCCAACATCCATGTAAAGGGGTAACACTTGAAGAATGGCAATTTCTGTGTAATAAATGGGAGTCTCCTACATGGAAGGAAAAGTCAGAAAAAACAAAAAGTAGCAG
AAGTAAACTCCCTTTCAATCATTGTGCTGGAATGAAATCATTTCTGTCTTATCGAGAAGAAAAGGAAAAAATGGTTGCCTTAAAACAAAAGCAAGAACACTCAACCACAC
TAATGAATGATGAAGAAATTATGGCAACCATTTTAGGGAAAAATCCTTCGTATGTTAATGGGATGGGATACGAACCAAAACCTCCACGAAGTCAAAAAGAAATGTACTCA
CAAGATTATGTCCAATCACTAGAGGCAGGCCTTGCACAGACTCAAGAACTTGTGGAGAGTGAACGTTGGGAGAATGAGAGAAATAGAGTCATTGTGGAAGAAGCTCTACA
AAGTCAACGTATTCAATTTGAGATGCAGCGTGAAGAAAACGAGAAAATGCTCAGTCAAATGAATGAAATTCTAGAAAGGTTCACCGAGGGAAGTTCATCTCTTAGTAAGT
CTTCAACCACCACCTGCGTCAAGTCCTCACCTTCTCCTCGCTTTGTCCAAAATGTTGCTGTCTCATCCGATTACAACCAGGAAGAAGACACTCTTGAGGCTGATGGAGAA
GATTCTAGCTACGCTCTTCACCTCAAACTGTTCGTCGGAAATCTTCATTTCAGCGTTGACAGTGCTCAACTCACCGGCTTGTTTGAAAGTGTAGGACAAATTGAAGGGGC
CAAAAGCTTCAAGCTCCTTCGAGCAGCGTCATTCGAGCCCATAACGCCGTTGAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCATCAAGTAATGAAGAGAACATTGTAGACACGAGTAACAGTCCAACTAATAGCCAAGTTCGTGGTGTTACACGAGGAGTCAGATTACATCGTGTGATTGAGGCAAG
TGGAGGAAGAATACCTATCTCATGGGATCCTTTCCCCGGGAAACCAGTTGGAAAAGTTGCGAATCTTTTTAGCAGTGAGATTGGTACACTGAATAAATTTGACATGGATT
ACTCTCAACCACACATCAGAAAGTACATTGAATACGAGATTGGTACTCGCTTTAAGGACTATAGATCAAGATTGTACCAGTATTATAAAAAATTGGGTGATCCGGTCGAA
GCTCGCCAACATCCATGTAAAGGGGTAACACTTGAAGAATGGCAATTTCTGTGTAATAAATGGGAGTCTCCTACATGGAAGGAAAAGTCAGAAAAAACAAAAAGTAGCAG
AAGTAAACTCCCTTTCAATCATTGTGCTGGAATGAAATCATTTCTGTCTTATCGAGAAGAAAAGGAAAAAATGGTTGCCTTAAAACAAAAGCAAGAACACTCAACCACAC
TAATGAATGATGAAGAAATTATGGCAACCATTTTAGGGAAAAATCCTTCGTATGTTAATGGGATGGGATACGAACCAAAACCTCCACGAAGTCAAAAAGAAATGTACTCA
CAAGATTATGTCCAATCACTAGAGGCAGGCCTTGCACAGACTCAAGAACTTGTGGAGAGTGAACGTTGGGAGAATGAGAGAAATAGAGTCATTGTGGAAGAAGCTCTACA
AAGTCAACGTATTCAATTTGAGATGCAGCGTGAAGAAAACGAGAAAATGCTCAGTCAAATGAATGAAATTCTAGAAAGGTTCACCGAGGGAAGTTCATCTCTTAGTAAGT
CTTCAACCACCACCTGCGTCAAGTCCTCACCTTCTCCTCGCTTTGTCCAAAATGTTGCTGTCTCATCCGATTACAACCAGGAAGAAGACACTCTTGAGGCTGATGGAGAA
GATTCTAGCTACGCTCTTCACCTCAAACTGTTCGTCGGAAATCTTCATTTCAGCGTTGACAGTGCTCAACTCACCGGCTTGTTTGAAAGTGTAGGACAAATTGAAGGGGC
CAAAAGCTTCAAGCTCCTTCGAGCAGCGTCATTCGAGCCCATAACGCCGTTGAAGTAG
Protein sequenceShow/hide protein sequence
MSSSNEENIVDTSNSPTNSQVRGVTRGVRLHRVIEASGGRIPISWDPFPGKPVGKVANLFSSEIGTLNKFDMDYSQPHIRKYIEYEIGTRFKDYRSRLYQYYKKLGDPVE
ARQHPCKGVTLEEWQFLCNKWESPTWKEKSEKTKSSRSKLPFNHCAGMKSFLSYREEKEKMVALKQKQEHSTTLMNDEEIMATILGKNPSYVNGMGYEPKPPRSQKEMYS
QDYVQSLEAGLAQTQELVESERWENERNRVIVEEALQSQRIQFEMQREENEKMLSQMNEILERFTEGSSSLSKSSTTTCVKSSPSPRFVQNVAVSSDYNQEEDTLEADGE
DSSYALHLKLFVGNLHFSVDSAQLTGLFESVGQIEGAKSFKLLRAASFEPITPLK