; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc03G09030 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc03G09030
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationClcChr03:9694865..9704581
RNA-Seq ExpressionClc03G09030
SyntenyClc03G09030
Gene Ontology termsNA
InterPro domainsIPR011992 - EF-hand domain pair
IPR018247 - EF-Hand 1, calcium-binding site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7947748.1 hypothetical protein I3843_14G109500 [Carya illinoinensis]1.1e-3638.3Show/hide
Query:  VAREDQTEVTKTIRDYFHLIVPTLRPRIVKAPINANNFELKPSLIQMAKDSIFRGTSNEDPHKHLRSLQEI-----------------------------
        +A ED+  + +T++DY   +V      I++ PINANNFELKP+LI M + + F G+  +DP+ HL    EI                             
Subjt:  VAREDQTEVTKTIRDYFHLIVPTLRPRIVKAPINANNFELKPSLIQMAKDSIFRGTSNEDPHKHLRSLQEI-----------------------------

Query:  ---------------YGMAFLSKFF-LSKTSNLRIKIGTFKQKEDEQLFEAWERYKELLRKCPQHNYLDWLQIQLFYNGLSGTTKSILNVAAGGSIFSKT
                           FL+KFF  +KT+ LR +IG FKQ + E L+EAWERYK+L+R+CPQH   DWLQ+Q+FYNGL+G T++I++ A+GG++ SKT
Subjt:  ---------------YGMAFLSKFF-LSKTSNLRIKIGTFKQKEDEQLFEAWERYKELLRKCPQHNYLDWLQIQLFYNGLSGTTKSILNVAAGGSIFSKT

Query:  VDATRSLLEDMVVTSYNWPSKQS-TSKVAELYELD
         +   +LLE+M   +Y WP++++   KVA ++EL+
Subjt:  VDATRSLLEDMVVTSYNWPSKQS-TSKVAELYELD

KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]1.2e-4037.01Show/hide
Query:  MRQSQRAKLLPFNPEIKLTLRKVSRNLKAEFVAREDQTEVTKTIRDYFHLIVPTLRPRIVKAPINANNFELKPSLIQMAKDSIFRGTSNEDPHKHLRSLQ
        MR+++   ++P +PEI+ TLR + RN K   +A ED+  + +T++DY   +V      I++ PINANNFELKP+LI M + + F G+  +DP+ HL    
Subjt:  MRQSQRAKLLPFNPEIKLTLRKVSRNLKAEFVAREDQTEVTKTIRDYFHLIVPTLRPRIVKAPINANNFELKPSLIQMAKDSIFRGTSNEDPHKHLRSLQ

Query:  EI--------------------------------------------YGMAFLSKFF-LSKTSNLRIKIGTFKQKEDEQLFEAWERYKELLRKCPQHNYLD
        EI                                                FL+KFF  +KT+ LR +IG FKQ + E L+EAWERYK+L+R+CPQH   D
Subjt:  EI--------------------------------------------YGMAFLSKFF-LSKTSNLRIKIGTFKQKEDEQLFEAWERYKELLRKCPQHNYLD

Query:  WLQIQLFYNGLSGTTKSILNVAAGGSIFSKTVDATRSLLEDMVVTSYNWPSKQS-TSKVAELYELDEPPSGFTDSNPNAEKKSSMEDLLTEFIKESKNRT
        WLQ+Q+FYNGL+G T++I++ A+GG++ SKT +   +LLE+M   +Y WP++++   KVA +++L EP +    S   A     +  L T+ I +S   T
Subjt:  WLQIQLFYNGLSGTTKSILNVAAGGSIFSKTVDATRSLLEDMVVTSYNWPSKQS-TSKVAELYELDEPPSGFTDSNPNAEKKSSMEDLLTEFIKESKNRT

Query:  NLLESTIM
          L ST M
Subjt:  NLLESTIM

WP_217833153.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]2.0e-5448.85Show/hide
Query:  LLPFNPEIKLTLRKVSRNLKAEF-VAREDQTEVTKTIRDYFHLIVPTLRPRIVKAPINANNFELKPSLIQMAKDSIFRGTSNEDPHKHLRSLQEIYG---
        LLP +PEI  T R   RNL+A      E   E+ K IRDYF   +P  +P I+  PIN NNFELKP LIQMA++  FRG +NEDPHKHLRS  EI G   
Subjt:  LLPFNPEIKLTLRKVSRNLKAEF-VAREDQTEVTKTIRDYFHLIVPTLRPRIVKAPINANNFELKPSLIQMAKDSIFRGTSNEDPHKHLRSLQEIYG---

Query:  -----------------------------------------MAFLSKFF-LSKTSNLRIKIGTFKQKEDEQLFEAWERYKELLRKCPQHNYLDWLQIQLF
                                                  AFL+K+F  +K+  LR +IGTF+Q EDEQL+EAWERYK+LLR+CPQH Y DWLQIQLF
Subjt:  -----------------------------------------MAFLSKFF-LSKTSNLRIKIGTFKQKEDEQLFEAWERYKELLRKCPQHNYLDWLQIQLF

Query:  YNGLSGTTKSILNVAAGGSIFSKTVDATRSLLEDMVVTSYNWPSKQST---SKVAELYELDE
        YNGL+ +TKSIL+  AGGSIFSK      ++LED+  TSYNWP ++++    K A LYE+DE
Subjt:  YNGLSGTTKSILNVAAGGSIFSKTVDATRSLLEDMVVTSYNWPSKQST---SKVAELYELDE

XP_022157708.1 uncharacterized protein LOC111024361 [Momordica charantia]3.1e-3928.12Show/hide
Query:  TIRDYFHLIVPTLRPRIVKAPINANNFELKPSLIQMAKDSIFRGTSNEDPHKHLRSLQEIYG---------------------------MAFLSKFF-LS
        TIRDY     P     I+  PINANN ELKP LIQM +++ FRG + EDP+ HL    ++ G                            AFL+ FF  +
Subjt:  TIRDYFHLIVPTLRPRIVKAPINANNFELKPSLIQMAKDSIFRGTSNEDPHKHLRSLQEIYG---------------------------MAFLSKFF-LS

Query:  KTSNLRIKIGTFKQKEDEQLFEAWERYKELLRKCPQHNYLDWLQIQLFYNGLSGTTKSILNVAAGGSIFSKTVDATRSLLEDMVVTSYNWPSKQSTS-KV
        KT+ LR +I +F++ + EQLFE WERYKELLRKCPQH  L+WLQIQ+FYNGL+G T++IL+ AAGG++ S+T +    LL+DM   S+ WPS++S + KV
Subjt:  KTSNLRIKIGTFKQKEDEQLFEAWERYKELLRKCPQHNYLDWLQIQLFYNGLSGTTKSILNVAAGGSIFSKTVDATRSLLEDMVVTSYNWPSKQSTS-KV

Query:  AELYELDEPPS------------------GFTDSNP-----------------------NAEKKSSMEDLLTEFIKESKNRTNLLESTIMSHEKTIQNME
        A +YE+DE  S                  G + SN                         AEKKSS+EDLL  FI E ++R +            I+N  
Subjt:  AELYELDEPPS------------------GFTDSNP-----------------------NAEKKSSMEDLLTEFIKESKNRTNLLESTIMSHEKTIQNME

Query:  EGEQAVVEPDLDERRENEEWNVQEKMEEVRMVVCRLHMPTILRTRPRGEPTPYAYHLEDKTEWGAGDIIIQDGIHLFSSLGRKTMLDTWSRQEMHDKGET
        EG +  +E +    +     N++ ++ ++         PT L T  +G+   +   +E K       + ++ G  L              +++M +   T
Subjt:  EGEQAVVEPDLDERRENEEWNVQEKMEEVRMVVCRLHMPTILRTRPRGEPTPYAYHLEDKTEWGAGDIIIQDGIHLFSSLGRKTMLDTWSRQEMHDKGET

Query:  HRRKSRKEIFEFLRERAKRRQKKEILASVVEVCGQEKIF----LTTQSSYVK--EKVLRLRGQPYVQDELNIQDERRRVQERSTHPKEEARRVKLHLASK
           +  KE  E ++E     Q  +  +S+V        +    L    +YV+  + ++  + +    + +N+ +E   + +R    K +         S 
Subjt:  HRRKSRKEIFEFLRERAKRRQKKEILASVVEVCGQEKIF----LTTQSSYVK--EKVLRLRGQPYVQDELNIQDERRRVQERSTHPKEEARRVKLHLASK

Query:  EQDALEQMSNYVKFMKDVLSRRSLR----MENVLIKVDKFIFPEDFVVLDMEE------------------------------------------SLKYA
                S++ K + D+ +  +L     +E+VL+KVD+ IFP DFVVL  EE                                          ++KY 
Subjt:  EQDALEQMSNYVKFMKDVLSRRSLR----MENVLIKVDKFIFPEDFVVLDMEE------------------------------------------SLKYA

Query:  DHDYTRHSIDIVDRNVAKFSELVLSTDQLSQSYHTVLFNTIDTDCEGKVSYQDLLTQL
        +   T H ID+VD  VA+    V   D   +         I    +    Y D L QL
Subjt:  DHDYTRHSIDIVDRNVAKFSELVLSTDQLSQSYHTVLFNTIDTDCEGKVSYQDLLTQL

XP_022883666.1 uncharacterized protein LOC111400483 [Olea europaea var. sylvestris]1.3e-3737.41Show/hide
Query:  GSIMCMRQSQRAKLLPFNPEIKLT---LRKVSRNLKAEFVARE----DQTEVTKTIRDYFHLIVPTLRPRIVKAPINANNFELKPSLIQMAKDSIFRGTS
        G+ + MR+++   LLP +PE + T   LR + RN +     ++    ++    K I DY   +V      I +  I ANNFELKP LI M + + F G +
Subjt:  GSIMCMRQSQRAKLLPFNPEIKLT---LRKVSRNLKAEFVARE----DQTEVTKTIRDYFHLIVPTLRPRIVKAPINANNFELKPSLIQMAKDSIFRGTS

Query:  NEDPHKHLRSLQEIYGMA-----FLSKFFL-SKTSNLRIKIGTFKQKEDEQLFEAWERYKELLRKCPQHNYLDWLQIQLFYNGLSGTTKSILNVAAGGSI
         EDP+ HL S  EI         FL+K+F  SK++ L  +I  FKQ + E  +EAWER+K+LLR+CPQH +  W+QI++FYNGL+G T+ +++ AAGG +
Subjt:  NEDPHKHLRSLQEIYGMA-----FLSKFFL-SKTSNLRIKIGTFKQKEDEQLFEAWERYKELLRKCPQHNYLDWLQIQLFYNGLSGTTKSILNVAAGGSI

Query:  FSKTVDATRSLLEDMVVTSYNWPSKQS-TSKVAELYELDEPPSGFTDSNPNAEKKSSMEDLLTEFIKESKNRTNLLESTIMSHEKT
         +K  +A  +LL+D+   SY WPS++S   KVA L+E+D P +        A+  S    ++T  +    N+  ++ ST  SH++T
Subjt:  FSKTVDATRSLLEDMVVTSYNWPSKQS-TSKVAELYELDEPPSGFTDSNPNAEKKSSMEDLLTEFIKESKNRTNLLESTIMSHEKT

TrEMBL top hitse value%identityAlignment
A0A061E3H0 RT_RNaseH domain-containing protein1.3e-3041.45Show/hide
Query:  QTEVTKTIRDYFHLIVPTLRPRIVKAPINANNFELKPSLIQMAKDSI-FRGTSNEDPHKHLRSLQEI------YGMAFLSKFF-LSKTSNLRIKIGTFKQ
        Q E TK +RDY    V +L   I + PI ANNFE+KPS+I+M +    F G  N+D + H+ +  +I          F + FF L+KT+ +R  I  F Q
Subjt:  QTEVTKTIRDYFHLIVPTLRPRIVKAPINANNFELKPSLIQMAKDSI-FRGTSNEDPHKHLRSLQEI------YGMAFLSKFF-LSKTSNLRIKIGTFKQ

Query:  KEDEQLFEAWERYKELLRKCPQHNYLDWLQIQLFYNGLSGTTKSILNVAAGGSIFSKTVDATRSLLEDMVVTSYNWPSKQ-STSKVAELYELD
         +   L+ AWERYK+L+R+CP H    WLQ+Q FYNGL G  ++ ++  AGG++ SK++D T  LL++M   +Y WPS++ ST K+A ++ LD
Subjt:  KEDEQLFEAWERYKELLRKCPQHNYLDWLQIQLFYNGLSGTTKSILNVAAGGSIFSKTVDATRSLLEDMVVTSYNWPSKQ-STSKVAELYELD

A0A3S3N117 Retrotrans_gag domain-containing protein3.0e-3233.46Show/hide
Query:  MRQSQRAKLLPFNPEIKLTLRKVSRNLK--AEFVAREDQTEVTKTIRDYFHLIVPTLRPRIVKAPINANNFELKPSLIQMAKDSI-FRGTSNEDPHKHLR
        MR++Q   L+P +PEI+ TLR++ +  K  +EF   E + +  +++ DY   +V      I +  I ANNFE+KP++IQM   ++ F G  ++DP+ H+ 
Subjt:  MRQSQRAKLLPFNPEIKLTLRKVSRNLK--AEFVAREDQTEVTKTIRDYFHLIVPTLRPRIVKAPINANNFELKPSLIQMAKDSI-FRGTSNEDPHKHLR

Query:  SLQEI--------------------------------------------YGMAFLSKFF-LSKTSNLRIKIGTFKQKEDEQLFEAWERYKELLRKCPQHN
        +  E+                                                FL+KFF  +KT  +R  I TF Q E E L+EAWERYKELLRKCP H 
Subjt:  SLQEI--------------------------------------------YGMAFLSKFF-LSKTSNLRIKIGTFKQKEDEQLFEAWERYKELLRKCPQHN

Query:  YLDWLQIQLFYNGLSGTTKSILNVAAGGSIFSKTVDATRSLLEDMVVTSYNWPSKQ-STSKVAELYELD
           W+Q+Q FYNGL   T++ ++ A GG++  K+ +    L+E+M   +Y WPS      K+  ++ELD
Subjt:  YLDWLQIQLFYNGLSGTTKSILNVAAGGSIFSKTVDATRSLLEDMVVTSYNWPSKQ-STSKVAELYELD

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129457.9e-3335.79Show/hide
Query:  MRQSQRAKLLPFNPEIKLTLRKVSR-NLKAEFVAREDQT-----------------EVTKTIRDYFHLIVPTLRPRIVKAPINANNFELKPSLIQMAKDS
        M++     L+PF+P+I+ T R+  R NL+   VA  +QT                 E  + +RDY   +V  L   I +  INANNFE+KP+ IQM + S
Subjt:  MRQSQRAKLLPFNPEIKLTLRKVSR-NLKAEFVAREDQT-----------------EVTKTIRDYFHLIVPTLRPRIVKAPINANNFELKPSLIQMAKDS

Query:  I-FRGTSNEDPHKHLRSLQEI--------------------------------------------YGMAFLSKFF-LSKTSNLRIKIGTFKQKEDEQLFE
        + F G  ++DP+ HL +  EI                                                FL+KFF  +KT+ +R  I +F Q + E L+E
Subjt:  I-FRGTSNEDPHKHLRSLQEI--------------------------------------------YGMAFLSKFF-LSKTSNLRIKIGTFKQKEDEQLFE

Query:  AWERYKELLRKCPQHNYLDWLQIQLFYNGLSGTTKSILNVAAGGSIFSKTVDATRSLLEDMVVTSYNWPSKQSTS-KVAELYELD
        AWER+KELLR+CP H   DWLQ+Q FYNGL G+ K+I++ AAGG++ SK      +LLE+M   +Y WPS++S S K    YE+D
Subjt:  AWERYKELLRKCPQHNYLDWLQIQLFYNGLSGTTKSILNVAAGGSIFSKTVDATRSLLEDMVVTSYNWPSKQSTS-KVAELYELD

A0A6J0ZYV0 uncharacterized protein LOC1104134131.0e-3235.79Show/hide
Query:  MRQSQRAKLLPFNPEIKLTLRKVSR-NLKAEFVAREDQT-----------------EVTKTIRDYFHLIVPTLRPRIVKAPINANNFELKPSLIQMAKDS
        M++     L+PF+P+I+ T R+  R NL+   VA  +QT                 E  + +RDY   +V  L   I +  INANNFE+KP+ IQM + S
Subjt:  MRQSQRAKLLPFNPEIKLTLRKVSR-NLKAEFVAREDQT-----------------EVTKTIRDYFHLIVPTLRPRIVKAPINANNFELKPSLIQMAKDS

Query:  I-FRGTSNEDPHKHLRSLQEI--------------------------------------------YGMAFLSKFF-LSKTSNLRIKIGTFKQKEDEQLFE
        + F G  ++DP+ HL +  EI                                                FL+KFF  +KT+ +R  I +F Q + E L+E
Subjt:  I-FRGTSNEDPHKHLRSLQEI--------------------------------------------YGMAFLSKFF-LSKTSNLRIKIGTFKQKEDEQLFE

Query:  AWERYKELLRKCPQHNYLDWLQIQLFYNGLSGTTKSILNVAAGGSIFSKTVDATRSLLEDMVVTSYNWPSKQSTS-KVAELYELD
        AWER+KELLR+CP H   DWLQ+Q FYNGL G+ K+I++ AAGG++ SK      +LLE+M   +Y WPS++S S K    YE+D
Subjt:  AWERYKELLRKCPQHNYLDWLQIQLFYNGLSGTTKSILNVAAGGSIFSKTVDATRSLLEDMVVTSYNWPSKQSTS-KVAELYELD

A0A6J1DU19 uncharacterized protein LOC1110243611.5e-3928.12Show/hide
Query:  TIRDYFHLIVPTLRPRIVKAPINANNFELKPSLIQMAKDSIFRGTSNEDPHKHLRSLQEIYG---------------------------MAFLSKFF-LS
        TIRDY     P     I+  PINANN ELKP LIQM +++ FRG + EDP+ HL    ++ G                            AFL+ FF  +
Subjt:  TIRDYFHLIVPTLRPRIVKAPINANNFELKPSLIQMAKDSIFRGTSNEDPHKHLRSLQEIYG---------------------------MAFLSKFF-LS

Query:  KTSNLRIKIGTFKQKEDEQLFEAWERYKELLRKCPQHNYLDWLQIQLFYNGLSGTTKSILNVAAGGSIFSKTVDATRSLLEDMVVTSYNWPSKQSTS-KV
        KT+ LR +I +F++ + EQLFE WERYKELLRKCPQH  L+WLQIQ+FYNGL+G T++IL+ AAGG++ S+T +    LL+DM   S+ WPS++S + KV
Subjt:  KTSNLRIKIGTFKQKEDEQLFEAWERYKELLRKCPQHNYLDWLQIQLFYNGLSGTTKSILNVAAGGSIFSKTVDATRSLLEDMVVTSYNWPSKQSTS-KV

Query:  AELYELDEPPS------------------GFTDSNP-----------------------NAEKKSSMEDLLTEFIKESKNRTNLLESTIMSHEKTIQNME
        A +YE+DE  S                  G + SN                         AEKKSS+EDLL  FI E ++R +            I+N  
Subjt:  AELYELDEPPS------------------GFTDSNP-----------------------NAEKKSSMEDLLTEFIKESKNRTNLLESTIMSHEKTIQNME

Query:  EGEQAVVEPDLDERRENEEWNVQEKMEEVRMVVCRLHMPTILRTRPRGEPTPYAYHLEDKTEWGAGDIIIQDGIHLFSSLGRKTMLDTWSRQEMHDKGET
        EG +  +E +    +     N++ ++ ++         PT L T  +G+   +   +E K       + ++ G  L              +++M +   T
Subjt:  EGEQAVVEPDLDERRENEEWNVQEKMEEVRMVVCRLHMPTILRTRPRGEPTPYAYHLEDKTEWGAGDIIIQDGIHLFSSLGRKTMLDTWSRQEMHDKGET

Query:  HRRKSRKEIFEFLRERAKRRQKKEILASVVEVCGQEKIF----LTTQSSYVK--EKVLRLRGQPYVQDELNIQDERRRVQERSTHPKEEARRVKLHLASK
           +  KE  E ++E     Q  +  +S+V        +    L    +YV+  + ++  + +    + +N+ +E   + +R    K +         S 
Subjt:  HRRKSRKEIFEFLRERAKRRQKKEILASVVEVCGQEKIF----LTTQSSYVK--EKVLRLRGQPYVQDELNIQDERRRVQERSTHPKEEARRVKLHLASK

Query:  EQDALEQMSNYVKFMKDVLSRRSLR----MENVLIKVDKFIFPEDFVVLDMEE------------------------------------------SLKYA
                S++ K + D+ +  +L     +E+VL+KVD+ IFP DFVVL  EE                                          ++KY 
Subjt:  EQDALEQMSNYVKFMKDVLSRRSLR----MENVLIKVDKFIFPEDFVVLDMEE------------------------------------------SLKYA

Query:  DHDYTRHSIDIVDRNVAKFSELVLSTDQLSQSYHTVLFNTIDTDCEGKVSYQDLLTQL
        +   T H ID+VD  VA+    V   D   +         I    +    Y D L QL
Subjt:  DHDYTRHSIDIVDRNVAKFSELVLSTDQLSQSYHTVLFNTIDTDCEGKVSYQDLLTQL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCCCCATTACGCTTGAGACCCAAGAGCAAGTTTTGTGCACTGTTGTTGGTGACGATCCGAGGCCTGAAGATGAAGGTCAGAGGTCAAGATTGGGAGATGGC
AATGGTGAAGGGAGTATCATGTGTATGCGACAAAGCCAACGAGCTAAACTTCTGCCTTTCAATCCAGAAATCAAATTAACTTTAAGAAAAGTAAGCAGGAATTTG
AAAGCAGAATTTGTCGCCAGGGAAGATCAAACAGAAGTCACTAAGACTATTAGAGATTACTTCCACTTGATCGTCCCTACATTGCGACCTAGAATAGTAAAGGCT
CCAATTAATGCAAACAACTTTGAGCTGAAACCTAGTTTAATTCAAATGGCAAAGGACAGCATATTCAGAGGGACTTCAAACGAAGATCCACATAAACATCTTCGA
TCTCTTCAAGAAATTTATGGAATGGCTTTCCTTAGCAAATTTTTCCTATCTAAGACGAGCAACTTGAGAATAAAAATTGGAACATTCAAGCAGAAAGAGGATGAG
CAATTGTTCGAGGCTTGGGAGCGTTATAAAGAATTGTTGAGAAAGTGTCCCCAACACAATTATCTTGACTGGCTTCAGATTCAGTTGTTCTATAATGGGTTGTCA
GGAACGACGAAGTCCATCCTAAATGTTGCAGCTGGTGGATCAATCTTTTCTAAAACTGTTGATGCTACTCGATCACTTTTAGAAGACATGGTTGTCACCAGTTAC
AACTGGCCATCAAAACAATCAACATCCAAAGTGGCTGAGCTCTATGAGCTTGATGAGCCTCCTTCAGGTTTTACAGATTCTAATCCTAATGCTGAGAAGAAATCA
TCTATGGAGGACTTGCTAACTGAATTTATCAAAGAGTCGAAAAATCGAACCAACTTATTAGAGAGCACTATTATGAGTCATGAGAAGACCATTCAAAATATGGAG
GAAGGAGAGCAAGCGGTAGTGGAACCTGACTTGGACGAGAGAAGAGAAAATGAGGAGTGGAATGTTCAGGAGAAGATGGAAGAGGTGAGAATGGTCGTTTGCCGA
CTCCATATGCCTACCATCTTGAGGACAAGACCGAGGGGGGAGCCGACTCCATATGCCTACCATCTTGAGGACAAGACCGAATGGGGAGCTGGGGACATAATTATA
CAAGATGGAATTCACTTATTCTCATCTTTAGGTAGAAAGACCATGTTGGACACATGGTCAAGACAGGAAATGCATGATAAAGGTGAAACGCATAGGAGGAAATCA
AGGAAGGAGATTTTTGAATTCTTGAGAGAAAGGGCAAAGAGAAGGCAGAAGAAAGAAATCCTTGCGTCTGTAGTAGAAGTCTGCGGCCAAGAGAAGATATTCTTG
ACAACTCAGTCATCTTATGTTAAAGAAAAAGTCTTGCGTCTAAGGGGTCAGCCTTACGTTCAAGATGAACTCAACATTCAAGATGAACGCCGCCGCGTCCAAGAG
AGGTCGACGCATCCAAAAGAAGAAGCAAGAAGAGTTAAGCTGCACCTTGCGTCCAAGGAGCAAGATGCGCTAGAACAGATGTCGAACTATGTTAAGTTTATGAAG
GATGTTTTGTCGAGAAGAAGTTTGAGAATGGAAAATGTGTTGATTAAAGTAGATAAGTTCATCTTCCCGGAGGATTTTGTTGTGCTGGATATGGAAGAATCTCTG
AAATATGCCGACCATGATTATACGCGTCATAGTATAGATATTGTGGACAGAAATGTAGCTAAGTTTAGCGAGTTAGTTTTGTCTACAGATCAGTTGAGTCAAAGC
TACCACACAGTACTATTCAACACTATAGACACTGACTGTGAAGGAAAGGTCTCCTATCAGGATCTGCTCACCCAACTTTCAAAGGAGTTAATCGAGGCGAGGGGC
GCAACCGTTGTGGTTGCCTTCAATTCAAACGGAGACAAGATGTTAGGGATGGAGGAGTTTGGAAGGTTGGTGGAGGGGGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCTCCCCATTACGCTTGAGACCCAAGAGCAAGTTTTGTGCACTGTTGTTGGTGACGATCCGAGGCCTGAAGATGAAGGTCAGAGGTCAAGATTGGGAGATGGC
AATGGTGAAGGGAGTATCATGTGTATGCGACAAAGCCAACGAGCTAAACTTCTGCCTTTCAATCCAGAAATCAAATTAACTTTAAGAAAAGTAAGCAGGAATTTG
AAAGCAGAATTTGTCGCCAGGGAAGATCAAACAGAAGTCACTAAGACTATTAGAGATTACTTCCACTTGATCGTCCCTACATTGCGACCTAGAATAGTAAAGGCT
CCAATTAATGCAAACAACTTTGAGCTGAAACCTAGTTTAATTCAAATGGCAAAGGACAGCATATTCAGAGGGACTTCAAACGAAGATCCACATAAACATCTTCGA
TCTCTTCAAGAAATTTATGGAATGGCTTTCCTTAGCAAATTTTTCCTATCTAAGACGAGCAACTTGAGAATAAAAATTGGAACATTCAAGCAGAAAGAGGATGAG
CAATTGTTCGAGGCTTGGGAGCGTTATAAAGAATTGTTGAGAAAGTGTCCCCAACACAATTATCTTGACTGGCTTCAGATTCAGTTGTTCTATAATGGGTTGTCA
GGAACGACGAAGTCCATCCTAAATGTTGCAGCTGGTGGATCAATCTTTTCTAAAACTGTTGATGCTACTCGATCACTTTTAGAAGACATGGTTGTCACCAGTTAC
AACTGGCCATCAAAACAATCAACATCCAAAGTGGCTGAGCTCTATGAGCTTGATGAGCCTCCTTCAGGTTTTACAGATTCTAATCCTAATGCTGAGAAGAAATCA
TCTATGGAGGACTTGCTAACTGAATTTATCAAAGAGTCGAAAAATCGAACCAACTTATTAGAGAGCACTATTATGAGTCATGAGAAGACCATTCAAAATATGGAG
GAAGGAGAGCAAGCGGTAGTGGAACCTGACTTGGACGAGAGAAGAGAAAATGAGGAGTGGAATGTTCAGGAGAAGATGGAAGAGGTGAGAATGGTCGTTTGCCGA
CTCCATATGCCTACCATCTTGAGGACAAGACCGAGGGGGGAGCCGACTCCATATGCCTACCATCTTGAGGACAAGACCGAATGGGGAGCTGGGGACATAATTATA
CAAGATGGAATTCACTTATTCTCATCTTTAGGTAGAAAGACCATGTTGGACACATGGTCAAGACAGGAAATGCATGATAAAGGTGAAACGCATAGGAGGAAATCA
AGGAAGGAGATTTTTGAATTCTTGAGAGAAAGGGCAAAGAGAAGGCAGAAGAAAGAAATCCTTGCGTCTGTAGTAGAAGTCTGCGGCCAAGAGAAGATATTCTTG
ACAACTCAGTCATCTTATGTTAAAGAAAAAGTCTTGCGTCTAAGGGGTCAGCCTTACGTTCAAGATGAACTCAACATTCAAGATGAACGCCGCCGCGTCCAAGAG
AGGTCGACGCATCCAAAAGAAGAAGCAAGAAGAGTTAAGCTGCACCTTGCGTCCAAGGAGCAAGATGCGCTAGAACAGATGTCGAACTATGTTAAGTTTATGAAG
GATGTTTTGTCGAGAAGAAGTTTGAGAATGGAAAATGTGTTGATTAAAGTAGATAAGTTCATCTTCCCGGAGGATTTTGTTGTGCTGGATATGGAAGAATCTCTG
AAATATGCCGACCATGATTATACGCGTCATAGTATAGATATTGTGGACAGAAATGTAGCTAAGTTTAGCGAGTTAGTTTTGTCTACAGATCAGTTGAGTCAAAGC
TACCACACAGTACTATTCAACACTATAGACACTGACTGTGAAGGAAAGGTCTCCTATCAGGATCTGCTCACCCAACTTTCAAAGGAGTTAATCGAGGCGAGGGGC
GCAACCGTTGTGGTTGCCTTCAATTCAAACGGAGACAAGATGTTAGGGATGGAGGAGTTTGGAAGGTTGGTGGAGGGGGAATGA
Protein sequenceShow/hide protein sequence
MLPITLETQEQVLCTVVGDDPRPEDEGQRSRLGDGNGEGSIMCMRQSQRAKLLPFNPEIKLTLRKVSRNLKAEFVAREDQTEVTKTIRDYFHLIVPTLRPRIVKA
PINANNFELKPSLIQMAKDSIFRGTSNEDPHKHLRSLQEIYGMAFLSKFFLSKTSNLRIKIGTFKQKEDEQLFEAWERYKELLRKCPQHNYLDWLQIQLFYNGLS
GTTKSILNVAAGGSIFSKTVDATRSLLEDMVVTSYNWPSKQSTSKVAELYELDEPPSGFTDSNPNAEKKSSMEDLLTEFIKESKNRTNLLESTIMSHEKTIQNME
EGEQAVVEPDLDERRENEEWNVQEKMEEVRMVVCRLHMPTILRTRPRGEPTPYAYHLEDKTEWGAGDIIIQDGIHLFSSLGRKTMLDTWSRQEMHDKGETHRRKS
RKEIFEFLRERAKRRQKKEILASVVEVCGQEKIFLTTQSSYVKEKVLRLRGQPYVQDELNIQDERRRVQERSTHPKEEARRVKLHLASKEQDALEQMSNYVKFMK
DVLSRRSLRMENVLIKVDKFIFPEDFVVLDMEESLKYADHDYTRHSIDIVDRNVAKFSELVLSTDQLSQSYHTVLFNTIDTDCEGKVSYQDLLTQLSKELIEARG
ATVVVAFNSNGDKMLGMEEFGRLVEGE