; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0034741 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0034741
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr3:10327673..10328443
RNA-Seq ExpressionLag0034741
SyntenyLag0034741
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF7115099.1 hypothetical protein RHSIM_RhsimUnG0064500 [Rhododendron simsii]7.4e-5844.79Show/hide
Query:  MEFDTSRMISLNGSNWQLWKIRMKDLLYCNDMHAPILGESSKPETMSKEDWDLLNRKVVGHIRLWVDDNLYNYISNETSAYSLWKKLEELYERKTNEDKL
        ME ++  MI L  +N+ +WK RM+D+LY  D++ PI G++ KP+  S EDW + NRK V  IR WVD +++++++ ET+AY LW KLE +YERKT ++K 
Subjt:  MEFDTSRMISLNGSNWQLWKIRMKDLLYCNDMHAPILGESSKPETMSKEDWDLLNRKVVGHIRLWVDDNLYNYISNETSAYSLWKKLEELYERKTNEDKL

Query:  FLVRKLINLRYEEGTSMGSHLSEMQGIMNRLSSMQIVLDDELQALLVLSSLPDNRVKLVESLNNNSGSEKLSVDMVKSSLLNEEAMRIALGSLAIESDVL
         L+R+L+NL+Y++G S+  H+S+ QG++N+LS+M++VLD+ELQALL+LSSLPD+   LV +++N++ S  +++DMVK  + NEEA R   G+ +++++ L
Subjt:  FLVRKLINLRYEEGTSMGSHLSEMQGIMNRLSSMQIVLDDELQALLVLSSLPDNRVKLVESLNNNSGSEKLSVDMVKSSLLNEEAMRIALGSLAIESDVL

Query:  IMNNRGRNRQRKKGHHSDRGQSASKSS--------REIQCYYCKKMGHKKVECRKWKKE
        ++ NRGR+R R  G   ++ +  SKS          EI+C++C KMGH + ECR  +KE
Subjt:  IMNNRGRNRQRKKGHHSDRGQSASKSS--------REIQCYYCKKMGHKKVECRKWKKE

KAF7129225.1 hypothetical protein RHSIM_Rhsim10G0050800 [Rhododendron simsii]1.7e-5745.56Show/hide
Query:  MEFDTSRMISLNGSNWQLWKIRMKDLLYCNDMHAPILGESSKPETMSKEDWDLLNRKVVGHIRLWVDDNLYNYISNETSAYSLWKKLEELYERKTNEDKL
        ME ++  MI L  +N+ +WK RM+D+LY  D++ PI G++ KP+  S EDW + NRK V  IR WVD +++++++ ET+AY LW KLE +YERKT ++K 
Subjt:  MEFDTSRMISLNGSNWQLWKIRMKDLLYCNDMHAPILGESSKPETMSKEDWDLLNRKVVGHIRLWVDDNLYNYISNETSAYSLWKKLEELYERKTNEDKL

Query:  FLVRKLINLRYEEGTSMGSHLSEMQGIMNRLSSMQIVLDDELQALLVLSSLPDNRVKLVESLNNNSGSEKLSVDMVKSSLLNEEAMRIALGSLAIESDVL
         L+R+L+NL+Y++G S+  H+S+ QG++N+LS+M++VLD+ELQALL+LSSLPD+   LV +++N++ S  +++DMVK  + NEEA R   G+ +++++ L
Subjt:  FLVRKLINLRYEEGTSMGSHLSEMQGIMNRLSSMQIVLDDELQALLVLSSLPDNRVKLVESLNNNSGSEKLSVDMVKSSLLNEEAMRIALGSLAIESDVL

Query:  IMNNRGRNRQRKKG----HHSDRGQS----ASKSSREIQCYYCKKMGHKKVECRKWKKE
        ++ NRGR+R R  G       DR +S     SK   EI+C++C KMGH + ECR  +KE
Subjt:  IMNNRGRNRQRKKG----HHSDRGQS----ASKSSREIQCYYCKKMGHKKVECRKWKKE

KAF7129546.1 hypothetical protein RHSIM_Rhsim10G0154200 [Rhododendron simsii]2.2e-5745.56Show/hide
Query:  MEFDTSRMISLNGSNWQLWKIRMKDLLYCNDMHAPILGESSKPETMSKEDWDLLNRKVVGHIRLWVDDNLYNYISNETSAYSLWKKLEELYERKTNEDKL
        ME ++  MI L  +N+ +WK RM+D+LY  D++ PI G++ KP+  S EDW + NRK V  IR WVD +++++++ ET+AY LW KLE +YERKT ++K 
Subjt:  MEFDTSRMISLNGSNWQLWKIRMKDLLYCNDMHAPILGESSKPETMSKEDWDLLNRKVVGHIRLWVDDNLYNYISNETSAYSLWKKLEELYERKTNEDKL

Query:  FLVRKLINLRYEEGTSMGSHLSEMQGIMNRLSSMQIVLDDELQALLVLSSLPDNRVKLVESLNNNSGSEKLSVDMVKSSLLNEEAMRIALGSLAIESDVL
         L+R+L+NL+Y++G S+  H+S+ QG++N+LS+M++VLD+ELQALL+LSSLPD+   LV +++N++ S  +++DMVK  + NEEA R   G+ +++++ L
Subjt:  FLVRKLINLRYEEGTSMGSHLSEMQGIMNRLSSMQIVLDDELQALLVLSSLPDNRVKLVESLNNNSGSEKLSVDMVKSSLLNEEAMRIALGSLAIESDVL

Query:  IMNNRGRNRQRKKG----HHSDRGQS----ASKSSREIQCYYCKKMGHKKVECRKWKKE
        ++ NRGR+R R  G       DR +S     SK   EI+C++C KMGH + ECR  +KE
Subjt:  IMNNRGRNRQRKKG----HHSDRGQS----ASKSSREIQCYYCKKMGHKKVECRKWKKE

KAF7143526.1 hypothetical protein RHSIM_Rhsim05G0092400 [Rhododendron simsii]1.7e-6250.19Show/hide
Query:  MEFDTSRMISLNGSNWQLWKIRMKDLLYCNDMHAPILGESSKPETMSKEDWDLLNRKVVGHIRLWVDDNLYNYISNETSAYSLWKKLEELYERKTNEDKL
        ME    RMIS NG+NW  WKI+M+DLLYC D+H P+ G+ +KP  M +EDW  LNRK VG IR W+DD++++++S ETSAY LWKKLE LY+RK+  +K 
Subjt:  MEFDTSRMISLNGSNWQLWKIRMKDLLYCNDMHAPILGESSKPETMSKEDWDLLNRKVVGHIRLWVDDNLYNYISNETSAYSLWKKLEELYERKTNEDKL

Query:  FLVRKLINLRYEEGTSMGSHLSEMQGIMNRLSSMQIVLDDELQALLVLSSLPDNRVKLVESLNNNSGSEKLSVDMVKSSLLNEEAMRIALGSLAIESDVL
        FL +KL+NL+Y+EG S+  HL+EM  I+N+L+SM+IV DDELQ L++LSSLP+    LV +++N++    +S+  V SSLLNEE  R +  S    S+ L
Subjt:  FLVRKLINLRYEEGTSMGSHLSEMQGIMNRLSSMQIVLDDELQALLVLSSLPDNRVKLVESLNNNSGSEKLSVDMVKSSLLNEEAMRIALGSLAIESDVL

Query:  IMNNRGRNR-------QRKKGHHSDRGQS--ASKSSREIQCYYCKKMGHKKVECRKWKKEQRN
        ++N RGR R        R + H S RG S   S S ++I+C+YCKK GH K EC K K ++ N
Subjt:  IMNNRGRNR-------QRKKGHHSDRGQS--ASKSSREIQCYYCKKMGHKKVECRKWKKEQRN

KAG5549868.1 hypothetical protein RHGRI_014986 [Rhododendron griersonianum]1.2e-6350.96Show/hide
Query:  MEFDTSRMISLNGSNWQLWKIRMKDLLYCNDMHAPILGESSKPETMSKEDWDLLNRKVVGHIRLWVDDNLYNYISNETSAYSLWKKLEELYERKTNEDKL
        ME +  RMIS NG+NW  WKI+M+DLLYC D+H P+ G+++KP  M +EDW +LNRK VG IR W+DD++++++S ETSAY LWKKLE LY+RK+  +K 
Subjt:  MEFDTSRMISLNGSNWQLWKIRMKDLLYCNDMHAPILGESSKPETMSKEDWDLLNRKVVGHIRLWVDDNLYNYISNETSAYSLWKKLEELYERKTNEDKL

Query:  FLVRKLINLRYEEGTSMGSHLSEMQGIMNRLSSMQIVLDDELQALLVLSSLPDNRVKLVESLNNNSGSEKLSVDMVKSSLLNEEAMRIALGSLAIESDVL
        FL +KL+NL+++EG S+  HL+EM  I+N+L+SM+IV DDELQAL++LSSLP+    LV +++N++    +S   V SSLLNEE  R + GS    S+ L
Subjt:  FLVRKLINLRYEEGTSMGSHLSEMQGIMNRLSSMQIVLDDELQALLVLSSLPDNRVKLVESLNNNSGSEKLSVDMVKSSLLNEEAMRIALGSLAIESDVL

Query:  IMNNRGRNRQRKKGHH-------SDRGQSASKSSREIQCYYCKKMGHKKVECRKWKKEQRN
        ++N RGR R R    H       S RG+S SK  ++I+C+YCKK GH K EC K K ++ N
Subjt:  IMNNRGRNRQRKKGHH-------SDRGQSASKSSREIQCYYCKKMGHKKVECRKWKKEQRN

TrEMBL top hitse value%identityAlignment
A0A4Y1QYG0 Uncharacterized protein2.6e-5645.56Show/hide
Query:  TSRMISLNGSNWQLWKIRMKDLLYCNDMHAPILGESSKPETMSKEDWDLLNRKVVGHIRLWVDDNLYNYISNETSAYSLWKKLEELYERKTNEDKLFLVR
        ++ M+ L  SN+ +W  RM+D+LYC D++ P+     KPE  S + W +LNRKVVG IR WVD ++++++S ET AY LW KL  +YERKT ++K  ++R
Subjt:  TSRMISLNGSNWQLWKIRMKDLLYCNDMHAPILGESSKPETMSKEDWDLLNRKVVGHIRLWVDDNLYNYISNETSAYSLWKKLEELYERKTNEDKLFLVR

Query:  KLINLRYEEGTSMGSHLSEMQGIMNRLSSMQIVLDDELQALLVLSSLPDNRVKLVESLNNNSGSEKLSVDMVKSSLLNEEAMRIALGSLAIESDVLIMNN
        +L+NL+Y +G S+  HLS+ QG++N L++M++VLDDELQAL++LSSLPD+   LV SL+N++    L++D+VK S+ NEEA R   G +A ES+ L+   
Subjt:  KLINLRYEEGTSMGSHLSEMQGIMNRLSSMQIVLDDELQALLVLSSLPDNRVKLVESLNNNSGSEKLSVDMVKSSLLNEEAMRIALGSLAIESDVLIMNN

Query:  RGRNRQRK-------KGHHSDRGQSASKSSREIQCYYCKKMGHKKVECRKWKKEQRNDN
        RGR   RK       KG   D  +  SK+ ++++CY+C  +GH K ECR +K+EQ   N
Subjt:  RGRNRQRK-------KGHHSDRGQSASKSSREIQCYYCKKMGHKKVECRKWKKEQRNDN

A0A4Y1RJM3 CCHC-type domain-containing protein1.1e-5444.79Show/hide
Query:  TSRMISLNGSNWQLWKIRMKDLLYCNDMHAPILGESSKPETMSKEDWDLLNRKVVGHIRLWVDDNLYNYISNETSAYSLWKKLEELYERKTNEDKLFLVR
        ++ M+ L  SN+ +W  RM+D+LYC D++ P+     KP   S + W +LNRKVV  IR WVD ++++++S ET AY LW KL  +YERKT ++K  ++R
Subjt:  TSRMISLNGSNWQLWKIRMKDLLYCNDMHAPILGESSKPETMSKEDWDLLNRKVVGHIRLWVDDNLYNYISNETSAYSLWKKLEELYERKTNEDKLFLVR

Query:  KLINLRYEEGTSMGSHLSEMQGIMNRLSSMQIVLDDELQALLVLSSLPDNRVKLVESLNNNSGSEKLSVDMVKSSLLNEEAMRIALGSLAIESDVLIMNN
        +L+NL+Y +G S+  HLS+ QG++N L++M++VLDDELQAL++LSSLPD+   LV SL+N++    L++D+VK S+ NEEA R   G +A ES+ L+   
Subjt:  KLINLRYEEGTSMGSHLSEMQGIMNRLSSMQIVLDDELQALLVLSSLPDNRVKLVESLNNNSGSEKLSVDMVKSSLLNEEAMRIALGSLAIESDVLIMNN

Query:  RGRNRQRK-------KGHHSDRGQSASKSSREIQCYYCKKMGHKKVECRKWKKEQRNDN
        RGR   RK       KG   D  +  SK+ ++++CY+C  +GH K ECR +K+EQ   N
Subjt:  RGRNRQRK-------KGHHSDRGQSASKSSREIQCYYCKKMGHKKVECRKWKKEQRNDN

A0A5C7HIC1 Uncharacterized protein8.3e-5544.14Show/hide
Query:  MEFDTSRMISLNGSNWQLWKIRMKDLLYCNDMHAPILGESSKPETMSKEDWDLLNRKVVGHIRLWVDDNLYNYISNETSAYSLWKKLEELYERKTNEDKL
        ME + SRMI+LNGSN+ +WK +M+DLLY  D + P+  E  KPE  + ++W +L+R+V G+IR WVDDN+YN++S ET A SLW KLE+LY RKT  +KL
Subjt:  MEFDTSRMISLNGSNWQLWKIRMKDLLYCNDMHAPILGESSKPETMSKEDWDLLNRKVVGHIRLWVDDNLYNYISNETSAYSLWKKLEELYERKTNEDKL

Query:  FLVRKLINLRYEEGTSMGSHLSEMQGIMNRLSSMQIVLDDELQALLVLSSLPDNRVKLVESLNNNSGSEKLSVDMVKSSLLNEEAMRIALGSLAIESDVL
        FL++K++ L+Y +GT +  HL+  QGI+N+L+ M I  +DE+Q L +L +LPD+      S+ N++ +  +++D+ KSS+LNEE  R + GS   +S+VL
Subjt:  FLVRKLINLRYEEGTSMGSHLSEMQGIMNRLSSMQIVLDDELQALLVLSSLPDNRVKLVESLNNNSGSEKLSVDMVKSSLLNEEAMRIALGSLAIESDVL

Query:  IMNNRGRNRQRKKG-HHSDRGQSASKSSREIQCYYCKKMGHKKVECRKWKKEQRND
        +   RGR++ + +G  + DR +S S     ++CY+C + GH K  CR+ K++ +N+
Subjt:  IMNNRGRNRQRKKG-HHSDRGQSASKSSREIQCYYCKKMGHKKVECRKWKKEQRND

A0A5C7IZ12 CCHC-type domain-containing protein7.0e-5444.31Show/hide
Query:  MEFDTSRMISLNGSNWQLWKIRMKDLLYCNDMHAPILGESSKPETMSKEDWDLLNRKVVGHIRLWVDDNLYNYISNETSAYSLWKKLEELYERKTNEDKL
        ME + SRMI+LNGSN+ +WK +M+DLLY  D + P+  E  KPE  + ++W +L+R+V G+IR WVDDN+YN++S ET A SLW KLE+LY RKT  +KL
Subjt:  MEFDTSRMISLNGSNWQLWKIRMKDLLYCNDMHAPILGESSKPETMSKEDWDLLNRKVVGHIRLWVDDNLYNYISNETSAYSLWKKLEELYERKTNEDKL

Query:  FLVRKLINLRYEEGTSMGSHLSEMQGIMNRLSSMQIVLDDELQALLVLSSLPDNRVKLVESLNNNSGSEKLSVDMVKSSLLNEEAMRIALGSLAIESDVL
        FL++K++ L+Y +GT +  HL+  QGI+N+L+ M I  +DE+Q L +L +LPD+      S+ N++ +  +++D+ KSS+LNEE  R + GS   +S+VL
Subjt:  FLVRKLINLRYEEGTSMGSHLSEMQGIMNRLSSMQIVLDDELQALLVLSSLPDNRVKLVESLNNNSGSEKLSVDMVKSSLLNEEAMRIALGSLAIESDVL

Query:  IMNNRGRNRQRKKGHHSDRGQSASKSSREIQCYYCKKMGHKKVECRKWKKEQRND
        +   RGR++ R    + DR +S S     ++CY+C + GH K  CR+ K++ +N+
Subjt:  IMNNRGRNRQRKKGHHSDRGQSASKSSREIQCYYCKKMGHKKVECRKWKKEQRND

A0A5J5B7H2 CCHC-type domain-containing protein4.4e-5645.17Show/hide
Query:  MEFDTSRMISLNGSNWQLWKIRMKDLLYCNDMHAPILGESSKPETMSKEDWDLLNRKVVGHIRLWVDDNLYNYISNETSAYSLWKKLEELYERKTNEDKL
        ME   S M  LN  NW +WK +M+D++YC D++ PI G+ +KP+ M  E W  L+RK +G+IR W+DD++++++SNET A  LWKKLE  YE+KT  +K 
Subjt:  MEFDTSRMISLNGSNWQLWKIRMKDLLYCNDMHAPILGESSKPETMSKEDWDLLNRKVVGHIRLWVDDNLYNYISNETSAYSLWKKLEELYERKTNEDKL

Query:  FLVRKLINLRYEEGTSMGSHLSEMQGIMNRLSSMQIVLDDELQALLVLSSLPDNRVKLVESLNNNSGSEKLSVDMVKSSLLNEEAMRIALGSLAIESDVL
        FL+RKL+N++++EG S+  HL+E Q ++N+L++M++V++DELQA L+LSSLPD+   LV +++N++   KLS+  V SSL NEE  R   G+    +  L
Subjt:  FLVRKLINLRYEEGTSMGSHLSEMQGIMNRLSSMQIVLDDELQALLVLSSLPDNRVKLVESLNNNSGSEKLSVDMVKSSLLNEEAMRIALGSLAIESDVL

Query:  IMNNRGRNRQR-KKGHHSDRG--QSASKSSREIQCYYCKKMGHKKVECRKWKKEQRNDN
        +  NR R++    KGH   +G  QS  KS+   +CY+C K GH K  C  WK+EQ+ +N
Subjt:  IMNNRGRNRQR-KKGHHSDRG--QSASKSSREIQCYYCKKMGHKKVECRKWKKEQRNDN

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.2e-0722.57Show/hide
Query:  NGSNWQLWKIRMKDLLYCNDMHAPILGESSKPETMSKEDWDLLNRKVVGHIRLWVDDNLYNYISNETSAYSLWKKLEELYERKTNEDKLFLVRKLINLRY
        +G  + +WK R++ LL   D+   +  +   P  +  + W    R     I  ++ D+  N+ +++ +A  + + L+ +YERK+   +L L ++L++L+ 
Subjt:  NGSNWQLWKIRMKDLLYCNDMHAPILGESSKPETMSKEDWDLLNRKVVGHIRLWVDDNLYNYISNETSAYSLWKKLEELYERKTNEDKLFLVRKLINLRY

Query:  EEGTSMGSHLSEMQGIMNRL--SSMQIVLDDELQALLV-LSSLPDNRVKLVESLNNNSGSEKLSVDMVKSSLLNEE---------AMRIALGSLAIESDV
            S+ SH      +++ L  +  +I   D++  LL+ L S  D  +  +E+L+     E L++  VK+ LL++E           +  + ++   ++ 
Subjt:  EEGTSMGSHLSEMQGIMNRL--SSMQIVLDDELQALLV-LSSLPDNRVKLVESLNNNSGSEKLSVDMVKSSLLNEE---------AMRIALGSLAIESDV

Query:  LIMNNRGRNRQRKKGHHSDRGQSASKSSREIQCYYCKKMGHKKVECRKWKKEQRNDN
           NN  +NR  K      +      S  +++C++C + GH K +C  +K+   N N
Subjt:  LIMNNRGRNRQRKKGHHSDRGQSASKSSREIQCYYCKKMGHKKVECRKWKKEQRNDN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.3e-2835.55Show/hide
Query:  NGSN-WQLWKIRMKDLLYCNDMHAPILGESSKPETMSKEDWDLLNRKVVGHIRLWVDDNLYNYISNETSAYSLWKKLEELYERKTNEDKLFLVRKLINLR
        NG N +  W+ RM+DLL    +H  +  +S KP+TM  EDW  L+ +    IRL + D++ N I +E +A  +W +LE LY  KT  +KL+L ++L  L 
Subjt:  NGSN-WQLWKIRMKDLLYCNDMHAPILGESSKPETMSKEDWDLLNRKVVGHIRLWVDDNLYNYISNETSAYSLWKKLEELYERKTNEDKLFLVRKLINLR

Query:  YEEGTSMGSHLSEMQGIMNRLSSMQIVLDDELQALLVLSSLPDNRVKLVESLNNNSGSEKLSVDMVKSSLLNEEAMRIALGSLAIESDVLIMNNRGRNRQ
          EGT+  SHL+   G++ +L+++ + +++E +A+L+L+SLP +   L  ++ +  G   + +  V S+LL  E MR    +   +   LI   RGR+ Q
Subjt:  YEEGTSMGSHLSEMQGIMNRLSSMQIVLDDELQALLVLSSLPDNRVKLVESLNNNSGSEKLSVDMVKSSLLNEEAMRIALGSLAIESDVLIMNNRGRNRQ

Query:  RKK---GHHSDRGQSASKS-SREIQCYYCKKMGHKKVEC---RKWKKE---QRNDN
        R     G    RG+S ++S SR   CY C + GH K +C   RK K E   Q+ND+
Subjt:  RKK---GHHSDRGQSASKS-SREIQCYYCKKMGHKKVEC---RKWKKE---QRNDN

Arabidopsis top hitse value%identityAlignment
AT3G29785.1 unknown protein8.9e-0927.27Show/hide
Query:  LNGSNWQLWKIRMKDLLYCNDMHAPILGESSKPETMSKEDWDLLNRKVVGHIRLWVDDNLYNYISNETSAYSLWKKLEELYERKTNEDKLFLVRKLINL
        ++G+++   +++++D LY   +H P+     K ETMS++DW++L R+V+  IRL +  N+ + ++ E S   L K L ++Y++ +  + +    + I++
Subjt:  LNGSNWQLWKIRMKDLLYCNDMHAPILGESSKPETMSKEDWDLLNRKVVGHIRLWVDDNLYNYISNETSAYSLWKKLEELYERKTNEDKLFLVRKLINL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTTCGACACAAGTAGAATGATCAGTTTGAATGGCTCAAATTGGCAGTTATGGAAGATTAGAATGAAAGATCTTCTTTACTGTAACGATATGCATGCTCCTATTCT
GGGTGAGTCAAGCAAACCAGAAACTATGAGTAAAGAAGATTGGGACTTGTTGAATAGAAAAGTTGTTGGGCATATACGTTTGTGGGTGGATGACAATCTGTACAACTACA
TCTCAAATGAGACATCGGCATATTCCTTGTGGAAGAAATTGGAAGAGTTGTATGAGAGAAAAACGAACGAGGATAAACTTTTCTTGGTAAGAAAGCTTATCAACCTAAGA
TACGAGGAGGGTACTTCAATGGGAAGTCATTTGAGTGAGATGCAAGGCATAATGAATCGGCTTTCATCTATGCAGATAGTTTTAGATGATGAGTTGCAGGCGTTACTGGT
TCTTAGTTCTTTGCCTGACAATAGGGTAAAGTTGGTAGAATCGTTGAATAATAACTCTGGTAGTGAAAAGTTGAGTGTAGATATGGTCAAAAGTAGTCTGTTGAATGAAG
AAGCAATGAGGATAGCATTGGGTTCTTTAGCTATAGAGTCAGATGTTCTGATCATGAATAATCGAGGGAGAAATCGGCAACGAAAGAAGGGGCATCATAGTGATCGTGGT
CAATCAGCAAGCAAAAGCAGTAGAGAAATTCAATGCTATTATTGCAAGAAGATGGGCCACAAGAAAGTTGAATGTAGAAAATGGAAAAAGGAGCAACGAAATGATAATTA
G
mRNA sequenceShow/hide mRNA sequence
ATGGAGTTCGACACAAGTAGAATGATCAGTTTGAATGGCTCAAATTGGCAGTTATGGAAGATTAGAATGAAAGATCTTCTTTACTGTAACGATATGCATGCTCCTATTCT
GGGTGAGTCAAGCAAACCAGAAACTATGAGTAAAGAAGATTGGGACTTGTTGAATAGAAAAGTTGTTGGGCATATACGTTTGTGGGTGGATGACAATCTGTACAACTACA
TCTCAAATGAGACATCGGCATATTCCTTGTGGAAGAAATTGGAAGAGTTGTATGAGAGAAAAACGAACGAGGATAAACTTTTCTTGGTAAGAAAGCTTATCAACCTAAGA
TACGAGGAGGGTACTTCAATGGGAAGTCATTTGAGTGAGATGCAAGGCATAATGAATCGGCTTTCATCTATGCAGATAGTTTTAGATGATGAGTTGCAGGCGTTACTGGT
TCTTAGTTCTTTGCCTGACAATAGGGTAAAGTTGGTAGAATCGTTGAATAATAACTCTGGTAGTGAAAAGTTGAGTGTAGATATGGTCAAAAGTAGTCTGTTGAATGAAG
AAGCAATGAGGATAGCATTGGGTTCTTTAGCTATAGAGTCAGATGTTCTGATCATGAATAATCGAGGGAGAAATCGGCAACGAAAGAAGGGGCATCATAGTGATCGTGGT
CAATCAGCAAGCAAAAGCAGTAGAGAAATTCAATGCTATTATTGCAAGAAGATGGGCCACAAGAAAGTTGAATGTAGAAAATGGAAAAAGGAGCAACGAAATGATAATTA
G
Protein sequenceShow/hide protein sequence
MEFDTSRMISLNGSNWQLWKIRMKDLLYCNDMHAPILGESSKPETMSKEDWDLLNRKVVGHIRLWVDDNLYNYISNETSAYSLWKKLEELYERKTNEDKLFLVRKLINLR
YEEGTSMGSHLSEMQGIMNRLSSMQIVLDDELQALLVLSSLPDNRVKLVESLNNNSGSEKLSVDMVKSSLLNEEAMRIALGSLAIESDVLIMNNRGRNRQRKKGHHSDRG
QSASKSSREIQCYYCKKMGHKKVECRKWKKEQRNDN