; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039160 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039160
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr2:37586759..37588358
RNA-Seq ExpressionLag0039160
SyntenyLag0039160
Gene Ontology termsGO:0010467 - gene expression (biological process)
GO:0034645 - cellular macromolecule biosynthetic process (biological process)
GO:0044267 - cellular protein metabolic process (biological process)
GO:0044271 - cellular nitrogen compound biosynthetic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004672 - protein kinase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8717380.1 hypothetical protein F3Y22_tig00110050pilonHSYRG00143 [Hibiscus syriacus]4.5e-7761.37Show/hide
Query:  KMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASESGEGGPVESNGGSSRGSKKSSMSDEDWEEMDLRAASAIRLNLAKNILTNVHGILTAK
        + D+EKFDGR+NFGLWQVQVKD+LIQSGL+KALKG+P    +   EG   +    SS    KS MS+E+WEE+D+RAAS IRL LAKN+L NV    + K
Subjt:  KMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASESGEGGPVESNGGSSRGSKKSSMSDEDWEEMDLRAASAIRLNLAKNILTNVHGILTAK

Query:  ELWEKLEAMYQARSISNRLYLKEQFYTLRMEECTKISYHLSVLNGIISELEVIEVKIEDEDKALRLILSLPTSYEHMKPILMYGKETLNFADVTSKLLSE
        ELWEKLE MYQA+S+SNRLYLKE+F+ L+MEE TKIS HLS LNGI+SELE I V+I+DEDKALRLI SLP+SYEHM+ +LMYGKE +NF++VTSKL+SE
Subjt:  ELWEKLEAMYQARSISNRLYLKEQFYTLRMEECTKISYHLSVLNGIISELEVIEVKIEDEDKALRLILSLPTSYEHMKPILMYGKETLNFADVTSKLLSE

Query:  ERRLKSEGRTSLEDSAL-VASNWKKKKESMQNKGCWECGHSGHMKKDCPN-RAGSSKGSGSDA-DVVSLVRGDSKFL
        ERRLK+    S E  AL V  N KK K S +   CW CG  GH+KKDC N  A S+ GS SDA +VV     D +F+
Subjt:  ERRLKSEGRTSLEDSAL-VASNWKKKKESMQNKGCWECGHSGHMKKDCPN-RAGSSKGSGSDA-DVVSLVRGDSKFL

KAF5758504.1 putative RNA-directed DNA polymerase [Helianthus annuus]1.2e-8262.68Show/hide
Query:  SPVKMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASESGEGGPVESNGGSSRGSKKSSMSDEDWEEMDLRAASAIRLNLAKNILTNVHGIL
        SP++  VEK+DGR+NFGLWQVQVKDVLIQSGLHKAL+G+P+  +S+   G               S   DE+WE++DLRAASAIRL LAKN+L NVHGI 
Subjt:  SPVKMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASESGEGGPVESNGGSSRGSKKSSMSDEDWEEMDLRAASAIRLNLAKNILTNVHGIL

Query:  TAKELWEKLEAMYQARSISNRLYLKEQFYTLRMEECTKISYHLSVLNGIISELEVIEVKIEDEDKALRLILSLPTSYEHMKPILMYGKETLNFADVTSKL
        TAK+LWEKLE +YQ + ISNRLYLKEQF+TLRM+  TKIS HLSVLN I+SELE I VK+EDEDKALRLILSL +SYEHMKPILMYGKETL +ADVT KL
Subjt:  TAKELWEKLEAMYQARSISNRLYLKEQFYTLRMEECTKISYHLSVLNGIISELEVIEVKIEDEDKALRLILSLPTSYEHMKPILMYGKETLNFADVTSKL

Query:  LSEERRLKSEGRTSLEDSALVASNWKKKKESMQNKGCWECGHSGHMKKDCPNRAGSSKGSGSDADVVSLVRGDSKF
        LSEE+RL S G TS E + L+  N  KKK   +   CW+CG SGH+K++CP  A S+  S   A+ V++V GD  F
Subjt:  LSEERRLKSEGRTSLEDSALVASNWKKKKESMQNKGCWECGHSGHMKKDCPNRAGSSKGSGSDADVVSLVRGDSKF

KAF5765959.1 putative RNA-directed DNA polymerase [Helianthus annuus]5.4e-8362.68Show/hide
Query:  SPVKMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASESGEGGPVESNGGSSRGSKKSSMSDEDWEEMDLRAASAIRLNLAKNILTNVHGIL
        SP++ DVEK+DGR+NFGLWQVQVKDVLIQSGLHKAL+G+P+  +S+   G               S   DE+WE++DLRAASAIRL LAKN+L NVHGI 
Subjt:  SPVKMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASESGEGGPVESNGGSSRGSKKSSMSDEDWEEMDLRAASAIRLNLAKNILTNVHGIL

Query:  TAKELWEKLEAMYQARSISNRLYLKEQFYTLRMEECTKISYHLSVLNGIISELEVIEVKIEDEDKALRLILSLPTSYEHMKPILMYGKETLNFADVTSKL
        TAK+LWEKLE +YQ + I NRLYLKEQF+TLRM+  TKIS HLSVLN I+SELE I VK+EDEDKALRLILSL +SYEHMKPILMYGKETL +ADVT KL
Subjt:  TAKELWEKLEAMYQARSISNRLYLKEQFYTLRMEECTKISYHLSVLNGIISELEVIEVKIEDEDKALRLILSLPTSYEHMKPILMYGKETLNFADVTSKL

Query:  LSEERRLKSEGRTSLEDSALVASNWKKKKESMQNKGCWECGHSGHMKKDCPNRAGSSKGSGSDADVVSLVRGDSKF
        LSEE+RL S G TS E + L+  N  KKK   +   CW+CG SGH+K++CP  A S+  S   A+ V++V GD  F
Subjt:  LSEERRLKSEGRTSLEDSALVASNWKKKKESMQNKGCWECGHSGHMKKDCPNRAGSSKGSGSDADVVSLVRGDSKF

QHN81458.1 Retrovirus-related Pol polyprotein [Arachis hypogaea]5.3e-7858.76Show/hide
Query:  MSSFMSPVKMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASESGEGGPVESNGGSSRGSKKSSMSDEDWEEMDLRAASAIRLNLAKNILTN
        MS + S VK+++EKFDGR+NFGLWQ+QVKDVLIQSGLHKALK R                          S M DE+WEE+DLRAASAIRL LAKN+L N
Subjt:  MSSFMSPVKMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASESGEGGPVESNGGSSRGSKKSSMSDEDWEEMDLRAASAIRLNLAKNILTN

Query:  VHGILTAKELWEKLEAMYQARSISNRLYLKEQFYTLRMEECTKISYHLSVLNGIISELEVIEVKIEDEDKALRLILSLPTSYEHMKPILMYGKETLNFAD
        V GI TAKELW+KLE +YQ++ ISNRL LKEQF+ LRM+   KIS HLS +NGI+SELE I VKI+DEDKALRLILSLP+SYE++KP+LMYGKETLNF +
Subjt:  VHGILTAKELWEKLEAMYQARSISNRLYLKEQFYTLRMEECTKISYHLSVLNGIISELEVIEVKIEDEDKALRLILSLPTSYEHMKPILMYGKETLNFAD

Query:  VTSKLLSEERRLKSEGRTSLEDSALVASNWKKKKESMQNKGCWECGHSGHMKKDCPNRAGSSKGSGSDADVVSL
        V SKL++EERR+K+EG TS + + +  S+   K    ++  CW+CG SGH+K++CP  A S K S SD   ++L
Subjt:  VTSKLLSEERRLKSEGRTSLEDSALVASNWKKKKESMQNKGCWECGHSGHMKKDCPNRAGSSKGSGSDADVVSL

XP_022139673.1 uncharacterized protein LOC111010521 [Momordica charantia]7.3e-11280.58Show/hide
Query:  NQINGDCFVGLARSEAEMSSFMSPVKMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASE--SGEGGPVESNGGSSRGSKKSSMSDEDWEEM
        N I G+C       EA+MS FMSPVK+DVEKFDG +NFGLWQVQVKDVLIQS LHKALKGRPS+GASE  S +GGP+ES+GGSSRGSKKSSMS EDWEEM
Subjt:  NQINGDCFVGLARSEAEMSSFMSPVKMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASE--SGEGGPVESNGGSSRGSKKSSMSDEDWEEM

Query:  DLRAASAIRLNLAKNILTNVHGILTAKELWEKLEAMYQARSISNRLYLKEQFYTLRMEECTKISYHLSVLNGIISELEVIEVKIEDEDKALRLILSLPTS
        DLRAASAIR +LAKNIL NVH I TAKELWEKLEA+YQA+ ISNRLYLKEQF+TL+MEE  KIS HLS LN II ELE IEVKI+DEDKALRLILSLP S
Subjt:  DLRAASAIRLNLAKNILTNVHGILTAKELWEKLEAMYQARSISNRLYLKEQFYTLRMEECTKISYHLSVLNGIISELEVIEVKIEDEDKALRLILSLPTS

Query:  YEHMKPILMYGKETLNFADVTSKLLSEERRLKSEGRTSLEDSALVASNWKKKKESMQNKG-CWECGHSGHMKKDCPNR
        YEHMKPILMYGK+TLNFA+VTSKLLSEERRLKSEGRTS EDSALV SNWKKKK+S+Q K  CW CG SGHMKKDCPNR
Subjt:  YEHMKPILMYGKETLNFADVTSKLLSEERRLKSEGRTSLEDSALVASNWKKKKESMQNKG-CWECGHSGHMKKDCPNR

TrEMBL top hitse value%identityAlignment
A0A6A2YS90 Transcription initiation factor IIA subunit 21.8e-7661.01Show/hide
Query:  KMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASESGEGGPVESNGGSSRGSKKSSMSDEDWEEMDLRAASAIRLNLAKNILTNVHGILTAK
        + D+EKFDGR+NFGLWQVQVKD+LIQSGL+KALKG+P    +   EG   +    SS    KS MS+E+WEE+D+RAAS IRL LAKN+L NV    + K
Subjt:  KMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASESGEGGPVESNGGSSRGSKKSSMSDEDWEEMDLRAASAIRLNLAKNILTNVHGILTAK

Query:  ELWEKLEAMYQARSISNRLYLKEQFYTLRMEECTKISYHLSVLNGIISELEVIEVKIEDEDKALRLILSLPTSYEHMKPILMYGKETLNFADVTSKLLSE
        ELWEKLE MYQA+S+SNRLYLKE+F+ L+MEE TKIS HLS LNGI+SELE I V+I+DEDKALRLI SL +SYEHM+ +LMYGKE +NF++VTSKL+SE
Subjt:  ELWEKLEAMYQARSISNRLYLKEQFYTLRMEECTKISYHLSVLNGIISELEVIEVKIEDEDKALRLILSLPTSYEHMKPILMYGKETLNFADVTSKLLSE

Query:  ERRLKSEGRTSLEDSAL-VASNWKKKKESMQNKGCWECGHSGHMKKDCPN-RAGSSKGSGSDA-DVVSLVRGDSKFL
        ERRLK+    S E  AL V  N KK K S +   CW CG  GH+KKDC N  A S+ GS SDA +VV     D +F+
Subjt:  ERRLKSEGRTSLEDSAL-VASNWKKKKESMQNKGCWECGHSGHMKKDCPN-RAGSSKGSGSDA-DVVSLVRGDSKFL

A0A6A3BK59 CCHC-type domain-containing protein2.2e-7761.37Show/hide
Query:  KMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASESGEGGPVESNGGSSRGSKKSSMSDEDWEEMDLRAASAIRLNLAKNILTNVHGILTAK
        + D+EKFDGR+NFGLWQVQVKD+LIQSGL+KALKG+P    +   EG   +    SS    KS MS+E+WEE+D+RAAS IRL LAKN+L NV    + K
Subjt:  KMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASESGEGGPVESNGGSSRGSKKSSMSDEDWEEMDLRAASAIRLNLAKNILTNVHGILTAK

Query:  ELWEKLEAMYQARSISNRLYLKEQFYTLRMEECTKISYHLSVLNGIISELEVIEVKIEDEDKALRLILSLPTSYEHMKPILMYGKETLNFADVTSKLLSE
        ELWEKLE MYQA+S+SNRLYLKE+F+ L+MEE TKIS HLS LNGI+SELE I V+I+DEDKALRLI SLP+SYEHM+ +LMYGKE +NF++VTSKL+SE
Subjt:  ELWEKLEAMYQARSISNRLYLKEQFYTLRMEECTKISYHLSVLNGIISELEVIEVKIEDEDKALRLILSLPTSYEHMKPILMYGKETLNFADVTSKLLSE

Query:  ERRLKSEGRTSLEDSAL-VASNWKKKKESMQNKGCWECGHSGHMKKDCPN-RAGSSKGSGSDA-DVVSLVRGDSKFL
        ERRLK+    S E  AL V  N KK K S +   CW CG  GH+KKDC N  A S+ GS SDA +VV     D +F+
Subjt:  ERRLKSEGRTSLEDSAL-VASNWKKKKESMQNKGCWECGHSGHMKKDCPN-RAGSSKGSGSDA-DVVSLVRGDSKFL

A0A6A3CWI3 CCHC-type domain-containing protein1.1e-7661.19Show/hide
Query:  KMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASESGEGGPVESNGGSSRGSKKSSMSDEDWEEMDLRAASAIRLNLAKNILTNVHGILTAK
        + D+EKFDGR+NFGLWQVQVKD+LIQSGL+KALKG+P    +   EG   +    SS    KS MS+E+WEE+D+RAAS IRL LAKN+L NV    + K
Subjt:  KMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASESGEGGPVESNGGSSRGSKKSSMSDEDWEEMDLRAASAIRLNLAKNILTNVHGILTAK

Query:  ELWEKLEAMYQARSISNRLYLKEQFYTLRMEECTKISYHLSVLNGIISELEVIEVKIEDEDKALRLILSLPTSYEHMKPILMYGKETLNFADVTSKLLSE
        ELWEKLE MYQA+S+SNRLYLKE+F+ L+MEE TKIS HLS LNGI+SELE I V I+DEDKALRLI SLP+SYEHM+ +LMYGKE +NF++VTSKL+SE
Subjt:  ELWEKLEAMYQARSISNRLYLKEQFYTLRMEECTKISYHLSVLNGIISELEVIEVKIEDEDKALRLILSLPTSYEHMKPILMYGKETLNFADVTSKLLSE

Query:  ERRLKSEGRTSLEDSAL-VASNWKKKKESMQNKGCWECGHSGHMKKDCPNRAGSSKGSGSDADVVSLV
        ERRLK+    S E  AL V  N KK K S +   CW CG  GH+KKDC N  G++  +GS +D  ++V
Subjt:  ERRLKSEGRTSLEDSAL-VASNWKKKKESMQNKGCWECGHSGHMKKDCPNRAGSSKGSGSDADVVSLV

A0A6A3DA47 CCHC-type domain-containing protein1.1e-7661.59Show/hide
Query:  KMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASESGEGGPVESNGGSSRGSKKSSMSDEDWEEMDLRAASAIRLNLAKNILTNVHGILTAK
        + D+EKFDGR+NFGLWQVQVKD+LIQSGL+KALKG+P    +   EG   +    SS    KS MS+E+WEE+D+RAAS IRL LAKN+L NV    + K
Subjt:  KMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASESGEGGPVESNGGSSRGSKKSSMSDEDWEEMDLRAASAIRLNLAKNILTNVHGILTAK

Query:  ELWEKLEAMYQARSISNRLYLKEQFYTLRMEECTKISYHLSVLNGIISELEVIEVKIEDEDKALRLILSLPTSYEHMKPILMYGKETLNFADVTSKLLSE
        ELWEKLE MYQA+S+SNRLYLKE+F+ L+MEE TKIS HLS LNGI+SELE I V+I+DEDKALRLI SLP+SYEHM+ +LMYGKE +NF++VTSKL+SE
Subjt:  ELWEKLEAMYQARSISNRLYLKEQFYTLRMEECTKISYHLSVLNGIISELEVIEVKIEDEDKALRLILSLPTSYEHMKPILMYGKETLNFADVTSKLLSE

Query:  ERRLKSEGRTSLEDSAL-VASNWKKKKESMQNKGCWECGHSGHMKKDCPN-RAGSSKGSGSDA-DVVSLVRGDSKF
        ERRLK+    S E  AL V  N KK K S +   CW CG  GH+KKDC N  A S+ GS SDA +VV     D +F
Subjt:  ERRLKSEGRTSLEDSAL-VASNWKKKKESMQNKGCWECGHSGHMKKDCPN-RAGSSKGSGSDA-DVVSLVRGDSKF

A0A6J1CG82 uncharacterized protein LOC1110105213.5e-11280.58Show/hide
Query:  NQINGDCFVGLARSEAEMSSFMSPVKMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASE--SGEGGPVESNGGSSRGSKKSSMSDEDWEEM
        N I G+C       EA+MS FMSPVK+DVEKFDG +NFGLWQVQVKDVLIQS LHKALKGRPS+GASE  S +GGP+ES+GGSSRGSKKSSMS EDWEEM
Subjt:  NQINGDCFVGLARSEAEMSSFMSPVKMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASE--SGEGGPVESNGGSSRGSKKSSMSDEDWEEM

Query:  DLRAASAIRLNLAKNILTNVHGILTAKELWEKLEAMYQARSISNRLYLKEQFYTLRMEECTKISYHLSVLNGIISELEVIEVKIEDEDKALRLILSLPTS
        DLRAASAIR +LAKNIL NVH I TAKELWEKLEA+YQA+ ISNRLYLKEQF+TL+MEE  KIS HLS LN II ELE IEVKI+DEDKALRLILSLP S
Subjt:  DLRAASAIRLNLAKNILTNVHGILTAKELWEKLEAMYQARSISNRLYLKEQFYTLRMEECTKISYHLSVLNGIISELEVIEVKIEDEDKALRLILSLPTS

Query:  YEHMKPILMYGKETLNFADVTSKLLSEERRLKSEGRTSLEDSALVASNWKKKKESMQNKG-CWECGHSGHMKKDCPNR
        YEHMKPILMYGK+TLNFA+VTSKLLSEERRLKSEGRTS EDSALV SNWKKKK+S+Q K  CW CG SGHMKKDCPNR
Subjt:  YEHMKPILMYGKETLNFADVTSKLLSEERRLKSEGRTSLEDSALVASNWKKKKESMQNKG-CWECGHSGHMKKDCPNR

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.6e-1524.26Show/hide
Query:  MSPVKMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKG-RPSDGASESGEGGPVESNGGSSRGSKKSSMSDEDWEEMDLRAASAIRLNLAKNILTNVHG
        M   K +++ FDG   + +W+ +++ +L +  + K + G  P++                           D+ W++ +  A S I   L+ + L     
Subjt:  MSPVKMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKG-RPSDGASESGEGGPVESNGGSSRGSKKSSMSDEDWEEMDLRAASAIRLNLAKNILTNVHG

Query:  ILTAKELWEKLEAMYQARSISNRLYLKEQFYTLRMEECTKISYHLSVLNGIISELEVIEVKIEDEDKALRLILSLPTSYEH-MKPILMYGKETLNFADVT
         +TA+++ E L+A+Y+ +S++++L L+++  +L++     +  H  + + +ISEL     KIE+ DK   L+++LP+ Y+  +  I    +E L  A V 
Subjt:  ILTAKELWEKLEAMYQARSISNRLYLKEQFYTLRMEECTKISYHLSVLNGIISELEVIEVKIEDEDKALRLILSLPTSYEH-MKPILMYGKETLNFADVT

Query:  SKLLSEERRLKSEGRTSLED--SALVASN-------------WKKKK----ESMQNKGCWECGHSGHMKKDC
        ++LL +E ++K++   + +   +A+V +N              K KK     S     C  CG  GH+KKDC
Subjt:  SKLLSEERRLKSEGRTSLED--SALVASN-------------WKKKK----ESMQNKGCWECGHSGHMKKDC

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-4237.46Show/hide
Query:  MSPVKMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASESGEGGPVESNGGSSRGSKKSSMSDEDWEEMDLRAASAIRLNLAKNILTNVHGI
        MS VK +V KF+G   F  WQ +++D+LIQ GLHK L                V+S        K  +M  EDW ++D RAASAIRL+L+ +++ N+   
Subjt:  MSPVKMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASESGEGGPVESNGGSSRGSKKSSMSDEDWEEMDLRAASAIRLNLAKNILTNVHGI

Query:  LTAKELWEKLEAMYQARSISNRLYLKEQFYTLRMEECTKISYHLSVLNGIISELEVIEVKIEDEDKALRLILSLPTSYEHMKPILMYGKETLNFADVTSK
         TA+ +W +LE++Y +++++N+LYLK+Q Y L M E T    HL+V NG+I++L  + VKIE+EDKA+ L+ SLP+SY+++   +++GK T+   DVTS 
Subjt:  LTAKELWEKLEAMYQARSISNRLYLKEQFYTLRMEECTKISYHLSVLNGIISELEVIEVKIEDEDKALRLILSLPTSYEHMKPILMYGKETLNFADVTSK

Query:  LLSEER----------------RLKSEGRTSLEDSALVASNWKKKKESMQNKGCWECGHSGHMKKDCPN-RAGSSKGSGSDAD
        LL  E+                R +S  R+S       A    K +   + + C+ C   GH K+DCPN R G  + SG   D
Subjt:  LLSEER----------------RLKSEGRTSLEDSALVASNWKKKKESMQNKGCWECGHSGHMKKDCPN-RAGSSKGSGSDAD

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTTCCCAAAGAAGAAGACGTCGTGAAACCTACAAAATTGATGCTGAATTTTGGAACGGTGTTCGACATAATTTGGCCGAGATACGTCAAGCCATGAATGATTACAT
TGAAATATGCAAAGAGATGTATTTACGCGACCAACCCAAGGAGGCTCAAGCTGCACAATCTAATCAAATTAATGGAGATTGTTTTGTTGGATTAGCTAGGAGTGAAGCAG
AAATGTCAAGCTTCATGAGTCCAGTGAAGATGGACGTGGAGAAATTTGATGGAAGGATGAACTTCGGCTTGTGGCAAGTGCAAGTAAAGGATGTGCTGATACAATCTGGG
TTACACAAGGCTTTGAAGGGAAGACCAAGCGATGGTGCTTCTGAAAGCGGTGAAGGTGGTCCAGTGGAGTCCAATGGCGGTTCCAGCAGAGGTTCGAAGAAGTCCAGCAT
GAGTGATGAAGATTGGGAGGAAATGGATTTGAGAGCTGCAAGTGCAATCAGATTAAATTTGGCTAAGAACATTCTTACAAATGTGCATGGAATTTTGACAGCCAAAGAGC
TTTGGGAGAAGCTTGAAGCAATGTATCAGGCAAGGAGCATCTCGAATCGGTTGTACCTGAAGGAGCAGTTTTACACGCTACGAATGGAGGAATGTACGAAAATCTCATAT
CATCTGAGTGTTCTCAATGGCATCATTTCGGAGCTGGAGGTGATCGAAGTTAAGATAGAGGATGAGGATAAGGCACTTAGGCTTATCTTGTCACTTCCAACTTCTTATGA
ACACATGAAGCCAATCTTGATGTACGGGAAGGAAACTTTAAATTTTGCTGATGTTACTAGTAAACTCTTATCAGAAGAAAGAAGGCTGAAGAGTGAAGGGCGTACTTCAC
TGGAGGATTCAGCACTAGTAGCTAGCAATTGGAAGAAGAAGAAAGAGTCCATGCAGAATAAAGGTTGCTGGGAATGCGGACATTCTGGACACATGAAAAAGGATTGTCCT
AACAGAGCAGGTTCGTCAAAGGGCTCTGGGTCGGATGCTGACGTTGTCTCTCTCGTCAGGGGAGACAGTAAATTCCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTTCCCAAAGAAGAAGACGTCGTGAAACCTACAAAATTGATGCTGAATTTTGGAACGGTGTTCGACATAATTTGGCCGAGATACGTCAAGCCATGAATGATTACAT
TGAAATATGCAAAGAGATGTATTTACGCGACCAACCCAAGGAGGCTCAAGCTGCACAATCTAATCAAATTAATGGAGATTGTTTTGTTGGATTAGCTAGGAGTGAAGCAG
AAATGTCAAGCTTCATGAGTCCAGTGAAGATGGACGTGGAGAAATTTGATGGAAGGATGAACTTCGGCTTGTGGCAAGTGCAAGTAAAGGATGTGCTGATACAATCTGGG
TTACACAAGGCTTTGAAGGGAAGACCAAGCGATGGTGCTTCTGAAAGCGGTGAAGGTGGTCCAGTGGAGTCCAATGGCGGTTCCAGCAGAGGTTCGAAGAAGTCCAGCAT
GAGTGATGAAGATTGGGAGGAAATGGATTTGAGAGCTGCAAGTGCAATCAGATTAAATTTGGCTAAGAACATTCTTACAAATGTGCATGGAATTTTGACAGCCAAAGAGC
TTTGGGAGAAGCTTGAAGCAATGTATCAGGCAAGGAGCATCTCGAATCGGTTGTACCTGAAGGAGCAGTTTTACACGCTACGAATGGAGGAATGTACGAAAATCTCATAT
CATCTGAGTGTTCTCAATGGCATCATTTCGGAGCTGGAGGTGATCGAAGTTAAGATAGAGGATGAGGATAAGGCACTTAGGCTTATCTTGTCACTTCCAACTTCTTATGA
ACACATGAAGCCAATCTTGATGTACGGGAAGGAAACTTTAAATTTTGCTGATGTTACTAGTAAACTCTTATCAGAAGAAAGAAGGCTGAAGAGTGAAGGGCGTACTTCAC
TGGAGGATTCAGCACTAGTAGCTAGCAATTGGAAGAAGAAGAAAGAGTCCATGCAGAATAAAGGTTGCTGGGAATGCGGACATTCTGGACACATGAAAAAGGATTGTCCT
AACAGAGCAGGTTCGTCAAAGGGCTCTGGGTCGGATGCTGACGTTGTCTCTCTCGTCAGGGGAGACAGTAAATTCCTTTGA
Protein sequenceShow/hide protein sequence
MFSQRRRRRETYKIDAEFWNGVRHNLAEIRQAMNDYIEICKEMYLRDQPKEAQAAQSNQINGDCFVGLARSEAEMSSFMSPVKMDVEKFDGRMNFGLWQVQVKDVLIQSG
LHKALKGRPSDGASESGEGGPVESNGGSSRGSKKSSMSDEDWEEMDLRAASAIRLNLAKNILTNVHGILTAKELWEKLEAMYQARSISNRLYLKEQFYTLRMEECTKISY
HLSVLNGIISELEVIEVKIEDEDKALRLILSLPTSYEHMKPILMYGKETLNFADVTSKLLSEERRLKSEGRTSLEDSALVASNWKKKKESMQNKGCWECGHSGHMKKDCP
NRAGSSKGSGSDADVVSLVRGDSKFL