; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032663 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032663
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr11:35788087..35789727
RNA-Seq ExpressionLag0032663
SyntenyLag0032663
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]2.7e-5743.78Show/hide
Query:  MALDDILSHWENFNLTADEEGTEVDVDRQAAEVTSRSLGFSLIGKLLSPRFIAGDVMRKNFKNAWNIP-TGLTVEKLGPNLFLFSLRSEEEQARILRQGP
        MA  ++L  W+NF LT++E+   VD+D  A E T + L  SLI KLLS R I+  V++   K AW +     +V+ +G N+FLF+     ++ RILR GP
Subjt:  MALDDILSHWENFNLTADEEGTEVDVDRQAAEVTSRSLGFSLIGKLLSPRFIAGDVMRKNFKNAWNIP-TGLTVEKLGPNLFLFSLRSEEEQARILRQGP

Query:  WLFDKFLLVLSKPIRMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLGNAIGHFEEYDNGGRNLGWKESLRVRVTLDITKPIRRCIKVNLEEPLGSCW
        W FD+ L+++  P+ + KP  M+F+ V+ WVHF++L +   N +MA RLGNAIG FE+ ++   N  W   LRVRV  D+ KP+ R IK+NL+ P+G CW
Subjt:  WLFDKFLLVLSKPIRMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLGNAIGHFEEYDNGGRNLGWKESLRVRVTLDITKPIRRCIKVNLEEPLGSCW

Query:  TPIRYEKLPDLCGYCGIIGHGVKDCSAYYLAAGSPSQSKQYGMWLQYTG
         PI+YE+LPD   +CG + H +KDCS       S S++ QYG WL++ G
Subjt:  TPIRYEKLPDLCGYCGIIGHGVKDCSAYYLAAGSPSQSKQYGMWLQYTG

XP_022149484.1 uncharacterized protein LOC111017902 [Momordica charantia]4.3e-3936.69Show/hide
Query:  LDDILSHWENFNLTADEEGTEVDVDRQAAEVTSRSLGFSLIGKLLSPRFIAGDVMRKNFKNAWNIPTGLTVEKLGPNLFLFSLRSEEEQARILRQGPWLF
        +D+I   WE+F  T DE  T V +DR    +T+ ++   ++ KL + + I+ + +R   K+ W +      E LG N+++   +S  E++R+L  GPW F
Subjt:  LDDILSHWENFNLTADEEGTEVDVDRQAAEVTSRSLGFSLIGKLLSPRFIAGDVMRKNFKNAWNIPTGLTVEKLGPNLFLFSLRSEEEQARILRQGPWLF

Query:  DKFLLVLSKPIRMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLGNAIGHFEEYDNGGRNLGWK-ESLRVRVTLDITKPIRRCIKVNLEEPLGSCWTP
        +K LLVL+ P    +P  M F F AFW+  + +P E  +  MA  LG  +G  EE +  G + GW    +RVRV +D++KP+RR IK+   +     W P
Subjt:  DKFLLVLSKPIRMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLGNAIGHFEEYDNGGRNLGWK-ESLRVRVTLDITKPIRRCIKVNLEEPLGSCWTP

Query:  IRYEKLPDLCGYCGIIGHGVKDCS--AYYLAAGSPSQSKQYGMWLQYT
        +RYEKLPD C  CG IGH  ++C   +  +   SP   +QYG WL+ T
Subjt:  IRYEKLPDLCGYCGIIGHGVKDCS--AYYLAAGSPSQSKQYGMWLQYT

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]3.9e-6446.34Show/hide
Query:  DDILSHWENFNLTADEEGTEVDVDRQAAEVTSRSLGFSLIGKLLSPRFIAGDVMRKNFKNAWNIPTGLTVEKLGPNLFLFSLRSEEEQARILRQGPWLFD
        +++L+ W+ F LT++E+   +DVD  A ++  + L +SL+GKLL+ R I+ DV+ +    AW +   LTVE +G NLFLF    E +  R+++ GPW FD
Subjt:  DDILSHWENFNLTADEEGTEVDVDRQAAEVTSRSLGFSLIGKLLSPRFIAGDVMRKNFKNAWNIPTGLTVEKLGPNLFLFSLRSEEEQARILRQGPWLFD

Query:  KFLLVLSKPIRMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLGNAIGHFEEYDNGGRNLGWKESLRVRVTLDITKPIRRCIKVNLEEPLGSCWTPIR
        K L+VL KP      + +EF  VAFW+H ++LPM   N +MA RLGNAIG+F + D   +   W  SLR+RV +DITKP+RR IK+N++ P+G CW PI+
Subjt:  KFLLVLSKPIRMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLGNAIGHFEEYDNGGRNLGWKESLRVRVTLDITKPIRRCIKVNLEEPLGSCWTPIR

Query:  YEKLPDLCGYCGIIGHGVKDCSAYYLAAGSPSQ-SKQYGMWLQYTG
        YE+LPD C +CG+IGH   DC A YLAA   S+ + +YG WL++ G
Subjt:  YEKLPDLCGYCGIIGHGVKDCSAYYLAAGSPSQ-SKQYGMWLQYTG

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]1.7e-5135.2Show/hide
Query:  MALDDILSHWENFNLTADEEGTEVDVDRQAAEVTSRSLGFSLIGKLLSPRFIAGDVMRKNFKNAWNIP-TGLTVEKLGPNLFLFSLRSEEEQARILRQGP
        MA  D+L  W+NF LT++EE T +DVD  A   T   L   L+GKL   R I   VM+   + AW +      V+ LG NLFLFS     ++ +I + GP
Subjt:  MALDDILSHWENFNLTADEEGTEVDVDRQAAEVTSRSLGFSLIGKLLSPRFIAGDVMRKNFKNAWNIP-TGLTVEKLGPNLFLFSLRSEEEQARILRQGP

Query:  WLFDKFLLVLSKPIRMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLGNAIGHFEEYDNGGRNLGWKESLRVRVTLDITKPIRRCIKVNLEEPLGSCW
        W FD+ L++++KP+ ++ P+ ++F  +  WV F++LP+      MA RLGNA+G FEE D    N  W  +LRVRV LDI+KP+RR IK+NL+ P+G  W
Subjt:  WLFDKFLLVLSKPIRMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLGNAIGHFEEYDNGGRNLGWKESLRVRVTLDITKPIRRCIKVNLEEPLGSCW

Query:  TPIRYEKLPDLCGYCGIIGHGVKDCSAYYLAAGSPSQSKQYGMWLQYTG------------RTTTIYRSPSNSPLSNSRMVVDHATALQATPIAGNTS--
         PI+YE+LPD C +CG+                S  +  QYG WL+Y G            +   + +S +NS  S++  V   +  +Q+ P  G  +  
Subjt:  TPIRYEKLPDLCGYCGIIGHGVKDCSAYYLAAGSPSQSKQYGMWLQYTG------------RTTTIYRSPSNSPLSNSRMVVDHATALQATPIAGNTS--

Query:  --SPCLLNQNGGS--SGTGIGSQMTDSGTKSMDISPVPVELPNAPPAVKNNAPPPTPN
          SP       G+  S  G    + D G + +++     E+ N  P +K+ AP   P+
Subjt:  --SPCLLNQNGGS--SGTGIGSQMTDSGTKSMDISPVPVELPNAPPAVKNNAPPPTPN

XP_028122006.1 uncharacterized protein LOC114319195 [Camellia sinensis]3.8e-3535.89Show/hide
Query:  DDILSHWENFNLTADEEGTEVDVDRQAAEVTSRSLGFS---LIGKLLSPRFIAGDVMRKNFKNAWNIPTGLTVEKLGPNLFLFSLRSEEEQARILRQGPW
        D +L      +LT++E+     V R   E TS  +G S   L+GKLL+ R    + M+    + W    G+ V  +G NLF+F      ++ R+L  GPW
Subjt:  DDILSHWENFNLTADEEGTEVDVDRQAAEVTSRSLGFS---LIGKLLSPRFIAGDVMRKNFKNAWNIPTGLTVEKLGPNLFLFSLRSEEEQARILRQGPW

Query:  LFDKFLLVLSKPIRMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLGNAIGHFEEYDNGGRNLGWKESLRVRVTLDITKPIRRCIKVNLE--EPLGSC
         FDK LL+L +    V+P+ ++   V FWVH   LP+ L N  +   +GNA+G F + D     + W  ++R+RV LD+ KP+RR +K+ L   EP+   
Subjt:  LFDKFLLVLSKPIRMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLGNAIGHFEEYDNGGRNLGWKESLRVRVTLDITKPIRRCIKVNLE--EPLGSC

Query:  WTPIRYEKLPDLCGYCGIIGHGVKDCSAYYLAA-GSPSQSKQYGMWLQ
        W   +YE+LP  C +CG +GH  ++C     +A GS   S QYG WL+
Subjt:  WTPIRYEKLPDLCGYCGIIGHGVKDCSAYYLAA-GSPSQSKQYGMWLQ

TrEMBL top hitse value%identityAlignment
A0A2N9GWE9 Uncharacterized protein8.2e-3631.46Show/hide
Query:  DDILSHWENFNLTADEEGTEVDVDRQAAEVTSRSLGFSLIGKLLSPRFIAGDVMRKNFKNAW-NIPTGLTVEKLGPNLFLFSLRSEEEQARILRQGPWLF
        D+++  W  F+LT + EG  V +   A EV+       L+GKL + ++   + ++      W  +  G+T   +G NLF+F  R + E+ R++   PWLF
Subjt:  DDILSHWENFNLTADEEGTEVDVDRQAAEVTSRSLGFSLIGKLLSPRFIAGDVMRKNFKNAW-NIPTGLTVEKLGPNLFLFSLRSEEEQARILRQGPWLF

Query:  DKFLLVLSKPIRMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLGNAIGHFEEYDNGGRNLGWKESLRVRVTLDITKPIRRCIKVNLEEPLGSCWTPI
        D  LL+L +       + ++F +  FWV FY +P+         ++G+  G  EE D     +GW   LRVR+ LDITKPI R   V     LG  W   
Subjt:  DKFLLVLSKPIRMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLGNAIGHFEEYDNGGRNLGWKESLRVRVTLDITKPIRRCIKVNLEEPLGSCWTPI

Query:  RYEKLPDLCGYCGIIGHGVKDC-SAYYLAAGSPSQSKQYGMWLQYTGRTTTIYRSPSNSPLSNSRMVVDHATALQATPIAGNTSSPCLLN-----QNGGS
        +YE+LP LC +CG+IGH  +DC S     + SP   +QYG WL    R + +        +  S  V   A   + T   GN +    LN     Q GGS
Subjt:  RYEKLPDLCGYCGIIGHGVKDC-SAYYLAAGSPSQSKQYGMWLQYTGRTTTIYRSPSNSPLSNSRMVVDHATALQATPIAGNTSSPCLLN-----QNGGS

Query:  SGTGIGSQMTDSGTKSMDISP
          T +  ++ ++    ++I+P
Subjt:  SGTGIGSQMTDSGTKSMDISP

A0A6J1BSZ1 uncharacterized protein LOC1110054811.3e-5743.78Show/hide
Query:  MALDDILSHWENFNLTADEEGTEVDVDRQAAEVTSRSLGFSLIGKLLSPRFIAGDVMRKNFKNAWNIP-TGLTVEKLGPNLFLFSLRSEEEQARILRQGP
        MA  ++L  W+NF LT++E+   VD+D  A E T + L  SLI KLLS R I+  V++   K AW +     +V+ +G N+FLF+     ++ RILR GP
Subjt:  MALDDILSHWENFNLTADEEGTEVDVDRQAAEVTSRSLGFSLIGKLLSPRFIAGDVMRKNFKNAWNIP-TGLTVEKLGPNLFLFSLRSEEEQARILRQGP

Query:  WLFDKFLLVLSKPIRMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLGNAIGHFEEYDNGGRNLGWKESLRVRVTLDITKPIRRCIKVNLEEPLGSCW
        W FD+ L+++  P+ + KP  M+F+ V+ WVHF++L +   N +MA RLGNAIG FE+ ++   N  W   LRVRV  D+ KP+ R IK+NL+ P+G CW
Subjt:  WLFDKFLLVLSKPIRMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLGNAIGHFEEYDNGGRNLGWKESLRVRVTLDITKPIRRCIKVNLEEPLGSCW

Query:  TPIRYEKLPDLCGYCGIIGHGVKDCSAYYLAAGSPSQSKQYGMWLQYTG
         PI+YE+LPD   +CG + H +KDCS       S S++ QYG WL++ G
Subjt:  TPIRYEKLPDLCGYCGIIGHGVKDCSAYYLAAGSPSQSKQYGMWLQYTG

A0A6J1D765 uncharacterized protein LOC1110179022.1e-3936.69Show/hide
Query:  LDDILSHWENFNLTADEEGTEVDVDRQAAEVTSRSLGFSLIGKLLSPRFIAGDVMRKNFKNAWNIPTGLTVEKLGPNLFLFSLRSEEEQARILRQGPWLF
        +D+I   WE+F  T DE  T V +DR    +T+ ++   ++ KL + + I+ + +R   K+ W +      E LG N+++   +S  E++R+L  GPW F
Subjt:  LDDILSHWENFNLTADEEGTEVDVDRQAAEVTSRSLGFSLIGKLLSPRFIAGDVMRKNFKNAWNIPTGLTVEKLGPNLFLFSLRSEEEQARILRQGPWLF

Query:  DKFLLVLSKPIRMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLGNAIGHFEEYDNGGRNLGWK-ESLRVRVTLDITKPIRRCIKVNLEEPLGSCWTP
        +K LLVL+ P    +P  M F F AFW+  + +P E  +  MA  LG  +G  EE +  G + GW    +RVRV +D++KP+RR IK+   +     W P
Subjt:  DKFLLVLSKPIRMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLGNAIGHFEEYDNGGRNLGWK-ESLRVRVTLDITKPIRRCIKVNLEEPLGSCWTP

Query:  IRYEKLPDLCGYCGIIGHGVKDCS--AYYLAAGSPSQSKQYGMWLQYT
        +RYEKLPD C  CG IGH  ++C   +  +   SP   +QYG WL+ T
Subjt:  IRYEKLPDLCGYCGIIGHGVKDCS--AYYLAAGSPSQSKQYGMWLQYT

A0A6J1DU55 uncharacterized protein LOC1110231351.9e-6446.34Show/hide
Query:  DDILSHWENFNLTADEEGTEVDVDRQAAEVTSRSLGFSLIGKLLSPRFIAGDVMRKNFKNAWNIPTGLTVEKLGPNLFLFSLRSEEEQARILRQGPWLFD
        +++L+ W+ F LT++E+   +DVD  A ++  + L +SL+GKLL+ R I+ DV+ +    AW +   LTVE +G NLFLF    E +  R+++ GPW FD
Subjt:  DDILSHWENFNLTADEEGTEVDVDRQAAEVTSRSLGFSLIGKLLSPRFIAGDVMRKNFKNAWNIPTGLTVEKLGPNLFLFSLRSEEEQARILRQGPWLFD

Query:  KFLLVLSKPIRMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLGNAIGHFEEYDNGGRNLGWKESLRVRVTLDITKPIRRCIKVNLEEPLGSCWTPIR
        K L+VL KP      + +EF  VAFW+H ++LPM   N +MA RLGNAIG+F + D   +   W  SLR+RV +DITKP+RR IK+N++ P+G CW PI+
Subjt:  KFLLVLSKPIRMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLGNAIGHFEEYDNGGRNLGWKESLRVRVTLDITKPIRRCIKVNLEEPLGSCWTPIR

Query:  YEKLPDLCGYCGIIGHGVKDCSAYYLAAGSPSQ-SKQYGMWLQYTG
        YE+LPD C +CG+IGH   DC A YLAA   S+ + +YG WL++ G
Subjt:  YEKLPDLCGYCGIIGHGVKDCSAYYLAAGSPSQ-SKQYGMWLQYTG

A0A6J1DX30 uncharacterized protein LOC1110248748.2e-5235.2Show/hide
Query:  MALDDILSHWENFNLTADEEGTEVDVDRQAAEVTSRSLGFSLIGKLLSPRFIAGDVMRKNFKNAWNIP-TGLTVEKLGPNLFLFSLRSEEEQARILRQGP
        MA  D+L  W+NF LT++EE T +DVD  A   T   L   L+GKL   R I   VM+   + AW +      V+ LG NLFLFS     ++ +I + GP
Subjt:  MALDDILSHWENFNLTADEEGTEVDVDRQAAEVTSRSLGFSLIGKLLSPRFIAGDVMRKNFKNAWNIP-TGLTVEKLGPNLFLFSLRSEEEQARILRQGP

Query:  WLFDKFLLVLSKPIRMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLGNAIGHFEEYDNGGRNLGWKESLRVRVTLDITKPIRRCIKVNLEEPLGSCW
        W FD+ L++++KP+ ++ P+ ++F  +  WV F++LP+      MA RLGNA+G FEE D    N  W  +LRVRV LDI+KP+RR IK+NL+ P+G  W
Subjt:  WLFDKFLLVLSKPIRMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLGNAIGHFEEYDNGGRNLGWKESLRVRVTLDITKPIRRCIKVNLEEPLGSCW

Query:  TPIRYEKLPDLCGYCGIIGHGVKDCSAYYLAAGSPSQSKQYGMWLQYTG------------RTTTIYRSPSNSPLSNSRMVVDHATALQATPIAGNTS--
         PI+YE+LPD C +CG+                S  +  QYG WL+Y G            +   + +S +NS  S++  V   +  +Q+ P  G  +  
Subjt:  TPIRYEKLPDLCGYCGIIGHGVKDCSAYYLAAGSPSQSKQYGMWLQYTG------------RTTTIYRSPSNSPLSNSRMVVDHATALQATPIAGNTS--

Query:  --SPCLLNQNGGS--SGTGIGSQMTDSGTKSMDISPVPVELPNAPPAVKNNAPPPTPN
          SP       G+  S  G    + D G + +++     E+ N  P +K+ AP   P+
Subjt:  --SPCLLNQNGGS--SGTGIGSQMTDSGTKSMDISPVPVELPNAPPAVKNNAPPPTPN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G31430.1 unknown protein6.3e-1228.28Show/hide
Query:  FLFSLRSEEEQARILRQGPWLFDKFLLVLSKPIRMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLGNAIGHFEEYDNGGRNLGWKESLRVRVTLDIT
        F+F+L  EE    +LR+GPW F+ ++++L +     +P    F F+ FWV    +P +  N  +   +G A+G   + D     +   +  RV +  DIT
Subjt:  FLFSLRSEEEQARILRQGPWLFDKFLLVLSKPIRMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLGNAIGHFEEYDNGGRNLGWKESLRVRVTLDIT

Query:  KPIRRCIKVNLEEPLG-SCWTPIRYEKLPDLCGYCGIIGHGVKDC
         P+R   + + +   G +     RYE+L   C  CG++ H    C
Subjt:  KPIRRCIKVNLEEPLG-SCWTPIRYEKLPDLCGYCGIIGHGVKDC

AT3G42140.1 zinc ion binding;nucleic acid binding1.9e-0825.35Show/hide
Query:  FSLRSEEEQARILRQGPWLFDKFLLVLSKPIRMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLGNAIGHFEEYDNGGRNLGWKESLRVRVTLDITKP
        F  +SEE    ILR+GPW F+ ++ V+ +  ++   +  EFK + FW+    +P+    A +   +G  +G F E      NLG   S+           
Subjt:  FSLRSEEEQARILRQGPWLFDKFLLVLSKPIRMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLGNAIGHFEEYDNGGRNLGWKESLRVRVTLDITKP

Query:  IRRCIKVNLEEPLGSCWTPIRYEKLPDLCGYCGIIGHGVKDC
                            +YEKL + C  CG++ H   +C
Subjt:  IRRCIKVNLEEPLGSCWTPIRYEKLPDLCGYCGIIGHGVKDC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCTTGACGATATCCTTAGCCACTGGGAGAACTTCAACCTCACAGCTGACGAAGAAGGGACTGAGGTTGATGTCGACCGGCAAGCAGCGGAAGTCACCAGCAGATC
ATTGGGTTTCAGCCTCATTGGTAAACTCCTCTCGCCTCGTTTCATCGCCGGAGACGTAATGAGGAAAAATTTCAAAAATGCGTGGAATATCCCGACGGGCCTTACAGTGG
AGAAACTAGGGCCAAATCTCTTCCTTTTCTCTCTGAGATCAGAGGAGGAACAGGCTCGTATCCTCCGACAGGGACCCTGGCTCTTTGACAAGTTCTTACTTGTTCTCTCC
AAACCTATCCGTATGGTCAAACCAACGGCAATGGAGTTCAAGTTCGTGGCCTTTTGGGTTCATTTTTACGAACTCCCTATGGAACTATTTAATGCCTCCATGGCGGCTCG
ACTGGGAAATGCGATAGGACACTTCGAAGAATATGACAATGGTGGGCGCAATCTTGGTTGGAAGGAGAGTTTGCGTGTTCGTGTTACGCTCGATATCACGAAACCTATTC
GGCGATGTATCAAAGTAAATCTTGAGGAACCATTAGGGAGTTGCTGGACTCCAATCCGTTACGAAAAACTTCCTGACCTATGCGGATACTGTGGAATAATTGGCCATGGT
GTGAAAGATTGCAGTGCTTACTATCTTGCTGCAGGTTCACCATCTCAAAGTAAGCAATATGGCATGTGGCTTCAATACACGGGACGCACAACTACCATTTATCGATCTCC
AAGCAACAGTCCCCTCAGCAATAGTAGAATGGTTGTCGATCATGCGACTGCTCTCCAAGCCACACCGATAGCAGGTAACACAAGTTCCCCTTGCTTGCTGAACCAAAACG
GGGGGTCTTCAGGAACCGGCATAGGATCTCAGATGACGGACTCGGGCACGAAGTCCATGGACATATCGCCGGTGCCGGTAGAGTTGCCGAACGCGCCACCAGCAGTAAAA
AATAACGCGCCTCCGCCAACACCTAATGGCGGCTTTAATCCTGACTTCAATGCATTCAAGGCGGACGTTAATGCGGCTGAGGTAAATAAGGTAAAAAAGAAGCTTGAGTA
TAACGATTTTTTCGTGATTCCTAACTCACAGATCCGTGTGGATCCAATTGTGGAGAAATTAGGGGCAAAAAATCAGGAAATCAGTAACGGATCAAACCCATTAATGCGTC
AGCTAAATGGAAAGTTTCCAGAAGATTTACACCTCCCCCATTCTAACGGGCTGAAGTGTGAATTTAAAAATGGTGATGTGGTGGGCCAAATGGGCCCATTCAAATTAGGC
CAAAAACAAAATGAGATGAAGAATGAGCCTATTTCCCAAAATGGGCCGATTCACCCACAACAAGCCCAATCGTTTGCTGATCTTATTCCTAATCAGAGACAAGTTGTGCT
GGGTCTTCCTAATTCCAAGACTTGGAAACGTATGGCACGGCATAACAATATGGAATTTGGGGCTTCGTCTGGCGAAGAAGTTTACAAAAAGAGATTGGCGGAAGGTCTGC
TTAAAGGTACTAAGAAGCGTGCTCGAACCGAGGATGGGGAATCGTCTGATCATGAAGAACCAGCGGTGGAGGCTGTTGAGCAGCCCCGCCGAGAGCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCTTGACGATATCCTTAGCCACTGGGAGAACTTCAACCTCACAGCTGACGAAGAAGGGACTGAGGTTGATGTCGACCGGCAAGCAGCGGAAGTCACCAGCAGATC
ATTGGGTTTCAGCCTCATTGGTAAACTCCTCTCGCCTCGTTTCATCGCCGGAGACGTAATGAGGAAAAATTTCAAAAATGCGTGGAATATCCCGACGGGCCTTACAGTGG
AGAAACTAGGGCCAAATCTCTTCCTTTTCTCTCTGAGATCAGAGGAGGAACAGGCTCGTATCCTCCGACAGGGACCCTGGCTCTTTGACAAGTTCTTACTTGTTCTCTCC
AAACCTATCCGTATGGTCAAACCAACGGCAATGGAGTTCAAGTTCGTGGCCTTTTGGGTTCATTTTTACGAACTCCCTATGGAACTATTTAATGCCTCCATGGCGGCTCG
ACTGGGAAATGCGATAGGACACTTCGAAGAATATGACAATGGTGGGCGCAATCTTGGTTGGAAGGAGAGTTTGCGTGTTCGTGTTACGCTCGATATCACGAAACCTATTC
GGCGATGTATCAAAGTAAATCTTGAGGAACCATTAGGGAGTTGCTGGACTCCAATCCGTTACGAAAAACTTCCTGACCTATGCGGATACTGTGGAATAATTGGCCATGGT
GTGAAAGATTGCAGTGCTTACTATCTTGCTGCAGGTTCACCATCTCAAAGTAAGCAATATGGCATGTGGCTTCAATACACGGGACGCACAACTACCATTTATCGATCTCC
AAGCAACAGTCCCCTCAGCAATAGTAGAATGGTTGTCGATCATGCGACTGCTCTCCAAGCCACACCGATAGCAGGTAACACAAGTTCCCCTTGCTTGCTGAACCAAAACG
GGGGGTCTTCAGGAACCGGCATAGGATCTCAGATGACGGACTCGGGCACGAAGTCCATGGACATATCGCCGGTGCCGGTAGAGTTGCCGAACGCGCCACCAGCAGTAAAA
AATAACGCGCCTCCGCCAACACCTAATGGCGGCTTTAATCCTGACTTCAATGCATTCAAGGCGGACGTTAATGCGGCTGAGGTAAATAAGGTAAAAAAGAAGCTTGAGTA
TAACGATTTTTTCGTGATTCCTAACTCACAGATCCGTGTGGATCCAATTGTGGAGAAATTAGGGGCAAAAAATCAGGAAATCAGTAACGGATCAAACCCATTAATGCGTC
AGCTAAATGGAAAGTTTCCAGAAGATTTACACCTCCCCCATTCTAACGGGCTGAAGTGTGAATTTAAAAATGGTGATGTGGTGGGCCAAATGGGCCCATTCAAATTAGGC
CAAAAACAAAATGAGATGAAGAATGAGCCTATTTCCCAAAATGGGCCGATTCACCCACAACAAGCCCAATCGTTTGCTGATCTTATTCCTAATCAGAGACAAGTTGTGCT
GGGTCTTCCTAATTCCAAGACTTGGAAACGTATGGCACGGCATAACAATATGGAATTTGGGGCTTCGTCTGGCGAAGAAGTTTACAAAAAGAGATTGGCGGAAGGTCTGC
TTAAAGGTACTAAGAAGCGTGCTCGAACCGAGGATGGGGAATCGTCTGATCATGAAGAACCAGCGGTGGAGGCTGTTGAGCAGCCCCGCCGAGAGCCATGA
Protein sequenceShow/hide protein sequence
MALDDILSHWENFNLTADEEGTEVDVDRQAAEVTSRSLGFSLIGKLLSPRFIAGDVMRKNFKNAWNIPTGLTVEKLGPNLFLFSLRSEEEQARILRQGPWLFDKFLLVLS
KPIRMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLGNAIGHFEEYDNGGRNLGWKESLRVRVTLDITKPIRRCIKVNLEEPLGSCWTPIRYEKLPDLCGYCGIIGHG
VKDCSAYYLAAGSPSQSKQYGMWLQYTGRTTTIYRSPSNSPLSNSRMVVDHATALQATPIAGNTSSPCLLNQNGGSSGTGIGSQMTDSGTKSMDISPVPVELPNAPPAVK
NNAPPPTPNGGFNPDFNAFKADVNAAEVNKVKKKLEYNDFFVIPNSQIRVDPIVEKLGAKNQEISNGSNPLMRQLNGKFPEDLHLPHSNGLKCEFKNGDVVGQMGPFKLG
QKQNEMKNEPISQNGPIHPQQAQSFADLIPNQRQVVLGLPNSKTWKRMARHNNMEFGASSGEEVYKKRLAEGLLKGTKKRARTEDGESSDHEEPAVEAVEQPRREP