; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021965 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021965
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr7:14946359..14950506
RNA-Seq ExpressionLag0021965
SyntenyLag0021965
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG48431.1 hypothetical protein EZV62_027725 [Acer yangbiense]4.8e-4242.04Show/hide
Query:  LCVVGKVLSSKPVNADAFRRVMLSVWSVHRSTRIEPWGDNIFVIRFHSVTEKRRIMSLGPWTFDKSLLVLVSPQGSDDPSLLDFSRCEFWVHITKVPLNY
        LC+ GK+LS + +N +AFR ++  +W V     IE    NI+   F S  +++R+   GPW+FD +L+VL  P+G    + L F+  +FWVHIT VP+ Y
Subjt:  LCVVGKVLSSKPVNADAFRRVMLSVWSVHRSTRIEPWGDNIFVIRFHSVTEKRRIMSLGPWTFDKSLLVLVSPQGSDDPSLLDFSRCEFWVHITKVPLNY

Query:  HTAAMARALGSVIGQVVEVPGEGHNDWLGSVMRVRVVLNMAHPLRRVVRL-VKGDGSVLWCPLQYERLPDFCFKCGRIGHSHRECSEDGE-GVGADGKFL
         T  + + LGS+IG+V EV     +D LG  MRVRV + +  PLRR +R+ V GDG     P+QYERLP FCF CG +GH   EC+  GE G+      +
Subjt:  HTAAMARALGSVIGQVVEVPGEGHNDWLGSVMRVRVVLNMAHPLRRVVRL-VKGDGSVLWCPLQYERLPDFCFKCGRIGHSHRECSEDGE-GVGADGKFL

Query:  FGDWLRAVPFRRAVASASEEGGGNPG
        +G WL      RA A   + G GN G
Subjt:  FGDWLRAVPFRRAVASASEEGGGNPG

TXG66222.1 hypothetical protein EZV62_007497 [Acer yangbiense]9.7e-4339.26Show/hide
Query:  MGLSEAETTAFPVPADIPLLDESTVQLCVVGKVLSSKPVNADAFRRVMLSVWSVHRSTRIEPWGDNIFVIRFHSVTEKRRIMSLGPWTFDKSLLVLVSPQ
        M L E E     + AD+       +   +VGKVLS KPVN D FR VM  +W       IE   DNIF   F +  +KR+I+S GPW+F+ +L+V+  P+
Subjt:  MGLSEAETTAFPVPADIPLLDESTVQLCVVGKVLSSKPVNADAFRRVMLSVWSVHRSTRIEPWGDNIFVIRFHSVTEKRRIMSLGPWTFDKSLLVLVSPQ

Query:  GSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVIGQVVEVPGEGHNDWLGSVMRVRVVLNMAHPLRRVVRL-VKGDGSVLWCPLQYERLPDFCFK
        G  D   + F++ EFW+ I  VP+   T  +   LG+++G+VVEV       +    +RVRV+L +  PLRR +R+ V GDG      L+YERLPDFCF+
Subjt:  GSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVIGQVVEVPGEGHNDWLGSVMRVRVVLNMAHPLRRVVRL-VKGDGSVLWCPLQYERLPDFCFK

Query:  CGRIGHSHRECSEDG--EGVGADGKFLFGDWLRAVPFRRAVASASEEGGGNPGSQGGRGAGLSGRGRGRG
        CG IGHS ++C E     G G    F+FG W+R  P  R + S     G     +GG G+ L    +G G
Subjt:  CGRIGHSHRECSEDG--EGVGADGKFLFGDWLRAVPFRRAVASASEEGGGNPGSQGGRGAGLSGRGRGRG

TXG69172.1 hypothetical protein EZV62_004107 [Acer yangbiense]1.4e-4138.26Show/hide
Query:  DLVIQWEQMGLSEAETTAFPVPADIPLLDESTVQLCVVGKVLSSKPVNADAFRRVMLSVWSVHRSTRIEPWGDNIFVIRFHSVTEKRRIMSLGPWTFDKS
        +LV   E + L++ E     +  +   +  + V LC+VGKVLS+K VN +AF  V+  +W+      IE   DNIFV  F++   +  I + GPW FD++
Subjt:  DLVIQWEQMGLSEAETTAFPVPADIPLLDESTVQLCVVGKVLSSKPVNADAFRRVMLSVWSVHRSTRIEPWGDNIFVIRFHSVTEKRRIMSLGPWTFDKS

Query:  LLVLVSPQGSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVIGQVVEVPGEGHNDWLGSVMRVRVVLNMAHPLRRVVRL-VKGDGSVLWCPLQYE
        L+VL  P+G+ D S L FS  E WV I  +PL       A+A+   IG V+E+P E   +  G  +RV+V +N+++PL+R +RL VK     +  PL YE
Subjt:  LLVLVSPQGSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVIGQVVEVPGEGHNDWLGSVMRVRVVLNMAHPLRRVVRL-VKGDGSVLWCPLQYE

Query:  RLPDFCFKCGRIGHSHRECSEDGEGVGA--DGKFLFGDWLRAVPFRRAVASASEEGGGNPGSQG
        RLP+FC+ CGRIGH  REC++D     A       FG W+RA    R+  +  + GG N    G
Subjt:  RLPDFCFKCGRIGHSHRECSEDGEGVGA--DGKFLFGDWLRAVPFRRAVASASEEGGGNPGSQG

XP_022149484.1 uncharacterized protein LOC111017902 [Momordica charantia]3.0e-6039.88Show/hide
Query:  MDDLVIQWEQMGLSEAETTAFPVPADIPLLDESTVQLCVVGKVLSSKPVNADAFRRVMLSVWSVHRSTRIEPWGDNIFVIRFHSVTEKRRIMSLGPWTFD
        MD++   WE    +  E     +    P+L    V+LCVV K+ +SK ++A+A R VM SVW VH STR EP G NI+VI F S++EK R++S GPWTF+
Subjt:  MDDLVIQWEQMGLSEAETTAFPVPADIPLLDESTVQLCVVGKVLSSKPVNADAFRRVMLSVWSVHRSTRIEPWGDNIFVIRFHSVTEKRRIMSLGPWTFD

Query:  KSLLVLVSPQGSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVIGQVVEVPGEGHNDWLGSVMRVRVVLNMAHPLRRVVRLVKGDGSVLWCPLQY
        KSLLVL SP  ++ P  ++F+ C FW+ I  +P    +  MA  LG+ +G V E+ G+G + W G  +RVRV ++++ PLRR ++L   DG  +WCPL+Y
Subjt:  KSLLVLVSPQGSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVIGQVVEVPGEGHNDWLGSVMRVRVVLNMAHPLRRVVRLVKGDGSVLWCPLQY

Query:  ERLPDFCFKCGRIGHSHRECSEDGEGVGADGKFLFGDWLRAVPFRRAVASASEEGGGNPGSQGGRGAGLSGRGRGRG-----QVDQSEVVGVERPDVVSV
        E+LPDFC++CG+IGHS REC +  + V  +    +GDWLRA   +++V+   EE     G + GRG  ++G   GRG       +  ++ G E     +V
Subjt:  ERLPDFCFKCGRIGHSHRECSEDGEGVGADGKFLFGDWLRAVPFRRAVASASEEGGGNPGSQGGRGAGLSGRGRGRG-----QVDQSEVVGVERPDVVSV

Query:  SDLV-ADPAVASVVVSEVGTVSQAPSGTAHTVAPDV
         + V   P   SV  +E+ T+ QA   + H V   +
Subjt:  SDLV-ADPAVASVVVSEVGTVSQAPSGTAHTVAPDV

XP_022156711.1 uncharacterized protein LOC111023555 [Momordica charantia]4.3e-5146.12Show/hide
Query:  MDDLVIQWEQMGLSEAETTAFPVPADIPLLDESTVQLCVVGKVLSSKPVNADAFRRVMLSVWSVHRSTRIEPWGDNIFVIRFHSVTEKRRIMSLGPWTFD
        MDD+   WE   L + E     +    P+L    +QLC VGK+  SK +  +AF  VM  VW +H STRIE  G NI+VI F ++ EK R+ SLGPWTFD
Subjt:  MDDLVIQWEQMGLSEAETTAFPVPADIPLLDESTVQLCVVGKVLSSKPVNADAFRRVMLSVWSVHRSTRIEPWGDNIFVIRFHSVTEKRRIMSLGPWTFD

Query:  KSLLVLVSPQGSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVIGQVVEVPGEGHNDWLGSVMRVRVVLNMAHPLRRVVRLVKGDGSVLWCPLQY
        KSLL+LV    ++ P  +D S C FWV I  +     T  MA+ LG+ +G+V EV G    DW+   + VRV +N+  PLRR +++   DG  +WCPL+Y
Subjt:  KSLLVLVSPQGSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVIGQVVEVPGEGHNDWLGSVMRVRVVLNMAHPLRRVVRLVKGDGSVLWCPLQY

Query:  ERLPDFCFKCGRIGHSHRE
        ERLPDFC+ CG +GHS RE
Subjt:  ERLPDFCFKCGRIGHSHRE

TrEMBL top hitse value%identityAlignment
A0A5C7GUN1 Uncharacterized protein2.3e-4242.04Show/hide
Query:  LCVVGKVLSSKPVNADAFRRVMLSVWSVHRSTRIEPWGDNIFVIRFHSVTEKRRIMSLGPWTFDKSLLVLVSPQGSDDPSLLDFSRCEFWVHITKVPLNY
        LC+ GK+LS + +N +AFR ++  +W V     IE    NI+   F S  +++R+   GPW+FD +L+VL  P+G    + L F+  +FWVHIT VP+ Y
Subjt:  LCVVGKVLSSKPVNADAFRRVMLSVWSVHRSTRIEPWGDNIFVIRFHSVTEKRRIMSLGPWTFDKSLLVLVSPQGSDDPSLLDFSRCEFWVHITKVPLNY

Query:  HTAAMARALGSVIGQVVEVPGEGHNDWLGSVMRVRVVLNMAHPLRRVVRL-VKGDGSVLWCPLQYERLPDFCFKCGRIGHSHRECSEDGE-GVGADGKFL
         T  + + LGS+IG+V EV     +D LG  MRVRV + +  PLRR +R+ V GDG     P+QYERLP FCF CG +GH   EC+  GE G+      +
Subjt:  HTAAMARALGSVIGQVVEVPGEGHNDWLGSVMRVRVVLNMAHPLRRVVRL-VKGDGSVLWCPLQYERLPDFCFKCGRIGHSHRECSEDGE-GVGADGKFL

Query:  FGDWLRAVPFRRAVASASEEGGGNPG
        +G WL      RA A   + G GN G
Subjt:  FGDWLRAVPFRRAVASASEEGGGNPG

A0A5C7IBW4 CCHC-type domain-containing protein4.7e-4339.26Show/hide
Query:  MGLSEAETTAFPVPADIPLLDESTVQLCVVGKVLSSKPVNADAFRRVMLSVWSVHRSTRIEPWGDNIFVIRFHSVTEKRRIMSLGPWTFDKSLLVLVSPQ
        M L E E     + AD+       +   +VGKVLS KPVN D FR VM  +W       IE   DNIF   F +  +KR+I+S GPW+F+ +L+V+  P+
Subjt:  MGLSEAETTAFPVPADIPLLDESTVQLCVVGKVLSSKPVNADAFRRVMLSVWSVHRSTRIEPWGDNIFVIRFHSVTEKRRIMSLGPWTFDKSLLVLVSPQ

Query:  GSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVIGQVVEVPGEGHNDWLGSVMRVRVVLNMAHPLRRVVRL-VKGDGSVLWCPLQYERLPDFCFK
        G  D   + F++ EFW+ I  VP+   T  +   LG+++G+VVEV       +    +RVRV+L +  PLRR +R+ V GDG      L+YERLPDFCF+
Subjt:  GSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVIGQVVEVPGEGHNDWLGSVMRVRVVLNMAHPLRRVVRL-VKGDGSVLWCPLQYERLPDFCFK

Query:  CGRIGHSHRECSEDG--EGVGADGKFLFGDWLRAVPFRRAVASASEEGGGNPGSQGGRGAGLSGRGRGRG
        CG IGHS ++C E     G G    F+FG W+R  P  R + S     G     +GG G+ L    +G G
Subjt:  CGRIGHSHRECSEDG--EGVGADGKFLFGDWLRAVPFRRAVASASEEGGGNPGSQGGRGAGLSGRGRGRG

A0A5C7IKP2 CCHC-type domain-containing protein6.8e-4238.26Show/hide
Query:  DLVIQWEQMGLSEAETTAFPVPADIPLLDESTVQLCVVGKVLSSKPVNADAFRRVMLSVWSVHRSTRIEPWGDNIFVIRFHSVTEKRRIMSLGPWTFDKS
        +LV   E + L++ E     +  +   +  + V LC+VGKVLS+K VN +AF  V+  +W+      IE   DNIFV  F++   +  I + GPW FD++
Subjt:  DLVIQWEQMGLSEAETTAFPVPADIPLLDESTVQLCVVGKVLSSKPVNADAFRRVMLSVWSVHRSTRIEPWGDNIFVIRFHSVTEKRRIMSLGPWTFDKS

Query:  LLVLVSPQGSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVIGQVVEVPGEGHNDWLGSVMRVRVVLNMAHPLRRVVRL-VKGDGSVLWCPLQYE
        L+VL  P+G+ D S L FS  E WV I  +PL       A+A+   IG V+E+P E   +  G  +RV+V +N+++PL+R +RL VK     +  PL YE
Subjt:  LLVLVSPQGSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVIGQVVEVPGEGHNDWLGSVMRVRVVLNMAHPLRRVVRL-VKGDGSVLWCPLQYE

Query:  RLPDFCFKCGRIGHSHRECSEDGEGVGA--DGKFLFGDWLRAVPFRRAVASASEEGGGNPGSQG
        RLP+FC+ CGRIGH  REC++D     A       FG W+RA    R+  +  + GG N    G
Subjt:  RLPDFCFKCGRIGHSHRECSEDGEGVGA--DGKFLFGDWLRAVPFRRAVASASEEGGGNPGSQG

A0A6J1D765 uncharacterized protein LOC1110179021.5e-6039.88Show/hide
Query:  MDDLVIQWEQMGLSEAETTAFPVPADIPLLDESTVQLCVVGKVLSSKPVNADAFRRVMLSVWSVHRSTRIEPWGDNIFVIRFHSVTEKRRIMSLGPWTFD
        MD++   WE    +  E     +    P+L    V+LCVV K+ +SK ++A+A R VM SVW VH STR EP G NI+VI F S++EK R++S GPWTF+
Subjt:  MDDLVIQWEQMGLSEAETTAFPVPADIPLLDESTVQLCVVGKVLSSKPVNADAFRRVMLSVWSVHRSTRIEPWGDNIFVIRFHSVTEKRRIMSLGPWTFD

Query:  KSLLVLVSPQGSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVIGQVVEVPGEGHNDWLGSVMRVRVVLNMAHPLRRVVRLVKGDGSVLWCPLQY
        KSLLVL SP  ++ P  ++F+ C FW+ I  +P    +  MA  LG+ +G V E+ G+G + W G  +RVRV ++++ PLRR ++L   DG  +WCPL+Y
Subjt:  KSLLVLVSPQGSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVIGQVVEVPGEGHNDWLGSVMRVRVVLNMAHPLRRVVRLVKGDGSVLWCPLQY

Query:  ERLPDFCFKCGRIGHSHRECSEDGEGVGADGKFLFGDWLRAVPFRRAVASASEEGGGNPGSQGGRGAGLSGRGRGRG-----QVDQSEVVGVERPDVVSV
        E+LPDFC++CG+IGHS REC +  + V  +    +GDWLRA   +++V+   EE     G + GRG  ++G   GRG       +  ++ G E     +V
Subjt:  ERLPDFCFKCGRIGHSHRECSEDGEGVGADGKFLFGDWLRAVPFRRAVASASEEGGGNPGSQGGRGAGLSGRGRGRG-----QVDQSEVVGVERPDVVSV

Query:  SDLV-ADPAVASVVVSEVGTVSQAPSGTAHTVAPDV
         + V   P   SV  +E+ T+ QA   + H V   +
Subjt:  SDLV-ADPAVASVVVSEVGTVSQAPSGTAHTVAPDV

A0A6J1DVS4 uncharacterized protein LOC1110235552.1e-5146.12Show/hide
Query:  MDDLVIQWEQMGLSEAETTAFPVPADIPLLDESTVQLCVVGKVLSSKPVNADAFRRVMLSVWSVHRSTRIEPWGDNIFVIRFHSVTEKRRIMSLGPWTFD
        MDD+   WE   L + E     +    P+L    +QLC VGK+  SK +  +AF  VM  VW +H STRIE  G NI+VI F ++ EK R+ SLGPWTFD
Subjt:  MDDLVIQWEQMGLSEAETTAFPVPADIPLLDESTVQLCVVGKVLSSKPVNADAFRRVMLSVWSVHRSTRIEPWGDNIFVIRFHSVTEKRRIMSLGPWTFD

Query:  KSLLVLVSPQGSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVIGQVVEVPGEGHNDWLGSVMRVRVVLNMAHPLRRVVRLVKGDGSVLWCPLQY
        KSLL+LV    ++ P  +D S C FWV I  +     T  MA+ LG+ +G+V EV G    DW+   + VRV +N+  PLRR +++   DG  +WCPL+Y
Subjt:  KSLLVLVSPQGSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVIGQVVEVPGEGHNDWLGSVMRVRVVLNMAHPLRRVVRLVKGDGSVLWCPLQY

Query:  ERLPDFCFKCGRIGHSHRE
        ERLPDFC+ CG +GHS RE
Subjt:  ERLPDFCFKCGRIGHSHRE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G22300.1 signal responsive 11.5e-0469.23Show/hide
Query:  DSGRIPFYVSCRNRLSYSEVREFEYR
        ++GR+PFYV+C NRL+ SEVREFEY+
Subjt:  DSGRIPFYVSCRNRLSYSEVREFEYR

AT2G22300.2 signal responsive 11.5e-0469.23Show/hide
Query:  DSGRIPFYVSCRNRLSYSEVREFEYR
        ++GR+PFYV+C NRL+ SEVREFEY+
Subjt:  DSGRIPFYVSCRNRLSYSEVREFEYR

AT3G42140.1 zinc ion binding;nucleic acid binding1.8e-0726.67Show/hide
Query:  FHSVTEKRRIMSLGPWTFDKSLLVLVSPQGSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVIGQVVEVPGEGHNDWLGSVMRVRVVLNMAHPLR
        F S      I+  GPW+F+  + V+   + +   S  +F R  FW+ I  +PL + TA +  ++G  +G  +E                         L 
Subjt:  FHSVTEKRRIMSLGPWTFDKSLLVLVSPQGSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVIGQVVEVPGEGHNDWLGSVMRVRVVLNMAHPLR

Query:  RVVRLVKGDGSVLWCPLQYERLPDFCFKCGRIGHSHRECSEDG-EGVGAD
        R V ++K          QYE+L +FC  CG + H   EC   G +G  AD
Subjt:  RVVRLVKGDGSVLWCPLQYERLPDFCFKCGRIGHSHRECSEDG-EGVGAD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCACATATAAAGTTTAACAATGAAAACCTTTCATTTTCTCTTGGAGAACTCAGAAAACTTGGCAGTTTTAGCCGATGGATGGATAAGGAAATTGGAAGAGATTGTGA
TGATTCTTTGATGACTTTGGACTCTGGGCGTATTCCCTTCTATGTTTCATGCCGTAATAGGCTATCCTACAGTGAAGTGAGAGAGTTTGAATATCGTGAAAAGCCTCCAA
CCCTTTCTGCAGCTAATGCTATCAAGTGTGCACTAGAAGACGAAGACAATGTCAAAGAAAATGTTAATGTTGATGAGATTATTCACACTGCAGATGTTGTACCATCACAA
TTGACGGAAGATGAACTCCTATCCCTGAAGGCCTCTCTCGCTGTTGTCCGTGTATGCTGCTGCCTTAATCCATGCGGCTTTCCGTGCTTGTTCATTTCGTCGTCTCTCTG
CGTGAGAGTGGGTTCTAGGGCTTGCTGGGGTTTGCGGCTGGGGTTTGCGGCGTGGGTCTTCTTCAGCGGCGGCGTGGGTCTTCGTGCAGGTGGTGGGTCGTCTTCAGCGG
CGGCGTGGGTCCTTCGTGCAGGTGGTGGGTCGTCTTCAGCGGCGGCTGCCACGGGTTTCGACGGCGCTCCGGTCTCTCCCGAAGCTGCGGGTCGTCTTCAGCGGCGTCTC
CCACGGGGTCTTTCGGTTCGACGGCGTTTCGTTTGCAGCATTTGGTTTCGGTTCGACGGCGTGGGTCTTTCGTGCAGGGTCTGGTTAGGTTGTCGCGGCTTCGTTTGTAG
TGTTGATGCGCCGGCAACTGTGGGGTTCGGTTTGCAATTCGGGATTTCATCTGCTAAGTTGTGCTATGCTTCTTGGTGTGTGCTTCGATTGATTTTGTTGTTTCTCGGTG
TAGTTGTTTGGTGGTGTAGTTCTGTGTTTCGGTATCTGTTTGGGTTCTGTGGTTTGTGCTTGGCTGTGGTTTTGTGTAGGCTGTGGCTTGTGCGGTTTCTGTGGTTGTGT
GACTCGCTGTGTTTGGTGCGGTTAGTTGTTAGAATACTTCGCAGTGCACTTATGGATGATCTTGTGATTCAATGGGAACAGATGGGGTTATCAGAGGCAGAGACGACGGC
ATTCCCTGTGCCAGCTGACATTCCTCTTCTGGATGAATCAACTGTTCAACTGTGTGTTGTTGGCAAGGTTCTTTCTTCCAAACCGGTAAACGCTGATGCCTTTCGTCGAG
TAATGTTATCGGTTTGGAGTGTCCATCGGTCTACCCGGATTGAGCCATGGGGGGATAATATCTTCGTAATTCGGTTCCATTCCGTCACTGAAAAACGAAGAATTATGAGT
TTGGGCCCTTGGACCTTCGATAAGTCTCTGCTGGTTCTGGTGTCTCCTCAGGGGTCGGATGACCCATCTCTTCTGGATTTTTCACGTTGTGAGTTTTGGGTTCATATCAC
GAAAGTTCCGTTGAATTACCATACAGCGGCTATGGCTCGTGCTCTTGGTAGTGTGATTGGCCAGGTGGTTGAGGTGCCAGGGGAAGGCCACAATGACTGGCTTGGTTCAG
TAATGAGAGTTCGTGTTGTCCTCAACATGGCTCATCCCCTCCGTCGTGTTGTCCGGCTTGTTAAAGGGGATGGGTCTGTTCTTTGGTGCCCGTTGCAATATGAGCGATTG
CCAGACTTCTGTTTTAAATGTGGGCGTATTGGGCATTCGCATAGGGAATGTTCTGAGGATGGGGAAGGTGTGGGGGCTGATGGTAAGTTTCTGTTTGGTGATTGGTTGCG
GGCTGTTCCATTTCGGCGTGCTGTTGCTAGTGCTTCAGAAGAGGGTGGTGGGAATCCAGGTAGTCAGGGGGGGCGGGGAGCAGGTTTGTCTGGTAGAGGGAGAGGTCGGG
GCCAGGTGGACCAGTCTGAGGTGGTGGGGGTAGAGAGGCCTGATGTGGTGTCGGTGTCTGACCTGGTGGCCGATCCGGCTGTTGCTTCAGTAGTTGTGTCTGAGGTGGGT
ACGGTATCTCAGGCTCCTTCTGGTACCGCCCATACAGTCGCTCCTGATGTTGGGTTGGTTTCTGCAGAAAACGGTAAGGAGGTGGCAGACTTGGCTGTTGATTCAGTAGC
TGTTGCTAAGGTGGATACGGTTCCTTTGGCCCCTTCAGTTGGCACTAGTATAGTCACTTCTGTTGCTGGGTTGGTTTCTGCAGATAAAGGTAAGGCTGTGGCTAATGAGA
GCTTTGAGGTCGCTATGACTGATGTGCATGCTGTTCCGATCAAGAAGAGTTGGAAGCGACTGGCCAGAGGCTCTTTAAAGGACATTACTAATGAATCACCCTCCCCAGTT
ATTAGTAGGCATAAGCGACCAGCCCAGGGGGACCCGCCTGATAAGGTGGGGTCAGCTCCCAAGCGGCCGAAAGAGGGGGGATCGGGTGTTGATCTGTGTGGGGCTGATGT
GATGGATGTGGCGGTGGCTGGGTCCCAGCCCCGCCCGGGATTATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCACATATAAAGTTTAACAATGAAAACCTTTCATTTTCTCTTGGAGAACTCAGAAAACTTGGCAGTTTTAGCCGATGGATGGATAAGGAAATTGGAAGAGATTGTGA
TGATTCTTTGATGACTTTGGACTCTGGGCGTATTCCCTTCTATGTTTCATGCCGTAATAGGCTATCCTACAGTGAAGTGAGAGAGTTTGAATATCGTGAAAAGCCTCCAA
CCCTTTCTGCAGCTAATGCTATCAAGTGTGCACTAGAAGACGAAGACAATGTCAAAGAAAATGTTAATGTTGATGAGATTATTCACACTGCAGATGTTGTACCATCACAA
TTGACGGAAGATGAACTCCTATCCCTGAAGGCCTCTCTCGCTGTTGTCCGTGTATGCTGCTGCCTTAATCCATGCGGCTTTCCGTGCTTGTTCATTTCGTCGTCTCTCTG
CGTGAGAGTGGGTTCTAGGGCTTGCTGGGGTTTGCGGCTGGGGTTTGCGGCGTGGGTCTTCTTCAGCGGCGGCGTGGGTCTTCGTGCAGGTGGTGGGTCGTCTTCAGCGG
CGGCGTGGGTCCTTCGTGCAGGTGGTGGGTCGTCTTCAGCGGCGGCTGCCACGGGTTTCGACGGCGCTCCGGTCTCTCCCGAAGCTGCGGGTCGTCTTCAGCGGCGTCTC
CCACGGGGTCTTTCGGTTCGACGGCGTTTCGTTTGCAGCATTTGGTTTCGGTTCGACGGCGTGGGTCTTTCGTGCAGGGTCTGGTTAGGTTGTCGCGGCTTCGTTTGTAG
TGTTGATGCGCCGGCAACTGTGGGGTTCGGTTTGCAATTCGGGATTTCATCTGCTAAGTTGTGCTATGCTTCTTGGTGTGTGCTTCGATTGATTTTGTTGTTTCTCGGTG
TAGTTGTTTGGTGGTGTAGTTCTGTGTTTCGGTATCTGTTTGGGTTCTGTGGTTTGTGCTTGGCTGTGGTTTTGTGTAGGCTGTGGCTTGTGCGGTTTCTGTGGTTGTGT
GACTCGCTGTGTTTGGTGCGGTTAGTTGTTAGAATACTTCGCAGTGCACTTATGGATGATCTTGTGATTCAATGGGAACAGATGGGGTTATCAGAGGCAGAGACGACGGC
ATTCCCTGTGCCAGCTGACATTCCTCTTCTGGATGAATCAACTGTTCAACTGTGTGTTGTTGGCAAGGTTCTTTCTTCCAAACCGGTAAACGCTGATGCCTTTCGTCGAG
TAATGTTATCGGTTTGGAGTGTCCATCGGTCTACCCGGATTGAGCCATGGGGGGATAATATCTTCGTAATTCGGTTCCATTCCGTCACTGAAAAACGAAGAATTATGAGT
TTGGGCCCTTGGACCTTCGATAAGTCTCTGCTGGTTCTGGTGTCTCCTCAGGGGTCGGATGACCCATCTCTTCTGGATTTTTCACGTTGTGAGTTTTGGGTTCATATCAC
GAAAGTTCCGTTGAATTACCATACAGCGGCTATGGCTCGTGCTCTTGGTAGTGTGATTGGCCAGGTGGTTGAGGTGCCAGGGGAAGGCCACAATGACTGGCTTGGTTCAG
TAATGAGAGTTCGTGTTGTCCTCAACATGGCTCATCCCCTCCGTCGTGTTGTCCGGCTTGTTAAAGGGGATGGGTCTGTTCTTTGGTGCCCGTTGCAATATGAGCGATTG
CCAGACTTCTGTTTTAAATGTGGGCGTATTGGGCATTCGCATAGGGAATGTTCTGAGGATGGGGAAGGTGTGGGGGCTGATGGTAAGTTTCTGTTTGGTGATTGGTTGCG
GGCTGTTCCATTTCGGCGTGCTGTTGCTAGTGCTTCAGAAGAGGGTGGTGGGAATCCAGGTAGTCAGGGGGGGCGGGGAGCAGGTTTGTCTGGTAGAGGGAGAGGTCGGG
GCCAGGTGGACCAGTCTGAGGTGGTGGGGGTAGAGAGGCCTGATGTGGTGTCGGTGTCTGACCTGGTGGCCGATCCGGCTGTTGCTTCAGTAGTTGTGTCTGAGGTGGGT
ACGGTATCTCAGGCTCCTTCTGGTACCGCCCATACAGTCGCTCCTGATGTTGGGTTGGTTTCTGCAGAAAACGGTAAGGAGGTGGCAGACTTGGCTGTTGATTCAGTAGC
TGTTGCTAAGGTGGATACGGTTCCTTTGGCCCCTTCAGTTGGCACTAGTATAGTCACTTCTGTTGCTGGGTTGGTTTCTGCAGATAAAGGTAAGGCTGTGGCTAATGAGA
GCTTTGAGGTCGCTATGACTGATGTGCATGCTGTTCCGATCAAGAAGAGTTGGAAGCGACTGGCCAGAGGCTCTTTAAAGGACATTACTAATGAATCACCCTCCCCAGTT
ATTAGTAGGCATAAGCGACCAGCCCAGGGGGACCCGCCTGATAAGGTGGGGTCAGCTCCCAAGCGGCCGAAAGAGGGGGGATCGGGTGTTGATCTGTGTGGGGCTGATGT
GATGGATGTGGCGGTGGCTGGGTCCCAGCCCCGCCCGGGATTATGA
Protein sequenceShow/hide protein sequence
MPHIKFNNENLSFSLGELRKLGSFSRWMDKEIGRDCDDSLMTLDSGRIPFYVSCRNRLSYSEVREFEYREKPPTLSAANAIKCALEDEDNVKENVNVDEIIHTADVVPSQ
LTEDELLSLKASLAVVRVCCCLNPCGFPCLFISSSLCVRVGSRACWGLRLGFAAWVFFSGGVGLRAGGGSSSAAAWVLRAGGGSSSAAAATGFDGAPVSPEAAGRLQRRL
PRGLSVRRRFVCSIWFRFDGVGLSCRVWLGCRGFVCSVDAPATVGFGLQFGISSAKLCYASWCVLRLILLFLGVVVWWCSSVFRYLFGFCGLCLAVVLCRLWLVRFLWLC
DSLCLVRLVVRILRSALMDDLVIQWEQMGLSEAETTAFPVPADIPLLDESTVQLCVVGKVLSSKPVNADAFRRVMLSVWSVHRSTRIEPWGDNIFVIRFHSVTEKRRIMS
LGPWTFDKSLLVLVSPQGSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVIGQVVEVPGEGHNDWLGSVMRVRVVLNMAHPLRRVVRLVKGDGSVLWCPLQYERL
PDFCFKCGRIGHSHRECSEDGEGVGADGKFLFGDWLRAVPFRRAVASASEEGGGNPGSQGGRGAGLSGRGRGRGQVDQSEVVGVERPDVVSVSDLVADPAVASVVVSEVG
TVSQAPSGTAHTVAPDVGLVSAENGKEVADLAVDSVAVAKVDTVPLAPSVGTSIVTSVAGLVSADKGKAVANESFEVAMTDVHAVPIKKSWKRLARGSLKDITNESPSPV
ISRHKRPAQGDPPDKVGSAPKRPKEGGSGVDLCGADVMDVAVAGSQPRPGL