; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0030730 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0030730
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr11:871603..874724
RNA-Seq ExpressionLag0030730
SyntenyLag0030730
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036875 - Zinc finger, CCHC-type superfamily
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG50019.1 hypothetical protein EZV62_025894 [Acer yangbiense]1.7e-4041.63Show/hide
Query:  LDLTEEESIGVPVPAGAPLLNESSIQLCVVGKILSSKVVNVDAFRNVMLSVWSVHRATRIESLGENVFVIRFSSIGEKLRILKTGPWSFDRALLVLISPG
        L L E E + + + +G        + L + GK+LS K VN +AF +V+  +W   +   IE L  N+F   F    ++  +L+ GPWSFD+ALLVL  P 
Subjt:  LDLTEEESIGVPVPAGAPLLNESSIQLCVVGKILSSKVVNVDAFRNVMLSVWSVHRATRIESLGENVFVIRFSSIGEKLRILKTGPWSFDRALLVLISPG

Query:  VSDSPTMLDFSRCAFWVQISQIPFRYLTPTVARALGSVVGLVEEVAGEGYGDWMGAVMRVRVVLDVTKPLRRVVRL-VKDDGSSLWCPLLYERLPDFCFQ
               + F + AFW+QI ++P   +T  + R LGS++G V+E+   G GD +G  +RVRVV+DVTKPLRR++R+ V  DG      L YERLPD C++
Subjt:  VSDSPTMLDFSRCAFWVQISQIPFRYLTPTVARALGSVVGLVEEVAGEGYGDWMGAVMRVRVVLDVTKPLRRVVRL-VKDDGSSLWCPLLYERLPDFCFQ

Query:  CGRIGHSHRECPEV-GSEGASDARFPFGDWLRA
        CGRIGH  R+C  V  S    D    FG WLRA
Subjt:  CGRIGHSHRECPEV-GSEGASDARFPFGDWLRA

XP_015380691.1 uncharacterized protein LOC107174364 [Citrus sinensis]7.5e-4138.24Show/hide
Query:  CVVGKILSSKVVNVDAFRNVMLSVWSVHRATRIESLGENVFVIRFSSIGEKLRILKTGPWSFDRALLVLISPGVSDSPTMLDFSRCAFWVQISQIPFRYL
        C+VGK+L ++ VN + F++ +  VW   +  +IESLG N F+ +F+   +K R+L  GPW FDRALLVL  P      T   F+  AFW+QI  +P   +
Subjt:  CVVGKILSSKVVNVDAFRNVMLSVWSVHRATRIESLGENVFVIRFSSIGEKLRILKTGPWSFDRALLVLISPGVSDSPTMLDFSRCAFWVQISQIPFRYL

Query:  TPTVARALGSVVGLVEEVAGEGYGDWMGAVMRVRVVLDVTKPLRRVVRLVKDDGSSLWCPLLYERLPDFCFQCGRIGHSHRECPEVGSEGASDARFPFGD
           + + LG ++G VEE+  +  G+ +G   R+RV++++T PL++++ L ++  S +  P++YERLPDFC+ CG IGH ++EC +   +G    + P+G 
Subjt:  TPTVARALGSVVGLVEEVAGEGYGDWMGAVMRVRVVLDVTKPLRRVVRLVKDDGSSLWCPLLYERLPDFCFQCGRIGHSHRECPEVGSEGASDARFPFGD

Query:  WLRA
        W++A
Subjt:  WLRA

XP_022149484.1 uncharacterized protein LOC111017902 [Momordica charantia]9.4e-6041.69Show/hide
Query:  TEEESIGVPVPAGAPLLNESSIQLCVVGKILSSKVVNVDAFRNVMLSVWSVHRATRIESLGENVFVIRFSSIGEKLRILKTGPWSFDRALLVLISPGVSD
        T +E+  V +  G P+L   +++LCVV K+ +SK ++ +A R+VM SVW VH +TR E LG N++VI F S+ EK R+L +GPW+F+++LLVL SP  ++
Subjt:  TEEESIGVPVPAGAPLLNESSIQLCVVGKILSSKVVNVDAFRNVMLSVWSVHRATRIESLGENVFVIRFSSIGEKLRILKTGPWSFDRALLVLISPGVSD

Query:  SPTMLDFSRCAFWVQISQIPFRYLTPTVARALGSVVGLVEEVAGEGYGDWMGAVMRVRVVLDVTKPLRRVVRLVKDDGSSLWCPLLYERLPDFCFQCGRI
         P  ++F+ CAFW+QI  IPF  ++  +A  LG+ +G VEE+ G+G   W G  +RVRV +DV+KPLRR ++L   DG  +WCPL YE+LPDFC++CG+I
Subjt:  SPTMLDFSRCAFWVQISQIPFRYLTPTVARALGSVVGLVEEVAGEGYGDWMGAVMRVRVVLDVTKPLRRVVRLVKDDGSSLWCPLLYERLPDFCFQCGRI

Query:  GHSHRECPEVGSEGASDARFPFGDWLRASPLRRMGSGGSGDGGRRSDGFGSRSSGWAGRGRGRG---RSGVSEVEVEGVSGFARTTESPSVDPLPSVHES
        GHS REC +      +++   +GDWLRA+ L++  S    +   R   FG       GRG GRG   R   +  +++G     R      VD +P+    
Subjt:  GHSHRECPEVGSEGASDARFPFGDWLRASPLRRMGSGGSGDGGRRSDGFGSRSSGWAGRGRGRG---RSGVSEVEVEGVSGFARTTESPSVDPLPSVHES

Query:  ETVVVSE
        E+V  +E
Subjt:  ETVVVSE

XP_022156711.1 uncharacterized protein LOC111023555 [Momordica charantia]1.8e-5046.19Show/hide
Query:  NLDLTEEESIGVPVPAGAPLLNESSIQLCVVGKILSSKVVNVDAFRNVMLSVWSVHRATRIESLGENVFVIRFSSIGEKLRILKTGPWSFDRALLVLISP
        N  L +EE   + +    P+L   +IQLC VGK+  SK + V+AF +VM  VW +H +TRIE+ G N++VI F ++ EK+R+   GPW+FD++LL+L+  
Subjt:  NLDLTEEESIGVPVPAGAPLLNESSIQLCVVGKILSSKVVNVDAFRNVMLSVWSVHRATRIESLGENVFVIRFSSIGEKLRILKTGPWSFDRALLVLISP

Query:  GVSDSPTMLDFSRCAFWVQISQIPFRYLTPTVARALGSVVGLVEEVAGEGYGDWMGAVMRVRVVLDVTKPLRRVVRLVKDDGSSLWCPLLYERLPDFCFQ
          ++ P  +D S CAFWVQI  I F  +T  +A+ LG+ +G VEEV G    DW+   + VRV ++V KPLRR +++   DG  +WCPL YERLPDFC+ 
Subjt:  GVSDSPTMLDFSRCAFWVQISQIPFRYLTPTVARALGSVVGLVEEVAGEGYGDWMGAVMRVRVVLDVTKPLRRVVRLVKDDGSSLWCPLLYERLPDFCFQ

Query:  CGRIGHSHRE
        CG +GHS RE
Subjt:  CGRIGHSHRE

XP_024955847.1 uncharacterized protein LOC112498636 [Citrus sinensis]9.8e-4139.32Show/hide
Query:  CVVGKILSSKVVNVDAFRNVMLSVWSVHRATRIESLGENVFVIRFSSIGEKLRILKTGPWSFDRALLVLISP-GVSDSPTMLDFSRCAFWVQISQIPFRY
        C+VGKI+  + VN++  R+ ML +W  ++  RIESLG N+F+ +F+   +K R++  GPW FD AL+VL  P G+ D      F+   FWV +  +P  +
Subjt:  CVVGKILSSKVVNVDAFRNVMLSVWSVHRATRIESLGENVFVIRFSSIGEKLRILKTGPWSFDRALLVLISP-GVSDSPTMLDFSRCAFWVQISQIPFRY

Query:  LTPTVARALGSVVGLVEEVAGEGYGDWMGAVMRVRVVLDVTKPLRRVVRLVKDDGSSLWCPLLYERLPDFCFQCGRIGHSHRECPEVGSEGASDARFPFG
        +   + + LG  +G VEE+  +  G+ +G + R+R+ +D+TKPLR+ + +  +D  S+  P+LYERLPDFCF CG IGH ++EC E   +   D   P+G
Subjt:  LTPTVARALGSVVGLVEEVAGEGYGDWMGAVMRVRVVLDVTKPLRRVVRLVKDDGSSLWCPLLYERLPDFCFQCGRIGHSHRECPEVGSEGASDARFPFG

Query:  DWLRAS
         W++A+
Subjt:  DWLRAS

TrEMBL top hitse value%identityAlignment
A0A1S8AC25 CCHC-type domain-containing protein (Fragment)8.1e-4138.52Show/hide
Query:  LTEEESIGVPVPAGAPLLNESSIQLCVVGKILSSKVVNVDAFRNVMLSVWSVHRATRIESLGENVFVIRFSSIGEKLRILKTGPWSFDRALLVLISP-GV
        L++EE   V       +  E  +  C+VGK+L ++ V+++  +  M  VW   R  +IE LGENVF+ +F S  +K  I+  GPW FDRAL+ L  P G+
Subjt:  LTEEESIGVPVPAGAPLLNESSIQLCVVGKILSSKVVNVDAFRNVMLSVWSVHRATRIESLGENVFVIRFSSIGEKLRILKTGPWSFDRALLVLISP-GV

Query:  SDSPTMLDFSRCAFWVQISQIPFRYLTPTVARALGSVVGLVEEVAGEGYGDWMGAVMRVRVVLDVTKPLRRVVRL--VKDDGSSLWCPLLYERLPDFCFQ
         D     DFS  +FWVQI  +P   ++  +A  LG V+G VEEV  +  G+  G  +R+R+ +D+TKPL++++ L   ++D   +   ++YERLPDFCF 
Subjt:  SDSPTMLDFSRCAFWVQISQIPFRYLTPTVARALGSVVGLVEEVAGEGYGDWMGAVMRVRVVLDVTKPLRRVVRL--VKDDGSSLWCPLLYERLPDFCFQ

Query:  CGRIGHSHRECPEVGSEGASDARFPFGDWLRASPLRRMGSGGSG
        CGRIGH +REC    S+  S     +G WL+A+ +      G G
Subjt:  CGRIGHSHRECPEVGSEGASDARFPFGDWLRASPLRRMGSGGSG

A0A5C7GZQ4 CCHC-type domain-containing protein8.1e-4141.63Show/hide
Query:  LDLTEEESIGVPVPAGAPLLNESSIQLCVVGKILSSKVVNVDAFRNVMLSVWSVHRATRIESLGENVFVIRFSSIGEKLRILKTGPWSFDRALLVLISPG
        L L E E + + + +G        + L + GK+LS K VN +AF +V+  +W   +   IE L  N+F   F    ++  +L+ GPWSFD+ALLVL  P 
Subjt:  LDLTEEESIGVPVPAGAPLLNESSIQLCVVGKILSSKVVNVDAFRNVMLSVWSVHRATRIESLGENVFVIRFSSIGEKLRILKTGPWSFDRALLVLISPG

Query:  VSDSPTMLDFSRCAFWVQISQIPFRYLTPTVARALGSVVGLVEEVAGEGYGDWMGAVMRVRVVLDVTKPLRRVVRL-VKDDGSSLWCPLLYERLPDFCFQ
               + F + AFW+QI ++P   +T  + R LGS++G V+E+   G GD +G  +RVRVV+DVTKPLRR++R+ V  DG      L YERLPD C++
Subjt:  VSDSPTMLDFSRCAFWVQISQIPFRYLTPTVARALGSVVGLVEEVAGEGYGDWMGAVMRVRVVLDVTKPLRRVVRL-VKDDGSSLWCPLLYERLPDFCFQ

Query:  CGRIGHSHRECPEV-GSEGASDARFPFGDWLRA
        CGRIGH  R+C  V  S    D    FG WLRA
Subjt:  CGRIGHSHRECPEV-GSEGASDARFPFGDWLRA

A0A6J1D765 uncharacterized protein LOC1110179024.6e-6041.69Show/hide
Query:  TEEESIGVPVPAGAPLLNESSIQLCVVGKILSSKVVNVDAFRNVMLSVWSVHRATRIESLGENVFVIRFSSIGEKLRILKTGPWSFDRALLVLISPGVSD
        T +E+  V +  G P+L   +++LCVV K+ +SK ++ +A R+VM SVW VH +TR E LG N++VI F S+ EK R+L +GPW+F+++LLVL SP  ++
Subjt:  TEEESIGVPVPAGAPLLNESSIQLCVVGKILSSKVVNVDAFRNVMLSVWSVHRATRIESLGENVFVIRFSSIGEKLRILKTGPWSFDRALLVLISPGVSD

Query:  SPTMLDFSRCAFWVQISQIPFRYLTPTVARALGSVVGLVEEVAGEGYGDWMGAVMRVRVVLDVTKPLRRVVRLVKDDGSSLWCPLLYERLPDFCFQCGRI
         P  ++F+ CAFW+QI  IPF  ++  +A  LG+ +G VEE+ G+G   W G  +RVRV +DV+KPLRR ++L   DG  +WCPL YE+LPDFC++CG+I
Subjt:  SPTMLDFSRCAFWVQISQIPFRYLTPTVARALGSVVGLVEEVAGEGYGDWMGAVMRVRVVLDVTKPLRRVVRLVKDDGSSLWCPLLYERLPDFCFQCGRI

Query:  GHSHRECPEVGSEGASDARFPFGDWLRASPLRRMGSGGSGDGGRRSDGFGSRSSGWAGRGRGRG---RSGVSEVEVEGVSGFARTTESPSVDPLPSVHES
        GHS REC +      +++   +GDWLRA+ L++  S    +   R   FG       GRG GRG   R   +  +++G     R      VD +P+    
Subjt:  GHSHRECPEVGSEGASDARFPFGDWLRASPLRRMGSGGSGDGGRRSDGFGSRSSGWAGRGRGRG---RSGVSEVEVEGVSGFARTTESPSVDPLPSVHES

Query:  ETVVVSE
        E+V  +E
Subjt:  ETVVVSE

A0A6J1DU55 uncharacterized protein LOC1110231351.7e-3835.04Show/hide
Query:  TEEESIGVPVPAGAPLLNESSIQLCVVGKILSSKVVNVDAFRNVMLSVWSVHRATRIESLGENVFVIRFSSIGEKLRILKTGPWSFDRALLVLISPGVSD
        +EE+ I + V   A  + E  +   +VGK+L+ ++++ D    V+L  W V     +ES+G+N+F+  F    +  R++KTGPW FD+AL+VL  P  S 
Subjt:  TEEESIGVPVPAGAPLLNESSIQLCVVGKILSSKVVNVDAFRNVMLSVWSVHRATRIESLGENVFVIRFSSIGEKLRILKTGPWSFDRALLVLISPGVSD

Query:  SPTMLDFSRCAFWVQISQIPFRYLTPTVARALGSVVGLVEEVAGEGYGDWMGAVMRVRVVLDVTKPLRRVVRL-VKDDGSSLWCPLLYERLPDFCFQCGR
        + + L+F+R AFW+ +  +P  +L  T+A  LG+ +G   +V     G   GA +R+RV++D+TKPLRR +++ +       W P+ YERLPDFC+ CG 
Subjt:  SPTMLDFSRCAFWVQISQIPFRYLTPTVARALGSVVGLVEEVAGEGYGDWMGAVMRVRVVLDVTKPLRRVVRL-VKDDGSSLWCPLLYERLPDFCFQCGR

Query:  IGHSHRECPE--VGSEGASDARFPFGDWLR-ASPLRRMGSGGSGDGGRRSDGFGSRSSGWAGRGRGRGRSGVSE
        IGHS  +C    + ++  S A   +G WLR          G  G    R D  GS S     RG    +  +SE
Subjt:  IGHSHRECPE--VGSEGASDARFPFGDWLR-ASPLRRMGSGGSGDGGRRSDGFGSRSSGWAGRGRGRGRSGVSE

A0A6J1DVS4 uncharacterized protein LOC1110235558.6e-5146.19Show/hide
Query:  NLDLTEEESIGVPVPAGAPLLNESSIQLCVVGKILSSKVVNVDAFRNVMLSVWSVHRATRIESLGENVFVIRFSSIGEKLRILKTGPWSFDRALLVLISP
        N  L +EE   + +    P+L   +IQLC VGK+  SK + V+AF +VM  VW +H +TRIE+ G N++VI F ++ EK+R+   GPW+FD++LL+L+  
Subjt:  NLDLTEEESIGVPVPAGAPLLNESSIQLCVVGKILSSKVVNVDAFRNVMLSVWSVHRATRIESLGENVFVIRFSSIGEKLRILKTGPWSFDRALLVLISP

Query:  GVSDSPTMLDFSRCAFWVQISQIPFRYLTPTVARALGSVVGLVEEVAGEGYGDWMGAVMRVRVVLDVTKPLRRVVRLVKDDGSSLWCPLLYERLPDFCFQ
          ++ P  +D S CAFWVQI  I F  +T  +A+ LG+ +G VEEV G    DW+   + VRV ++V KPLRR +++   DG  +WCPL YERLPDFC+ 
Subjt:  GVSDSPTMLDFSRCAFWVQISQIPFRYLTPTVARALGSVVGLVEEVAGEGYGDWMGAVMRVRVVLDVTKPLRRVVRLVKDDGSSLWCPLLYERLPDFCFQ

Query:  CGRIGHSHRE
        CG +GHS RE
Subjt:  CGRIGHSHRE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G42140.1 zinc ion binding;nucleic acid binding4.3e-1029.45Show/hide
Query:  FSSIGEKLRILKTGPWSFDRALLVLISPGVSDSPTMLDFSRCAFWVQISQIPFRYLTPTVARALGSVVGLVEEVAGEGYGDWMGAVMRVRVVLDVTKPLR
        F S      IL+ GPWSF+  + V+       S    +F R  FW+QI  IP R+LT  +  ++G  +GL  E                         L 
Subjt:  FSSIGEKLRILKTGPWSFDRALLVLISPGVSDSPTMLDFSRCAFWVQISQIPFRYLTPTVARALGSVVGLVEEVAGEGYGDWMGAVMRVRVVLDVTKPLR

Query:  RVVRLVKDDGSSLWCPLLYERLPDFCFQCGRIGHSHRECPEVGSEG
        R V ++K           YE+L +FC  CG + H   ECP  G++G
Subjt:  RVVRLVKDDGSSLWCPLLYERLPDFCFQCGRIGHSHRECPEVGSEG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAAAAACATTGGTGGCCGGAAAAGTCATCGGCGATCGCCGGAAAAGTCGTCGGGGTCGCCAGAAATTCGTCGGAAACTTCGCCGGCTGACGGCGGCCGGCGGTC
GTCGAAAAGTCAAGGAGTCGCATGGAAGAATCGACGGCGGTTACGTGCTTCTCTTGATTTCCAGCGGTTGGGCTTCGATTTTCCGGCGGTTTCTCCGGCGGTTGGGCTTC
GATCTTCCGGCGGCGTGGTGTTGATTTTCCGGCGATTTCTCTCGGCTGCGGCGGCGGTTTTCAGGCTTCTCTTCACGGCGTTTTCTCTTGTTTGCGGCGGCGTTTTTAGG
TATCGATTTCCGGCGATTTCCCTTGTCTGCGGCGGCGTTTCCAGGCTTCGATTTCCGGCGGGTTTCAGTTTGGTGTACAGGGTTCGAGTTGCGGTGGCGGGGTCTCGATT
GGGCGGTTGTTCCCGTCGGTTCTCATCTGCGTGGGATCGTCTTCAGTGTGTGTTTGTGGTTTGTGTTTGGGTGTGTGTGGTTTGTGCTTGTGTGCTTGTGGTTTGTGATT
GGTGGTTGTTCTCGTCGCTTCTTCAGTGTGTGCTTGTGGTTTGTGTGTTGGTTTGGGATTGTGCAGGTTTTGTTATTGTGCTGTTTTGTGCTGGTTGGTGTTCTGGGTTG
TGCTGGTTTGTGATTGTACTGAATTTGGATCTAACGGAGGAAGAATCGATTGGGGTTCCGGTGCCGGCGGGCGCTCCCTTGCTTAATGAGTCTTCGATTCAGTTATGCGT
AGTTGGCAAGATTCTTTCGTCTAAGGTTGTGAATGTTGATGCATTCCGGAACGTTATGCTGTCGGTTTGGAGCGTTCATCGGGCTACTCGGATTGAATCTTTGGGCGAGA
ATGTATTTGTGATTCGGTTCTCGTCCATTGGGGAAAAGCTCCGCATTTTGAAGACAGGGCCTTGGTCTTTTGATAGGGCACTTCTTGTTCTGATTTCGCCAGGAGTCTCA
GATAGTCCAACTATGTTGGATTTCTCTCGTTGTGCTTTTTGGGTCCAAATCTCTCAAATTCCTTTTCGGTACCTTACTCCGACGGTTGCCCGTGCTCTGGGTAGTGTGGT
GGGCTTGGTTGAGGAAGTTGCTGGGGAGGGTTATGGTGATTGGATGGGGGCGGTGATGAGGGTTCGGGTTGTTCTTGATGTCACCAAGCCGCTCCGGCGGGTTGTTCGTT
TGGTGAAAGATGATGGGTCGTCTTTGTGGTGCCCTCTTCTGTATGAGCGGTTACCGGATTTTTGTTTTCAGTGTGGGCGTATTGGGCATTCACACAGAGAGTGTCCTGAA
GTAGGTTCAGAGGGTGCTTCAGATGCTCGGTTCCCCTTTGGTGATTGGTTGCGTGCGTCCCCCTTACGTCGTATGGGTTCTGGAGGTTCAGGAGATGGGGGTCGGCGTTC
TGACGGGTTTGGTAGTCGTTCTTCTGGTTGGGCTGGTAGAGGAAGGGGGCGGGGTCGATCAGGGGTGTCAGAGGTAGAGGTTGAGGGGGTGTCTGGGTTTGCTCGGACTA
CTGAATCACCTTCTGTTGATCCACTGCCGAGTGTACATGAGTCTGAGACTGTGGTTGTGTCTGAGGATGTGCCTGTGTTGGTGGCTGAGAAGGGGCCTGTGTTGGTGATT
GAGGATGAGCCTGTGACGGTGGCTGAGAACGTGTTGGGGGATGTGGATGTGTCTCAGGCACTGGACCCGCTTGATGATGGGAATGGTATTGTGGGTACTCGGGATAAGGG
GAAAGCGGTGACTGTTGAAAGGGCAGTGGTGGTGGGGAAGGAGAATATTCTGGGTATGGCGAGGAAAGGTTGGAAGTGGCTAGCCAGGGGCAATCTGAACGATATTACTT
CAGTGAGTGTTAGCCAGGGGAAGCGTCCGGGTGGGATGGATTGTGTTTCTGAGGATATTGGGGTGGTTAAACGTCAGAAAGTTATGGGTGATGATATGATTCTAGGTGCG
TCAGGTGGGCAGGATATGGCGGTGGCTGGGTCTCAGCCCCGCCAAGGATTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAAAAAACATTGGTGGCCGGAAAAGTCATCGGCGATCGCCGGAAAAGTCGTCGGGGTCGCCAGAAATTCGTCGGAAACTTCGCCGGCTGACGGCGGCCGGCGGTC
GTCGAAAAGTCAAGGAGTCGCATGGAAGAATCGACGGCGGTTACGTGCTTCTCTTGATTTCCAGCGGTTGGGCTTCGATTTTCCGGCGGTTTCTCCGGCGGTTGGGCTTC
GATCTTCCGGCGGCGTGGTGTTGATTTTCCGGCGATTTCTCTCGGCTGCGGCGGCGGTTTTCAGGCTTCTCTTCACGGCGTTTTCTCTTGTTTGCGGCGGCGTTTTTAGG
TATCGATTTCCGGCGATTTCCCTTGTCTGCGGCGGCGTTTCCAGGCTTCGATTTCCGGCGGGTTTCAGTTTGGTGTACAGGGTTCGAGTTGCGGTGGCGGGGTCTCGATT
GGGCGGTTGTTCCCGTCGGTTCTCATCTGCGTGGGATCGTCTTCAGTGTGTGTTTGTGGTTTGTGTTTGGGTGTGTGTGGTTTGTGCTTGTGTGCTTGTGGTTTGTGATT
GGTGGTTGTTCTCGTCGCTTCTTCAGTGTGTGCTTGTGGTTTGTGTGTTGGTTTGGGATTGTGCAGGTTTTGTTATTGTGCTGTTTTGTGCTGGTTGGTGTTCTGGGTTG
TGCTGGTTTGTGATTGTACTGAATTTGGATCTAACGGAGGAAGAATCGATTGGGGTTCCGGTGCCGGCGGGCGCTCCCTTGCTTAATGAGTCTTCGATTCAGTTATGCGT
AGTTGGCAAGATTCTTTCGTCTAAGGTTGTGAATGTTGATGCATTCCGGAACGTTATGCTGTCGGTTTGGAGCGTTCATCGGGCTACTCGGATTGAATCTTTGGGCGAGA
ATGTATTTGTGATTCGGTTCTCGTCCATTGGGGAAAAGCTCCGCATTTTGAAGACAGGGCCTTGGTCTTTTGATAGGGCACTTCTTGTTCTGATTTCGCCAGGAGTCTCA
GATAGTCCAACTATGTTGGATTTCTCTCGTTGTGCTTTTTGGGTCCAAATCTCTCAAATTCCTTTTCGGTACCTTACTCCGACGGTTGCCCGTGCTCTGGGTAGTGTGGT
GGGCTTGGTTGAGGAAGTTGCTGGGGAGGGTTATGGTGATTGGATGGGGGCGGTGATGAGGGTTCGGGTTGTTCTTGATGTCACCAAGCCGCTCCGGCGGGTTGTTCGTT
TGGTGAAAGATGATGGGTCGTCTTTGTGGTGCCCTCTTCTGTATGAGCGGTTACCGGATTTTTGTTTTCAGTGTGGGCGTATTGGGCATTCACACAGAGAGTGTCCTGAA
GTAGGTTCAGAGGGTGCTTCAGATGCTCGGTTCCCCTTTGGTGATTGGTTGCGTGCGTCCCCCTTACGTCGTATGGGTTCTGGAGGTTCAGGAGATGGGGGTCGGCGTTC
TGACGGGTTTGGTAGTCGTTCTTCTGGTTGGGCTGGTAGAGGAAGGGGGCGGGGTCGATCAGGGGTGTCAGAGGTAGAGGTTGAGGGGGTGTCTGGGTTTGCTCGGACTA
CTGAATCACCTTCTGTTGATCCACTGCCGAGTGTACATGAGTCTGAGACTGTGGTTGTGTCTGAGGATGTGCCTGTGTTGGTGGCTGAGAAGGGGCCTGTGTTGGTGATT
GAGGATGAGCCTGTGACGGTGGCTGAGAACGTGTTGGGGGATGTGGATGTGTCTCAGGCACTGGACCCGCTTGATGATGGGAATGGTATTGTGGGTACTCGGGATAAGGG
GAAAGCGGTGACTGTTGAAAGGGCAGTGGTGGTGGGGAAGGAGAATATTCTGGGTATGGCGAGGAAAGGTTGGAAGTGGCTAGCCAGGGGCAATCTGAACGATATTACTT
CAGTGAGTGTTAGCCAGGGGAAGCGTCCGGGTGGGATGGATTGTGTTTCTGAGGATATTGGGGTGGTTAAACGTCAGAAAGTTATGGGTGATGATATGATTCTAGGTGCG
TCAGGTGGGCAGGATATGGCGGTGGCTGGGTCTCAGCCCCGCCAAGGATTATGA
Protein sequenceShow/hide protein sequence
MEKKHWWPEKSSAIAGKVVGVARNSSETSPADGGRRSSKSQGVAWKNRRRLRASLDFQRLGFDFPAVSPAVGLRSSGGVVLIFRRFLSAAAAVFRLLFTAFSLVCGGVFR
YRFPAISLVCGGVSRLRFPAGFSLVYRVRVAVAGSRLGGCSRRFSSAWDRLQCVFVVCVWVCVVCACVLVVCDWWLFSSLLQCVLVVCVLVWDCAGFVIVLFCAGWCSGL
CWFVIVLNLDLTEEESIGVPVPAGAPLLNESSIQLCVVGKILSSKVVNVDAFRNVMLSVWSVHRATRIESLGENVFVIRFSSIGEKLRILKTGPWSFDRALLVLISPGVS
DSPTMLDFSRCAFWVQISQIPFRYLTPTVARALGSVVGLVEEVAGEGYGDWMGAVMRVRVVLDVTKPLRRVVRLVKDDGSSLWCPLLYERLPDFCFQCGRIGHSHRECPE
VGSEGASDARFPFGDWLRASPLRRMGSGGSGDGGRRSDGFGSRSSGWAGRGRGRGRSGVSEVEVEGVSGFARTTESPSVDPLPSVHESETVVVSEDVPVLVAEKGPVLVI
EDEPVTVAENVLGDVDVSQALDPLDDGNGIVGTRDKGKAVTVERAVVVGKENILGMARKGWKWLARGNLNDITSVSVSQGKRPGGMDCVSEDIGVVKRQKVMGDDMILGA
SGGQDMAVAGSQPRQGL