; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028688 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028688
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr8:28480109..28481391
RNA-Seq ExpressionLag0028688
SyntenyLag0028688
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG54013.1 hypothetical protein EZV62_019269 [Acer yangbiense]9.6e-4428.37Show/hide
Query:  DDLAVLWERMVLTEEETMVVVVEEGLSLLSEATVQVCAVGKLLSSRKTNVEAFRKVLISVWNVHSSTRIEALGDNFFVIRFCSLSEKRQIMNGGPWSFDK
        DD++   E++ L ++   +  ++  L    E ++ +  +GK ++++  N EAF+  + ++W   +   +E +G N F  RF +  ++++I+ GGPW FDK
Subjt:  DDLAVLWERMVLTEEETMVVVVEEGLSLLSEATVQVCAVGKLLSSRKTNVEAFRKVLISVWNVHSSTRIEALGDNFFVIRFCSLSEKRQIMNGGPWSFDK

Query:  SLLALVSPRGIDDPLLLDFTYNAFWVQILNVSFHCLTSTMARRLGSLVGMVEEVHGEGHGEWLGSVMHVRVLIDLSKPLRRGVRLKISEN-KYQWCPVGY
         LL L    G +    L F Y  FW+Q+ N+   CL   +   LG LVG V+E+     GE +G  + +RVLID+  PL+RG+R+ + ++ K     + Y
Subjt:  SLLALVSPRGIDDPLLLDFTYNAFWVQILNVSFHCLTSTMARRLGSLVGMVEEVHGEGHGEWLGSVMHVRVLIDLSKPLRRGVRLKISEN-KYQWCPVGY

Query:  EKLPDFCYQCGRIGHLHRDCELVLSELGAASPLQYGEWMRAISMRQISGL------------YGTSDDLSRSR---GMRGRRGSDDD-------------
        E+LP+FCY CG+IGHL RDC L   E+ ++S  ++G WMRA+S  +  G              G+SD L   R     +   G D               
Subjt:  EKLPDFCYQCGRIGHLHRDCELVLSELGAASPLQYGEWMRAISMRQISGL------------YGTSDDLSRSR---GMRGRRGSDDD-------------

Query:  --------------------SPPEHLDIPHSAQREESIPLLPEVGKCFKGKTVVEPQQMGVSITDSLAGVGSNLSPVSIAARKNWKRLARGALMDVTNSV
                            S  E L +  S  +E+      +  K  +    V    +G ++++    +GS  +   I  +K WKRLAR    +   SV
Subjt:  --------------------SPPEHLDIPHSAQREESIPLLPEVGKCFKGKTVVEPQQMGVSITDSLAGVGSNLSPVSIAARKNWKRLARGALMDVTNSV

Query:  LDTGSSFGKRSGSENLEGENVTMSKKKVKV
         +   S GK+ G  ++E  +    +KK+ V
Subjt:  LDTGSSFGKRSGSENLEGENVTMSKKKVKV

XP_022149484.1 uncharacterized protein LOC111017902 [Momordica charantia]1.2e-6542.86Show/hide
Query:  MDDLAVLWERMVLTEEETMVVVVEEGLSLLSEATVQVCAVGKLLSSRKTNVEAFRKVLISVWNVHSSTRIEALGDNFFVIRFCSLSEKRQIMNGGPWSFD
        MD++  LWE    T +E   V ++ G  +L+   V++C V KL +S++ + EA R V+ SVW VH+STR E LG N +VI F SLSEK ++++ GPW+F+
Subjt:  MDDLAVLWERMVLTEEETMVVVVEEGLSLLSEATVQVCAVGKLLSSRKTNVEAFRKVLISVWNVHSSTRIEALGDNFFVIRFCSLSEKRQIMNGGPWSFD

Query:  KSLLALVSPRGIDDPLLLDFTYNAFWVQILNVSFHCLTSTMARRLGSLVGMVEEVHGEGHGEWLGSVMHVRVLIDLSKPLRRGVRLKISENKYQWCPVGY
        KSLL L SP   + PL ++F + AFW+QI N+ F C+++ MA  LG+ +G VEE+ G+G   W G  + VRV ID+SKPLRRG++LK S+ K  WCP+ Y
Subjt:  KSLLALVSPRGIDDPLLLDFTYNAFWVQILNVSFHCLTSTMARRLGSLVGMVEEVHGEGHGEWLGSVMHVRVLIDLSKPLRRGVRLKISENKYQWCPVGY

Query:  EKLPDFCYQCGRIGHLHRDCELVLSELGAASPLQYGEWMRAISMRQI-----------SGLYGTSDDLSRSRGMRGRRGSDDDSPPEHLDIPHSAQR---
        EKLPDFCY+CG+IGH  R+CE     +   SP QYG+W+RA  +++             G +G    ++  RG RG     D++  + +D P S+ R   
Subjt:  EKLPDFCYQCGRIGHLHRDCELVLSELGAASPLQYGEWMRAISMRQI-----------SGLYGTSDDLSRSRGMRGRRGSDDDSPPEHLDIPHSAQR---

Query:  EESIPLLP
        EE +  +P
Subjt:  EESIPLLP

XP_022155933.1 uncharacterized protein LOC111022932 [Momordica charantia]7.4e-4443.28Show/hide
Query:  MDDLAVLWERMVLTEEETMVVVVEEGLSLLSEATVQVCAVGKLLSSRKTNVEAFRKVLISVWNVHSSTRIEALGDNFFVIRFCSLSEKRQIMNGGPWSFD
        MD++  LWE   LT+E+   ++++    L++   VQ  AVGKL +S++ +VEAFR V+ S+W VH ST IE  G N +VI F S++EK +++  GPWSF+
Subjt:  MDDLAVLWERMVLTEEETMVVVVEEGLSLLSEATVQVCAVGKLLSSRKTNVEAFRKVLISVWNVHSSTRIEALGDNFFVIRFCSLSEKRQIMNGGPWSFD

Query:  KSLLALVSPRGIDDPLLLDFTYNAFWVQILNVSFHCLTSTMARRLGSLVGMVEEVHGEGHGEWLGSVMHVRVLIDLSKPLRRGVRLKISENKYQWCPVGY
         SLL L SP   D P  ++F + A W+QI  + F C+   MA+ LG+ +G VEE+   G  EW G  + VRV ID+SKP +RG++++  + K  WCP+ Y
Subjt:  KSLLALVSPRGIDDPLLLDFTYNAFWVQILNVSFHCLTSTMARRLGSLVGMVEEVHGEGHGEWLGSVMHVRVLIDLSKPLRRGVRLKISENKYQWCPVGY

Query:  E
        E
Subjt:  E

XP_022156711.1 uncharacterized protein LOC111023555 [Momordica charantia]3.3e-5246.61Show/hide
Query:  MDDLAVLWERMVLTEEETMVVVVEEGLSLLSEATVQVCAVGKLLSSRKTNVEAFRKVLISVWNVHSSTRIEALGDNFFVIRFCSLSEKRQIMNGGPWSFD
        MDD+  LWE   L +EE   +V++    +L+   +Q+CAVGKL  S++  VEAF  V+  VW +H+STRIE  G N +VI F +++EK ++ + GPW+FD
Subjt:  MDDLAVLWERMVLTEEETMVVVVEEGLSLLSEATVQVCAVGKLLSSRKTNVEAFRKVLISVWNVHSSTRIEALGDNFFVIRFCSLSEKRQIMNGGPWSFD

Query:  KSLLALVSPRGIDDPLLLDFTYNAFWVQILNVSFHCLTSTMARRLGSLVGMVEEVHGEGHGEWLGSVMHVRVLIDLSKPLRRGVRLKISENKYQWCPVGY
        KSLL LV     + PL +D +  AFWVQI  ++F C+T  MA+ LG+ +G VEEV G    +W+   + VRV I++ KPLRRG+++K S+ K  WCP+ Y
Subjt:  KSLLALVSPRGIDDPLLLDFTYNAFWVQILNVSFHCLTSTMARRLGSLVGMVEEVHGEGHGEWLGSVMHVRVLIDLSKPLRRGVRLKISENKYQWCPVGY

Query:  EKLPDFCYQCGRIGHLHRDCE
        E+LPDFCY CG +GH  R+ E
Subjt:  EKLPDFCYQCGRIGHLHRDCE

XP_024035600.1 uncharacterized protein LOC112096408 [Citrus clementina]1.1e-4234.16Show/hide
Query:  ERMVLTEEETMVVVVEEGLSLLSEATVQVCAVGKLLSSRKTNVEAFRKVLISVWNVHSSTRIEALGDNFFVIRFCSLSEKRQIMNGGPWSFDKSLLALVS
        E + L  EE  ++     + +  E  V  C VGK+L +R    E  R  +   W     T++E+LGDN F+ +F S SEK++I+ GGPW FD+SL+ +  
Subjt:  ERMVLTEEETMVVVVEEGLSLLSEATVQVCAVGKLLSSRKTNVEAFRKVLISVWNVHSSTRIEALGDNFFVIRFCSLSEKRQIMNGGPWSFDKSLLALVS

Query:  PRGIDDPLLLDFTYNAFWVQILNVSFHCLTSTMARRLGSLVGMVEEVHGEGHGEWLGSVMHVRVLIDLSKPLRRGVRLKISENKYQWCPVGYEKLPDFCY
        P GI D    DFT+  FWVQI N+   C+   + + +G  +G V+EV  +  GE +GS + VR+++++++PL + + LK+ +       + YEKLPDFC+
Subjt:  PRGIDDPLLLDFTYNAFWVQILNVSFHCLTSTMARRLGSLVGMVEEVHGEGHGEWLGSVMHVRVLIDLSKPLRRGVRLKISENKYQWCPVGYEKLPDFCY

Query:  QCGRIGHLHRDCELVLSELG-AASPLQYGEWMRAISMRQISGLYGTSDDLSRSRGMRGRRGSDDDSPPEHLDIPHSAQREE
         CG IGH +R+C   L   G +   L YG WMRA +  +            R+R  R +   + D  P +  +    Q+E+
Subjt:  QCGRIGHLHRDCELVLSELG-AASPLQYGEWMRAISMRQISGLYGTSDDLSRSRGMRGRRGSDDDSPPEHLDIPHSAQREE

TrEMBL top hitse value%identityAlignment
A0A1S8AC25 CCHC-type domain-containing protein (Fragment)7.4e-4238.53Show/hide
Query:  LTEEETMVVVVEEGLSLLSEATVQVCAVGKLLSSRKTNVEAFRKVLISVWNVHSSTRIEALGDNFFVIRFCSLSEKRQIMNGGPWSFDKSLLALVSPRGI
        L++EE   V  +  + +  E  +  C VGK+L +R+ ++E  +  +  VW      +IE LG+N F+ +F S  +KR IM GGPW FD++L+ L  P GI
Subjt:  LTEEETMVVVVEEGLSLLSEATVQVCAVGKLLSSRKTNVEAFRKVLISVWNVHSSTRIEALGDNFFVIRFCSLSEKRQIMNGGPWSFDKSLLALVSPRGI

Query:  DDPLLLDFTYNAFWVQILNVSFHCLTSTMARRLGSLVGMVEEVHGEGHGEWLGSVMHVRVLIDLSKPLRRGVRLKISENKYQWCP--VGYEKLPDFCYQC
         D    DF++ +FWVQI +V   C++  MA  LG ++G VEEV  +  GE  G  + +R+ +D++KPL++ + L+  E      P  V YE+LPDFC+ C
Subjt:  DDPLLLDFTYNAFWVQILNVSFHCLTSTMARRLGSLVGMVEEVHGEGHGEWLGSVMHVRVLIDLSKPLRRGVRLKISENKYQWCP--VGYEKLPDFCYQC

Query:  GRIGHLHRDCELVLSELGAASPLQYGEWMRA
        GRIGH +R+C    S+  +   L YG W++A
Subjt:  GRIGHLHRDCELVLSELGAASPLQYGEWMRA

A0A5C7H9Y2 CCHC-type domain-containing protein4.6e-4428.37Show/hide
Query:  DDLAVLWERMVLTEEETMVVVVEEGLSLLSEATVQVCAVGKLLSSRKTNVEAFRKVLISVWNVHSSTRIEALGDNFFVIRFCSLSEKRQIMNGGPWSFDK
        DD++   E++ L ++   +  ++  L    E ++ +  +GK ++++  N EAF+  + ++W   +   +E +G N F  RF +  ++++I+ GGPW FDK
Subjt:  DDLAVLWERMVLTEEETMVVVVEEGLSLLSEATVQVCAVGKLLSSRKTNVEAFRKVLISVWNVHSSTRIEALGDNFFVIRFCSLSEKRQIMNGGPWSFDK

Query:  SLLALVSPRGIDDPLLLDFTYNAFWVQILNVSFHCLTSTMARRLGSLVGMVEEVHGEGHGEWLGSVMHVRVLIDLSKPLRRGVRLKISEN-KYQWCPVGY
         LL L    G +    L F Y  FW+Q+ N+   CL   +   LG LVG V+E+     GE +G  + +RVLID+  PL+RG+R+ + ++ K     + Y
Subjt:  SLLALVSPRGIDDPLLLDFTYNAFWVQILNVSFHCLTSTMARRLGSLVGMVEEVHGEGHGEWLGSVMHVRVLIDLSKPLRRGVRLKISEN-KYQWCPVGY

Query:  EKLPDFCYQCGRIGHLHRDCELVLSELGAASPLQYGEWMRAISMRQISGL------------YGTSDDLSRSR---GMRGRRGSDDD-------------
        E+LP+FCY CG+IGHL RDC L   E+ ++S  ++G WMRA+S  +  G              G+SD L   R     +   G D               
Subjt:  EKLPDFCYQCGRIGHLHRDCELVLSELGAASPLQYGEWMRAISMRQISGL------------YGTSDDLSRSR---GMRGRRGSDDD-------------

Query:  --------------------SPPEHLDIPHSAQREESIPLLPEVGKCFKGKTVVEPQQMGVSITDSLAGVGSNLSPVSIAARKNWKRLARGALMDVTNSV
                            S  E L +  S  +E+      +  K  +    V    +G ++++    +GS  +   I  +K WKRLAR    +   SV
Subjt:  --------------------SPPEHLDIPHSAQREESIPLLPEVGKCFKGKTVVEPQQMGVSITDSLAGVGSNLSPVSIAARKNWKRLARGALMDVTNSV

Query:  LDTGSSFGKRSGSENLEGENVTMSKKKVKV
         +   S GK+ G  ++E  +    +KK+ V
Subjt:  LDTGSSFGKRSGSENLEGENVTMSKKKVKV

A0A6J1D765 uncharacterized protein LOC1110179025.6e-6642.86Show/hide
Query:  MDDLAVLWERMVLTEEETMVVVVEEGLSLLSEATVQVCAVGKLLSSRKTNVEAFRKVLISVWNVHSSTRIEALGDNFFVIRFCSLSEKRQIMNGGPWSFD
        MD++  LWE    T +E   V ++ G  +L+   V++C V KL +S++ + EA R V+ SVW VH+STR E LG N +VI F SLSEK ++++ GPW+F+
Subjt:  MDDLAVLWERMVLTEEETMVVVVEEGLSLLSEATVQVCAVGKLLSSRKTNVEAFRKVLISVWNVHSSTRIEALGDNFFVIRFCSLSEKRQIMNGGPWSFD

Query:  KSLLALVSPRGIDDPLLLDFTYNAFWVQILNVSFHCLTSTMARRLGSLVGMVEEVHGEGHGEWLGSVMHVRVLIDLSKPLRRGVRLKISENKYQWCPVGY
        KSLL L SP   + PL ++F + AFW+QI N+ F C+++ MA  LG+ +G VEE+ G+G   W G  + VRV ID+SKPLRRG++LK S+ K  WCP+ Y
Subjt:  KSLLALVSPRGIDDPLLLDFTYNAFWVQILNVSFHCLTSTMARRLGSLVGMVEEVHGEGHGEWLGSVMHVRVLIDLSKPLRRGVRLKISENKYQWCPVGY

Query:  EKLPDFCYQCGRIGHLHRDCELVLSELGAASPLQYGEWMRAISMRQI-----------SGLYGTSDDLSRSRGMRGRRGSDDDSPPEHLDIPHSAQR---
        EKLPDFCY+CG+IGH  R+CE     +   SP QYG+W+RA  +++             G +G    ++  RG RG     D++  + +D P S+ R   
Subjt:  EKLPDFCYQCGRIGHLHRDCELVLSELGAASPLQYGEWMRAISMRQI-----------SGLYGTSDDLSRSRGMRGRRGSDDDSPPEHLDIPHSAQR---

Query:  EESIPLLP
        EE +  +P
Subjt:  EESIPLLP

A0A6J1DP89 uncharacterized protein LOC1110229323.6e-4443.28Show/hide
Query:  MDDLAVLWERMVLTEEETMVVVVEEGLSLLSEATVQVCAVGKLLSSRKTNVEAFRKVLISVWNVHSSTRIEALGDNFFVIRFCSLSEKRQIMNGGPWSFD
        MD++  LWE   LT+E+   ++++    L++   VQ  AVGKL +S++ +VEAFR V+ S+W VH ST IE  G N +VI F S++EK +++  GPWSF+
Subjt:  MDDLAVLWERMVLTEEETMVVVVEEGLSLLSEATVQVCAVGKLLSSRKTNVEAFRKVLISVWNVHSSTRIEALGDNFFVIRFCSLSEKRQIMNGGPWSFD

Query:  KSLLALVSPRGIDDPLLLDFTYNAFWVQILNVSFHCLTSTMARRLGSLVGMVEEVHGEGHGEWLGSVMHVRVLIDLSKPLRRGVRLKISENKYQWCPVGY
         SLL L SP   D P  ++F + A W+QI  + F C+   MA+ LG+ +G VEE+   G  EW G  + VRV ID+SKP +RG++++  + K  WCP+ Y
Subjt:  KSLLALVSPRGIDDPLLLDFTYNAFWVQILNVSFHCLTSTMARRLGSLVGMVEEVHGEGHGEWLGSVMHVRVLIDLSKPLRRGVRLKISENKYQWCPVGY

Query:  E
        E
Subjt:  E

A0A6J1DVS4 uncharacterized protein LOC1110235551.6e-5246.61Show/hide
Query:  MDDLAVLWERMVLTEEETMVVVVEEGLSLLSEATVQVCAVGKLLSSRKTNVEAFRKVLISVWNVHSSTRIEALGDNFFVIRFCSLSEKRQIMNGGPWSFD
        MDD+  LWE   L +EE   +V++    +L+   +Q+CAVGKL  S++  VEAF  V+  VW +H+STRIE  G N +VI F +++EK ++ + GPW+FD
Subjt:  MDDLAVLWERMVLTEEETMVVVVEEGLSLLSEATVQVCAVGKLLSSRKTNVEAFRKVLISVWNVHSSTRIEALGDNFFVIRFCSLSEKRQIMNGGPWSFD

Query:  KSLLALVSPRGIDDPLLLDFTYNAFWVQILNVSFHCLTSTMARRLGSLVGMVEEVHGEGHGEWLGSVMHVRVLIDLSKPLRRGVRLKISENKYQWCPVGY
        KSLL LV     + PL +D +  AFWVQI  ++F C+T  MA+ LG+ +G VEEV G    +W+   + VRV I++ KPLRRG+++K S+ K  WCP+ Y
Subjt:  KSLLALVSPRGIDDPLLLDFTYNAFWVQILNVSFHCLTSTMARRLGSLVGMVEEVHGEGHGEWLGSVMHVRVLIDLSKPLRRGVRLKISENKYQWCPVGY

Query:  EKLPDFCYQCGRIGHLHRDCE
        E+LPDFCY CG +GH  R+ E
Subjt:  EKLPDFCYQCGRIGHLHRDCE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGATTTGGCTGTGCTCTGGGAGCGTATGGTGTTGACTGAGGAAGAAACGATGGTTGTTGTCGTGGAGGAGGGTTTGTCGTTGCTGTCGGAGGCCACAGTCCAGGT
ATGTGCTGTTGGTAAGTTGCTTTCTTCTCGGAAAACGAATGTTGAGGCTTTTCGGAAAGTGCTCATTTCAGTTTGGAATGTCCATAGTTCTACTCGAATTGAGGCGCTGG
GTGATAATTTCTTTGTTATTCGGTTTTGTTCTTTGTCTGAGAAGCGCCAGATTATGAATGGGGGCCCTTGGTCTTTTGATAAATCATTGTTGGCTCTTGTCTCTCCTCGT
GGAATTGATGATCCTTTGCTGTTGGATTTTACTTATAACGCATTTTGGGTCCAGATTTTAAATGTTTCGTTCCACTGCTTGACTTCGACCATGGCTCGTCGGCTGGGCTC
TCTGGTGGGTATGGTTGAAGAGGTTCATGGTGAGGGACATGGAGAATGGCTGGGTTCGGTTATGCATGTTCGTGTTCTAATTGATTTGTCAAAACCTCTCCGTCGTGGGG
TGAGGTTGAAGATTAGTGAGAATAAGTACCAATGGTGCCCTGTTGGGTATGAAAAACTGCCTGATTTTTGTTACCAATGTGGTCGGATTGGGCATTTGCATCGTGATTGT
GAACTTGTTTTGTCTGAGTTGGGTGCTGCTTCTCCTTTGCAATATGGGGAATGGATGAGGGCTATATCAATGAGACAGATTTCTGGTCTATATGGAACTTCTGATGATTT
ATCTAGAAGCAGAGGGATGCGAGGACGTCGTGGCTCTGATGATGATTCACCTCCTGAGCATCTGGATATCCCACATTCTGCACAAAGAGAGGAAAGTATACCACTGCTGC
CTGAGGTGGGTAAATGTTTCAAGGGTAAAACGGTTGTGGAACCACAACAGATGGGTGTGTCTATTACGGATAGTCTGGCGGGTGTGGGTTCTAACTTATCTCCGGTGTCT
ATTGCTGCTAGGAAGAACTGGAAGCGGCTGGCTCGTGGTGCACTTATGGATGTTACGAACTCTGTTTTGGATACAGGGTCTTCTTTTGGTAAAAGATCGGGCTCAGAGAA
TCTGGAGGGTGAGAATGTTACTATGAGTAAGAAGAAGGTTAAGGTTGGGTCTAATGTATATACTGCACAGAATGAGGCGGAGGCTGGCTGCCAGCCCCGCCGGGTGCCAT
GA
mRNA sequenceShow/hide mRNA sequence
ATGGACGATTTGGCTGTGCTCTGGGAGCGTATGGTGTTGACTGAGGAAGAAACGATGGTTGTTGTCGTGGAGGAGGGTTTGTCGTTGCTGTCGGAGGCCACAGTCCAGGT
ATGTGCTGTTGGTAAGTTGCTTTCTTCTCGGAAAACGAATGTTGAGGCTTTTCGGAAAGTGCTCATTTCAGTTTGGAATGTCCATAGTTCTACTCGAATTGAGGCGCTGG
GTGATAATTTCTTTGTTATTCGGTTTTGTTCTTTGTCTGAGAAGCGCCAGATTATGAATGGGGGCCCTTGGTCTTTTGATAAATCATTGTTGGCTCTTGTCTCTCCTCGT
GGAATTGATGATCCTTTGCTGTTGGATTTTACTTATAACGCATTTTGGGTCCAGATTTTAAATGTTTCGTTCCACTGCTTGACTTCGACCATGGCTCGTCGGCTGGGCTC
TCTGGTGGGTATGGTTGAAGAGGTTCATGGTGAGGGACATGGAGAATGGCTGGGTTCGGTTATGCATGTTCGTGTTCTAATTGATTTGTCAAAACCTCTCCGTCGTGGGG
TGAGGTTGAAGATTAGTGAGAATAAGTACCAATGGTGCCCTGTTGGGTATGAAAAACTGCCTGATTTTTGTTACCAATGTGGTCGGATTGGGCATTTGCATCGTGATTGT
GAACTTGTTTTGTCTGAGTTGGGTGCTGCTTCTCCTTTGCAATATGGGGAATGGATGAGGGCTATATCAATGAGACAGATTTCTGGTCTATATGGAACTTCTGATGATTT
ATCTAGAAGCAGAGGGATGCGAGGACGTCGTGGCTCTGATGATGATTCACCTCCTGAGCATCTGGATATCCCACATTCTGCACAAAGAGAGGAAAGTATACCACTGCTGC
CTGAGGTGGGTAAATGTTTCAAGGGTAAAACGGTTGTGGAACCACAACAGATGGGTGTGTCTATTACGGATAGTCTGGCGGGTGTGGGTTCTAACTTATCTCCGGTGTCT
ATTGCTGCTAGGAAGAACTGGAAGCGGCTGGCTCGTGGTGCACTTATGGATGTTACGAACTCTGTTTTGGATACAGGGTCTTCTTTTGGTAAAAGATCGGGCTCAGAGAA
TCTGGAGGGTGAGAATGTTACTATGAGTAAGAAGAAGGTTAAGGTTGGGTCTAATGTATATACTGCACAGAATGAGGCGGAGGCTGGCTGCCAGCCCCGCCGGGTGCCAT
GA
Protein sequenceShow/hide protein sequence
MDDLAVLWERMVLTEEETMVVVVEEGLSLLSEATVQVCAVGKLLSSRKTNVEAFRKVLISVWNVHSSTRIEALGDNFFVIRFCSLSEKRQIMNGGPWSFDKSLLALVSPR
GIDDPLLLDFTYNAFWVQILNVSFHCLTSTMARRLGSLVGMVEEVHGEGHGEWLGSVMHVRVLIDLSKPLRRGVRLKISENKYQWCPVGYEKLPDFCYQCGRIGHLHRDC
ELVLSELGAASPLQYGEWMRAISMRQISGLYGTSDDLSRSRGMRGRRGSDDDSPPEHLDIPHSAQREESIPLLPEVGKCFKGKTVVEPQQMGVSITDSLAGVGSNLSPVS
IAARKNWKRLARGALMDVTNSVLDTGSSFGKRSGSENLEGENVTMSKKKVKVGSNVYTAQNEAEAGCQPRRVP