; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0010039 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0010039
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr9:44185758..44187383
RNA-Seq ExpressionLag0010039
SyntenyLag0010039
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF5469007.1 hypothetical protein F2P56_013112 [Juglans regia]2.6e-3634.73Show/hide
Query:  LTKQMLKLNITKEERGKVTGMDDDELRQKVQEMENVMVCKIMTEKHINPEIFKEMIPKIWNMERKVTVKKAGENIFECTFDSMREKKKVLEGSPWIHDKA
        L +Q   L++T++E  +V  +D + + + +   EN ++ +++T KH N E+FK+ + K+W   + V ++    +     F+ +++K KVL   PW  DK 
Subjt:  LTKQMLKLNITKEERGKVTGMDDDELRQKVQEMENVMVCKIMTEKHINPEIFKEMIPKIWNMERKVTVKKAGENIFECTFDSMREKKKVLEGSPWIHDKA

Query:  LIVFEEIKGNERYSRLQFRYASFWVHFVDLPRICFSRKWAEDLGNAVGSFERVDLDKEEYEVGDSLRVRVKFDIKKPLRRGTVIRIGTNAEEEWVDMRFE
        L++ +E  G  +   L   +  FWV   DLP +  +    + +G+ +G F  VDL K E+E G+ +RVRV  DI  PL+R   +++G +    WV + +E
Subjt:  LIVFEEIKGNERYSRLQFRYASFWVHFVDLPRICFSRKWAEDLGNAVGSFERVDLDKEEYEVGDSLRVRVKFDIKKPLRRGTVIRIGTNAEEEWVDMRFE

Query:  KLPEFCYGCGLIGHLARECQDNEAVNR--DNLEYGAWLK
        +LP+FCY CGL+GH  REC +  +VN   D L YGAWL+
Subjt:  KLPEFCYGCGLIGHLARECQDNEAVNR--DNLEYGAWLK

TXG54013.1 hypothetical protein EZV62_019269 [Acer yangbiense]5.8e-3632.44Show/hide
Query:  LTKQMLKLNITKEERGKVTGMDDDELRQKVQEMENVMVCKIMTEKHINPEIFKEMIPKIWNMERKVTVKKAGENIFECTFDSMREKKKVLEGSPWIHDKA
        ++++  KL++  ++ G +  +      +  Q +   ++ K +T K IN E FK  I  IW  + +VT++  G NIF+  F +  ++K++LEG PW+ DK 
Subjt:  LTKQMLKLNITKEERGKVTGMDDDELRQKVQEMENVMVCKIMTEKHINPEIFKEMIPKIWNMERKVTVKKAGENIFECTFDSMREKKKVLEGSPWIHDKA

Query:  LIVFEEIKGNERYSRLQFRYASFWVHFVDLPRICFSRKWAEDLGNAVGSFERVDLDKEEYEVGDSLRVRVKFDIKKPLRRGTVIRIGTNAEEEWVDMRFE
        L+V  E  G+E+ + LQFRY  FW+   +LP  C +R+    LG  VG  + +D  +    VG  +R+RV  D+  PL+RG  + +G + +   V + +E
Subjt:  LIVFEEIKGNERYSRLQFRYASFWVHFVDLPRICFSRKWAEDLGNAVGSFERVDLDKEEYEVGDSLRVRVKFDIKKPLRRGTVIRIGTNAEEEWVDMRFE

Query:  KLPEFCYGCGLIGHLARECQDN--EAVNRDNLEYGAWLKKEPSHRYKFGNRKEFGKKDNERG
        +LP FCY CG IGHL R+C  N  E  +  + ++G W++     R K    K+   + +  G
Subjt:  KLPEFCYGCGLIGHLARECQDN--EAVNRDNLEYGAWLKKEPSHRYKFGNRKEFGKKDNERG

XP_028071384.1 uncharacterized protein LOC114273772 [Camellia sinensis]9.3e-3434.31Show/hide
Query:  LNITKEERGKV-TGMDDDELRQKVQEMENVMVCKIMTEKHINPEIFKEMIPKIWNMERKVTVKKAGENIFECTFDSMREKKKVLEGSPWIHDKALIVFEE
        L++T EE   V  G D   L     +M   +V K++T +  N E  K  +  +W   + + V+  G+N+F   F  + +K+++L   PW  DK L++  E
Subjt:  LNITKEERGKV-TGMDDDELRQKVQEMENVMVCKIMTEKHINPEIFKEMIPKIWNMERKVTVKKAGENIFECTFDSMREKKKVLEGSPWIHDKALIVFEE

Query:  IKGNERYSRLQFRYASFWVHFVDLPRICFSRKWAEDLGNAVGSFERVDLDKEEYEV--GDSLRVRVKFDIKKPLRRGTVIRIGTNAEEEWVDMRFEKLPE
        +  N + S +Q  +  FWVH  +LP I  ++K  + +GNAVG F  +D+D E+  +  G ++ +RV  D++KPLRRG  + + ++ E  WVD ++E+LP 
Subjt:  IKGNERYSRLQFRYASFWVHFVDLPRICFSRKWAEDLGNAVGSFERVDLDKEEYEV--GDSLRVRVKFDIKKPLRRGTVIRIGTNAEEEWVDMRFEKLPE

Query:  FCYGCGLIGHLAREC----QDNEAVNRDNLEYGAWLKKE
        +CY CG +GH  REC       +    D+L+YGAWL+ +
Subjt:  FCYGCGLIGHLAREC----QDNEAVNRDNLEYGAWLKKE

XP_028122006.1 uncharacterized protein LOC114319195 [Camellia sinensis]6.4e-3535.56Show/hide
Query:  LNITKEERGKV-TGMDDDELRQKVQEMENVMVCKIMTEKHINPEIFKEMIPKIWNMERKVTVKKAGENIFECTFDSMREKKKVLEGSPWIHDKALIVFEE
        L++T EE   V  G +   L     +M   +V K++T +  N E  K  +  +W   + + V+  G+N+F   F  + +K++VL   PW  DK L++  E
Subjt:  LNITKEERGKV-TGMDDDELRQKVQEMENVMVCKIMTEKHINPEIFKEMIPKIWNMERKVTVKKAGENIFECTFDSMREKKKVLEGSPWIHDKALIVFEE

Query:  IKGNERYSRLQFRYASFWVHFVDLPRICFSRKWAEDLGNAVGSFERVDLDKEEYEV--GDSLRVRVKFDIKKPLRRGTVIRIGTNAEEEWVDMRFEKLPE
        +  N + S +Q     FWVH  +LP +  ++K  E +GNAVG F  +D+D E+  +  G ++R+RV  D++KPLRRG  + + ++AE  WVD ++E+LP 
Subjt:  IKGNERYSRLQFRYASFWVHFVDLPRICFSRKWAEDLGNAVGSFERVDLDKEEYEV--GDSLRVRVKFDIKKPLRRGTVIRIGTNAEEEWVDMRFEKLPE

Query:  FCYGCGLIGHLARECQDN----EAVNRDNLEYGAWLKKE
        +CY CG +GH  REC D     +    D+L+YGAWL+ +
Subjt:  FCYGCGLIGHLARECQDN----EAVNRDNLEYGAWLKKE

XP_042988686.1 uncharacterized protein LOC122316216 [Carya illinoinensis]1.9e-3436.07Show/hide
Query:  EISLTKQMLKLNITKEERGKVTGMDDDELRQKVQEMENVMVCKIMTEKHINPEIFKEMIPKIWNMERKVTVKKAGENIFECTFDSMREKKKVLEGSPWIH
        E  L     +L++T++E   V  ++  +L +        +V  + TEKH N E FK  + + W + R V  +    NIF   F+ +R+K+KVL   PW  
Subjt:  EISLTKQMLKLNITKEERGKVTGMDDDELRQKVQEMENVMVCKIMTEKHINPEIFKEMIPKIWNMERKVTVKKAGENIFECTFDSMREKKKVLEGSPWIH

Query:  DKALIVFEEIKGNERYSRLQFRYASFWVHFVDLPRICFSRKWAEDLGNAVGSFERVDLDKEEYEVGDSLRVRVKFDIKKPLRRGTVIRIGTNAEEEWVDM
        DK L++ +E+ GN++  +++ R ASFWV   DLP    + +    +G  +G    VDLD  E   GD LRVRV  DI+KPL RGT   +G   +  WV  
Subjt:  DKALIVFEEIKGNERYSRLQFRYASFWVHFVDLPRICFSRKWAEDLGNAVGSFERVDLDKEEYEVGDSLRVRVKFDIKKPLRRGTVIRIGTNAEEEWVDM

Query:  RFEKLPEFCYGCGLIGHLARECQ-DNEAV---NRDNLEYGAWLK
         +E+L  FC+ CG +GH  REC+   +AV   + D+  YG+WL+
Subjt:  RFEKLPEFCYGCGLIGHLARECQ-DNEAV---NRDNLEYGAWLK

TrEMBL top hitse value%identityAlignment
A0A5C7H9Y2 CCHC-type domain-containing protein2.8e-3632.44Show/hide
Query:  LTKQMLKLNITKEERGKVTGMDDDELRQKVQEMENVMVCKIMTEKHINPEIFKEMIPKIWNMERKVTVKKAGENIFECTFDSMREKKKVLEGSPWIHDKA
        ++++  KL++  ++ G +  +      +  Q +   ++ K +T K IN E FK  I  IW  + +VT++  G NIF+  F +  ++K++LEG PW+ DK 
Subjt:  LTKQMLKLNITKEERGKVTGMDDDELRQKVQEMENVMVCKIMTEKHINPEIFKEMIPKIWNMERKVTVKKAGENIFECTFDSMREKKKVLEGSPWIHDKA

Query:  LIVFEEIKGNERYSRLQFRYASFWVHFVDLPRICFSRKWAEDLGNAVGSFERVDLDKEEYEVGDSLRVRVKFDIKKPLRRGTVIRIGTNAEEEWVDMRFE
        L+V  E  G+E+ + LQFRY  FW+   +LP  C +R+    LG  VG  + +D  +    VG  +R+RV  D+  PL+RG  + +G + +   V + +E
Subjt:  LIVFEEIKGNERYSRLQFRYASFWVHFVDLPRICFSRKWAEDLGNAVGSFERVDLDKEEYEVGDSLRVRVKFDIKKPLRRGTVIRIGTNAEEEWVDMRFE

Query:  KLPEFCYGCGLIGHLARECQDN--EAVNRDNLEYGAWLKKEPSHRYKFGNRKEFGKKDNERG
        +LP FCY CG IGHL R+C  N  E  +  + ++G W++     R K    K+   + +  G
Subjt:  KLPEFCYGCGLIGHLARECQDN--EAVNRDNLEYGAWLKKEPSHRYKFGNRKEFGKKDNERG

A0A5C7IW83 CCHC-type domain-containing protein7.6e-3433.33Show/hide
Query:  EERGKVTGMDDDELRQKVQEMENVMVCKIMTEKHINPEIFKEMIPKIWNMERKVTVKKAGENIFECTFDSMREKKKVLEGSPWIHDKALIVFEEIKGNER
        EE G V    + E+    ++++  +V K++T K +N E F+ +I +IW+   +V V+   +NIF   F    ++ +V +  PW   K+LIV E+ KG   
Subjt:  EERGKVTGMDDDELRQKVQEMENVMVCKIMTEKHINPEIFKEMIPKIWNMERKVTVKKAGENIFECTFDSMREKKKVLEGSPWIHDKALIVFEEIKGNER

Query:  YSRLQFRYASFWVHFVDLPRICFSRKWAEDLGNAVGSFERVDLDKEEYEVGDSLRVRVKFDIKKPLRRGTVIRIGTNAEEEWVDMRFEKLPEFCYGCGLI
        YS+L F  A+FWV   D P IC +R+ A+ +   +G    +  D +E   G  +RV+V  DI KPLRR   +++G   E   V +++E+LPEFCY CG +
Subjt:  YSRLQFRYASFWVHFVDLPRICFSRKWAEDLGNAVGSFERVDLDKEEYEVGDSLRVRVKFDIKKPLRRGTVIRIGTNAEEEWVDMRFEKLPEFCYGCGLI

Query:  GHLARECQD----NEAVNRDNLEYGAWLKKEPSHR-YKFGNRKEFGKKDNERGKMGNKHEQEGRPAKQTGNGS
        GH   EC D     +A+     +YGAWLK     + Y   N + +G   +         E EG  +     GS
Subjt:  GHLARECQD----NEAVNRDNLEYGAWLKKEPSHR-YKFGNRKEFGKKDNERGKMGNKHEQEGRPAKQTGNGS

A0A6J1BSZ1 uncharacterized protein LOC1110054814.2e-3231.54Show/hide
Query:  SLTKQMLKLNITKEERGKVTGMDDDELRQKVQEMENVMVCKIMTEKHINPEIFKEMIPKIWNMERKV-TVKKAGENIFECTFDSMREKKKVLEGSPWIHD
        +L ++     +T EE      +D   L    + +E  ++CK+++++ I+  + K  +   W ++ K  +V   G NIF   F+   ++ ++L   PW  D
Subjt:  SLTKQMLKLNITKEERGKVTGMDDDELRQKVQEMENVMVCKIMTEKHINPEIFKEMIPKIWNMERKV-TVKKAGENIFECTFDSMREKKKVLEGSPWIHD

Query:  KALIVFEEIKGNERYSRLQFRYASFWVHFVDLPRICFSRKWAEDLGNAVGSFERVDLDKEEYEVGDSLRVRVKFDIKKPLRRGTVIRIGTNAEEEWVDMR
        +ALI+ +      +   + FR  S WVHF DL   C ++  A  LGNA+G FE V+ +   +  G  LRVRV+FD+ KPL RG  + +       W+ ++
Subjt:  KALIVFEEIKGNERYSRLQFRYASFWVHFVDLPRICFSRKWAEDLGNAVGSFERVDLDKEEYEVGDSLRVRVKFDIKKPLRRGTVIRIGTNAEEEWVDMR

Query:  FEKLPEFCYGCGLIGHLARECQD--NEAVNRDNLEYGAWLK
        +E+LP+F Y CG + H+ ++C D   ++V++ NL+YG WL+
Subjt:  FEKLPEFCYGCGLIGHLARECQD--NEAVNRDNLEYGAWLK

A0A6J1DU55 uncharacterized protein LOC1110231351.3e-3331.13Show/hide
Query:  KLNITKEERGKVTGMDDDELRQKVQEMENVMVCKIMTEKHINPEIFKEMIPKIWNMERKVTVKKAGENIFECTFDSMREKKKVLEGSPWIHDKALIVFEE
        K  +T EE      +D D ++   Q +   +V K++ ++ I+ ++   ++   W +E ++TV+  G+N+F   F    +  +V++  PW  DKALIV ++
Subjt:  KLNITKEERGKVTGMDDDELRQKVQEMENVMVCKIMTEKHINPEIFKEMIPKIWNMERKVTVKKAGENIFECTFDSMREKKKVLEGSPWIHDKALIVFEE

Query:  IKGNERYSRLQFRYASFWVHFVDLPRICFSRKWAEDLGNAVGSFERVDLDKEEYEVGDSLRVRVKFDIKKPLRRGTVIRIGTNAEEEWVDMRFEKLPEFC
           ++  S L+F   +FW+H  DLP    ++  A  LGNA+G+F  VD +++ +  G SLR+RV  DI KPLRRG  I I       W+ +++E+LP+FC
Subjt:  IKGNERYSRLQFRYASFWVHFVDLPRICFSRKWAEDLGNAVGSFERVDLDKEEYEVGDSLRVRVKFDIKKPLRRGTVIRIGTNAEEEWVDMRFEKLPEFC

Query:  YGCGLIGHLARECQDNEAVNRDN----LEYGAWLKKEPSHRYKFGNRKEFGKKDNERGKMGNKHEQEGRPAKQTGNGSEGRREKERKNSERKEPEKTDTS
        Y CG+IGH + +C       +D+     EYG WL+   S            K   ++G+ G        PA++   GS     KER   E K+     T+
Subjt:  YGCGLIGHLARECQDNEAVNRDN----LEYGAWLKKEPSHRYKFGNRKEFGKKDNERGKMGNKHEQEGRPAKQTGNGSEGRREKERKNSERKEPEKTDTS

Query:  PE
         +
Subjt:  PE

A0A7N2R0C3 Reverse transcriptase domain-containing protein1.6e-3128.27Show/hide
Query:  KLNITKEERGKVTGMDDDELRQKVQEMENVMVCKIMTEKHINPEIFKEMIPKIWNMERKVTVKKAGENIFECTFDSMREKKKVLEGSPWIHDKALIVFEE
        +L +T+EE   +  + D+ +R  V+  +  +  K+M+ K +  E  ++ +  +W   + + +   GE +F   F+  R+K++V++  PW ++K L++F+E
Subjt:  KLNITKEERGKVTGMDDDELRQKVQEMENVMVCKIMTEKHINPEIFKEMIPKIWNMERKVTVKKAGENIFECTFDSMREKKKVLEGSPWIHDKALIVFEE

Query:  IKGNERYSRLQFRYASFWVHFVDLPRICFSRKWAEDLGNAVGSFERVDLDKEEYEVGDSLRVRVKFDIKKPLRRGTVIRIGTNAEEEWVDMRFEKLPEFC
         +G+E    +  +++ FWV   +LP    +++  + +G ++G F  VD+++   + G  LRVRV+ D+ + L RG  I +    E  WV  ++E+LP FC
Subjt:  IKGNERYSRLQFRYASFWVHFVDLPRICFSRKWAEDLGNAVGSFERVDLDKEEYEVGDSLRVRVKFDIKKPLRRGTVIRIGTNAEEEWVDMRFEKLPEFC

Query:  YGCGLIGHLARECQDNEAVNR----DNLEYGAWLKKEPSHR----YKFGNRKEFGKKDNERGKMGNKHEQEGRPAKQTGNGSE
        Y CGL+ H  ++C +    ++     +L+YGAWL+ EP  +    + F  +K  G+  N+        E++GR   Q G   E
Subjt:  YGCGLIGHLARECQDNEAVNR----DNLEYGAWLKKEPSHR----YKFGNRKEFGKKDNERGKMGNKHEQEGRPAKQTGNGSE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGAAAAGAACTGGAGGAAAACAAGAAGTTAGAGGAAACAAACGGAAATCAACCAACGGAAATATCTCTGACTAAGCAAATGTTGAAGCTTAATATCACAAAGGA
AGAAAGAGGTAAGGTTACAGGTATGGACGACGACGAATTACGGCAGAAAGTCCAAGAGATGGAAAACGTCATGGTATGCAAGATTATGACGGAAAAACATATTAATCCAG
AAATCTTTAAGGAAATGATCCCGAAGATATGGAACATGGAAAGGAAAGTTACGGTAAAGAAGGCAGGAGAAAACATCTTTGAATGTACTTTTGATTCTATGCGTGAGAAA
AAGAAAGTGTTAGAAGGAAGCCCCTGGATACATGACAAAGCTCTCATTGTCTTTGAGGAAATCAAAGGAAATGAAAGATATTCAAGACTCCAATTCAGGTATGCCTCGTT
TTGGGTTCATTTTGTTGATTTACCAAGAATCTGCTTCAGTAGGAAATGGGCTGAAGATCTGGGGAATGCAGTTGGAAGTTTTGAAAGAGTAGATTTGGACAAAGAGGAAT
ACGAAGTTGGAGATTCTTTGCGGGTAAGAGTTAAGTTTGATATTAAAAAACCATTACGAAGGGGGACAGTGATTAGAATAGGAACGAACGCAGAAGAAGAATGGGTGGAT
ATGAGATTTGAGAAGCTTCCGGAGTTCTGTTATGGATGCGGCCTGATTGGACACCTAGCTCGAGAATGTCAAGATAATGAAGCTGTAAACAGGGACAATCTGGAATATGG
GGCTTGGCTAAAGAAGGAGCCCAGTCATAGGTACAAATTTGGAAATAGAAAAGAATTTGGGAAAAAAGACAATGAAAGGGGAAAAATGGGGAATAAACACGAGCAAGAAG
GAAGGCCAGCGAAACAAACCGGTAACGGGTCAGAAGGGAGAAGAGAAAAAGAAAGGAAGAACTCGGAAAGAAAAGAACCGGAGAAGACAGATACCTCGCCGGAAAAAATG
ACCATTGAGCTACCGGAGGAACCAGTGCATACGACAAAAAAAACGGCCAGCCCAAAAATCCCAGCCTACAAGGAGATAATCATGGAAGACCAAAATCCAAGGATGTCAGA
AATGGAAAAAGAGGAAATAGGAAAAGAAGCAATGGGAAGCGGTGGGGAATCGACTATTACCCACCAGATAAAAGATATTAAAGGGAAAGGAGTTCAAATATCAGAGGGAT
ATGGCCAAACAATAAGGCCCAGGCGTGAAGGCCTAAATCGTGACCTGGGCCTAAAAATTAGAGAAAGTCATTCAAAAGAAAATGAGCAAAAGGAAATGGAAAAAGCCTTT
AGGCAACACAGTGAAGAAAAGGGATTAAATGGTCGTAAAGGGATGGAAGCCAGCGAGAGACCCATGGAAATCAAGGTGGAGAAACCAAATGAACAAATCCAAGGTAAATC
GTGGAAAAGGAGAGCTAGGGAAGCCTTAAGTAAAAGTTCTCAAACAAACGAATCTCAGACTAAGGGCACCAGCAGGAAGCATGAAAGAGAAGAGGAAATTGAGGAAAGTG
GAAGAAAAAAAATGTGTGTTGAATACTTCGGGAAACCCGTTGGGATATCGGCGGAGGCTGAAATTCAGCCCCGCCGGACGCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGGAAAAGAACTGGAGGAAAACAAGAAGTTAGAGGAAACAAACGGAAATCAACCAACGGAAATATCTCTGACTAAGCAAATGTTGAAGCTTAATATCACAAAGGA
AGAAAGAGGTAAGGTTACAGGTATGGACGACGACGAATTACGGCAGAAAGTCCAAGAGATGGAAAACGTCATGGTATGCAAGATTATGACGGAAAAACATATTAATCCAG
AAATCTTTAAGGAAATGATCCCGAAGATATGGAACATGGAAAGGAAAGTTACGGTAAAGAAGGCAGGAGAAAACATCTTTGAATGTACTTTTGATTCTATGCGTGAGAAA
AAGAAAGTGTTAGAAGGAAGCCCCTGGATACATGACAAAGCTCTCATTGTCTTTGAGGAAATCAAAGGAAATGAAAGATATTCAAGACTCCAATTCAGGTATGCCTCGTT
TTGGGTTCATTTTGTTGATTTACCAAGAATCTGCTTCAGTAGGAAATGGGCTGAAGATCTGGGGAATGCAGTTGGAAGTTTTGAAAGAGTAGATTTGGACAAAGAGGAAT
ACGAAGTTGGAGATTCTTTGCGGGTAAGAGTTAAGTTTGATATTAAAAAACCATTACGAAGGGGGACAGTGATTAGAATAGGAACGAACGCAGAAGAAGAATGGGTGGAT
ATGAGATTTGAGAAGCTTCCGGAGTTCTGTTATGGATGCGGCCTGATTGGACACCTAGCTCGAGAATGTCAAGATAATGAAGCTGTAAACAGGGACAATCTGGAATATGG
GGCTTGGCTAAAGAAGGAGCCCAGTCATAGGTACAAATTTGGAAATAGAAAAGAATTTGGGAAAAAAGACAATGAAAGGGGAAAAATGGGGAATAAACACGAGCAAGAAG
GAAGGCCAGCGAAACAAACCGGTAACGGGTCAGAAGGGAGAAGAGAAAAAGAAAGGAAGAACTCGGAAAGAAAAGAACCGGAGAAGACAGATACCTCGCCGGAAAAAATG
ACCATTGAGCTACCGGAGGAACCAGTGCATACGACAAAAAAAACGGCCAGCCCAAAAATCCCAGCCTACAAGGAGATAATCATGGAAGACCAAAATCCAAGGATGTCAGA
AATGGAAAAAGAGGAAATAGGAAAAGAAGCAATGGGAAGCGGTGGGGAATCGACTATTACCCACCAGATAAAAGATATTAAAGGGAAAGGAGTTCAAATATCAGAGGGAT
ATGGCCAAACAATAAGGCCCAGGCGTGAAGGCCTAAATCGTGACCTGGGCCTAAAAATTAGAGAAAGTCATTCAAAAGAAAATGAGCAAAAGGAAATGGAAAAAGCCTTT
AGGCAACACAGTGAAGAAAAGGGATTAAATGGTCGTAAAGGGATGGAAGCCAGCGAGAGACCCATGGAAATCAAGGTGGAGAAACCAAATGAACAAATCCAAGGTAAATC
GTGGAAAAGGAGAGCTAGGGAAGCCTTAAGTAAAAGTTCTCAAACAAACGAATCTCAGACTAAGGGCACCAGCAGGAAGCATGAAAGAGAAGAGGAAATTGAGGAAAGTG
GAAGAAAAAAAATGTGTGTTGAATACTTCGGGAAACCCGTTGGGATATCGGCGGAGGCTGAAATTCAGCCCCGCCGGACGCCATGA
Protein sequenceShow/hide protein sequence
MEGKELEENKKLEETNGNQPTEISLTKQMLKLNITKEERGKVTGMDDDELRQKVQEMENVMVCKIMTEKHINPEIFKEMIPKIWNMERKVTVKKAGENIFECTFDSMREK
KKVLEGSPWIHDKALIVFEEIKGNERYSRLQFRYASFWVHFVDLPRICFSRKWAEDLGNAVGSFERVDLDKEEYEVGDSLRVRVKFDIKKPLRRGTVIRIGTNAEEEWVD
MRFEKLPEFCYGCGLIGHLARECQDNEAVNRDNLEYGAWLKKEPSHRYKFGNRKEFGKKDNERGKMGNKHEQEGRPAKQTGNGSEGRREKERKNSERKEPEKTDTSPEKM
TIELPEEPVHTTKKTASPKIPAYKEIIMEDQNPRMSEMEKEEIGKEAMGSGGESTITHQIKDIKGKGVQISEGYGQTIRPRREGLNRDLGLKIRESHSKENEQKEMEKAF
RQHSEEKGLNGRKGMEASERPMEIKVEKPNEQIQGKSWKRRAREALSKSSQTNESQTKGTSRKHEREEEIEESGRKKMCVEYFGKPVGISAEAEIQPRRTP