; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g17530 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g17530
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCCHC-type domain-containing protein
Genome locationchr1:11954230..11963378
RNA-Seq ExpressionMoc01g17530
SyntenyMoc01g17530
Gene Ontology termsGO:0016310 - phosphorylation (biological process)
GO:0044267 - cellular protein metabolic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0000166 - nucleotide binding (molecular function)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004672 - protein kinase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8682180.1 Transcription initiation factor IIA subunit 2 [Hibiscus syriacus]1.3e-6156.79Show/hide
Query:  KALKGRP-SESASEKLSGDSSSTGSK-NSSMSDKDWEEMDLRAASAIRIGFAKNILADVHEISIVKELWEKLEALYQAKGISNRLYPNEQFHTLRMEEGT
        KALKG+P S S  ++    S S+G K  S MS+++WEE+D+RAAS IR+  AKN+LA+V   S  KELWEKLE +YQAK +SNRLY  E+FH L+MEEGT
Subjt:  KALKGRP-SESASEKLSGDSSSTGSK-NSSMSDKDWEEMDLRAASAIRIGFAKNILADVHEISIVKELWEKLEALYQAKGISNRLYPNEQFHTLRMEEGT

Query:  KISDHLSNLNSIIFELELIEVKIDDEDKALRLILSLPPSYEHMKPILMYGKDTLNFAEATSKLLSEERRLKSEGRTIHEDSALVASNWKKKKASVQKKAC
        KISDHLS LN I+ ELE I V+IDDEDKALRLI SL  SYEHM+ +LMYGK+ +NF+E TSKL+SEERRLK+      E  AL     +KK    +KK  
Subjt:  KISDHLSNLNSIIFELELIEVKIDDEDKALRLILSLPPSYEHMKPILMYGKDTLNFAEATSKLLSEERRLKSEGRTIHEDSALVASNWKKKKASVQKKAC

Query:  CWGCGQSGHMKKDCPN-RVDSSKGPGLDADSVSLIRGDDDQFL
        CWGCGQ GH+KKDC N   +S+ G   DA +V +   +DD+F+
Subjt:  CWGCGQSGHMKKDCPN-RVDSSKGPGLDADSVSLIRGDDDQFL

KAE8717380.1 hypothetical protein F3Y22_tig00110050pilonHSYRG00143 [Hibiscus syriacus]1.5e-6257.2Show/hide
Query:  KALKGRP-SESASEKLSGDSSSTGSK-NSSMSDKDWEEMDLRAASAIRIGFAKNILADVHEISIVKELWEKLEALYQAKGISNRLYPNEQFHTLRMEEGT
        KALKG+P S S  ++    S S+G K  S MS+++WEE+D+RAAS IR+  AKN+LA+V   S  KELWEKLE +YQAK +SNRLY  E+FH L+MEEGT
Subjt:  KALKGRP-SESASEKLSGDSSSTGSK-NSSMSDKDWEEMDLRAASAIRIGFAKNILADVHEISIVKELWEKLEALYQAKGISNRLYPNEQFHTLRMEEGT

Query:  KISDHLSNLNSIIFELELIEVKIDDEDKALRLILSLPPSYEHMKPILMYGKDTLNFAEATSKLLSEERRLKSEGRTIHEDSALVASNWKKKKASVQKKAC
        KISDHLS LN I+ ELE I V+IDDEDKALRLI SLP SYEHM+ +LMYGK+ +NF+E TSKL+SEERRLK+      E  AL     +KK    +KK  
Subjt:  KISDHLSNLNSIIFELELIEVKIDDEDKALRLILSLPPSYEHMKPILMYGKDTLNFAEATSKLLSEERRLKSEGRTIHEDSALVASNWKKKKASVQKKAC

Query:  CWGCGQSGHMKKDCPN-RVDSSKGPGLDADSVSLIRGDDDQFL
        CWGCGQ GH+KKDC N   +S+ G   DA +V +   +DD+F+
Subjt:  CWGCGQSGHMKKDCPN-RVDSSKGPGLDADSVSLIRGDDDQFL

KAF5758504.1 putative RNA-directed DNA polymerase [Helianthus annuus]6.3e-6458.9Show/hide
Query:  KALKGRPSESASEKLSGDSSSTGSKNSSMSDKDWEEMDLRAASAIRIGFAKNILADVHEISIVKELWEKLEALYQAKGISNRLYPNEQFHTLRMEEGTKI
        KAL+G+P+  +S+  SG S           D++WE++DLRAASAIR+  AKN+LA+VH IS  K+LWEKLE LYQ KGISNRLY  EQFHTLRM+  TKI
Subjt:  KALKGRPSESASEKLSGDSSSTGSKNSSMSDKDWEEMDLRAASAIRIGFAKNILADVHEISIVKELWEKLEALYQAKGISNRLYPNEQFHTLRMEEGTKI

Query:  SDHLSNLNSIIFELELIEVKIDDEDKALRLILSLPPSYEHMKPILMYGKDTLNFAEATSKLLSEERRLKSEGRTIHEDSALVASNWKKKKASVQKKACCW
        SDHLS LN+I+ ELE I VK++DEDKALRLILSL  SYEHMKPILMYGK+TL +A+ T KLLSEE+RL S G T  E + L+  N KKK    QK   CW
Subjt:  SDHLSNLNSIIFELELIEVKIDDEDKALRLILSLPPSYEHMKPILMYGKDTLNFAEATSKLLSEERRLKSEGRTIHEDSALVASNWKKKKASVQKKACCW

Query:  GCGQSGHMKKDCPNRVDSSKGPGLDADSVSLIRGDD
         CGQSGH+K++CP   DS+      A++V+++ GDD
Subjt:  GCGQSGHMKKDCPNRVDSSKGPGLDADSVSLIRGDD

KAF5765959.1 putative RNA-directed DNA polymerase [Helianthus annuus]2.4e-6358.47Show/hide
Query:  KALKGRPSESASEKLSGDSSSTGSKNSSMSDKDWEEMDLRAASAIRIGFAKNILADVHEISIVKELWEKLEALYQAKGISNRLYPNEQFHTLRMEEGTKI
        KAL+G+P+  +S+  SG S           D++WE++DLRAASAIR+  AKN+LA+VH IS  K+LWEKLE LYQ KGI NRLY  EQFHTLRM+  TKI
Subjt:  KALKGRPSESASEKLSGDSSSTGSKNSSMSDKDWEEMDLRAASAIRIGFAKNILADVHEISIVKELWEKLEALYQAKGISNRLYPNEQFHTLRMEEGTKI

Query:  SDHLSNLNSIIFELELIEVKIDDEDKALRLILSLPPSYEHMKPILMYGKDTLNFAEATSKLLSEERRLKSEGRTIHEDSALVASNWKKKKASVQKKACCW
        SDHLS LN+I+ ELE I VK++DEDKALRLILSL  SYEHMKPILMYGK+TL +A+ T KLLSEE+RL S G T  E + L+  N KKK    QK   CW
Subjt:  SDHLSNLNSIIFELELIEVKIDDEDKALRLILSLPPSYEHMKPILMYGKDTLNFAEATSKLLSEERRLKSEGRTIHEDSALVASNWKKKKASVQKKACCW

Query:  GCGQSGHMKKDCPNRVDSSKGPGLDADSVSLIRGDD
         CGQSGH+K++CP   DS+      A++V+++ GDD
Subjt:  GCGQSGHMKKDCPNRVDSSKGPGLDADSVSLIRGDD

XP_022139673.1 uncharacterized protein LOC111010521 [Momordica charantia]4.0e-9585.65Show/hide
Query:  KALKGRPSESASEKLSGD--------SSSTGSKNSSMSDKDWEEMDLRAASAIRIGFAKNILADVHEISIVKELWEKLEALYQAKGISNRLYPNEQFHTL
        KALKGRPSE ASEKLS D         SS GSK SSMS +DWEEMDLRAASAIR   AKNILA+VH IS  KELWEKLEALYQAKGISNRLY  EQFHTL
Subjt:  KALKGRPSESASEKLSGD--------SSSTGSKNSSMSDKDWEEMDLRAASAIRIGFAKNILADVHEISIVKELWEKLEALYQAKGISNRLYPNEQFHTL

Query:  RMEEGTKISDHLSNLNSIIFELELIEVKIDDEDKALRLILSLPPSYEHMKPILMYGKDTLNFAEATSKLLSEERRLKSEGRTIHEDSALVASNWKKKKAS
        +MEEG KISDHLSNLNSIIFELE IEVKIDDEDKALRLILSLP SYEHMKPILMYGKDTLNFAE TSKLLSEERRLKSEGRT HEDSALV SNWKKKK S
Subjt:  RMEEGTKISDHLSNLNSIIFELELIEVKIDDEDKALRLILSLPPSYEHMKPILMYGKDTLNFAEATSKLLSEERRLKSEGRTIHEDSALVASNWKKKKAS

Query:  VQKKACCWGCGQSGHMKKDCPNR
        VQKKACCWGCGQSGHMKKDCPNR
Subjt:  VQKKACCWGCGQSGHMKKDCPNR

TrEMBL top hitse value%identityAlignment
A0A6A2YS90 Transcription initiation factor IIA subunit 26.3e-6256.79Show/hide
Query:  KALKGRP-SESASEKLSGDSSSTGSK-NSSMSDKDWEEMDLRAASAIRIGFAKNILADVHEISIVKELWEKLEALYQAKGISNRLYPNEQFHTLRMEEGT
        KALKG+P S S  ++    S S+G K  S MS+++WEE+D+RAAS IR+  AKN+LA+V   S  KELWEKLE +YQAK +SNRLY  E+FH L+MEEGT
Subjt:  KALKGRP-SESASEKLSGDSSSTGSK-NSSMSDKDWEEMDLRAASAIRIGFAKNILADVHEISIVKELWEKLEALYQAKGISNRLYPNEQFHTLRMEEGT

Query:  KISDHLSNLNSIIFELELIEVKIDDEDKALRLILSLPPSYEHMKPILMYGKDTLNFAEATSKLLSEERRLKSEGRTIHEDSALVASNWKKKKASVQKKAC
        KISDHLS LN I+ ELE I V+IDDEDKALRLI SL  SYEHM+ +LMYGK+ +NF+E TSKL+SEERRLK+      E  AL     +KK    +KK  
Subjt:  KISDHLSNLNSIIFELELIEVKIDDEDKALRLILSLPPSYEHMKPILMYGKDTLNFAEATSKLLSEERRLKSEGRTIHEDSALVASNWKKKKASVQKKAC

Query:  CWGCGQSGHMKKDCPN-RVDSSKGPGLDADSVSLIRGDDDQFL
        CWGCGQ GH+KKDC N   +S+ G   DA +V +   +DD+F+
Subjt:  CWGCGQSGHMKKDCPN-RVDSSKGPGLDADSVSLIRGDDDQFL

A0A6A3BK59 CCHC-type domain-containing protein7.5e-6357.2Show/hide
Query:  KALKGRP-SESASEKLSGDSSSTGSK-NSSMSDKDWEEMDLRAASAIRIGFAKNILADVHEISIVKELWEKLEALYQAKGISNRLYPNEQFHTLRMEEGT
        KALKG+P S S  ++    S S+G K  S MS+++WEE+D+RAAS IR+  AKN+LA+V   S  KELWEKLE +YQAK +SNRLY  E+FH L+MEEGT
Subjt:  KALKGRP-SESASEKLSGDSSSTGSK-NSSMSDKDWEEMDLRAASAIRIGFAKNILADVHEISIVKELWEKLEALYQAKGISNRLYPNEQFHTLRMEEGT

Query:  KISDHLSNLNSIIFELELIEVKIDDEDKALRLILSLPPSYEHMKPILMYGKDTLNFAEATSKLLSEERRLKSEGRTIHEDSALVASNWKKKKASVQKKAC
        KISDHLS LN I+ ELE I V+IDDEDKALRLI SLP SYEHM+ +LMYGK+ +NF+E TSKL+SEERRLK+      E  AL     +KK    +KK  
Subjt:  KISDHLSNLNSIIFELELIEVKIDDEDKALRLILSLPPSYEHMKPILMYGKDTLNFAEATSKLLSEERRLKSEGRTIHEDSALVASNWKKKKASVQKKAC

Query:  CWGCGQSGHMKKDCPN-RVDSSKGPGLDADSVSLIRGDDDQFL
        CWGCGQ GH+KKDC N   +S+ G   DA +V +   +DD+F+
Subjt:  CWGCGQSGHMKKDCPN-RVDSSKGPGLDADSVSLIRGDDDQFL

A0A6A3CWI3 CCHC-type domain-containing protein6.3e-6256.79Show/hide
Query:  KALKGRP-SESASEKLSGDSSSTGSK-NSSMSDKDWEEMDLRAASAIRIGFAKNILADVHEISIVKELWEKLEALYQAKGISNRLYPNEQFHTLRMEEGT
        KALKG+P S S  ++    S S+G K  S MS+++WEE+D+RAAS IR+  AKN+LA+V   S  KELWEKLE +YQAK +SNRLY  E+FH L+MEEGT
Subjt:  KALKGRP-SESASEKLSGDSSSTGSK-NSSMSDKDWEEMDLRAASAIRIGFAKNILADVHEISIVKELWEKLEALYQAKGISNRLYPNEQFHTLRMEEGT

Query:  KISDHLSNLNSIIFELELIEVKIDDEDKALRLILSLPPSYEHMKPILMYGKDTLNFAEATSKLLSEERRLKSEGRTIHEDSALVASNWKKKKASVQKKAC
        KISDHLS LN I+ ELE I V IDDEDKALRLI SLP SYEHM+ +LMYGK+ +NF+E TSKL+SEERRLK+      E  AL     +KK    +KK  
Subjt:  KISDHLSNLNSIIFELELIEVKIDDEDKALRLILSLPPSYEHMKPILMYGKDTLNFAEATSKLLSEERRLKSEGRTIHEDSALVASNWKKKKASVQKKAC

Query:  CWGCGQSGHMKKDCPN-RVDSSKGPGLDADSVSLIRGDDDQFL
        CWGCGQ GH+KKDC N   + + G   DA +V +   +DD+F+
Subjt:  CWGCGQSGHMKKDCPN-RVDSSKGPGLDADSVSLIRGDDDQFL

A0A6A3DA47 CCHC-type domain-containing protein1.8e-6156.61Show/hide
Query:  KALKGRP-SESASEKLSGDSSSTGSK-NSSMSDKDWEEMDLRAASAIRIGFAKNILADVHEISIVKELWEKLEALYQAKGISNRLYPNEQFHTLRMEEGT
        KALKG+P S    ++    S S+G K  S MS+++WEE+D+RAAS IR+  AKN+LA+V   S  KELWEKLE +YQAK +SNRLY  E+FH L+MEEGT
Subjt:  KALKGRP-SESASEKLSGDSSSTGSK-NSSMSDKDWEEMDLRAASAIRIGFAKNILADVHEISIVKELWEKLEALYQAKGISNRLYPNEQFHTLRMEEGT

Query:  KISDHLSNLNSIIFELELIEVKIDDEDKALRLILSLPPSYEHMKPILMYGKDTLNFAEATSKLLSEERRLKSEGRTIHEDSALVASNWKKKKASVQKKAC
        KISDHLS LN I+ ELE I V+IDDEDKALRLI SLP SYEHM+ +LMYGK+ +NF+E TSKL+SEERRLK+      E  AL     +KK    ++K  
Subjt:  KISDHLSNLNSIIFELELIEVKIDDEDKALRLILSLPPSYEHMKPILMYGKDTLNFAEATSKLLSEERRLKSEGRTIHEDSALVASNWKKKKASVQKKAC

Query:  CWGCGQSGHMKKDCPN-RVDSSKGPGLDADSVSLIRGDDDQF
        CWGCGQ GH+KKDC N   +S+ G   DA +V +   +DD+F
Subjt:  CWGCGQSGHMKKDCPN-RVDSSKGPGLDADSVSLIRGDDDQF

A0A6J1CG82 uncharacterized protein LOC1110105211.9e-9585.65Show/hide
Query:  KALKGRPSESASEKLSGD--------SSSTGSKNSSMSDKDWEEMDLRAASAIRIGFAKNILADVHEISIVKELWEKLEALYQAKGISNRLYPNEQFHTL
        KALKGRPSE ASEKLS D         SS GSK SSMS +DWEEMDLRAASAIR   AKNILA+VH IS  KELWEKLEALYQAKGISNRLY  EQFHTL
Subjt:  KALKGRPSESASEKLSGD--------SSSTGSKNSSMSDKDWEEMDLRAASAIRIGFAKNILADVHEISIVKELWEKLEALYQAKGISNRLYPNEQFHTL

Query:  RMEEGTKISDHLSNLNSIIFELELIEVKIDDEDKALRLILSLPPSYEHMKPILMYGKDTLNFAEATSKLLSEERRLKSEGRTIHEDSALVASNWKKKKAS
        +MEEG KISDHLSNLNSIIFELE IEVKIDDEDKALRLILSLP SYEHMKPILMYGKDTLNFAE TSKLLSEERRLKSEGRT HEDSALV SNWKKKK S
Subjt:  RMEEGTKISDHLSNLNSIIFELELIEVKIDDEDKALRLILSLPPSYEHMKPILMYGKDTLNFAEATSKLLSEERRLKSEGRTIHEDSALVASNWKKKKAS

Query:  VQKKACCWGCGQSGHMKKDCPNR
        VQKKACCWGCGQSGHMKKDCPNR
Subjt:  VQKKACCWGCGQSGHMKKDCPNR

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.5e-0922.89Show/hide
Query:  DKDWEEMDLRAASAIRIGFAKNILADVHEISIVKELWEKLEALYQAKGISNRLYPNEQFHTLRMEEGTKISDHLSNLNSIIFELELIEVKIDDEDKALRL
        D  W++ +  A S I    + + L         +++ E L+A+Y+ K ++++L   ++  +L++     +  H    + +I EL     KI++ DK   L
Subjt:  DKDWEEMDLRAASAIRIGFAKNILADVHEISIVKELWEKLEALYQAKGISNRLYPNEQFHTLRMEEGTKISDHLSNLNSIIFELELIEVKIDDEDKALRL

Query:  ILSLPPSYEH-MKPILMYGKDTLNFAEATSKLLSEERRLKSE---------GRTIHEDSALVASNWKKKKASVQK---------KACCWGCGQSGHMKKD
        +++LP  Y+  +  I    ++ L  A   ++LL +E ++K++            +H ++    +N  K + +  K         K  C  CG+ GH+KKD
Subjt:  ILSLPPSYEH-MKPILMYGKDTLNFAEATSKLLSEERRLKSE---------GRTIHEDSALVASNWKKKKASVQK---------KACCWGCGQSGHMKKD

Query:  C
        C
Subjt:  C

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.3e-2931.62Show/hide
Query:  KNSSMSDKDWEEMDLRAASAIRIGFAKNILADVHEISIVKELWEKLEALYQAKGISNRLYPNEQFHTLRMEEGTKISDHLSNLNSIIFELELIEVKIDDE
        K  +M  +DW ++D RAASAIR+  + +++ ++ +    + +W +LE+LY +K ++N+LY  +Q + L M EGT    HL+  N +I +L  + VKI++E
Subjt:  KNSSMSDKDWEEMDLRAASAIRIGFAKNILADVHEISIVKELWEKLEALYQAKGISNRLYPNEQFHTLRMEEGTKISDHLSNLNSIIFELELIEVKIDDE

Query:  DKALRLILSLPPSYEHMKPILMYGKDTLNFAEATSKLLSEERRLK------------SEGRTIHEDS---ALVASNWKKKKASVQKKACCWGCGQSGHMK
        DKA+ L+ SLP SY+++   +++GK T+   + TS LL  E+  K              GR+    S       +  K K  S  +   C+ C Q GH K
Subjt:  DKALRLILSLPPSYEHMKPILMYGKDTLNFAEATSKLLSEERRLK------------SEGRTIHEDS---ALVASNWKKKKASVQKKACCWGCGQSGHMK

Query:  KDCPN---RVDSSKGPGLDADSVSLIRGDDDQFL
        +DCPN       + G   D ++ ++++ +D+  L
Subjt:  KDCPN---RVDSSKGPGLDADSVSLIRGDDDQFL

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAACAAAGGGTGGCCGTGGTCCAGCGGCGGAATGTGATGGTGCGGCCATGGCCATAGCCATGGGTCCCAATTCGCCATCTTCAGTATATCTTCTCCTCAAC
CATAAGATATCTTCTCCTTTCGCCTTTCTTTTGAATCAAAGTAGGGCGGGGGAAGCTACCGTTTATACGACAGCTCTGCTTCCTCGTCTTGCTTCTGGCAAAGCT
TCGACTTTGCTTGCTTGTGCAAAGCACGTTGGGATTCTTTTCCCTGGTTTCACTGATTGGCTTTTAGCTTATCGGCCTACTTCTAGGGGAGTAGGAGGTGTTGTG
ATGTGGGGAGTTACCCCACGTCACCCTTCTTCTTCTTCTCCAAGTGTTGGAGAAGAGTGCTACCACAAGCACGATCCCGAGACCCAAGAGGATAGCGAGGAAGAT
CCGGTGGTGGTGTTCGAGGGGAATTCACTGAAGAAACGTTCTTCAAAGGCGTTGAAGGGAAGACCAAGTGAAAGTGCTTCTGAAAAGCTAAGCGGCGATAGTTCC
AGTACAGGTTCTAAGAACTCGAGCATGAGTGATAAAGATTGGGAGGAAATGGATTTGAGAGCTGCAAGCGCGATACGAATAGGTTTCGCTAAGAATATTCTGGCG
GATGTGCATGAAATTTCGATAGTGAAGGAACTTTGGGAGAAGCTCGAAGCATTGTATCAAGCAAAGGGCATCTCAAATCGGCTGTACCCGAATGAGCAGTTTCAC
ACGCTGCGAATGGAGGAAGGTACGAAAATTTCAGATCATCTGAGTAATCTCAATAGCATCATCTTTGAGCTGGAGTTGATCGAAGTGAAGATAGATGACGAAGAT
AAAGCACTCAGACTCATCTTGTCACTTCCACCTTCTTATGAACACATGAAGCCGATCTTGATGTATGGGAAAGATACTTTGAATTTTGCCGAGGCTACTAGTAAA
CTATTGTCAGAGGAAAGAAGGCTGAAGAGTGAAGGGCGTACTATACATGAAGATTCAGCACTGGTAGCTAGCAATTGGAAGAAGAAGAAAGCCTCCGTACAAAAG
AAAGCTTGTTGCTGGGGATGCGGACAGTCTGGACACATGAAGAAAGATTGTCCCAACAGAGTCGATTCGTCAAAAGGTCCTGGGTTGGATGCTGACAGTGTTTCT
CTTATCAGGGGAGACGATGATCAGTTCCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGATAACAAAGGGTGGCCGTGGTCCAGCGGCGGAATGTGATGGTGCGGCCATGGCCATAGCCATGGGTCCCAATTCGCCATCTTCAGTATATCTTCTCCTCAAC
CATAAGATATCTTCTCCTTTCGCCTTTCTTTTGAATCAAAGTAGGGCGGGGGAAGCTACCGTTTATACGACAGCTCTGCTTCCTCGTCTTGCTTCTGGCAAAGCT
TCGACTTTGCTTGCTTGTGCAAAGCACGTTGGGATTCTTTTCCCTGGTTTCACTGATTGGCTTTTAGCTTATCGGCCTACTTCTAGGGGAGTAGGAGGTGTTGTG
ATGTGGGGAGTTACCCCACGTCACCCTTCTTCTTCTTCTCCAAGTGTTGGAGAAGAGTGCTACCACAAGCACGATCCCGAGACCCAAGAGGATAGCGAGGAAGAT
CCGGTGGTGGTGTTCGAGGGGAATTCACTGAAGAAACGTTCTTCAAAGGCGTTGAAGGGAAGACCAAGTGAAAGTGCTTCTGAAAAGCTAAGCGGCGATAGTTCC
AGTACAGGTTCTAAGAACTCGAGCATGAGTGATAAAGATTGGGAGGAAATGGATTTGAGAGCTGCAAGCGCGATACGAATAGGTTTCGCTAAGAATATTCTGGCG
GATGTGCATGAAATTTCGATAGTGAAGGAACTTTGGGAGAAGCTCGAAGCATTGTATCAAGCAAAGGGCATCTCAAATCGGCTGTACCCGAATGAGCAGTTTCAC
ACGCTGCGAATGGAGGAAGGTACGAAAATTTCAGATCATCTGAGTAATCTCAATAGCATCATCTTTGAGCTGGAGTTGATCGAAGTGAAGATAGATGACGAAGAT
AAAGCACTCAGACTCATCTTGTCACTTCCACCTTCTTATGAACACATGAAGCCGATCTTGATGTATGGGAAAGATACTTTGAATTTTGCCGAGGCTACTAGTAAA
CTATTGTCAGAGGAAAGAAGGCTGAAGAGTGAAGGGCGTACTATACATGAAGATTCAGCACTGGTAGCTAGCAATTGGAAGAAGAAGAAAGCCTCCGTACAAAAG
AAAGCTTGTTGCTGGGGATGCGGACAGTCTGGACACATGAAGAAAGATTGTCCCAACAGAGTCGATTCGTCAAAAGGTCCTGGGTTGGATGCTGACAGTGTTTCT
CTTATCAGGGGAGACGATGATCAGTTCCTTTGA
Protein sequenceShow/hide protein sequence
MITKGGRGPAAECDGAAMAIAMGPNSPSSVYLLLNHKISSPFAFLLNQSRAGEATVYTTALLPRLASGKASTLLACAKHVGILFPGFTDWLLAYRPTSRGVGGVV
MWGVTPRHPSSSSPSVGEECYHKHDPETQEDSEEDPVVVFEGNSLKKRSSKALKGRPSESASEKLSGDSSSTGSKNSSMSDKDWEEMDLRAASAIRIGFAKNILA
DVHEISIVKELWEKLEALYQAKGISNRLYPNEQFHTLRMEEGTKISDHLSNLNSIIFELELIEVKIDDEDKALRLILSLPPSYEHMKPILMYGKDTLNFAEATSK
LLSEERRLKSEGRTIHEDSALVASNWKKKKASVQKKACCWGCGQSGHMKKDCPNRVDSSKGPGLDADSVSLIRGDDDQFL