; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g15960 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g15960
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCCHC-type domain-containing protein
Genome locationchr2:12007321..12026588
RNA-Seq ExpressionMoc02g15960
SyntenyMoc02g15960
Gene Ontology termsGO:0006816 - calcium ion transport (biological process)
GO:0009987 - cellular process (biological process)
GO:0005743 - mitochondrial inner membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW73244.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.4e-4437.5Show/hide
Query:  LSMVAQSSPEVADAPPLRQSIRVPENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLITHEV-IMKCNMEEDVKKKKSLALKSTSFQGASESEE
        + M+ + +  V     L + I   E V KILRSLP  W  KVTAIQEAKDL+KLP+EEL+GSL+T+E+ + K   E + KKKKS+ALK+T+ +     EE
Subjt:  LSMVAQSSPEVADAPPLRQSIRVPENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLITHEV-IMKCNMEEDVKKKKSLALKSTSFQGASESEE

Query:  ELNEEEH--AYLSKRFKKH-----FKKRYFPKKTN----NQDAKGEK---STRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDERSDSE
        + +EE+   A ++++  K+     F+ + F  + N       + G+K     +  +IC++CKK+GH++ + PL +  + RR KKAM ATW ES+E S+ E
Subjt:  ELNEEEH--AYLSKRFKKH-----FKKRYFPKKTN----NQDAKGEK---STRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDERSDSE

Query:  SGEEVANLCFMAFGDEDDDDEVCFENPSFDELMDAYNEIKEEFEKLGMFEINKEQVVKHMTGDRTKFTPLEMKDGGYVTFGNNKRGKIIG----------
        + +EVAN+CFMA    DD DE   E+  F +                          +HMTGD +KF  L  + GGYVTFG+N +G+IIG          
Subjt:  SGEEVANLCFMAFGDEDDDDEVCFENPSFDELMDAYNEIKEEFEKLGMFEINKEQVVKHMTGDRTKFTPLEMKDGGYVTFGNNKRGKIIG----------

Query:  KVSDELEMPFENLNFKDNNTEAVVNETQQVPEET---LPKDWSFSIHHPKELIIGDVSQG
        ++ D+ +      + K   +   +   QQV  E+   LPKDW F I+HP++ IIG+ S G
Subjt:  KVSDELEMPFENLNFKDNNTEAVVNETQQVPEET---LPKDWSFSIHHPKELIIGDVSQG

XP_022143648.1 uncharacterized protein LOC111013509 [Momordica charantia]2.6e-9179.24Show/hide
Query:  ENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLITHEVIMKCNMEEDVKKKKSLALKSTSFQGASESEEELNEEEHAYLSKRFKKHFKKRYFPK
        ENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSL+THE++MK NMEEDVKKKKSLALKSTSFQ ASESEEELNEEE AYLSK+FKKHFKKR+FPK
Subjt:  ENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLITHEVIMKCNMEEDVKKKKSLALKSTSFQGASESEEELNEEEHAYLSKRFKKHFKKRYFPK

Query:  KTNNQDAKGEKSTRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDERSDSESGEEVANLCFMAFGDEDDDDEVCFENPSFDELMDAYNEI
        KTN+QDAKGEK+TRD+IICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESD  SDSESGEEVANLCFMAFGDEDDD+EVC  +      +D+    
Subjt:  KTNNQDAKGEKSTRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDERSDSESGEEVANLCFMAFGDEDDDDEVCFENPSFDELMDAYNEI

Query:  KEEFEKLGMFEINKEQVVKHMTGDRTKFTPLEMKDG
                          KHMTGD TKFT LEMKDG
Subjt:  KEEFEKLGMFEINKEQVVKHMTGDRTKFTPLEMKDG

XP_022158792.1 uncharacterized protein LOC111025259 [Momordica charantia]4.1e-4433.42Show/hide
Query:  ENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLITHEVIMKCNMEEDVKKKKSLALKSTSFQGASESEEELNEEEHAYLSKRFKKHFK-KRYFP
        E V+K+L SLPK WEPKVTAIQEAKDL  L ++ELIG                                    E  L+E++ AYLS+++K   K K+ F 
Subjt:  ENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLITHEVIMKCNMEEDVKKKKSLALKSTSFQGASESEEELNEEEHAYLSKRFKKHFK-KRYFP

Query:  KK-TNNQDAKGEKSTRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDERSDSESGEEVANLCFMAFGDEDD--DDEVCFENPSFDELMDA
        K  +N ++ K E S +D +ICYECKK GH+R++CP L KSS +  KKAMKATWD+SDE  +    EEVAN CFMA  D++D  DDE+  +  S+DEL +A
Subjt:  KK-TNNQDAKGEKSTRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDERSDSESGEEVANLCFMAFGDEDD--DDEVCFENPSFDELMDA

Query:  YNEIKEEFEKLGMFEINKEQVVK--------------------------------------------------------------------------HMT
        +  ++ + EKL   +  K+ + K                                                                          +MT
Subjt:  YNEIKEEFEKLGMFEINKEQVVK--------------------------------------------------------------------------HMT

Query:  GDRTKFTPLEMKDGGYVTFGNNKRGKIIGKVSDELEMPFENLNFKDNNTEAVVNETQQVPEETLPKDWSFSIHHPKELIIGDVSQGGIA
        GD++KF     KDGG+VTFG++K+ +  G    +L +  ++     +  E  +NE +     ++PK+W ++  HPK+LI+GD  QG IA
Subjt:  GDRTKFTPLEMKDGGYVTFGNNKRGKIIGKVSDELEMPFENLNFKDNNTEAVVNETQQVPEETLPKDWSFSIHHPKELIIGDVSQGGIA

XP_031741720.1 uncharacterized protein LOC116403915 [Cucumis sativus]1.3e-5058.29Show/hide
Query:  ENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLITHEVIMKCNMEEDVKKKKSLALKSTSFQGASESEEELNEEEHAYLSKRFKKHFK-KRYFP
        ENVRKILRSLPK+WE KVTAIQEAKDL+KLPLEELIGSL+THE+IMK ++E++ KKKKS+ALK+ S +   E E+ L+E++ AY S+++K   K K+YF 
Subjt:  ENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLITHEVIMKCNMEEDVKKKKSLALKSTSFQGASESEEELNEEEHAYLSKRFKKHFK-KRYFP

Query:  KKTNNQ-DAKGEKSTRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDERSDSESGEEVANLCFMAFGDEDD--DDEVCFENPSFDELMDA
        K  + Q ++KGEKS +D +ICYECK++GH+R++CPLL KSS +  KKAMKATWD+S E S+SE  EE+ANL  MA  D+DD  DD+V  E  S DEL + 
Subjt:  KKTNNQ-DAKGEKSTRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDERSDSESGEEVANLCFMAFGDEDD--DDEVCFENPSFDELMDA

Query:  YNEIKEEFEKL
        +  ++ + EKL
Subjt:  YNEIKEEFEKL

XP_038895919.1 uncharacterized protein LOC120084093 [Benincasa hispida]2.4e-6069.5Show/hide
Query:  LRQSIRVPENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLITHEVIMKCNMEEDVKKKKSLALKSTSFQGASESEEELNEEEHAYLSKRFKKH
        L +S    ENVRKILRSLPKSWE KVT IQEAKDLSKLPLEEL+GSL+ HE+IMK N+EEDVKKKK+L LKST  Q  SE+E ELN+EE AYL+K+FKKH
Subjt:  LRQSIRVPENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLITHEVIMKCNMEEDVKKKKSLALKSTSFQGASESEEELNEEEHAYLSKRFKKH

Query:  FKKRYFPKKTNNQDAKGEKSTRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDERSDSESGEEVANLCFMAF-GDEDDDDEVCFENPSFD
        F+KR F KK NNQ+ KGEKS RD IICYECKK GHV  + P  RK+ S++++KAMKAT DESDE S+ ES E VANLC MAF  D+DDDDEV  EN +FD
Subjt:  FKKRYFPKKTNNQDAKGEKSTRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDERSDSESGEEVANLCFMAF-GDEDDDDEVCFENPSFD

TrEMBL top hitse value%identityAlignment
A0A2N9H9V2 CCHC-type domain-containing protein1.5e-5244.14Show/hide
Query:  ENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLITHEVIMKCNM-EEDVKKKKSLALKSTSFQGASESEEELNEEEHAYLSKRFKKHFKK----
        ENVRKILRSLPK WE K+TAI EA+DL  L LEEL GSL+T+E+ M   + EE+VK KK+ ALKS+     +  EE   EEE A +++ FKK  KK    
Subjt:  ENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLITHEVIMKCNM-EEDVKKKKSLALKSTSFQGASESEEELNEEEHAYLSKRFKKHFKK----

Query:  -RYFPKKTNNQDAKGEKSTRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDERSDSE---SGEEVANLCFMAFGDEDDDDEVCFENPSFD
         R FPKK  N   KGE S  +   CY+CKK GH ++ECP + K   +  KKA+K TWD+SDE SDS+   S  EVANLC + + +E +  E   E+ SF 
Subjt:  -RYFPKKTNNQDAKGEKSTRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDERSDSE---SGEEVANLCFMAFGDEDDDDEVCFENPSFD

Query:  ELMDAYNEIKEEFEKLGMFEINKEQVV----------------KHMTGDRTKFTPLEMKDGGYVTFGNNKRGKIIGKVSDELEMPFENLNFKDNNTEAVV
         L  A+N+ +   E L +     E  +                +HMTGD+ KFT L +KDGG V FG+N +GKIIG V D+++ P         + E  +
Subjt:  ELMDAYNEIKEEFEKLGMFEINKEQVV----------------KHMTGDRTKFTPLEMKDGGYVTFGNNKRGKIIGKVSDELEMPFENLNFKDNNTEAVV

Query:  NETQQVPEETLPKDWSFSIHHPKELIIGDVSQG
          T     E LPK W+   +HPKELIIG++  G
Subjt:  NETQQVPEETLPKDWSFSIHHPKELIIGDVSQG

A0A2N9HKZ7 CCHC-type domain-containing protein1.5e-5244.14Show/hide
Query:  ENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLITHEVIMKCNM-EEDVKKKKSLALKSTSFQGASESEEELNEEEHAYLSKRFKKHFKK----
        ENVRKILRSLPK WE K+TAI EA+DL  L LEEL GSL+T+E+ M   + EE+VK KK+ ALKS+     +  EE   EEE A +++ FKK  KK    
Subjt:  ENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLITHEVIMKCNM-EEDVKKKKSLALKSTSFQGASESEEELNEEEHAYLSKRFKKHFKK----

Query:  -RYFPKKTNNQDAKGEKSTRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDERSDSE---SGEEVANLCFMAFGDEDDDDEVCFENPSFD
         R FPKK  N   KGE S  +   CY+CKK GH ++ECP + K   +  KKA+K TWD+SDE SDS+   S  EVANLC + + +E +  E   E+ SF 
Subjt:  -RYFPKKTNNQDAKGEKSTRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDERSDSE---SGEEVANLCFMAFGDEDDDDEVCFENPSFD

Query:  ELMDAYNEIKEEFEKLGMFEINKEQVV----------------KHMTGDRTKFTPLEMKDGGYVTFGNNKRGKIIGKVSDELEMPFENLNFKDNNTEAVV
         L  A+N+ +   E L +     E  +                +HMTGD+ KFT L +KDGG V FG+N +GKIIG V D+++ P         + E  +
Subjt:  ELMDAYNEIKEEFEKLGMFEINKEQVV----------------KHMTGDRTKFTPLEMKDGGYVTFGNNKRGKIIGKVSDELEMPFENLNFKDNNTEAVV

Query:  NETQQVPEETLPKDWSFSIHHPKELIIGDVSQG
          T     E LPK W+   +HPKELIIG++  G
Subjt:  NETQQVPEETLPKDWSFSIHHPKELIIGDVSQG

A0A2N9IDJ4 CCHC-type domain-containing protein1.5e-5244.14Show/hide
Query:  ENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLITHEVIMKCNM-EEDVKKKKSLALKSTSFQGASESEEELNEEEHAYLSKRFKKHFKK----
        ENVRKILRSLPK WE K+TAI EA+DL  L LEEL GSL+T+E+ M   + EE+VK KK+ ALKS+     +  EE   EEE A +++ FKK  KK    
Subjt:  ENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLITHEVIMKCNM-EEDVKKKKSLALKSTSFQGASESEEELNEEEHAYLSKRFKKHFKK----

Query:  -RYFPKKTNNQDAKGEKSTRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDERSDSE---SGEEVANLCFMAFGDEDDDDEVCFENPSFD
         R FPKK  N   KGE S  +   CY+CKK GH ++ECP + K   +  KKA+K TWD+SDE SDS+   S  EVANLC + + +E +  E   E+ SF 
Subjt:  -RYFPKKTNNQDAKGEKSTRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDERSDSE---SGEEVANLCFMAFGDEDDDDEVCFENPSFD

Query:  ELMDAYNEIKEEFEKLGMFEINKEQVV----------------KHMTGDRTKFTPLEMKDGGYVTFGNNKRGKIIGKVSDELEMPFENLNFKDNNTEAVV
         L  A+N+ +   E L +     E  +                +HMTGD+ KFT L +KDGG V FG+N +GKIIG V D+++ P         + E  +
Subjt:  ELMDAYNEIKEEFEKLGMFEINKEQVV----------------KHMTGDRTKFTPLEMKDGGYVTFGNNKRGKIIGKVSDELEMPFENLNFKDNNTEAVV

Query:  NETQQVPEETLPKDWSFSIHHPKELIIGDVSQG
          T     E LPK W+   +HPKELIIG++  G
Subjt:  NETQQVPEETLPKDWSFSIHHPKELIIGDVSQG

A0A2N9IFL9 CCHC-type domain-containing protein1.5e-5244.14Show/hide
Query:  ENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLITHEVIMKCNM-EEDVKKKKSLALKSTSFQGASESEEELNEEEHAYLSKRFKKHFKK----
        ENVRKILRSLPK WE K+TAI EA+DL  L LEEL GSL+T+E+ M   + EE+VK KK+ ALKS+     +  EE   EEE A +++ FKK  KK    
Subjt:  ENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLITHEVIMKCNM-EEDVKKKKSLALKSTSFQGASESEEELNEEEHAYLSKRFKKHFKK----

Query:  -RYFPKKTNNQDAKGEKSTRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDERSDSE---SGEEVANLCFMAFGDEDDDDEVCFENPSFD
         R FPKK  N   KGE S  +   CY+CKK GH ++ECP + K   +  KKA+K TWD+SDE SDS+   S  EVANLC + + +E +  E   E+ SF 
Subjt:  -RYFPKKTNNQDAKGEKSTRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDERSDSE---SGEEVANLCFMAFGDEDDDDEVCFENPSFD

Query:  ELMDAYNEIKEEFEKLGMFEINKEQVV----------------KHMTGDRTKFTPLEMKDGGYVTFGNNKRGKIIGKVSDELEMPFENLNFKDNNTEAVV
         L  A+N+ +   E L +     E  +                +HMTGD+ KFT L +KDGG V FG+N +GKIIG V D+++ P         + E  +
Subjt:  ELMDAYNEIKEEFEKLGMFEINKEQVV----------------KHMTGDRTKFTPLEMKDGGYVTFGNNKRGKIIGKVSDELEMPFENLNFKDNNTEAVV

Query:  NETQQVPEETLPKDWSFSIHHPKELIIGDVSQG
          T     E LPK W+   +HPKELIIG++  G
Subjt:  NETQQVPEETLPKDWSFSIHHPKELIIGDVSQG

A0A6J1CR79 uncharacterized protein LOC1110135091.3e-9179.24Show/hide
Query:  ENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLITHEVIMKCNMEEDVKKKKSLALKSTSFQGASESEEELNEEEHAYLSKRFKKHFKKRYFPK
        ENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSL+THE++MK NMEEDVKKKKSLALKSTSFQ ASESEEELNEEE AYLSK+FKKHFKKR+FPK
Subjt:  ENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLITHEVIMKCNMEEDVKKKKSLALKSTSFQGASESEEELNEEEHAYLSKRFKKHFKKRYFPK

Query:  KTNNQDAKGEKSTRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDERSDSESGEEVANLCFMAFGDEDDDDEVCFENPSFDELMDAYNEI
        KTN+QDAKGEK+TRD+IICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESD  SDSESGEEVANLCFMAFGDEDDD+EVC  +      +D+    
Subjt:  KTNNQDAKGEKSTRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDERSDSESGEEVANLCFMAFGDEDDDDEVCFENPSFDELMDAYNEI

Query:  KEEFEKLGMFEINKEQVVKHMTGDRTKFTPLEMKDG
                          KHMTGD TKFT LEMKDG
Subjt:  KEEFEKLGMFEINKEQVVKHMTGDRTKFTPLEMKDG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein4.3e-0724.1Show/hide
Query:  ILWGRELLSKGLRYKVWNGSSIQLFHDKWIPRETTFKPISPHPIGCNPLISEFITGS--------KQWDVQKLMRFVNKEDVKAISQIPISWSNEPDIWV
        +L G  LL KG R+ + +G +I++  D  +          P P+       E    +          WD  K+ +FV++ D   I +I ++ S +PD  +
Subjt:  ILWGRELLSKGLRYKVWNGSSIQLFHDKWIPRETTFKPISPHPIGCNPLISEFITGS--------KQWDVQKLMRFVNKEDVKAISQIPISWSNEPDIWV

Query:  WQYDKKGVYSVKSGY---------KVAMASSSIGSSSNYNR-------------------YCLPTMCNLAKKGINASTKCLVCGSHEESIEHVLF
        W Y+  G Y+V+SGY          +   +   GS     R                     L T   L  +G+     C  C    ESI H LF
Subjt:  WQYDKKGVYSVKSGY---------KVAMASSSIGSSSNYNR-------------------YCLPTMCNLAKKGINASTKCLVCGSHEESIEHVLF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGACGCATGGAATGATGGAGATGGAGAGGATGATGGGCAAGTGGAACGTACAGAAGAAGTAGCTAGAGAAGAAGGGGTAGATGTTGTGAGTGGCGATGGTAGGGA
GTTTGAAAGCAAGGGAGAAGGGGATATGTTAGATGGTGAGGAAGAAGAGAAAGAGGTAGCCGTTGGGGAAGGGAAAGAAAGGGGAGATAAGATGGGGGATGAGCAAGTGA
AGGCAAAGTTGAGTTTTGAAATTAAAAGGTCGTGGACTGTGTTGGGAGGGAACTCTGTGATTACGGTAGTTATTATCATGCTGATTCATAAAGATGATTTGTTGGTGGCC
CCGTCTTTGGATTTGTACATGCTTGAGCAATTTAAGAAGATCTTTCACAGCCGCATGCTATGTCTGCCTCCTCTTCTAAAGGTTTGTCATCATTTGATACTTCAGCTCAG
CATAATCGATACCTTGGTTGTCGAGACATCTACATCTCATTCTTCGCTTTCTATGGTCGCTCAATCATCTCCCGAGGTTGCAGATGCTCCTCCGCTTCGTCAGTCCATTC
GTGTTCCTGAGAACGTGAGGAAGATTCTAAGATCTCTACCAAAAAGTTGGGAACCCAAAGTGACAGCAATTCAAGAAGCGAAGGATCTCTCAAAACTTCCATTGGAGGAG
CTCATTGGATCTCTCATCACACATGAGGTAATCATGAAATGTAATATGGAAGAGGACGTCAAAAAGAAGAAAAGCTTGGCATTAAAGTCTACATCCTTCCAAGGGGCTTC
TGAAAGTGAGGAAGAACTAAACGAGGAAGAACATGCCTACTTATCTAAGAGGTTCAAGAAGCATTTCAAGAAGAGGTACTTTCCAAAGAAAACCAACAATCAAGATGCTA
AAGGAGAAAAGAGCACTAGAGATGTCATCATCTGCTACGAGTGCAAAAAGGCTGGACATGTAAGATCTGAATGCCCTCTACTACGTAAATCATCTTCAAGGAGAAACAAG
AAGGCTATGAAAGCTACTTGGGATGAAAGTGATGAAAGAAGTGATTCTGAAAGTGGAGAGGAAGTCGCTAATCTTTGTTTCATGGCTTTTGGAGATGAAGATGATGACGA
CGAGGTATGTTTTGAAAATCCTTCTTTTGATGAGCTTATGGATGCTTACAACGAGATAAAAGAAGAATTTGAAAAATTAGGTATGTTTGAGATCAATAAAGAACAAGTGG
TAAAGCACATGACCGGAGACCGCACAAAATTCACACCCCTAGAAATGAAAGATGGAGGATACGTTACCTTTGGCAATAATAAAAGAGGTAAAATCATTGGCAAAGTTAGT
GATGAGTTAGAAATGCCTTTTGAGAATTTGAATTTTAAAGATAATAACACGGAAGCTGTAGTAAATGAGACTCAACAAGTGCCAGAGGAAACCCTTCCCAAGGATTGGAG
TTTCTCAATTCACCATCCGAAGGAGTTAATTATAGGTGATGTATCCCAAGGGGGGATTGCAAATATTTCTATTATTAATGGTGTGGCAGATTCGCAAGGGATTGCAAATA
CTCCTATTATTAATGGGATGGTGCAAGGGAGTGCAAATAGTCCTATTATTAAGGGATCATTCCCGATGAATGGGGCTAAACAACAAAGTGAGGTTTTGGGATATGAAGGA
CTATTCCCCGCAGCTACCGTAACCAATTTATTGGCAATAAAATCTCCCTTACAGGATTTTCCCCCAATGACCTGTCAGTTGGGTCCCACAGAAACTCAGGCAATGGCTAG
GAAAAATGGTAATTGGAAAAAGAGAGCGCGTGCGGTAGGGAGGAAACCTGGTGATGCATCAGGAGCTCCACCGAAAGCCATGAAAACACTTTGTTGGAATGTCCGGGGTT
TAGGGAACCCTCGAACATTTAGAGCTTGTCGTGATGAATTGAGTGAGAGGGGAAACGATGGAGGTTTACTGGGCTATACGACTAGCTTGAGCAACAGGCGCAGAAATTCA
CATGGGAGCAGCTTAGACGTTTATACAACAATGATGATAGTGCATAGACCGATTGTTATGGAGCTTGTGAAGGACTCTGGTGATATTTGGTGCGGCAAGAGAGCAAGCAG
GTTCATATTTGAGGAGGGTTGGGTGTCTTATCTGGAGTGCAATGAGATAATTGAAAGGAATTGGGGCTCTGCTGATGTGGGTTTGGTGTTTTTCAAGGCAAATGGCAGGG
AATGCGGCTGGGTTAAAAAGGCTCTTATTGACTACGAGATGGCTTCAATTCAGAAGATCAATTTCAGTAAGGTTGTCCGATGTGTTTCTCCTAATGTTAGCTCTAATTGC
TCTCAGTACCTTCAACAGATTCTGGGTGTCCAACTGGGCCAAGTCCCCATACTTTTGAAAGGAATTTTGTGGGGAAGGGAGCTTCTCTCAAAGGGCCTTAGATACAAAGT
CTGGAATGGTAGTTCCATTCAATTGTTTCATGACAAATGGATCCCTCGAGAAACTACTTTCAAACCCATTTCACCTCACCCGATTGGATGCAATCCGTTGATCTCGGAGT
TCATCACTGGTTCTAAGCAATGGGATGTTCAAAAACTCATGAGGTTTGTGAATAAGGAGGATGTGAAGGCAATATCCCAGATCCCTATAAGCTGGTCCAATGAGCCAGAC
ATATGGGTATGGCAATATGACAAGAAGGGCGTTTATTCGGTGAAAAGTGGTTATAAGGTGGCAATGGCGTCCAGCTCTATTGGCTCTTCTTCTAACTACAATCGGTACTG
TCTTCCTACTATGTGTAATCTAGCCAAGAAAGGAATTAATGCATCAACTAAGTGCCTAGTGTGTGGGTCGCATGAAGAGTCTATCGAGCATGTGTTATTCAAATATCTTC
CTCAGAAAGACTTTGAATTAGCTTGTGTGAGAATGTGGGAGATTTGGCTCGATAGAAATGCAATTAGGGTTCATAACCCTATGCCTGATTCAACAAACAGATGTAATTGG
ATCCTTTCATACATGACCGAGATTGAAGACAACATTCCTAACAGAAAGGGAGGTGCGTACTTACTTGGTGTCTCGACAGGCAATGACGAGGGTTCTCTCTGGAGTCCCCT
TCCTTCGGGGCACGAGAATGGTGAAATTCTGGCGGCATTGTCGAAGATGGTGCCTGTGTGTTACAATTCGTTGATAGCCGAGTTGTTAGCAATCCTTGAATGTACCTGGG
TTGCAGACATTAAAAATTTCACTAATCTCTTCTCTCAGGTTCAGTTTCTGCATGTTCGTATGGGAGGCAACAAAGCAGCTTACAAGCTTGCTACCATGGGTGCCTCGAAT
GTAACCAATTTTATGTGGCTGTCTGATTTTCCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTGACGCATGGAATGATGGAGATGGAGAGGATGATGGGCAAGTGGAACGTACAGAAGAAGTAGCTAGAGAAGAAGGGGTAGATGTTGTGAGTGGCGATGGTAGGGA
GTTTGAAAGCAAGGGAGAAGGGGATATGTTAGATGGTGAGGAAGAAGAGAAAGAGGTAGCCGTTGGGGAAGGGAAAGAAAGGGGAGATAAGATGGGGGATGAGCAAGTGA
AGGCAAAGTTGAGTTTTGAAATTAAAAGGTCGTGGACTGTGTTGGGAGGGAACTCTGTGATTACGGTAGTTATTATCATGCTGATTCATAAAGATGATTTGTTGGTGGCC
CCGTCTTTGGATTTGTACATGCTTGAGCAATTTAAGAAGATCTTTCACAGCCGCATGCTATGTCTGCCTCCTCTTCTAAAGGTTTGTCATCATTTGATACTTCAGCTCAG
CATAATCGATACCTTGGTTGTCGAGACATCTACATCTCATTCTTCGCTTTCTATGGTCGCTCAATCATCTCCCGAGGTTGCAGATGCTCCTCCGCTTCGTCAGTCCATTC
GTGTTCCTGAGAACGTGAGGAAGATTCTAAGATCTCTACCAAAAAGTTGGGAACCCAAAGTGACAGCAATTCAAGAAGCGAAGGATCTCTCAAAACTTCCATTGGAGGAG
CTCATTGGATCTCTCATCACACATGAGGTAATCATGAAATGTAATATGGAAGAGGACGTCAAAAAGAAGAAAAGCTTGGCATTAAAGTCTACATCCTTCCAAGGGGCTTC
TGAAAGTGAGGAAGAACTAAACGAGGAAGAACATGCCTACTTATCTAAGAGGTTCAAGAAGCATTTCAAGAAGAGGTACTTTCCAAAGAAAACCAACAATCAAGATGCTA
AAGGAGAAAAGAGCACTAGAGATGTCATCATCTGCTACGAGTGCAAAAAGGCTGGACATGTAAGATCTGAATGCCCTCTACTACGTAAATCATCTTCAAGGAGAAACAAG
AAGGCTATGAAAGCTACTTGGGATGAAAGTGATGAAAGAAGTGATTCTGAAAGTGGAGAGGAAGTCGCTAATCTTTGTTTCATGGCTTTTGGAGATGAAGATGATGACGA
CGAGGTATGTTTTGAAAATCCTTCTTTTGATGAGCTTATGGATGCTTACAACGAGATAAAAGAAGAATTTGAAAAATTAGGTATGTTTGAGATCAATAAAGAACAAGTGG
TAAAGCACATGACCGGAGACCGCACAAAATTCACACCCCTAGAAATGAAAGATGGAGGATACGTTACCTTTGGCAATAATAAAAGAGGTAAAATCATTGGCAAAGTTAGT
GATGAGTTAGAAATGCCTTTTGAGAATTTGAATTTTAAAGATAATAACACGGAAGCTGTAGTAAATGAGACTCAACAAGTGCCAGAGGAAACCCTTCCCAAGGATTGGAG
TTTCTCAATTCACCATCCGAAGGAGTTAATTATAGGTGATGTATCCCAAGGGGGGATTGCAAATATTTCTATTATTAATGGTGTGGCAGATTCGCAAGGGATTGCAAATA
CTCCTATTATTAATGGGATGGTGCAAGGGAGTGCAAATAGTCCTATTATTAAGGGATCATTCCCGATGAATGGGGCTAAACAACAAAGTGAGGTTTTGGGATATGAAGGA
CTATTCCCCGCAGCTACCGTAACCAATTTATTGGCAATAAAATCTCCCTTACAGGATTTTCCCCCAATGACCTGTCAGTTGGGTCCCACAGAAACTCAGGCAATGGCTAG
GAAAAATGGTAATTGGAAAAAGAGAGCGCGTGCGGTAGGGAGGAAACCTGGTGATGCATCAGGAGCTCCACCGAAAGCCATGAAAACACTTTGTTGGAATGTCCGGGGTT
TAGGGAACCCTCGAACATTTAGAGCTTGTCGTGATGAATTGAGTGAGAGGGGAAACGATGGAGGTTTACTGGGCTATACGACTAGCTTGAGCAACAGGCGCAGAAATTCA
CATGGGAGCAGCTTAGACGTTTATACAACAATGATGATAGTGCATAGACCGATTGTTATGGAGCTTGTGAAGGACTCTGGTGATATTTGGTGCGGCAAGAGAGCAAGCAG
GTTCATATTTGAGGAGGGTTGGGTGTCTTATCTGGAGTGCAATGAGATAATTGAAAGGAATTGGGGCTCTGCTGATGTGGGTTTGGTGTTTTTCAAGGCAAATGGCAGGG
AATGCGGCTGGGTTAAAAAGGCTCTTATTGACTACGAGATGGCTTCAATTCAGAAGATCAATTTCAGTAAGGTTGTCCGATGTGTTTCTCCTAATGTTAGCTCTAATTGC
TCTCAGTACCTTCAACAGATTCTGGGTGTCCAACTGGGCCAAGTCCCCATACTTTTGAAAGGAATTTTGTGGGGAAGGGAGCTTCTCTCAAAGGGCCTTAGATACAAAGT
CTGGAATGGTAGTTCCATTCAATTGTTTCATGACAAATGGATCCCTCGAGAAACTACTTTCAAACCCATTTCACCTCACCCGATTGGATGCAATCCGTTGATCTCGGAGT
TCATCACTGGTTCTAAGCAATGGGATGTTCAAAAACTCATGAGGTTTGTGAATAAGGAGGATGTGAAGGCAATATCCCAGATCCCTATAAGCTGGTCCAATGAGCCAGAC
ATATGGGTATGGCAATATGACAAGAAGGGCGTTTATTCGGTGAAAAGTGGTTATAAGGTGGCAATGGCGTCCAGCTCTATTGGCTCTTCTTCTAACTACAATCGGTACTG
TCTTCCTACTATGTGTAATCTAGCCAAGAAAGGAATTAATGCATCAACTAAGTGCCTAGTGTGTGGGTCGCATGAAGAGTCTATCGAGCATGTGTTATTCAAATATCTTC
CTCAGAAAGACTTTGAATTAGCTTGTGTGAGAATGTGGGAGATTTGGCTCGATAGAAATGCAATTAGGGTTCATAACCCTATGCCTGATTCAACAAACAGATGTAATTGG
ATCCTTTCATACATGACCGAGATTGAAGACAACATTCCTAACAGAAAGGGAGGTGCGTACTTACTTGGTGTCTCGACAGGCAATGACGAGGGTTCTCTCTGGAGTCCCCT
TCCTTCGGGGCACGAGAATGGTGAAATTCTGGCGGCATTGTCGAAGATGGTGCCTGTGTGTTACAATTCGTTGATAGCCGAGTTGTTAGCAATCCTTGAATGTACCTGGG
TTGCAGACATTAAAAATTTCACTAATCTCTTCTCTCAGGTTCAGTTTCTGCATGTTCGTATGGGAGGCAACAAAGCAGCTTACAAGCTTGCTACCATGGGTGCCTCGAAT
GTAACCAATTTTATGTGGCTGTCTGATTTTCCCTAG
Protein sequenceShow/hide protein sequence
MGDAWNDGDGEDDGQVERTEEVAREEGVDVVSGDGREFESKGEGDMLDGEEEEKEVAVGEGKERGDKMGDEQVKAKLSFEIKRSWTVLGGNSVITVVIIMLIHKDDLLVA
PSLDLYMLEQFKKIFHSRMLCLPPLLKVCHHLILQLSIIDTLVVETSTSHSSLSMVAQSSPEVADAPPLRQSIRVPENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEE
LIGSLITHEVIMKCNMEEDVKKKKSLALKSTSFQGASESEEELNEEEHAYLSKRFKKHFKKRYFPKKTNNQDAKGEKSTRDVIICYECKKAGHVRSECPLLRKSSSRRNK
KAMKATWDESDERSDSESGEEVANLCFMAFGDEDDDDEVCFENPSFDELMDAYNEIKEEFEKLGMFEINKEQVVKHMTGDRTKFTPLEMKDGGYVTFGNNKRGKIIGKVS
DELEMPFENLNFKDNNTEAVVNETQQVPEETLPKDWSFSIHHPKELIIGDVSQGGIANISIINGVADSQGIANTPIINGMVQGSANSPIIKGSFPMNGAKQQSEVLGYEG
LFPAATVTNLLAIKSPLQDFPPMTCQLGPTETQAMARKNGNWKKRARAVGRKPGDASGAPPKAMKTLCWNVRGLGNPRTFRACRDELSERGNDGGLLGYTTSLSNRRRNS
HGSSLDVYTTMMIVHRPIVMELVKDSGDIWCGKRASRFIFEEGWVSYLECNEIIERNWGSADVGLVFFKANGRECGWVKKALIDYEMASIQKINFSKVVRCVSPNVSSNC
SQYLQQILGVQLGQVPILLKGILWGRELLSKGLRYKVWNGSSIQLFHDKWIPRETTFKPISPHPIGCNPLISEFITGSKQWDVQKLMRFVNKEDVKAISQIPISWSNEPD
IWVWQYDKKGVYSVKSGYKVAMASSSIGSSSNYNRYCLPTMCNLAKKGINASTKCLVCGSHEESIEHVLFKYLPQKDFELACVRMWEIWLDRNAIRVHNPMPDSTNRCNW
ILSYMTEIEDNIPNRKGGAYLLGVSTGNDEGSLWSPLPSGHENGEILAALSKMVPVCYNSLIAELLAILECTWVADIKNFTNLFSQVQFLHVRMGGNKAAYKLATMGASN
VTNFMWLSDFP