; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g14650 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g14650
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCCHC-type domain-containing protein
Genome locationchr4:11182438..11188366
RNA-Seq ExpressionMoc04g14650
SyntenyMoc04g14650
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047078.1 reverse transcriptase [Cucumis melo var. makuwa]2.6e-4834.99Show/hide
Query:  KEVESTTVLSPRSTTNRLVMVEGAVENLQQNVVEICQLLNVIANKIDA-------------------------INLHQIGRID------------NQIP-
        +E E   VLSP++T+ RL+ +E +VE ++  +  + Q L  +    +                          IN HQ  R D            N  P 
Subjt:  KEVESTTVLSPRSTTNRLVMVEGAVENLQQNVVEICQLLNVIANKIDA-------------------------INLHQIGRID------------NQIP-

Query:  ---------------------PRFDQNREGRQSEINGQQHNNLHEDGDYKMKIDLLTFDEK------MHIEG----------------------------
                             P F+QNR  R      Q      E  + KMKIDL ++D K      +  EG                            
Subjt:  ---------------------PRFDQNREGRQSEINGQQHNNLHEDGDYKMKIDLLTFDEK------MHIEG----------------------------

Query:  --LRAITNLGEHEQYQVARFISGLRADIKEKLQLQPIGYLSEAIATTVTVEEQQANKFKYQYQRRSTVDGQASTSKKGMSTLDKLATTGTSLTTKGRNSE
          L A  NL E+EQ+Q+ARFI GLR DIKEK++LQP  +LSEAI+   TVEE  A + K             STS           T G     + ++  
Subjt:  --LRAITNLGEHEQYQVARFISGLRADIKEKLQLQPIGYLSEAIATTVTVEEQQANKFKYQYQRRSTVDGQASTSKKGMSTLDKLATTGTSLTTKGRNSE

Query:  DNKVPDPVPK-KPPDSYSRPTLGKCFRCGQVGHLSNECPQQKSVNLVDDCPDLEQSFTPDSDEDIAYIEPDDGEQVSCILERILLTPKTDSFPQRHSLFG
        D+K  + V K K  + Y+RP+LGKCFRCGQ GH SN CPQ+K++ L D   D     + + +E+   IE DDG +VSC+++R+LL PK ++ PQ HSLF 
Subjt:  DNKVPDPVPK-KPPDSYSRPTLGKCFRCGQVGHLSNECPQQKSVNLVDDCPDLEQSFTPDSDEDIAYIEPDDGEQVSCILERILLTPKTDSFPQRHSLFG

Query:  TRCTVNGKVCNIIIDSSSTENVVSSKLISALNLKVSPHSSPYK
        TRCT+NGKVC++IID+ S+EN V+ KL++ALNLK  PH +PYK
Subjt:  TRCTVNGKVCNIIIDSSSTENVVSSKLISALNLKVSPHSSPYK

KAA0054966.1 transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa]1.9e-4644.94Show/hide
Query:  EKMHIEGLRAITNLGEHEQYQVARFISGLRADIKEKLQLQPIGYLSEAIATTVTVEEQQANKFKYQYQRRSTVDGQASTSKKGMSTLDKLATTGTSLTTK
        E+ H  G R  TNL E E++ ++ F+ GLR D+KEK++LQP  +LSEAI    TVEE   N+ K   +R      + S SKK  +   KL    +    +
Subjt:  EKMHIEGLRAITNLGEHEQYQVARFISGLRADIKEKLQLQPIGYLSEAIATTVTVEEQQANKFKYQYQRRSTVDGQASTSKKGMSTLDKLATTGTSLTTK

Query:  GRNSEDNKVPDPVPKKPPDSYSRPTLGKCFRCGQVGHLSNECPQQKSVNLVDDCPDLEQSFTPDSDEDIAYIEPDDGEQVSCILERILLTPKTDSFPQRH
           S   K      KK  + Y RP  G C+RCGQ+GH SN+CPQ+K++ +  D  D       + DE+   IE D+G+ +SCIL+R+L++PK ++  QRH
Subjt:  GRNSEDNKVPDPVPKKPPDSYSRPTLGKCFRCGQVGHLSNECPQQKSVNLVDDCPDLEQSFTPDSDEDIAYIEPDDGEQVSCILERILLTPKTDSFPQRH

Query:  SLFGTRCTVNGKVCNIIIDSSSTENVVSSKLISALNLKVSPHSSPYK
        SLF TRCT+ GKVCN+IIDS S+EN VS KL++ALNLK  PH  PYK
Subjt:  SLFGTRCTVNGKVCNIIIDSSSTENVVSSKLISALNLKVSPHSSPYK

XP_031741035.1 uncharacterized protein LOC116403692 [Cucumis sativus]2.4e-4947.97Show/hide
Query:  LRAITNLGEHEQYQVARFISGLRADIKEKLQLQPIGYLSEAIATTVTVEEQQANKFKYQYQRRSTVDGQASTSKKGMSTLDKLATTGTSLTTKGRNSEDN
        L A TNL E+EQ+QVARF+ GLR DIKEK++LQP  +LSEAI+   TVEE  A + K    RRS  +  ++ SK    T D+     TS   KG+  ++ 
Subjt:  LRAITNLGEHEQYQVARFISGLRADIKEKLQLQPIGYLSEAIATTVTVEEQQANKFKYQYQRRSTVDGQASTSKKGMSTLDKLATTGTSLTTKGRNSEDN

Query:  KVPDPVPK----KP--PDSYSRPTLGKCFRCGQVGHLSNECPQQKSVNLVDDCPDLEQSFTPDSDEDIAYIEPDDGEQVSCILERILLTPKTDSFPQRHS
        +V     K    KP   +SYSRP+LGKCFRCGQ GHLS+ CPQ+K++ + ++   + +  + +++E+   IE DDGE+VSC+++R+L+TPK +   QRH 
Subjt:  KVPDPVPK----KP--PDSYSRPTLGKCFRCGQVGHLSNECPQQKSVNLVDDCPDLEQSFTPDSDEDIAYIEPDDGEQVSCILERILLTPKTDSFPQRHS

Query:  LFGTRCTVNGKVCNIIIDSSSTENVVSSKLISALNLKVSPHSSPYK
        LF TRCT+NG+VC++IIDS S+EN V+ KL++ LNLK   H +PYK
Subjt:  LFGTRCTVNGKVCNIIIDSSSTENVVSSKLISALNLKVSPHSSPYK

XP_031743026.1 uncharacterized protein LOC116404533 [Cucumis sativus]2.2e-4747.74Show/hide
Query:  LRAITNLGEHEQYQVARFISGLRADIKEKLQLQPIGYLSEAIATTVTVEEQQANKFKYQYQRRSTVDGQASTSKKGMSTLDKLATTGTSLTTKGRNSEDN
        L A TNL E+EQ+QVARF+ GLR DIKEK++LQP  +LSEAI+   TVEE  A + K    RRS  +  ++ SK    T D+     TS   KG+  ++ 
Subjt:  LRAITNLGEHEQYQVARFISGLRADIKEKLQLQPIGYLSEAIATTVTVEEQQANKFKYQYQRRSTVDGQASTSKKGMSTLDKLATTGTSLTTKGRNSEDN

Query:  KVPDPVPK----KP--PDSYSRPTLGKCFRCGQVGHLSNECPQQKSVNLVDDCPDLEQSFTPDSDEDIAYIEPDDGEQVSCILERILLTPKTDSFPQRHS
        +V     K    KP   ++YSRP+LGKCFRCGQ GHLSN CPQ+K++ + ++     +  + +++E+   IE DDGE+VSC+++R+L+TPK +   QRH 
Subjt:  KVPDPVPK----KP--PDSYSRPTLGKCFRCGQVGHLSNECPQQKSVNLVDDCPDLEQSFTPDSDEDIAYIEPDDGEQVSCILERILLTPKTDSFPQRHS

Query:  LFGTRCTVNGKVCNIIIDSSSTENVVSSKLISALNLKVSPHSS
        LF TRCT+NG+VC++IIDS S+EN V+ KL++ LNLK   H S
Subjt:  LFGTRCTVNGKVCNIIIDSSSTENVVSSKLISALNLKVSPHSS

XP_031745468.1 uncharacterized protein LOC116405837 [Cucumis sativus]1.7e-4747.74Show/hide
Query:  LRAITNLGEHEQYQVARFISGLRADIKEKLQLQPIGYLSEAIATTVTVEEQQANKFKYQYQRRSTVDGQASTSKKGMSTLDKLATTGTSLTTKGRNSEDN
        L A TNL E+EQ+QVARF+ GLR DIKEK++LQP  +LSEAI+   TVEE  A + K    RRS  +  ++ SK    T D+     TS   KG+  ++ 
Subjt:  LRAITNLGEHEQYQVARFISGLRADIKEKLQLQPIGYLSEAIATTVTVEEQQANKFKYQYQRRSTVDGQASTSKKGMSTLDKLATTGTSLTTKGRNSEDN

Query:  KVPDPVPK----KP--PDSYSRPTLGKCFRCGQVGHLSNECPQQKSVNLVDDCPDLEQSFTPDSDEDIAYIEPDDGEQVSCILERILLTPKTDSFPQRHS
        +V     K    KP   ++YSRP+LGKCFRCGQ GHLSN CPQ+K++ + +D     +  + +++E+   IE DDGE+VSC+++R+L+TPK +   QRH 
Subjt:  KVPDPVPK----KP--PDSYSRPTLGKCFRCGQVGHLSNECPQQKSVNLVDDCPDLEQSFTPDSDEDIAYIEPDDGEQVSCILERILLTPKTDSFPQRHS

Query:  LFGTRCTVNGKVCNIIIDSSSTENVVSSKLISALNLKVSPHSS
        LF TRCT+NG+VC++IID+ S+EN V+ KL++ LNLK   H S
Subjt:  LFGTRCTVNGKVCNIIIDSSSTENVVSSKLISALNLKVSPHSS

TrEMBL top hitse value%identityAlignment
A0A5A7SLA2 Zf-CCHC domain-containing protein6.5e-3733.06Show/hide
Query:  MAGKK--GSSGELSLAGKEVESTTVLSPRSTTNRLVMVEGAVENLQQNVVEICQLLNVIANKIDAINLHQIGRIDNQ-IPPRFDQNREGRQSEINGQQ-H
        MAG++    + E     +E E    L P+++T RL++V+ ++ +L +    + +   ++  ++D +      RI+ + +  R   +R GR++  N +  H
Subjt:  MAGKK--GSSGELSLAGKEVESTTVLSPRSTTNRLVMVEGAVENLQQNVVEICQLLNVIANKIDAINLHQIGRIDNQ-IPPRFDQNREGRQSEINGQQ-H

Query:  NNLH---EDGDYKMKIDLLTFDEKMHIEGLRAITNLGEHEQYQVARFISGLRADIKEKLQLQPIGYLSEAIATTVTVEEQQANKFKYQYQRRSTVDGQAS
        N  H   E  DY    D   F E+   EG      + + E        S  +  ++E + ++      +  +T  T   +  N F       +    Q S
Subjt:  NNLH---EDGDYKMKIDLLTFDEKMHIEGLRAITNLGEHEQYQVARFISGLRADIKEKLQLQPIGYLSEAIATTVTVEEQQANKFKYQYQRRSTVDGQAS

Query:  TSKKGMSTLDKLATTGTSLTTKGRNSEDNKVPDPVPKKPPDSYSRPTLGKCFRCGQVGHLSNECPQQKSVNLVDDCPDLEQSFTPDSDEDIAYIEPDDGE
        TS +G          G  + T+  NSE  K       K  ++Y+ P+LGKCFRCGQ GHLSN CPQQK++ L D+  D +   + D++E++  IE D+G+
Subjt:  TSKKGMSTLDKLATTGTSLTTKGRNSEDNKVPDPVPKKPPDSYSRPTLGKCFRCGQVGHLSNECPQQKSVNLVDDCPDLEQSFTPDSDEDIAYIEPDDGE

Query:  QVSCILERILLTPKTDSFPQRHSLFGTRCTVNGKVCNIIIDSSSTENVVSSKLISALNLKVSPHSSPYK
        +VSCIL+R+LL P+ +  PQRHS F TRCT+NGKVC++IIDS S+EN ++ KL++ LN+K  PH +PYK
Subjt:  QVSCILERILLTPKTDSFPQRHSLFGTRCTVNGKVCNIIIDSSSTENVVSSKLISALNLKVSPHSSPYK

A0A5A7UXS4 CCHC-type domain-containing protein2.9e-3733.86Show/hide
Query:  IPPRFDQNREGRQSEINGQQHNNLHEDGDYKMKIDLLTFDEKMHIEGLRAITNLGEHEQYQVARFISGLRADIKEKLQLQPIGYLSEAIATTVTVEEQQA
        +PP ++Q            Q+ N H+         +  + E+ H   L A TNLGE+EQ+Q+ARFI     +  E++ + P+   +      V   ++Q+
Subjt:  IPPRFDQNREGRQSEINGQQHNNLHEDGDYKMKIDLLTFDEKMHIEGLRAITNLGEHEQYQVARFISGLRADIKEKLQLQPIGYLSEAIATTVTVEEQQA

Query:  NKFKYQYQRRSTVDGQASTSKKGMSTLDKLATTGTSLTTKGRNSEDNKVPDPVPKKPPDSYSRPTLGKCFRCGQVGHLSNECPQQKSVNLVDDCPDLEQS
           +   Q  ++V G++    K + T D  A      T KG              K  ++Y+RP+L KCFRCGQ GHLSN CPQ+++++L D   +    
Subjt:  NKFKYQYQRRSTVDGQASTSKKGMSTLDKLATTGTSLTTKGRNSEDNKVPDPVPKKPPDSYSRPTLGKCFRCGQVGHLSNECPQQKSVNLVDDCPDLEQS

Query:  FTPDSDEDIAYIEPDDGEQVSCILERILLTPKTDSFPQRHSLFGTRCTVNGKVCNIIIDSSSTENVVSSKLISALNLKVSPHSSPYKFD--NDEDVVKVV
           + +E+  +IE DDG+++S +++R+L+ PK ++ PQRHSLF TRCT+N +VC++IIDS S+EN V+ KL++ LNLK +P+ +PYK           + 
Subjt:  FTPDSDEDIAYIEPDDGEQVSCILERILLTPKTDSFPQRHSLFGTRCTVNGKVCNIIIDSSSTENVVSSKLISALNLKVSPHSSPYKFD--NDEDVVKVV

Query:  IVYFIELAIMGMGIRD
         +Y + L+I G G +D
Subjt:  IVYFIELAIMGMGIRD

A0A5B7BER3 Uncharacterized protein9.1e-3939.6Show/hide
Query:  LRAITNLGEHEQYQVARFISGLRADIKEKLQLQPIGYLSEAIATTVTVEEQQANK-FKYQYQRRSTVDGQASTSKKGMSTLDKLATTGTSLTTKGR--NS
        L +  NL E E  QVAR++ GLRA I+++L L+ I  L+EA +  + VE QQ+ +  + Q   RS  D   +   +    ++ +      +T + +  +S
Subjt:  LRAITNLGEHEQYQVARFISGLRADIKEKLQLQPIGYLSEAIATTVTVEEQQANK-FKYQYQRRSTVDGQASTSKKGMSTLDKLATTGTSLTTKGR--NS

Query:  EDNKVPDPVPKKPPDSYSRPTLGKCFRCGQVGHLSNECPQQKSVNLV----DDCPDLEQSFTPDSDEDIA---YIEPDDGEQVSCILERILLTPKTDSFP
        ++   P    +K  + Y+RP  GKCFRC Q GH SNECP ++ VN+V    D+ PD E     +  ++       E D+GE VSC+++R+LL PK +  P
Subjt:  EDNKVPDPVPKKPPDSYSRPTLGKCFRCGQVGHLSNECPQQKSVNLV----DDCPDLEQSFTPDSDEDIA---YIEPDDGEQVSCILERILLTPKTDSFP

Query:  QRHSLFGTRCTVNGKVCNIIIDSSSTENVVSSKLISALNLKVSPHSSPYK
        QRH++F TRCT+N KVC++IIDS S+EN+VS  L+ AL LK   H +PYK
Subjt:  QRHSLFGTRCTVNGKVCNIIIDSSSTENVVSSKLISALNLKVSPHSSPYK

A0A5D3C3X9 Reverse transcriptase1.3e-4834.99Show/hide
Query:  KEVESTTVLSPRSTTNRLVMVEGAVENLQQNVVEICQLLNVIANKIDA-------------------------INLHQIGRID------------NQIP-
        +E E   VLSP++T+ RL+ +E +VE ++  +  + Q L  +    +                          IN HQ  R D            N  P 
Subjt:  KEVESTTVLSPRSTTNRLVMVEGAVENLQQNVVEICQLLNVIANKIDA-------------------------INLHQIGRID------------NQIP-

Query:  ---------------------PRFDQNREGRQSEINGQQHNNLHEDGDYKMKIDLLTFDEK------MHIEG----------------------------
                             P F+QNR  R      Q      E  + KMKIDL ++D K      +  EG                            
Subjt:  ---------------------PRFDQNREGRQSEINGQQHNNLHEDGDYKMKIDLLTFDEK------MHIEG----------------------------

Query:  --LRAITNLGEHEQYQVARFISGLRADIKEKLQLQPIGYLSEAIATTVTVEEQQANKFKYQYQRRSTVDGQASTSKKGMSTLDKLATTGTSLTTKGRNSE
          L A  NL E+EQ+Q+ARFI GLR DIKEK++LQP  +LSEAI+   TVEE  A + K             STS           T G     + ++  
Subjt:  --LRAITNLGEHEQYQVARFISGLRADIKEKLQLQPIGYLSEAIATTVTVEEQQANKFKYQYQRRSTVDGQASTSKKGMSTLDKLATTGTSLTTKGRNSE

Query:  DNKVPDPVPK-KPPDSYSRPTLGKCFRCGQVGHLSNECPQQKSVNLVDDCPDLEQSFTPDSDEDIAYIEPDDGEQVSCILERILLTPKTDSFPQRHSLFG
        D+K  + V K K  + Y+RP+LGKCFRCGQ GH SN CPQ+K++ L D   D     + + +E+   IE DDG +VSC+++R+LL PK ++ PQ HSLF 
Subjt:  DNKVPDPVPK-KPPDSYSRPTLGKCFRCGQVGHLSNECPQQKSVNLVDDCPDLEQSFTPDSDEDIAYIEPDDGEQVSCILERILLTPKTDSFPQRHSLFG

Query:  TRCTVNGKVCNIIIDSSSTENVVSSKLISALNLKVSPHSSPYK
        TRCT+NGKVC++IID+ S+EN V+ KL++ALNLK  PH +PYK
Subjt:  TRCTVNGKVCNIIIDSSSTENVVSSKLISALNLKVSPHSSPYK

A0A5D3DGR0 Reverse transcriptase9.1e-4744.94Show/hide
Query:  EKMHIEGLRAITNLGEHEQYQVARFISGLRADIKEKLQLQPIGYLSEAIATTVTVEEQQANKFKYQYQRRSTVDGQASTSKKGMSTLDKLATTGTSLTTK
        E+ H  G R  TNL E E++ ++ F+ GLR D+KEK++LQP  +LSEAI    TVEE   N+ K   +R      + S SKK  +   KL    +    +
Subjt:  EKMHIEGLRAITNLGEHEQYQVARFISGLRADIKEKLQLQPIGYLSEAIATTVTVEEQQANKFKYQYQRRSTVDGQASTSKKGMSTLDKLATTGTSLTTK

Query:  GRNSEDNKVPDPVPKKPPDSYSRPTLGKCFRCGQVGHLSNECPQQKSVNLVDDCPDLEQSFTPDSDEDIAYIEPDDGEQVSCILERILLTPKTDSFPQRH
           S   K      KK  + Y RP  G C+RCGQ+GH SN+CPQ+K++ +  D  D       + DE+   IE D+G+ +SCIL+R+L++PK ++  QRH
Subjt:  GRNSEDNKVPDPVPKKPPDSYSRPTLGKCFRCGQVGHLSNECPQQKSVNLVDDCPDLEQSFTPDSDEDIAYIEPDDGEQVSCILERILLTPKTDSFPQRH

Query:  SLFGTRCTVNGKVCNIIIDSSSTENVVSSKLISALNLKVSPHSSPYK
        SLF TRCT+ GKVCN+IIDS S+EN VS KL++ALNLK  PH  PYK
Subjt:  SLFGTRCTVNGKVCNIIIDSSSTENVVSSKLISALNLKVSPHSSPYK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGGCAAGAAGGGTTCTTCAGGAGAACTTTCCCTCGCTGGAAAAGAGGTGGAATCAACTACCGTCCTCTCTCCACGATCCACAACCAATCGTTTAGTAATGGTTGA
GGGGGCTGTTGAAAACCTCCAACAGAATGTGGTAGAGATTTGCCAACTATTGAATGTAATTGCTAACAAGATAGATGCGATCAACTTACATCAGATAGGGAGGATCGACA
ACCAGATTCCTCCAAGATTTGATCAAAATAGGGAGGGTCGACAATCAGAAATCAATGGACAACAACACAACAACTTGCATGAAGATGGCGATTATAAGATGAAAATCGAC
TTACTTACCTTTGATGAGAAGATGCACATTGAAGGGTTGAGGGCAATAACTAATTTGGGGGAACACGAGCAGTATCAAGTGGCTCGGTTTATTAGTGGATTGAGAGCTGA
TATTAAAGAAAAATTGCAGCTTCAACCGATTGGATACTTAAGTGAAGCCATTGCCACTACAGTAACGGTAGAAGAGCAACAAGCTAACAAATTCAAATACCAATATCAGA
GGCGTTCCACGGTTGATGGCCAAGCCTCTACCTCCAAAAAGGGTATGTCCACTCTTGACAAATTGGCAACTACAGGTACATCTCTTACAACAAAAGGCAGGAACAGTGAA
GATAATAAAGTGCCAGATCCGGTTCCCAAAAAACCTCCTGATTCGTATAGCAGACCAACTCTTGGGAAATGTTTCAGGTGTGGACAGGTGGGTCATTTATCAAATGAATG
TCCTCAACAGAAATCGGTCAATCTTGTGGATGATTGTCCGGACCTTGAGCAGTCTTTCACTCCAGACTCTGATGAAGACATTGCTTATATTGAACCAGATGATGGAGAAC
AAGTGTCGTGCATTTTGGAACGAATCTTGCTGACTCCAAAAACCGATTCCTTTCCACAGCGTCACTCTCTGTTTGGAACTCGCTGCACTGTTAATGGTAAGGTATGCAAC
ATCATAATTGATAGCAGCAGTACAGAAAACGTGGTATCCAGTAAACTCATTTCCGCCCTTAATCTGAAAGTTTCTCCACACTCGAGTCCTTACAAATTTGACAACGATGA
GGATGTTGTCAAGGTTGTCATAGTTTACTTCATAGAGCTTGCCATAATGGGTATGGGCATACGAGACCATCTAGACGTTGAGTACACAGTAGCAATAAGGATAAACAGCG
ATGTCATTCCTCAACTTCTCAGATGGTCATGCACTTATTCTCACCTGACGATAGGTTTTATGCTGTCCAAGGTTAAAGTACATTTGGAGTCTACGAATGTTGAACAACAA
CACATGGATTGTATAATGCATCCACCGGAAGCCTGTATCATATCCCCTCAAAGACTTGACATTGAGGCAACACTTGATGTTCAAATGCATCCCGTTGTTGAAAGGAGAAA
CTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGGGCAAGAAGGGTTCTTCAGGAGAACTTTCCCTCGCTGGAAAAGAGGTGGAATCAACTACCGTCCTCTCTCCACGATCCACAACCAATCGTTTAGTAATGGTTGA
GGGGGCTGTTGAAAACCTCCAACAGAATGTGGTAGAGATTTGCCAACTATTGAATGTAATTGCTAACAAGATAGATGCGATCAACTTACATCAGATAGGGAGGATCGACA
ACCAGATTCCTCCAAGATTTGATCAAAATAGGGAGGGTCGACAATCAGAAATCAATGGACAACAACACAACAACTTGCATGAAGATGGCGATTATAAGATGAAAATCGAC
TTACTTACCTTTGATGAGAAGATGCACATTGAAGGGTTGAGGGCAATAACTAATTTGGGGGAACACGAGCAGTATCAAGTGGCTCGGTTTATTAGTGGATTGAGAGCTGA
TATTAAAGAAAAATTGCAGCTTCAACCGATTGGATACTTAAGTGAAGCCATTGCCACTACAGTAACGGTAGAAGAGCAACAAGCTAACAAATTCAAATACCAATATCAGA
GGCGTTCCACGGTTGATGGCCAAGCCTCTACCTCCAAAAAGGGTATGTCCACTCTTGACAAATTGGCAACTACAGGTACATCTCTTACAACAAAAGGCAGGAACAGTGAA
GATAATAAAGTGCCAGATCCGGTTCCCAAAAAACCTCCTGATTCGTATAGCAGACCAACTCTTGGGAAATGTTTCAGGTGTGGACAGGTGGGTCATTTATCAAATGAATG
TCCTCAACAGAAATCGGTCAATCTTGTGGATGATTGTCCGGACCTTGAGCAGTCTTTCACTCCAGACTCTGATGAAGACATTGCTTATATTGAACCAGATGATGGAGAAC
AAGTGTCGTGCATTTTGGAACGAATCTTGCTGACTCCAAAAACCGATTCCTTTCCACAGCGTCACTCTCTGTTTGGAACTCGCTGCACTGTTAATGGTAAGGTATGCAAC
ATCATAATTGATAGCAGCAGTACAGAAAACGTGGTATCCAGTAAACTCATTTCCGCCCTTAATCTGAAAGTTTCTCCACACTCGAGTCCTTACAAATTTGACAACGATGA
GGATGTTGTCAAGGTTGTCATAGTTTACTTCATAGAGCTTGCCATAATGGGTATGGGCATACGAGACCATCTAGACGTTGAGTACACAGTAGCAATAAGGATAAACAGCG
ATGTCATTCCTCAACTTCTCAGATGGTCATGCACTTATTCTCACCTGACGATAGGTTTTATGCTGTCCAAGGTTAAAGTACATTTGGAGTCTACGAATGTTGAACAACAA
CACATGGATTGTATAATGCATCCACCGGAAGCCTGTATCATATCCCCTCAAAGACTTGACATTGAGGCAACACTTGATGTTCAAATGCATCCCGTTGTTGAAAGGAGAAA
CTAG
Protein sequenceShow/hide protein sequence
MAGKKGSSGELSLAGKEVESTTVLSPRSTTNRLVMVEGAVENLQQNVVEICQLLNVIANKIDAINLHQIGRIDNQIPPRFDQNREGRQSEINGQQHNNLHEDGDYKMKID
LLTFDEKMHIEGLRAITNLGEHEQYQVARFISGLRADIKEKLQLQPIGYLSEAIATTVTVEEQQANKFKYQYQRRSTVDGQASTSKKGMSTLDKLATTGTSLTTKGRNSE
DNKVPDPVPKKPPDSYSRPTLGKCFRCGQVGHLSNECPQQKSVNLVDDCPDLEQSFTPDSDEDIAYIEPDDGEQVSCILERILLTPKTDSFPQRHSLFGTRCTVNGKVCN
IIIDSSSTENVVSSKLISALNLKVSPHSSPYKFDNDEDVVKVVIVYFIELAIMGMGIRDHLDVEYTVAIRINSDVIPQLLRWSCTYSHLTIGFMLSKVKVHLESTNVEQQ
HMDCIMHPPEACIISPQRLDIEATLDVQMHPVVERRN