; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g17480 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g17480
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCCHC-type domain-containing protein
Genome locationchr5:12972269..12974040
RNA-Seq ExpressionMoc05g17480
SyntenyMoc05g17480
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR005162 - Retrotransposon gag domain
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK18910.1 Retrovirus-related Pol polyprotein from transposon 17.6 [Cucumis melo var. makuwa]4.0e-6440.88Show/hide
Query:  MRMRYAPKHYKNDLKTKLQALRQGTKSVDEYYQEMVILMEKAHIRGEEEETMSRFLRGLNLEIVHHIDRNPPPYLEDMYHHAIKIEDQLRKEEEHSK---
        MR R+ PKHY+ DLKTKLQ+LRQG KSV EYY+EM  L+ +A I+ +EE+TMSRFL GL+ EI H +DRNPP  +EDMYH+AIKI  QL++E E SK   
Subjt:  MRMRYAPKHYKNDLKTKLQALRQGTKSVDEYYQEMVILMEKAHIRGEEEETMSRFLRGLNLEIVHHIDRNPPPYLEDMYHHAIKIEDQLRKEEEHSK---

Query:  -----------W--------NEFMQQKGKFGAVKRVKAESSNAKNNEVPKEVREKTNSLTCWKCKGFGHMSKECVNKEVMVIRNGVVDSNDACEKYDAKF
                   W        NE MQ +GKF A K+V+AESSN K  E  KEV+EKT+S+ CWKCKGFG+MSK+C+NK VMV+RNGV+DS D C + D + 
Subjt:  -----------W--------NEFMQQKGKFGAVKRVKAESSNAKNNEVPKEVREKTNSLTCWKCKGFGHMSKECVNKEVMVIRNGVVDSNDACEKYDAKF

Query:  EEDSKAHV-DEFIEQD-----------RKDSDF--------------DKSEFKE-KREREELRERNSKENSKIQSICQYKSNLIALLHKDPYYQAHIFYP
        EE+ +  V DE+IE+D               DF              + +E KE +R+ EEL ++       I      K +       +   +A   +P
Subjt:  EEDSKAHV-DEFIEQD-----------RKDSDF--------------DKSEFKE-KREREELRERNSKENSKIQSICQYKSNLIALLHKDPYYQAHIFYP

Query:  SLSTFLRFLQEVPTFQFK---HDRIQETKE-AIKEPRRVILSSSHDLEIVLQGNHGHKKPKVKGIVENYLSKNLYKEFHGRESRMLCWRMQMQRAIIT-R
        + +       E   F++K    +   E KE  IK P   + +     EI    +         G  +  L  N  KE  G +   +   + +    IT R
Subjt:  SLSTFLRFLQEVPTFQFK---HDRIQETKE-AIKEPRRVILSSSHDLEIVLQGNHGHKKPKVKGIVENYLSKNLYKEFHGRESRMLCWRMQMQRAIIT-R

Query:  KARKIQEAFTLHLQKLVNAQEPTKSFEPEFIHNVTSMSQEENGAKMAREKLSILRDDTEDEKSVQIREQVPLQIAPL
        KA+KIQEAFTLH+QKL NAQ   K+FE +F++NV+S SQEE+  KMARE   + +   E +K  +I E+V +Q+A L
Subjt:  KARKIQEAFTLHLQKLVNAQEPTKSFEPEFIHNVTSMSQEENGAKMAREKLSILRDDTEDEKSVQIREQVPLQIAPL

XP_022153198.1 uncharacterized protein LOC111020753 [Momordica charantia]6.1e-6563.81Show/hide
Query:  MRMRYAPKHYKNDLKTKLQALRQGTKSVDEYYQEMVILMEKAHIRGEEEETMSRFLRGLNLEIVHHIDRNPPPYLEDMYHHAIKIEDQLRKEEEHSK---
        MR RY PKHY+ DLKTKLQALRQGTKSV EYYQEM  LM K  IR EEE+TMSRFL GLN EI H +DRNPPPYLEDMYH+ +KIEDQL++E+E+SK   
Subjt:  MRMRYAPKHYKNDLKTKLQALRQGTKSVDEYYQEMVILMEKAHIRGEEEETMSRFLRGLNLEIVHHIDRNPPPYLEDMYHHAIKIEDQLRKEEEHSK---

Query:  -----------W--------NEFMQQKGKFGAVKRVKAESSNAKNNEVPKEVREKTNSLTCWKCKGFGHMSKECVNKEVMVIRNGVVDSNDACEKYDAKF
                   W        NE +Q K KF A KR++ ESSN K NE PKEVREKT+S+ CWKCKGFGHMSK+CVNK VMVIRNGVVDS+D CE++D +F
Subjt:  -----------W--------NEFMQQKGKFGAVKRVKAESSNAKNNEVPKEVREKTNSLTCWKCKGFGHMSKECVNKEVMVIRNGVVDSNDACEKYDAKF

Query:  EEDSKAHVDE
        EE  + +++E
Subjt:  EEDSKAHVDE

XP_022932136.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111438459, partial [Cucurbita moschata]6.6e-5961.17Show/hide
Query:  MRMRYAPKHYKNDLKTKLQALRQGTKSVDEYYQEMVILMEKAHIRGEEEETMSRFLRGLNLEIVHHIDRNPPPYLEDMYHHAIKIEDQLRKEEEHSK---
        MR RY PKHY+ +LKTKLQ LRQGTKSV EYYQEM  +ME+A++R EEE+TMSRFL GLN EI H +DRNPPPYLEDM H+A+KIEDQL++E+EHSK   
Subjt:  MRMRYAPKHYKNDLKTKLQALRQGTKSVDEYYQEMVILMEKAHIRGEEEETMSRFLRGLNLEIVHHIDRNPPPYLEDMYHHAIKIEDQLRKEEEHSK---

Query:  -----------W--------NEFMQQKGKFGAVKRVKAESSNAKNNEVPKEVREKTNSLTCWKCKGFGHMSKECVNKEVMVIRNGVVDSNDACEKYDAKF
                   W        NE M  K KF A KRV+AESSN K NE  K V+EK++S+ CWKCKGFGHMSKECVNK+VMVIRNG++DS+D  E YD ++
Subjt:  -----------W--------NEFMQQKGKFGAVKRVKAESSNAKNNEVPKEVREKTNSLTCWKCKGFGHMSKECVNKEVMVIRNGVVDSNDACEKYDAKF

Query:  EEDSKA
         E+  +
Subjt:  EEDSKA

XP_023544048.1 uncharacterized protein LOC111803745 [Cucurbita pepo subsp. pepo]4.8e-6259.81Show/hide
Query:  MRMRYAPKHYKNDLKTKLQALRQGTKSVDEYYQEMVILMEKAHIRGEEEETMSRFLRGLNLEIVHHIDRNPPPYLEDMYHHAIKIEDQLRKEEEHSK---
        MR RY PKHY+ +LKTKLQ LRQGTKSV EYYQEM  +ME+A++R EEE+TMSRFL GLN EI H +DRNPPPYLEDMYH+A+KIEDQL++E+EHSK   
Subjt:  MRMRYAPKHYKNDLKTKLQALRQGTKSVDEYYQEMVILMEKAHIRGEEEETMSRFLRGLNLEIVHHIDRNPPPYLEDMYHHAIKIEDQLRKEEEHSK---

Query:  -----------W--------NEFMQQKGKFGAVKRVKAESSNAKNNEVPKEVREKTNSLTCWKCKGFGHMSKECVNKEVMVIRNGVVDSNDACEKYDAKF
                   W        NE M  K KF A KRV+AESS  K NE  K V+EK++S+ CWKCKGFGHM KECVNK++MVIRNG++D ND CE +D+  
Subjt:  -----------W--------NEFMQQKGKFGAVKRVKAESSNAKNNEVPKEVREKTNSLTCWKCKGFGHMSKECVNKEVMVIRNGVVDSNDACEKYDAKF

Query:  EEDSKAHVDEFIEQ
         E+  A+ DE++E+
Subjt:  EEDSKAHVDEFIEQ

XP_038902315.1 LOW QUALITY PROTEIN: uncharacterized protein LOC120088948 [Benincasa hispida]3.6e-5758.77Show/hide
Query:  MRMRYAPKHYKNDLKTKLQALRQGTKSVDEYYQEMVILMEKAHIRGEEEETMSRFLRGLNLEIVHHIDRNPPPYLEDMYHHAIKIEDQLRKEEEHSK---
        MR RY PKHY+ DL+ KLQ LRQGTKSV EYYQEM  L+E+A+IR EEE+T SRFL GLN EIVH +DR P  YLEDMYH+AIKIE+QLR+E+EHSK   
Subjt:  MRMRYAPKHYKNDLKTKLQALRQGTKSVDEYYQEMVILMEKAHIRGEEEETMSRFLRGLNLEIVHHIDRNPPPYLEDMYHHAIKIEDQLRKEEEHSK---

Query:  -----------WNE-----FMQQKGKFGAVKRVKAESSNAKNNEVPKEVREKTNSLTCWKCKGFGHMSKECVNKEVMVIRNGVVDSNDACEKYDAKFEED
                   WN+       + KGKF   KR++A S+  K NE  KE REK+NS  CWKCKGFGHMSK+CVNK+VMVIRNG++D  D C++     +E+
Subjt:  -----------WNE-----FMQQKGKFGAVKRVKAESSNAKNNEVPKEVREKTNSLTCWKCKGFGHMSKECVNKEVMVIRNGVVDSNDACEKYDAKFEED

Query:  SKAHVDEFIEQ
        SKA  DE+IE+
Subjt:  SKAHVDEFIEQ

TrEMBL top hitse value%identityAlignment
A0A5D3C8C6 CCHC-type domain-containing protein4.3e-5653.56Show/hide
Query:  MRMRYAPKHYKNDLKTKLQALRQGTKSVDEYYQEMVILMEKAHIRGEEEETMSRFLRGLNLEIVHHIDRNPPPYLEDMYHHAIKIEDQLRKEEEHSK---
        MR R+ PKHY+ DLKTKLQ+LRQGTKSV EYY+EM  L+ +A I+ +EE+TMSRFL GLN EI H +DRNPP  +EDMYH+AIKIE QL++E+E SK   
Subjt:  MRMRYAPKHYKNDLKTKLQALRQGTKSVDEYYQEMVILMEKAHIRGEEEETMSRFLRGLNLEIVHHIDRNPPPYLEDMYHHAIKIEDQLRKEEEHSK---

Query:  -----------W--------NEFMQQKGKFGAVKRVKAESSNAKNNEVPKEVREKTNSLTCWKCKGFGHMSKECVNKEVMVIRNGVVDSNDACEKYDAKF
                   W        NE MQ +GKF A ++V+ ESSN K  E  KEVREKT+S+ CWKCKGFGHMSK+C+NK VMV+RNGV+DS D CE+ DA+ 
Subjt:  -----------W--------NEFMQQKGKFGAVKRVKAESSNAKNNEVPKEVREKTNSLTCWKCKGFGHMSKECVNKEVMVIRNGVVDSNDACEKYDAKF

Query:  EEDSKAHV-DEFIEQDRKDSDFDKSEFKEKREREELRER
        EE+ +  V DE+IE+    S   +     K + E++ ++
Subjt:  EEDSKAHV-DEFIEQDRKDSDFDKSEFKEKREREELRER

A0A5D3D5V9 Retrovirus-related Pol polyprotein from transposon 17.61.9e-6440.88Show/hide
Query:  MRMRYAPKHYKNDLKTKLQALRQGTKSVDEYYQEMVILMEKAHIRGEEEETMSRFLRGLNLEIVHHIDRNPPPYLEDMYHHAIKIEDQLRKEEEHSK---
        MR R+ PKHY+ DLKTKLQ+LRQG KSV EYY+EM  L+ +A I+ +EE+TMSRFL GL+ EI H +DRNPP  +EDMYH+AIKI  QL++E E SK   
Subjt:  MRMRYAPKHYKNDLKTKLQALRQGTKSVDEYYQEMVILMEKAHIRGEEEETMSRFLRGLNLEIVHHIDRNPPPYLEDMYHHAIKIEDQLRKEEEHSK---

Query:  -----------W--------NEFMQQKGKFGAVKRVKAESSNAKNNEVPKEVREKTNSLTCWKCKGFGHMSKECVNKEVMVIRNGVVDSNDACEKYDAKF
                   W        NE MQ +GKF A K+V+AESSN K  E  KEV+EKT+S+ CWKCKGFG+MSK+C+NK VMV+RNGV+DS D C + D + 
Subjt:  -----------W--------NEFMQQKGKFGAVKRVKAESSNAKNNEVPKEVREKTNSLTCWKCKGFGHMSKECVNKEVMVIRNGVVDSNDACEKYDAKF

Query:  EEDSKAHV-DEFIEQD-----------RKDSDF--------------DKSEFKE-KREREELRERNSKENSKIQSICQYKSNLIALLHKDPYYQAHIFYP
        EE+ +  V DE+IE+D               DF              + +E KE +R+ EEL ++       I      K +       +   +A   +P
Subjt:  EEDSKAHV-DEFIEQD-----------RKDSDF--------------DKSEFKE-KREREELRERNSKENSKIQSICQYKSNLIALLHKDPYYQAHIFYP

Query:  SLSTFLRFLQEVPTFQFK---HDRIQETKE-AIKEPRRVILSSSHDLEIVLQGNHGHKKPKVKGIVENYLSKNLYKEFHGRESRMLCWRMQMQRAIIT-R
        + +       E   F++K    +   E KE  IK P   + +     EI    +         G  +  L  N  KE  G +   +   + +    IT R
Subjt:  SLSTFLRFLQEVPTFQFK---HDRIQETKE-AIKEPRRVILSSSHDLEIVLQGNHGHKKPKVKGIVENYLSKNLYKEFHGRESRMLCWRMQMQRAIIT-R

Query:  KARKIQEAFTLHLQKLVNAQEPTKSFEPEFIHNVTSMSQEENGAKMAREKLSILRDDTEDEKSVQIREQVPLQIAPL
        KA+KIQEAFTLH+QKL NAQ   K+FE +F++NV+S SQEE+  KMARE   + +   E +K  +I E+V +Q+A L
Subjt:  KARKIQEAFTLHLQKLVNAQEPTKSFEPEFIHNVTSMSQEENGAKMAREKLSILRDDTEDEKSVQIREQVPLQIAPL

A0A5D3DRJ1 F15O4.134.3e-5653.56Show/hide
Query:  MRMRYAPKHYKNDLKTKLQALRQGTKSVDEYYQEMVILMEKAHIRGEEEETMSRFLRGLNLEIVHHIDRNPPPYLEDMYHHAIKIEDQLRKEEEHSK---
        MR R+ PKHY+ DLKTKLQ+LRQGTKSV EYY+EM  L+ +A I+ +EE+TMSRFL GLN EI H +DRNPP  +EDMYH+AIKIE QL++E+E SK   
Subjt:  MRMRYAPKHYKNDLKTKLQALRQGTKSVDEYYQEMVILMEKAHIRGEEEETMSRFLRGLNLEIVHHIDRNPPPYLEDMYHHAIKIEDQLRKEEEHSK---

Query:  -----------W--------NEFMQQKGKFGAVKRVKAESSNAKNNEVPKEVREKTNSLTCWKCKGFGHMSKECVNKEVMVIRNGVVDSNDACEKYDAKF
                   W        NE MQ +GKF A ++V+ ESSN K  E  KEVREKT+S+ CWKCKGFGHMSK+C+NK VMV+RNGV+DS D CE+ DA+ 
Subjt:  -----------W--------NEFMQQKGKFGAVKRVKAESSNAKNNEVPKEVREKTNSLTCWKCKGFGHMSKECVNKEVMVIRNGVVDSNDACEKYDAKF

Query:  EEDSKAHV-DEFIEQDRKDSDFDKSEFKEKREREELRER
        EE+ +  V DE+IE+    S   +     K + E++ ++
Subjt:  EEDSKAHV-DEFIEQDRKDSDFDKSEFKEKREREELRER

A0A5D3DRJ1 F15O4.135.9e-1366.67Show/hide
Query:  IITRKARKIQEAFTLHLQKLVNAQEPTKSFEPEFIHNVTSMSQEENGAKMAREKLSILRDDTEDEKSVQ
        I  RKA+KIQEAFTLH+QKL NAQ  TK+FE +F++NV+S SQEE+  KMAREKL  L D T +EKSVQ
Subjt:  IITRKARKIQEAFTLHLQKLVNAQEPTKSFEPEFIHNVTSMSQEENGAKMAREKLSILRDDTEDEKSVQ

A0A6J1DGU9 uncharacterized protein LOC1110207533.0e-6563.81Show/hide
Query:  MRMRYAPKHYKNDLKTKLQALRQGTKSVDEYYQEMVILMEKAHIRGEEEETMSRFLRGLNLEIVHHIDRNPPPYLEDMYHHAIKIEDQLRKEEEHSK---
        MR RY PKHY+ DLKTKLQALRQGTKSV EYYQEM  LM K  IR EEE+TMSRFL GLN EI H +DRNPPPYLEDMYH+ +KIEDQL++E+E+SK   
Subjt:  MRMRYAPKHYKNDLKTKLQALRQGTKSVDEYYQEMVILMEKAHIRGEEEETMSRFLRGLNLEIVHHIDRNPPPYLEDMYHHAIKIEDQLRKEEEHSK---

Query:  -----------W--------NEFMQQKGKFGAVKRVKAESSNAKNNEVPKEVREKTNSLTCWKCKGFGHMSKECVNKEVMVIRNGVVDSNDACEKYDAKF
                   W        NE +Q K KF A KR++ ESSN K NE PKEVREKT+S+ CWKCKGFGHMSK+CVNK VMVIRNGVVDS+D CE++D +F
Subjt:  -----------W--------NEFMQQKGKFGAVKRVKAESSNAKNNEVPKEVREKTNSLTCWKCKGFGHMSKECVNKEVMVIRNGVVDSNDACEKYDAKF

Query:  EEDSKAHVDE
        EE  + +++E
Subjt:  EEDSKAHVDE

A0A6J1EVI6 LOW QUALITY PROTEIN: uncharacterized protein LOC1114384593.2e-5961.17Show/hide
Query:  MRMRYAPKHYKNDLKTKLQALRQGTKSVDEYYQEMVILMEKAHIRGEEEETMSRFLRGLNLEIVHHIDRNPPPYLEDMYHHAIKIEDQLRKEEEHSK---
        MR RY PKHY+ +LKTKLQ LRQGTKSV EYYQEM  +ME+A++R EEE+TMSRFL GLN EI H +DRNPPPYLEDM H+A+KIEDQL++E+EHSK   
Subjt:  MRMRYAPKHYKNDLKTKLQALRQGTKSVDEYYQEMVILMEKAHIRGEEEETMSRFLRGLNLEIVHHIDRNPPPYLEDMYHHAIKIEDQLRKEEEHSK---

Query:  -----------W--------NEFMQQKGKFGAVKRVKAESSNAKNNEVPKEVREKTNSLTCWKCKGFGHMSKECVNKEVMVIRNGVVDSNDACEKYDAKF
                   W        NE M  K KF A KRV+AESSN K NE  K V+EK++S+ CWKCKGFGHMSKECVNK+VMVIRNG++DS+D  E YD ++
Subjt:  -----------W--------NEFMQQKGKFGAVKRVKAESSNAKNNEVPKEVREKTNSLTCWKCKGFGHMSKECVNKEVMVIRNGVVDSNDACEKYDAKF

Query:  EEDSKA
         E+  +
Subjt:  EEDSKA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAATGAGGTATGCTCCCAAACATTATAAAAATGATTTGAAAACTAAACTGCAAGCTTTGAGGCAGGGAACAAAAAGTGTTGATGAGTATTATCAAGAGATGGTGAT
TTTAATGGAAAAAGCACACATTCGAGGAGAAGAGGAAGAAACCATGTCTAGATTCCTTCGAGGTTTGAATCTGGAAATTGTTCATCATATTGATAGAAATCCACCACCTT
ACTTGGAAGATATGTATCACCATGCTATCAAAATTGAAGATCAATTGAGGAAAGAAGAGGAGCATTCCAAATGGAATGAATTCATGCAACAAAAAGGGAAGTTTGGTGCT
GTCAAAAGAGTGAAGGCCGAGAGCTCCAATGCCAAAAACAATGAAGTTCCAAAGGAGGTAAGGGAGAAGACTAATTCTCTTACATGTTGGAAGTGCAAAGGGTTTGGACA
CATGAGCAAAGAATGTGTCAATAAAGAAGTCATGGTGATAAGAAATGGTGTTGTTGATTCAAATGATGCTTGTGAGAAATATGATGCAAAATTTGAGGAAGATTCTAAAG
CACATGTTGATGAATTTATTGAACAAGATCGCAAAGATTCTGATTTTGATAAAAGTGAGTTCAAAGAAAAGAGAGAAAGAGAAGAGTTAAGAGAAAGGAATTCAAAGGAG
AATAGCAAGATCCAATCCATTTGTCAATATAAGTCAAATCTTATAGCATTGTTGCATAAGGACCCTTACTATCAAGCTCACATATTTTATCCTAGTCTTTCTACTTTTCT
AAGATTTTTGCAGGAAGTTCCAACTTTTCAATTCAAGCATGATCGAATACAAGAAACAAAGGAAGCTATCAAGGAGCCAAGAAGAGTTATCCTATCATCTTCTCATGATC
TTGAGATTGTCTTACAAGGTAACCATGGACATAAGAAACCTAAGGTGAAAGGCATTGTGGAAAACTACTTATCCAAGAATTTGTATAAAGAATTCCATGGTCGAGAAAGC
AGAATGTTATGTTGGAGAATGCAAATGCAAAGAGCAATTATTACAAGAAAGGCAAGGAAGATTCAAGAGGCTTTCACACTGCACCTTCAGAAGCTTGTTAATGCACAAGA
ACCAACAAAGAGTTTTGAGCCCGAATTTATTCATAATGTTACTTCAATGAGTCAAGAAGAGAATGGAGCAAAGATGGCACGAGAAAAGTTGTCTATTTTGAGAGATGACA
CGGAGGACGAAAAAAGTGTGCAGATTCGTGAACAGGTGCCTTTGCAGATTGCACCTCTTTGGAGTTTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGAATGAGGTATGCTCCCAAACATTATAAAAATGATTTGAAAACTAAACTGCAAGCTTTGAGGCAGGGAACAAAAAGTGTTGATGAGTATTATCAAGAGATGGTGAT
TTTAATGGAAAAAGCACACATTCGAGGAGAAGAGGAAGAAACCATGTCTAGATTCCTTCGAGGTTTGAATCTGGAAATTGTTCATCATATTGATAGAAATCCACCACCTT
ACTTGGAAGATATGTATCACCATGCTATCAAAATTGAAGATCAATTGAGGAAAGAAGAGGAGCATTCCAAATGGAATGAATTCATGCAACAAAAAGGGAAGTTTGGTGCT
GTCAAAAGAGTGAAGGCCGAGAGCTCCAATGCCAAAAACAATGAAGTTCCAAAGGAGGTAAGGGAGAAGACTAATTCTCTTACATGTTGGAAGTGCAAAGGGTTTGGACA
CATGAGCAAAGAATGTGTCAATAAAGAAGTCATGGTGATAAGAAATGGTGTTGTTGATTCAAATGATGCTTGTGAGAAATATGATGCAAAATTTGAGGAAGATTCTAAAG
CACATGTTGATGAATTTATTGAACAAGATCGCAAAGATTCTGATTTTGATAAAAGTGAGTTCAAAGAAAAGAGAGAAAGAGAAGAGTTAAGAGAAAGGAATTCAAAGGAG
AATAGCAAGATCCAATCCATTTGTCAATATAAGTCAAATCTTATAGCATTGTTGCATAAGGACCCTTACTATCAAGCTCACATATTTTATCCTAGTCTTTCTACTTTTCT
AAGATTTTTGCAGGAAGTTCCAACTTTTCAATTCAAGCATGATCGAATACAAGAAACAAAGGAAGCTATCAAGGAGCCAAGAAGAGTTATCCTATCATCTTCTCATGATC
TTGAGATTGTCTTACAAGGTAACCATGGACATAAGAAACCTAAGGTGAAAGGCATTGTGGAAAACTACTTATCCAAGAATTTGTATAAAGAATTCCATGGTCGAGAAAGC
AGAATGTTATGTTGGAGAATGCAAATGCAAAGAGCAATTATTACAAGAAAGGCAAGGAAGATTCAAGAGGCTTTCACACTGCACCTTCAGAAGCTTGTTAATGCACAAGA
ACCAACAAAGAGTTTTGAGCCCGAATTTATTCATAATGTTACTTCAATGAGTCAAGAAGAGAATGGAGCAAAGATGGCACGAGAAAAGTTGTCTATTTTGAGAGATGACA
CGGAGGACGAAAAAAGTGTGCAGATTCGTGAACAGGTGCCTTTGCAGATTGCACCTCTTTGGAGTTTGTGA
Protein sequenceShow/hide protein sequence
MRMRYAPKHYKNDLKTKLQALRQGTKSVDEYYQEMVILMEKAHIRGEEEETMSRFLRGLNLEIVHHIDRNPPPYLEDMYHHAIKIEDQLRKEEEHSKWNEFMQQKGKFGA
VKRVKAESSNAKNNEVPKEVREKTNSLTCWKCKGFGHMSKECVNKEVMVIRNGVVDSNDACEKYDAKFEEDSKAHVDEFIEQDRKDSDFDKSEFKEKREREELRERNSKE
NSKIQSICQYKSNLIALLHKDPYYQAHIFYPSLSTFLRFLQEVPTFQFKHDRIQETKEAIKEPRRVILSSSHDLEIVLQGNHGHKKPKVKGIVENYLSKNLYKEFHGRES
RMLCWRMQMQRAIITRKARKIQEAFTLHLQKLVNAQEPTKSFEPEFIHNVTSMSQEENGAKMAREKLSILRDDTEDEKSVQIREQVPLQIAPLWSL