; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g21500 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g21500
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCCHC-type domain-containing protein
Genome locationchr3:14680485..14682858
RNA-Seq ExpressionMoc03g21500
SyntenyMoc03g21500
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0019538 - protein metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016787 - hydrolase activity (molecular function)
GO:0140096 - catalytic activity, acting on a protein (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8532301.1 hypothetical protein F0562_032334 [Nyssa sinensis]1.4e-6848.16Show/hide
Query:  DLISGEDSVIPDNTPQNVKVRRKLKIKCGKVLFALQTSIGKKYIEHVRDDKSPKQVWDTLEKLFTQENTTMLQYLDNELAGTTQGNLSVSECFLKIKTLC
        DL+SG+D+VIP +TPQNV+++RK KIKCGK LFAL+TSI ++YIEHVRD KSPKQVW+TLE+LFTQ+N   LQ+L+NELAG TQ NLS+SE F KIKT  
Subjt:  DLISGEDSVIPDNTPQNVKVRRKLKIKCGKVLFALQTSIGKKYIEHVRDDKSPKQVWDTLEKLFTQENTTMLQYLDNELAGTTQGNLSVSECFLKIKTLC

Query:  YEVSELDEEEPISDARLRRYLIRQLRKKFMSFIPSIQGWVNQYSIIELENLLSNQEALVKQVLGDNKRSSLEVE--------------------------
         E+ ELD EEP+SDARLR YLIR+L K+FM FI SIQGW NQ SIIELENLL N+EAL+KQ+   NK+S  +VE                          
Subjt:  YEVSELDEEEPISDARLRRYLIRQLRKKFMSFIPSIQGWVNQYSIIELENLLSNQEALVKQVLGDNKRSSLEVE--------------------------

Query:  ---------------------------VQ--------------GEDWSFDFGCSHHSAIGNVSLIYDVCIHHGRRVNVVVGNSLHPDM-EGHVNVKN--C
                                   VQ               +DW  D  CSHH A GN SL+++VC +HG+R  V   NSLHP++ EG++NVK    
Subjt:  ---------------------------VQ--------------GEDWSFDFGCSHHSAIGNVSLIYDVCIHHGRRVNVVVGNSLHPDM-EGHVNVKN--C

Query:  VLDNVLRKGGCYGPNLKENFVLVYQV
        ++  +  K   + P LK+N   V Q+
Subjt:  VLDNVLRKGGCYGPNLKENFVLVYQV

KAA8540328.1 hypothetical protein F0562_024753 [Nyssa sinensis]3.5e-7244.78Show/hide
Query:  MEAYLQGQDLLDLISGEDSVIPDNTPQNVKVRRKLKIKCGKVLFALQTSIGKKYIEHVRDDKSPKQVWDTLEKLFTQENTTMLQYLDNELAGTTQGNLSV
        MEAYLQGQDL DLISG+D VIP++TPQN ++RRK KIKCGK LFAL+TSI ++YI+HVRD KSPKQVW+TLE+LFTQ+NT  LQ+L+N+LAG TQ NLS+
Subjt:  MEAYLQGQDLLDLISGEDSVIPDNTPQNVKVRRKLKIKCGKVLFALQTSIGKKYIEHVRDDKSPKQVWDTLEKLFTQENTTMLQYLDNELAGTTQGNLSV

Query:  SECFLKIKTLCYEVSELDEEEPISDARLRRYLIRQLRKKFMSFIPSIQGWVNQYSIIELENLLSNQEALVKQVL--------------------------
        SE FLKIKTLC E+SELD EEP+SDARLRRYLIR LRK+FM FI SIQGW NQ SIIELENLLSNQEAL+KQ+                           
Subjt:  SECFLKIKTLCYEVSELDEEEPISDARLRRYLIRQLRKKFMSFIPSIQGWVNQYSIIELENLLSNQEALVKQVL--------------------------

Query:  ----GDNKRSSLEVEVQG----------------------------------------------------------------------------------
            GDNK+S  E + +G                                                                                  
Subjt:  ----GDNKRSSLEVEVQG----------------------------------------------------------------------------------

Query:  -----------EDWSFDFGCSHHSAIGNVSLIYDVCIHHGRRVNVVVGNSLHPDM-EGHVNVKNCVLD--NVLRKGGCYGPNLKENFVLVYQV
                   +DW  D GCSHH A GN SL+ +V  H+G+R  V   NSLHP + EG+ NVK  + +   V  K   + P LK+N   V Q+
Subjt:  -----------EDWSFDFGCSHHSAIGNVSLIYDVCIHHGRRVNVVVGNSLHPDM-EGHVNVKNCVLD--NVLRKGGCYGPNLKENFVLVYQV

KAA8549541.1 hypothetical protein F0562_001441 [Nyssa sinensis]3.7e-6973.51Show/hide
Query:  MEAYLQGQDLLDLISGEDSVIPDNTPQNVKVRRKLKIKCGKVLFALQTSIGKKYIEHVRDDKSPKQVWDTLEKLFTQENTTMLQYLDNELAGTTQGNLSV
        MEAYLQGQDL DLISG+D VIP++TP+N ++RRK KIKCGK LFAL+TSI ++YI+HVRD KSPKQVW+TLE+LFTQ+N   LQ+L+N+LAG TQ NLS+
Subjt:  MEAYLQGQDLLDLISGEDSVIPDNTPQNVKVRRKLKIKCGKVLFALQTSIGKKYIEHVRDDKSPKQVWDTLEKLFTQENTTMLQYLDNELAGTTQGNLSV

Query:  SECFLKIKTLCYEVSELDEEEPISDARLRRYLIRQLRKKFMSFIPSIQGWVNQYSIIELENLLSNQEALVKQVLGDNKRSSLEVE
        SE FLKIKTLC E+SELD EEP+SDARLRRYLIR LRK+FM FI SIQGW NQ SIIELENLLSNQEAL+KQ+   NK+S  +VE
Subjt:  SECFLKIKTLCYEVSELDEEEPISDARLRRYLIRQLRKKFMSFIPSIQGWVNQYSIIELENLLSNQEALVKQVLGDNKRSSLEVE

KAA8549858.1 hypothetical protein F0562_001542 [Nyssa sinensis]6.2e-6943.91Show/hide
Query:  MEAYLQGQDLLDLISGEDSVIPDNTPQNVKVRRKLKIKCGKVLFALQTSIGKKYIEHVRDDKSPKQVWDTLEKLFTQENTTMLQYLDNELAGTTQGNLSV
        MEAYLQGQ+L DLISG+D VI ++TPQNV++RRK KIK GK LFAL+TSI ++YI+HVRD KSPKQVW TLE+LFTQ+NT  LQ+L NELAG TQ NLS+
Subjt:  MEAYLQGQDLLDLISGEDSVIPDNTPQNVKVRRKLKIKCGKVLFALQTSIGKKYIEHVRDDKSPKQVWDTLEKLFTQENTTMLQYLDNELAGTTQGNLSV

Query:  SECFLKIKTLCYEVSELDEEEPISDARLRRYLIRQLRKKFMSFIPSIQGWVNQYSIIELENLLSNQEALVKQVLGDNKRSSLEVE---------------
         E FLKIKTLC E+SELD EEP+SDARL RYLI  LRK+FM FI SIQGW NQ  IIELENLLSNQEAL+KQ+  +NK+S  +VE               
Subjt:  SECFLKIKTLCYEVSELDEEEPISDARLRRYLIRQLRKKFMSFIPSIQGWVNQYSIIELENLLSNQEALVKQVLGDNKRSSLEVE---------------

Query:  ----------------------------------------------------------------------------------------------VQ----
                                                                                                      VQ    
Subjt:  ----------------------------------------------------------------------------------------------VQ----

Query:  ----------GEDWSFDFGCSHHSAIGNVSLIYDVCIHHGRRVNVVVGNSLHPDM-EGHVNVKNCV--LDNVLRKGGCYGPNLKENFVLVYQVL
                   +DW  DFGCSHH AIGN  L+ +VC H+G+R  V   NSLHP + EG  NVK  +  +  V  K   + P+LK+N   V Q++
Subjt:  ----------GEDWSFDFGCSHHSAIGNVSLIYDVCIHHGRRVNVVVGNSLHPDM-EGHVNVKNCV--LDNVLRKGGCYGPNLKENFVLVYQVL

KAG6384138.1 hypothetical protein SASPL_156061 [Salvia splendens]9.3e-6548.61Show/hide
Query:  MEAYLQGQDLLDLISGEDSVIPDNTPQNVKVRRKLKIKCGKVLFALQTSIGKKYIEHVRDDKSPKQVWDTLEKLFTQENTTMLQYLDNELAGTTQGNLSV
        MEAYLQG DL DLIS ED V+P++TPQN + RRK KIKCGK LFAL+TSI K YIEHVR+ +SPK+VW+TL++L TQ+N   LQ+L+NELAGT Q  L++
Subjt:  MEAYLQGQDLLDLISGEDSVIPDNTPQNVKVRRKLKIKCGKVLFALQTSIGKKYIEHVRDDKSPKQVWDTLEKLFTQENTTMLQYLDNELAGTTQGNLSV

Query:  SECFLKIKTLCYEVSELDEEEPISDARLRRYLIRQLRKKFMSFIPSIQGWVNQYSIIELENLLSNQEALVKQVLGDNKRSSLEVE---------------
        SE FLK+K+LC E+SELD E  + +ARLRRYLIR LR +FM F  SIQGWVNQ SIIELENLLSN EALVKQ++G N +S   VE               
Subjt:  SECFLKIKTLCYEVSELDEEEPISDARLRRYLIRQLRKKFMSFIPSIQGWVNQYSIIELENLLSNQEALVKQVLGDNKRSSLEVE---------------

Query:  ---------VQGE-------------------------------DWSFDFGCSHHSAIGNVSLIYDVCIHHGRRVNVVVGNSLHPDM-EGHVNVKNCVLD
                  +GE                               +W  D G SHH A GN SL+  V  H+ ++V V   NSLHP + EG    K CV  
Subjt:  ---------VQGE-------------------------------DWSFDFGCSHHSAIGNVSLIYDVCIHHGRRVNVVVGNSLHPDM-EGHVNVKNCVLD

Query:  NVLRKGGCYGPNLKENFVLVYQV
        ++      + P LK+N   V Q+
Subjt:  NVLRKGGCYGPNLKENFVLVYQV

TrEMBL top hitse value%identityAlignment
A0A443N8T5 Integrase, catalytic core5.0e-6470.27Show/hide
Query:  MEAYLQGQDLLDLISGEDSVIPDNTPQNVKVRRKLKIKCGKVLFALQTSIGKKYIEHVRDDKSPKQVWDTLEKLFTQENTTMLQYLDNELAGTTQGNLSV
        MEAYLQGQDL DLISG+++VIP++T QN  + RK KIKCGK LFAL+TSI + YI  VRD  SPKQVW+ LE+LFTQ+NT  LQYL+NELAG TQG LS+
Subjt:  MEAYLQGQDLLDLISGEDSVIPDNTPQNVKVRRKLKIKCGKVLFALQTSIGKKYIEHVRDDKSPKQVWDTLEKLFTQENTTMLQYLDNELAGTTQGNLSV

Query:  SECFLKIKTLCYEVSELDEEEPISDARLRRYLIRQLRKKFMSFIPSIQGWVNQYSIIELENLLSNQEALVKQVLGDNKRSSLEVE
         E FLK+KTLC E+SELD EEP+SDARL RYLIR LRK+FM FI SIQGW  Q SIIELENLLSNQEALVKQ+  ++K+S   VE
Subjt:  SECFLKIKTLCYEVSELDEEEPISDARLRRYLIRQLRKKFMSFIPSIQGWVNQYSIIELENLLSNQEALVKQVLGDNKRSSLEVE

A0A5J5ATF7 gag_pre-integrs domain-containing protein6.7e-6948.16Show/hide
Query:  DLISGEDSVIPDNTPQNVKVRRKLKIKCGKVLFALQTSIGKKYIEHVRDDKSPKQVWDTLEKLFTQENTTMLQYLDNELAGTTQGNLSVSECFLKIKTLC
        DL+SG+D+VIP +TPQNV+++RK KIKCGK LFAL+TSI ++YIEHVRD KSPKQVW+TLE+LFTQ+N   LQ+L+NELAG TQ NLS+SE F KIKT  
Subjt:  DLISGEDSVIPDNTPQNVKVRRKLKIKCGKVLFALQTSIGKKYIEHVRDDKSPKQVWDTLEKLFTQENTTMLQYLDNELAGTTQGNLSVSECFLKIKTLC

Query:  YEVSELDEEEPISDARLRRYLIRQLRKKFMSFIPSIQGWVNQYSIIELENLLSNQEALVKQVLGDNKRSSLEVE--------------------------
         E+ ELD EEP+SDARLR YLIR+L K+FM FI SIQGW NQ SIIELENLL N+EAL+KQ+   NK+S  +VE                          
Subjt:  YEVSELDEEEPISDARLRRYLIRQLRKKFMSFIPSIQGWVNQYSIIELENLLSNQEALVKQVLGDNKRSSLEVE--------------------------

Query:  ---------------------------VQ--------------GEDWSFDFGCSHHSAIGNVSLIYDVCIHHGRRVNVVVGNSLHPDM-EGHVNVKN--C
                                   VQ               +DW  D  CSHH A GN SL+++VC +HG+R  V   NSLHP++ EG++NVK    
Subjt:  ---------------------------VQ--------------GEDWSFDFGCSHHSAIGNVSLIYDVCIHHGRRVNVVVGNSLHPDM-EGHVNVKN--C

Query:  VLDNVLRKGGCYGPNLKENFVLVYQV
        ++  +  K   + P LK+N   V Q+
Subjt:  VLDNVLRKGGCYGPNLKENFVLVYQV

A0A5J5BCB3 Uncharacterized protein1.7e-7244.78Show/hide
Query:  MEAYLQGQDLLDLISGEDSVIPDNTPQNVKVRRKLKIKCGKVLFALQTSIGKKYIEHVRDDKSPKQVWDTLEKLFTQENTTMLQYLDNELAGTTQGNLSV
        MEAYLQGQDL DLISG+D VIP++TPQN ++RRK KIKCGK LFAL+TSI ++YI+HVRD KSPKQVW+TLE+LFTQ+NT  LQ+L+N+LAG TQ NLS+
Subjt:  MEAYLQGQDLLDLISGEDSVIPDNTPQNVKVRRKLKIKCGKVLFALQTSIGKKYIEHVRDDKSPKQVWDTLEKLFTQENTTMLQYLDNELAGTTQGNLSV

Query:  SECFLKIKTLCYEVSELDEEEPISDARLRRYLIRQLRKKFMSFIPSIQGWVNQYSIIELENLLSNQEALVKQVL--------------------------
        SE FLKIKTLC E+SELD EEP+SDARLRRYLIR LRK+FM FI SIQGW NQ SIIELENLLSNQEAL+KQ+                           
Subjt:  SECFLKIKTLCYEVSELDEEEPISDARLRRYLIRQLRKKFMSFIPSIQGWVNQYSIIELENLLSNQEALVKQVL--------------------------

Query:  ----GDNKRSSLEVEVQG----------------------------------------------------------------------------------
            GDNK+S  E + +G                                                                                  
Subjt:  ----GDNKRSSLEVEVQG----------------------------------------------------------------------------------

Query:  -----------EDWSFDFGCSHHSAIGNVSLIYDVCIHHGRRVNVVVGNSLHPDM-EGHVNVKNCVLD--NVLRKGGCYGPNLKENFVLVYQV
                   +DW  D GCSHH A GN SL+ +V  H+G+R  V   NSLHP + EG+ NVK  + +   V  K   + P LK+N   V Q+
Subjt:  -----------EDWSFDFGCSHHSAIGNVSLIYDVCIHHGRRVNVVVGNSLHPDM-EGHVNVKNCVLD--NVLRKGGCYGPNLKENFVLVYQV

A0A5J5C3K7 Uncharacterized protein3.0e-6943.91Show/hide
Query:  MEAYLQGQDLLDLISGEDSVIPDNTPQNVKVRRKLKIKCGKVLFALQTSIGKKYIEHVRDDKSPKQVWDTLEKLFTQENTTMLQYLDNELAGTTQGNLSV
        MEAYLQGQ+L DLISG+D VI ++TPQNV++RRK KIK GK LFAL+TSI ++YI+HVRD KSPKQVW TLE+LFTQ+NT  LQ+L NELAG TQ NLS+
Subjt:  MEAYLQGQDLLDLISGEDSVIPDNTPQNVKVRRKLKIKCGKVLFALQTSIGKKYIEHVRDDKSPKQVWDTLEKLFTQENTTMLQYLDNELAGTTQGNLSV

Query:  SECFLKIKTLCYEVSELDEEEPISDARLRRYLIRQLRKKFMSFIPSIQGWVNQYSIIELENLLSNQEALVKQVLGDNKRSSLEVE---------------
         E FLKIKTLC E+SELD EEP+SDARL RYLI  LRK+FM FI SIQGW NQ  IIELENLLSNQEAL+KQ+  +NK+S  +VE               
Subjt:  SECFLKIKTLCYEVSELDEEEPISDARLRRYLIRQLRKKFMSFIPSIQGWVNQYSIIELENLLSNQEALVKQVLGDNKRSSLEVE---------------

Query:  ----------------------------------------------------------------------------------------------VQ----
                                                                                                      VQ    
Subjt:  ----------------------------------------------------------------------------------------------VQ----

Query:  ----------GEDWSFDFGCSHHSAIGNVSLIYDVCIHHGRRVNVVVGNSLHPDM-EGHVNVKNCV--LDNVLRKGGCYGPNLKENFVLVYQVL
                   +DW  DFGCSHH AIGN  L+ +VC H+G+R  V   NSLHP + EG  NVK  +  +  V  K   + P+LK+N   V Q++
Subjt:  ----------GEDWSFDFGCSHHSAIGNVSLIYDVCIHHGRRVNVVVGNSLHPDM-EGHVNVKNCV--LDNVLRKGGCYGPNLKENFVLVYQVL

A0A5J5C7C2 DUF4219 domain-containing protein1.8e-6973.51Show/hide
Query:  MEAYLQGQDLLDLISGEDSVIPDNTPQNVKVRRKLKIKCGKVLFALQTSIGKKYIEHVRDDKSPKQVWDTLEKLFTQENTTMLQYLDNELAGTTQGNLSV
        MEAYLQGQDL DLISG+D VIP++TP+N ++RRK KIKCGK LFAL+TSI ++YI+HVRD KSPKQVW+TLE+LFTQ+N   LQ+L+N+LAG TQ NLS+
Subjt:  MEAYLQGQDLLDLISGEDSVIPDNTPQNVKVRRKLKIKCGKVLFALQTSIGKKYIEHVRDDKSPKQVWDTLEKLFTQENTTMLQYLDNELAGTTQGNLSV

Query:  SECFLKIKTLCYEVSELDEEEPISDARLRRYLIRQLRKKFMSFIPSIQGWVNQYSIIELENLLSNQEALVKQVLGDNKRSSLEVE
        SE FLKIKTLC E+SELD EEP+SDARLRRYLIR LRK+FM FI SIQGW NQ SIIELENLLSNQEAL+KQ+   NK+S  +VE
Subjt:  SECFLKIKTLCYEVSELDEEEPISDARLRRYLIRQLRKKFMSFIPSIQGWVNQYSIIELENLLSNQEALVKQVLGDNKRSSLEVE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G10290.1 leucine-rich repeat transmembrane protein kinase family protein2.0e-0457.89Show/hide
Query:  LVYQVLWLLNMLLDEDFGAIVGDFGLAKLVGVSMTYVT
        ++++ +   N+LLDEDF A+VGDFGLAKLV V  T VT
Subjt:  LVYQVLWLLNMLLDEDFGAIVGDFGLAKLVGVSMTYVT

AT5G65240.1 Leucine-rich repeat protein kinase family protein2.0e-0457.89Show/hide
Query:  LVYQVLWLLNMLLDEDFGAIVGDFGLAKLVGVSMTYVT
        ++++ +   N+LLDEDF A+VGDFGLAKLV V  T VT
Subjt:  LVYQVLWLLNMLLDEDFGAIVGDFGLAKLVGVSMTYVT

AT5G65240.2 Leucine-rich repeat protein kinase family protein2.0e-0457.89Show/hide
Query:  LVYQVLWLLNMLLDEDFGAIVGDFGLAKLVGVSMTYVT
        ++++ +   N+LLDEDF A+VGDFGLAKLV V  T VT
Subjt:  LVYQVLWLLNMLLDEDFGAIVGDFGLAKLVGVSMTYVT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGCTTATCTACAAGGGCAAGATTTGTTGGATCTTATATCTGGTGAAGATAGTGTAATTCCAGATAATACTCCACAAAATGTTAAGGTACGAAGGAAGTTGAAGAT
CAAATGTGGCAAAGTTTTATTTGCTTTGCAAACTTCTATTGGCAAGAAGTATATCGAGCATGTTCGTGACGATAAGTCTCCAAAACAAGTGTGGGATACACTTGAAAAGT
TGTTCACTCAAGAGAACACGACGATGTTGCAGTATTTGGATAACGAACTTGCTGGAACAACTCAAGGTAATTTGTCAGTTTCAGAGTGCTTTCTGAAAATTAAAACTTTG
TGTTATGAAGTTTCAGAACTGGACGAGGAAGAGCCCATTAGTGATGCTCGTTTGCGACGTTATCTTATTCGTCAACTGCGAAAGAAATTTATGTCATTTATTCCCTCGAT
ACAAGGTTGGGTAAATCAATATTCTATCATTGAGTTGGAAAACTTACTCTCAAATCAGGAAGCCTTGGTGAAACAAGTGCTTGGCGACAACAAACGGTCTTCTCTCGAGG
TAGAAGTCCAAGGTGAGGATTGGTCTTTTGATTTTGGTTGTTCTCATCACAGTGCTATTGGAAATGTTTCTCTTATATATGACGTTTGTATTCATCATGGAAGAAGAGTC
AATGTGGTGGTTGGTAATTCCTTACATCCTGATATGGAAGGACATGTCAATGTTAAAAATTGTGTATTGGACAATGTTCTTCGTAAAGGTGGTTGCTACGGTCCAAATTT
GAAGGAGAATTTTGTTTTAGTCTATCAGGTGTTGTGGTTGCTAAATATGTTGCTTGATGAAGATTTTGGAGCCATTGTTGGCGATTTTGGCCTAGCAAAGTTGGTTGGTG
TTAGCATGACTTATGTCACTCAAGTTTTGAATAGAAGTTCTTCCTCGACGAATCTTGTTTTCGTCGTCGAAGGTGGACCTGGTGACGAAAACGAGTTTCGTTGCCAATTC
GTCGTCGTAGAAGCTTTCGGTGATGAAAGCAAAATTTCATCGCCGAAACCTTTTGGTGACGGCCCAACAGCGAGGACTTTTCGTCGCTCAAGGCTTTTGACGTCATCGCC
AAAACCTGAAATTCTTGTAGTGTACGTATGGTTTCTCTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGCTTATCTACAAGGGCAAGATTTGTTGGATCTTATATCTGGTGAAGATAGTGTAATTCCAGATAATACTCCACAAAATGTTAAGGTACGAAGGAAGTTGAAGAT
CAAATGTGGCAAAGTTTTATTTGCTTTGCAAACTTCTATTGGCAAGAAGTATATCGAGCATGTTCGTGACGATAAGTCTCCAAAACAAGTGTGGGATACACTTGAAAAGT
TGTTCACTCAAGAGAACACGACGATGTTGCAGTATTTGGATAACGAACTTGCTGGAACAACTCAAGGTAATTTGTCAGTTTCAGAGTGCTTTCTGAAAATTAAAACTTTG
TGTTATGAAGTTTCAGAACTGGACGAGGAAGAGCCCATTAGTGATGCTCGTTTGCGACGTTATCTTATTCGTCAACTGCGAAAGAAATTTATGTCATTTATTCCCTCGAT
ACAAGGTTGGGTAAATCAATATTCTATCATTGAGTTGGAAAACTTACTCTCAAATCAGGAAGCCTTGGTGAAACAAGTGCTTGGCGACAACAAACGGTCTTCTCTCGAGG
TAGAAGTCCAAGGTGAGGATTGGTCTTTTGATTTTGGTTGTTCTCATCACAGTGCTATTGGAAATGTTTCTCTTATATATGACGTTTGTATTCATCATGGAAGAAGAGTC
AATGTGGTGGTTGGTAATTCCTTACATCCTGATATGGAAGGACATGTCAATGTTAAAAATTGTGTATTGGACAATGTTCTTCGTAAAGGTGGTTGCTACGGTCCAAATTT
GAAGGAGAATTTTGTTTTAGTCTATCAGGTGTTGTGGTTGCTAAATATGTTGCTTGATGAAGATTTTGGAGCCATTGTTGGCGATTTTGGCCTAGCAAAGTTGGTTGGTG
TTAGCATGACTTATGTCACTCAAGTTTTGAATAGAAGTTCTTCCTCGACGAATCTTGTTTTCGTCGTCGAAGGTGGACCTGGTGACGAAAACGAGTTTCGTTGCCAATTC
GTCGTCGTAGAAGCTTTCGGTGATGAAAGCAAAATTTCATCGCCGAAACCTTTTGGTGACGGCCCAACAGCGAGGACTTTTCGTCGCTCAAGGCTTTTGACGTCATCGCC
AAAACCTGAAATTCTTGTAGTGTACGTATGGTTTCTCTCTTGA
Protein sequenceShow/hide protein sequence
MEAYLQGQDLLDLISGEDSVIPDNTPQNVKVRRKLKIKCGKVLFALQTSIGKKYIEHVRDDKSPKQVWDTLEKLFTQENTTMLQYLDNELAGTTQGNLSVSECFLKIKTL
CYEVSELDEEEPISDARLRRYLIRQLRKKFMSFIPSIQGWVNQYSIIELENLLSNQEALVKQVLGDNKRSSLEVEVQGEDWSFDFGCSHHSAIGNVSLIYDVCIHHGRRV
NVVVGNSLHPDMEGHVNVKNCVLDNVLRKGGCYGPNLKENFVLVYQVLWLLNMLLDEDFGAIVGDFGLAKLVGVSMTYVTQVLNRSSSSTNLVFVVEGGPGDENEFRCQF
VVVEAFGDESKISSPKPFGDGPTARTFRRSRLLTSSPKPEILVVYVWFLS