; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g29080 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g29080
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionDNA-directed DNA polymerase
Genome locationchr6:21857894..21867375
RNA-Seq ExpressionMoc06g29080
SyntenyMoc06g29080
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG8501049.1 hypothetical protein CXB51_003148 [Gossypium anomalum]2.8e-3530.03Show/hide
Query:  KCLQHGITRCIQIETYYNGLNEATQLLIDVSVNG-VLSKPYEEAFGILERISCNKHQWLKSRAKSSTTFKGLVAVDAIADLNSKISRLVDIVMMNATSGD
        KC  HGI  CIQ+ET+YNGLN  T++++D S NG +LSK Y EA+ I+ERI+ N +QW  +RA S     G+  VDA   L S++S +  ++    T+G 
Subjt:  KCLQHGITRCIQIETYYNGLNEATQLLIDVSVNG-VLSKPYEEAFGILERISCNKHQWLKSRAKSSTTFKGLVAVDAIADLNSKISRLVDIVMMNATSGD

Query:  AGASKAKVNVVQSTLCPYCEGEHQFENCPGNPASVFYLAA-------------------------QKPSEMSLKEMFKADMTKSDA---NVQSQA-----
           +    N  ++  C YC   H FE CP NP S++Y+ A                         Q  S  SL+ + KA M K+DA   N+++Q      
Subjt:  AGASKAKVNVVQSTLCPYCEGEHQFENCPGNPASVFYLAA-------------------------QKPSEMSLKEMFKADMTKSDA---NVQSQA-----

Query:  -------ALLQSQVASLHN---------------------------------------------------------------------------------
                 L S   +L N                                                                                 
Subjt:  -------ALLQSQVASLHN---------------------------------------------------------------------------------

Query:  -------MEVQIVEVLEQMPTYVKFLKDVLTKKRRLGEFETICLANECSTILTSKKPEKMKDPESFTVLVSTG
               + + +VE LEQMP YVKF+KD+L+KKR+LGEFET+ L  EC+T L  K P K+KDP  FT+  + G
Subjt:  -------MEVQIVEVLEQMPTYVKFLKDVLTKKRRLGEFETICLANECSTILTSKKPEKMKDPESFTVLVSTG

XP_017233063.1 PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus]2.2e-3227.64Show/hide
Query:  KCLQHGITRCIQIETYYNGLNEATQLLIDVSVNG-VLSKPYEEAFGILERISCNKHQWLKSRAKSSTTFKGLVAVDAIADLNSKIS----RLVDIVMMNA
        KC  HGI  CIQ+ET+YNGLN  T++++D S NG +LSK Y +A+ ILE I+   +QW  SRA++     G+  VD+I  + ++++     L ++ M N 
Subjt:  KCLQHGITRCIQIETYYNGLNEATQLLIDVSVNG-VLSKPYEEAFGILERISCNKHQWLKSRAKSSTTFKGLVAVDAIADLNSKIS----RLVDIVMMNA

Query:  TSGDAGASKAKVNVVQSTLCPYCEGEHQFENCPGNPASVFYL-----------------------------------------------AAQKPSEMSLK
         S +   S +++N  ++  C +C   H +++CP NP SVFY+                                               + Q P   SL+
Subjt:  TSGDAGASKAKVNVVQSTLCPYCEGEHQFENCPGNPASVFYL-----------------------------------------------AAQKPSEMSLK

Query:  EMFKADMTKSDANVQSQAALLQSQVASLHNMEVQI-----------------------------------------------------------------
         M K  + K++A+     AL+QSQ ASL N+E Q+                                                                 
Subjt:  EMFKADMTKSDANVQSQAALLQSQVASLHNMEVQI-----------------------------------------------------------------

Query:  ------------------------------------------------------VEVLEQMPTYVKFLKDVLTKKRRLGEFETICLANECSTILTSKKPE
                                                              VE LEQMP YVKF+KD+LTKKRRLGEFET+ L  ECS+ L  K P 
Subjt:  ------------------------------------------------------VEVLEQMPTYVKFLKDVLTKKRRLGEFETICLANECSTILTSKKPE

Query:  KMKDPESFTVLVSTGE
        KMKDP SFT+  + G+
Subjt:  KMKDPESFTVLVSTGE

XP_022158768.1 uncharacterized protein LOC111025234 [Momordica charantia]5.2e-4533.04Show/hide
Query:  NAPILPELKVERLEGAEAGQAVPVPPLNVKCLQHGITRCI--------------------------------QIETYYNGL------------------N
        N PI P+LKVER++G  AG     PPLN   L   I + I                                Q+   ++GL                  +
Subjt:  NAPILPELKVERLEGAEAGQAVPVPPLNVKCLQHGITRCI--------------------------------QIETYYNGL------------------N

Query:  EATQLLIDVSVN-GVLSKPYEEAFGILERISCNKHQWLKSRAKSSTTFKGLVAVDAIADLNSKISRLVDIVMMNATSGDAGASKAKVNVVQSTLCPYCEG
        +  + +ID S N  +L K Y EAF ILE IS NKHQ  KSRA SSTT KGL   D +A+LNSKIS+L DI M + +  DAGASKAKVN +Q+  CPYC+ 
Subjt:  EATQLLIDVSVN-GVLSKPYEEAFGILERISCNKHQWLKSRAKSSTTFKGLVAVDAIADLNSKISRLVDIVMMNATSGDAGASKAKVNVVQSTLCPYCEG

Query:  EHQFENCPGNPASVFYL-------------AAQKPSEMSLKEMFKADMTKSDANVQSQAALLQSQVASLHNME---------------------------
        EH FE+CPGNP SVFYL             A + P  M+L++MFKA M K++  +      +QS  ASL N++                           
Subjt:  EHQFENCPGNPASVFYL-------------AAQKPSEMSLKEMFKADMTKSDANVQSQAALLQSQVASLHNME---------------------------

Query:  -------------------------------------------------------------------------------------VQIVEVLEQMPTYVK
                                                                                             + +VE L+QMPTYV+
Subjt:  -------------------------------------------------------------------------------------VQIVEVLEQMPTYVK

Query:  FLKDVLTKKRRLGEFETICLANECSTILTSKKPEKMKDPESFTVLVSTGEK
        FLKD+LTKK RL EF+ +CL  ECSTIL+SK  EKMK P SFT+ ++ G++
Subjt:  FLKDVLTKKRRLGEFETICLANECSTILTSKKPEKMKDPESFTVLVSTGEK

XP_030505222.1 uncharacterized protein LOC115720205 [Cannabis sativa]4.2e-3131.85Show/hide
Query:  KCLQHGITRCIQIETYYNGLNEATQLLIDVSVNG-VLSKPYEEAFGILERISCNKHQWLKSRAKSSTTFKGLVAVDAIADLNSKISRLVDIVMMNATSGD
        KC  HGI  CIQ+ET+YNGLN A+++++D S NG +LSK Y EAF ILERI+ N +QW  +RA +S    G++ VDA+  L ++++ + +I + N   G 
Subjt:  KCLQHGITRCIQIETYYNGLNEATQLLIDVSVNG-VLSKPYEEAFGILERISCNKHQWLKSRAKSSTTFKGLVAVDAIADLNSKISRLVDIVMMNATSGD

Query:  AGASKAKVNVVQSTLCPYCEGEHQFENCPGNPASVFYLAA------------------------QKPSE------MSLKEMFKADMTKSDANVQSQAAL-
        +    A +   + + C YC   H FENCP NP  +    A                        Q+P +       SL+ + +  M K+DA +QSQA + 
Subjt:  AGASKAKVNVVQSTLCPYCEGEHQFENCPGNPASVFYLAA------------------------QKPSE------MSLKEMFKADMTKSDANVQSQAAL-

Query:  --LQSQVASLHN-------------------------------------------------------------------------------MEVQIVEVL
          L+ Q+  L N                                                                               + + +VE L
Subjt:  --LQSQVASLHN-------------------------------------------------------------------------------MEVQIVEVL

Query:  EQMPTYVKFLKDVLTKKRRLGEFETICLANECSTIL
        EQMPTYVKFLKD+LTKKRRLGEFETI L   CS +L
Subjt:  EQMPTYVKFLKDVLTKKRRLGEFETICLANECSTIL

XP_030508947.1 uncharacterized protein LOC115723603 [Cannabis sativa]4.7e-3030.48Show/hide
Query:  KCLQHGITRCIQIETYYNGLNEATQLLIDVSVNG-VLSKPYEEAFGILERISCNKHQWLKSRAKSSTTFKGLVAVDAIADLNSKISRLVDIV----MMNA
        KC  HGI  CIQ+ET+YNGLN A+++++D S +G +LSK Y EAF ILE I+ N +QW  +RA +S    G++ VDA+  L ++++ + +I+    M  +
Subjt:  KCLQHGITRCIQIETYYNGLNEATQLLIDVSVNG-VLSKPYEEAFGILERISCNKHQWLKSRAKSSTTFKGLVAVDAIADLNSKISRLVDIV----MMNA

Query:  TSGDAGASKAKVNVVQSTLCPYCEGEHQFENCPGNPAS---VFYLAAQKPSE--------------------------------MSLKEMFKADMTKSDA
            A   +AK++      C YC   H FEN P NPAS    F    Q+ S                                  SL+ + +  M K+DA
Subjt:  TSGDAGASKAKVNVVQSTLCPYCEGEHQFENCPGNPAS---VFYLAAQKPSE--------------------------------MSLKEMFKADMTKSDA

Query:  N--------------------------VQSQAALLQSQ---------------------------VASLH------------------------------
        N                          ++S A    S+                           V++ H                              
Subjt:  N--------------------------VQSQAALLQSQ---------------------------VASLH------------------------------

Query:  -------NMEVQIVEVLEQMPTYVKFLKDVLTKKRRLGEFETICLANECSTILTSKKPEKMKDPESFTVLVSTG
               N+ + + E LEQMPTYVKFLKD+LT+KRRLGEFET+ L    S +L SK P K+KDP SFT+ +S G
Subjt:  -------NMEVQIVEVLEQMPTYVKFLKDVLTKKRRLGEFETICLANECSTILTSKKPEKMKDPESFTVLVSTG

TrEMBL top hitse value%identityAlignment
A0A5B6VNY6 Gag-asp_proteas domain-containing protein8.6e-3037.35Show/hide
Query:  KCLQHGITRCIQIETYYNGLNEATQLLIDVSVNGV-LSKPYEEAFGILERISCNKHQWLKSRAKSSTTFKGLVAVDAIADLNS---KISRLVDIVMMNAT
        +C  +GI  CIQ+ET+YNGLN  T+L++D S NGV LSK Y EA+GI++RI+    QWL +R  S      +  VDA+A L +    IS ++     NA 
Subjt:  KCLQHGITRCIQIETYYNGLNEATQLLIDVSVNGV-LSKPYEEAFGILERISCNKHQWLKSRAKSSTTFKGLVAVDAIADLNS---KISRLVDIVMMNAT

Query:  SGDAGASKAKVNVVQSTLCPYCEGEHQ--------FENCPGNPASVFYLAAQKPSEMSLKEMFKADMTKSDANVQSQAALLQSQVASLHNMEVQIVEVLE
        +  A     + +VV        +   Q         EN  G  A+        PS+M     F  +  K  A ++S   L    V    ++ + +VE LE
Subjt:  SGDAGASKAKVNVVQSTLCPYCEGEHQ--------FENCPGNPASVFYLAAQKPSEMSLKEMFKADMTKSDANVQSQAALLQSQVASLHNMEVQIVEVLE

Query:  QMPTYVKFLKDVLTKKRRLGEFETICLANECSTILTSKKPEKMKDPESFTVLVSTGE
        QMP YVKF+KD+++KK+RL EFE + L  E +  L +K P KMKDP SFT+  + GE
Subjt:  QMPTYVKFLKDVLTKKRRLGEFETICLANECSTILTSKKPEKMKDPESFTVLVSTGE

A0A6J1DRG1 uncharacterized protein LOC1110236691.2e-2851.77Show/hide
Query:  KCLQHGITRCIQIETYYNGLNEATQLLIDVSVNG-VLSKPYEEAFGILERISCNKHQWLKSRAKSSTTFKGLVAVDAIADLNSKISRLVDIVMMNAT-SG
        KC  HGI RCIQIE YY GL++AT+L+ID S NG +L KPY EAF ILERIS N H W   RA      KGL   ++   LNSK+  L ++VM + T   
Subjt:  KCLQHGITRCIQIETYYNGLNEATQLLIDVSVNG-VLSKPYEEAFGILERISCNKHQWLKSRAKSSTTFKGLVAVDAIADLNSKISRLVDIVMMNAT-SG

Query:  DAGASKAKVNV--VQSTLCPYCEGEHQFENCPGNPASVFYL
          GAS  K NV  +Q   C +CEGEH + N P NP SV+YL
Subjt:  DAGASKAKVNV--VQSTLCPYCEGEHQFENCPGNPASVFYL

A0A6J1DX14 uncharacterized protein LOC1110252342.5e-4533.04Show/hide
Query:  NAPILPELKVERLEGAEAGQAVPVPPLNVKCLQHGITRCI--------------------------------QIETYYNGL------------------N
        N PI P+LKVER++G  AG     PPLN   L   I + I                                Q+   ++GL                  +
Subjt:  NAPILPELKVERLEGAEAGQAVPVPPLNVKCLQHGITRCI--------------------------------QIETYYNGL------------------N

Query:  EATQLLIDVSVN-GVLSKPYEEAFGILERISCNKHQWLKSRAKSSTTFKGLVAVDAIADLNSKISRLVDIVMMNATSGDAGASKAKVNVVQSTLCPYCEG
        +  + +ID S N  +L K Y EAF ILE IS NKHQ  KSRA SSTT KGL   D +A+LNSKIS+L DI M + +  DAGASKAKVN +Q+  CPYC+ 
Subjt:  EATQLLIDVSVN-GVLSKPYEEAFGILERISCNKHQWLKSRAKSSTTFKGLVAVDAIADLNSKISRLVDIVMMNATSGDAGASKAKVNVVQSTLCPYCEG

Query:  EHQFENCPGNPASVFYL-------------AAQKPSEMSLKEMFKADMTKSDANVQSQAALLQSQVASLHNME---------------------------
        EH FE+CPGNP SVFYL             A + P  M+L++MFKA M K++  +      +QS  ASL N++                           
Subjt:  EHQFENCPGNPASVFYL-------------AAQKPSEMSLKEMFKADMTKSDANVQSQAALLQSQVASLHNME---------------------------

Query:  -------------------------------------------------------------------------------------VQIVEVLEQMPTYVK
                                                                                             + +VE L+QMPTYV+
Subjt:  -------------------------------------------------------------------------------------VQIVEVLEQMPTYVK

Query:  FLKDVLTKKRRLGEFETICLANECSTILTSKKPEKMKDPESFTVLVSTGEK
        FLKD+LTKK RL EF+ +CL  ECSTIL+SK  EKMK P SFT+ ++ G++
Subjt:  FLKDVLTKKRRLGEFETICLANECSTILTSKKPEKMKDPESFTVLVSTGEK

A0A6J1DXK5 uncharacterized protein LOC1110255003.6e-2837.78Show/hide
Query:  KCLQHGITRCIQIETYYNGLNEATQLLIDVSVNG-VLSKPYEEAFGILERISCNKHQWLKSRAKSSTTFKGLVAVDAIADLNSKISRLVDIVMMNAT---
        K L  GI RCIQI+TYYNGL++AT+L+ID S NG +L+KPY EAF ILERIS N   W   RA      KG    ++   LN KI  L D+VM + T   
Subjt:  KCLQHGITRCIQIETYYNGLNEATQLLIDVSVNG-VLSKPYEEAFGILERISCNKHQWLKSRAKSSTTFKGLVAVDAIADLNSKISRLVDIVMMNAT---

Query:  SGDAGASKAKVNVVQSTLCPYCEGEHQFENCPGNPASVFYLAAQKPSEM------------SLKEMFKADMTKSD--ANVQSQAALLQSQVASLHNMEVQ
        +  A A KA V+ +Q   C +C GE+++ NCPGNP SV YL   + +E             ++  + + ++ K      +++    +QSQ  SL N+E+Q
Subjt:  SGDAGASKAKVNVVQSTLCPYCEGEHQFENCPGNPASVFYLAAQKPSEM------------SLKEMFKADMTKSD--ANVQSQAALLQSQVASLHNMEVQ

Query:  IVEVLEQMPTYVKFL--KDVLTKKR
        + ++   + +  K +   D+   KR
Subjt:  IVEVLEQMPTYVKFL--KDVLTKKR

A0A6J1EQ90 uncharacterized protein LOC1114364113.0e-3027.42Show/hide
Query:  KCLQHGITRCIQIETYYNGLNEATQLLIDVSVNG-VLSKPYEEAFGILERISCNKHQWLKSRAKSSTTFKGLVAVDAIADLNSKISRLVDIVMMNATSGD
        KC  HG+  CIQ+ET+YNGLN  T+ ++D S NG +LSK Y EA+ ILERI+ N  QW   R+      +G++ VDA++ +N++++ + +I+   A   D
Subjt:  KCLQHGITRCIQIETYYNGLNEATQLLIDVSVNG-VLSKPYEEAFGILERISCNKHQWLKSRAKSSTTFKGLVAVDAIADLNSKISRLVDIVMMNATSGD

Query:  AGA-----SKAKVNVVQSTLCPYCEGEHQFENCPGNPASVFYLA--------------------------------------------------------
        +       + A +N   +  C YC  EH F+ CP NPAS+FY+                                                         
Subjt:  AGA-----SKAKVNVVQSTLCPYCEGEHQFENCPGNPASVFYLA--------------------------------------------------------

Query:  ---------------AQKPSEMSLKEMFKADMTKSDANVQSQAALLQS---QVASLHNME----------------------------------------
                       AQ  SE S++ + K  M K+DA +QSQ A L++   Q+    N E                                        
Subjt:  ---------------AQKPSEMSLKEMFKADMTKSDANVQSQAALLQS---QVASLHNME----------------------------------------

Query:  -------------------------------------------VQIVEVLEQMPTYVKFLKDVLTKKRRLGEFETICLANECSTILTSKKPEKMKDPESF
                                                   + +VE L+QMP YVKFLKDVL  +R+  EF+ + L  ECS IL +K P K KDP SF
Subjt:  -------------------------------------------VQIVEVLEQMPTYVKFLKDVLTKKRRLGEFETICLANECSTILTSKKPEKMKDPESF

Query:  TVLVSTGEKGEAVVGSQSGISIN
        T+ VS G K         G +IN
Subjt:  TVLVSTGEKGEAVVGSQSGISIN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCTGAATGCACCAATTCTACCTGAACTGAAAGTGGAGAGGTTAGAAGGGGCTGAAGCTGGCCAAGCTGTGCCAGTACCTCCCCTGAATGTTAAGTGCTTGCAACA
TGGCATAACACGCTGCATCCAAATTGAAACTTATTACAACGGATTGAATGAGGCTACACAGTTGTTGATAGATGTCTCGGTGAATGGAGTGTTGTCAAAACCATATGAAG
AGGCATTTGGAATTTTGGAGAGAATTTCGTGCAACAAGCATCAATGGTTGAAATCAAGGGCTAAGTCATCGACAACATTTAAAGGCTTAGTGGCAGTAGATGCAATAGCA
GACCTGAACTCAAAGATCTCTAGGCTAGTTGATATAGTAATGATGAATGCAACTTCAGGTGATGCTGGAGCCTCCAAAGCTAAAGTGAATGTTGTGCAGAGTACACTATG
CCCATATTGTGAGGGCGAACATCAGTTTGAGAATTGTCCAGGAAATCCCGCCTCAGTATTTTACTTGGCCGCTCAAAAGCCTTCGGAGATGAGCTTGAAAGAGATGTTTA
AGGCTGATATGACAAAGAGTGACGCCAATGTCCAGAGTCAAGCTGCGTTACTTCAGAGTCAGGTAGCATCCTTACATAATATGGAAGTGCAAATTGTGGAAGTCTTAGAG
CAGATGCCTACATATGTGAAGTTTTTGAAAGATGTTCTTACTAAGAAGCGTAGGCTGGGAGAATTTGAAACAATTTGCTTAGCCAATGAGTGTAGTACAATTTTGACCAG
TAAGAAACCTGAGAAAATGAAGGACCCTGAAAGTTTCACAGTCCTAGTGTCTACTGGAGAGAAAGGTGAGGCGGTGGTGGGTTCTCAATCTGGAATCTCAATCAATGACC
CTCATGACGACTCATCTAATCCACCAAAGTCTAGTACTTCTCTTCATCGGCCAGGAAAAAAGCCCAAACAACATGGTTCTTTATACGATAAATTGGTTGGCCGAAAGATG
GCGGCATATGTTCAGAATGATAAGTTGCTTATACATTGCTTCCAAGATAGTTTAACAGGAAAATCTTTTGATGCCTATACACATTTTGGAAATGAGGCTAGGGAAGAACA
GACGCCCAAACTTTTTGGTGGCGCACTAGGTGATTTCGCTGTGGATAAGCTGGCTTACATCGATGCTCTTGCTCTCCATCAAACAAGCTTGGAGGAGGACTCTTTCCTTC
ATCACGGACGAATCATGTGTGGACGCTCTGGGGTGGGCAACAATTGTGAGCGCATGAGCAAATTGCTCCTTAGTGGGTTGTTCCAGCATTCGGTTGAACTCGGCAGTAGG
GTTGGAAGGCAGATGGTGGGCTTTGTTGATTTTTTCGAGGCTCCAATTGACTTTGACCTCCCTCACAATTGTTTTCCCAATGTTGTCGGTCTTGAGTTGCTCCGAGTTAG
AAGTAGGGTCGGAAGCGACGTAGAAACAGTAGAGGGCTTGACAGGGGAGGCTACCAAAGTCTTACGGACTTCAGCCGCTACTGATTGTTCATTACCATCGGCGAAGTCCT
CCGACGAACTCTCGGACAGCGGGAGCTCTATAGCTTTCTTTCGATCTGCTTTCGTCTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATCTGAATGCACCAATTCTACCTGAACTGAAAGTGGAGAGGTTAGAAGGGGCTGAAGCTGGCCAAGCTGTGCCAGTACCTCCCCTGAATGTTAAGTGCTTGCAACA
TGGCATAACACGCTGCATCCAAATTGAAACTTATTACAACGGATTGAATGAGGCTACACAGTTGTTGATAGATGTCTCGGTGAATGGAGTGTTGTCAAAACCATATGAAG
AGGCATTTGGAATTTTGGAGAGAATTTCGTGCAACAAGCATCAATGGTTGAAATCAAGGGCTAAGTCATCGACAACATTTAAAGGCTTAGTGGCAGTAGATGCAATAGCA
GACCTGAACTCAAAGATCTCTAGGCTAGTTGATATAGTAATGATGAATGCAACTTCAGGTGATGCTGGAGCCTCCAAAGCTAAAGTGAATGTTGTGCAGAGTACACTATG
CCCATATTGTGAGGGCGAACATCAGTTTGAGAATTGTCCAGGAAATCCCGCCTCAGTATTTTACTTGGCCGCTCAAAAGCCTTCGGAGATGAGCTTGAAAGAGATGTTTA
AGGCTGATATGACAAAGAGTGACGCCAATGTCCAGAGTCAAGCTGCGTTACTTCAGAGTCAGGTAGCATCCTTACATAATATGGAAGTGCAAATTGTGGAAGTCTTAGAG
CAGATGCCTACATATGTGAAGTTTTTGAAAGATGTTCTTACTAAGAAGCGTAGGCTGGGAGAATTTGAAACAATTTGCTTAGCCAATGAGTGTAGTACAATTTTGACCAG
TAAGAAACCTGAGAAAATGAAGGACCCTGAAAGTTTCACAGTCCTAGTGTCTACTGGAGAGAAAGGTGAGGCGGTGGTGGGTTCTCAATCTGGAATCTCAATCAATGACC
CTCATGACGACTCATCTAATCCACCAAAGTCTAGTACTTCTCTTCATCGGCCAGGAAAAAAGCCCAAACAACATGGTTCTTTATACGATAAATTGGTTGGCCGAAAGATG
GCGGCATATGTTCAGAATGATAAGTTGCTTATACATTGCTTCCAAGATAGTTTAACAGGAAAATCTTTTGATGCCTATACACATTTTGGAAATGAGGCTAGGGAAGAACA
GACGCCCAAACTTTTTGGTGGCGCACTAGGTGATTTCGCTGTGGATAAGCTGGCTTACATCGATGCTCTTGCTCTCCATCAAACAAGCTTGGAGGAGGACTCTTTCCTTC
ATCACGGACGAATCATGTGTGGACGCTCTGGGGTGGGCAACAATTGTGAGCGCATGAGCAAATTGCTCCTTAGTGGGTTGTTCCAGCATTCGGTTGAACTCGGCAGTAGG
GTTGGAAGGCAGATGGTGGGCTTTGTTGATTTTTTCGAGGCTCCAATTGACTTTGACCTCCCTCACAATTGTTTTCCCAATGTTGTCGGTCTTGAGTTGCTCCGAGTTAG
AAGTAGGGTCGGAAGCGACGTAGAAACAGTAGAGGGCTTGACAGGGGAGGCTACCAAAGTCTTACGGACTTCAGCCGCTACTGATTGTTCATTACCATCGGCGAAGTCCT
CCGACGAACTCTCGGACAGCGGGAGCTCTATAGCTTTCTTTCGATCTGCTTTCGTCTTGTAA
Protein sequenceShow/hide protein sequence
MNLNAPILPELKVERLEGAEAGQAVPVPPLNVKCLQHGITRCIQIETYYNGLNEATQLLIDVSVNGVLSKPYEEAFGILERISCNKHQWLKSRAKSSTTFKGLVAVDAIA
DLNSKISRLVDIVMMNATSGDAGASKAKVNVVQSTLCPYCEGEHQFENCPGNPASVFYLAAQKPSEMSLKEMFKADMTKSDANVQSQAALLQSQVASLHNMEVQIVEVLE
QMPTYVKFLKDVLTKKRRLGEFETICLANECSTILTSKKPEKMKDPESFTVLVSTGEKGEAVVGSQSGISINDPHDDSSNPPKSSTSLHRPGKKPKQHGSLYDKLVGRKM
AAYVQNDKLLIHCFQDSLTGKSFDAYTHFGNEAREEQTPKLFGGALGDFAVDKLAYIDALALHQTSLEEDSFLHHGRIMCGRSGVGNNCERMSKLLLSGLFQHSVELGSR
VGRQMVGFVDFFEAPIDFDLPHNCFPNVVGLELLRVRSRVGSDVETVEGLTGEATKVLRTSAATDCSLPSAKSSDELSDSGSSIAFFRSAFVL