; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g29510 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g29510
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCCHC-type domain-containing protein
Genome locationchr6:22200612..22203632
RNA-Seq ExpressionMoc06g29510
SyntenyMoc06g29510
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBG97541.1 disease resistance family protein / LRR family protein [Prunus dulcis]7.1e-3437.76Show/hide
Query:  VKDLLTCKKIHKNLGE-RPVGMEDKDWNEMDEQAVASIRMVLSMNVCSLVAKEITTKDLLKVLQDRYEKPSANTKILLWTKYFNIHMEEGTSVNSHINEL
        ++D L  KK+++ L E +P GM D+DW  +D QA+  IR+ LS NV   +AKE TT  L+  L   YEKPSA+ K+ L  + FN+ M EG SV  H+NEL
Subjt:  VKDLLTCKKIHKNLGE-RPVGMEDKDWNEMDEQAVASIRMVLSMNVCSLVAKEITTKDLLKVLQDRYEKPSANTKILLWTKYFNIHMEEGTSVNSHINEL

Query:  TDILNKLEGMGVKIDEKVKAMRLLTSLLNSWETMKTAVSNSLGENSLKFTTICDAAIYEEARRKLGKMYASTSGAENEVESALVAQNKEKAKMSYNGKQQ
          +  +L  +G++ DE+V+A+ LL+SL  SW    TAVS+S G N L F  + D  + EE RR+     +++S    E       +   + + +Y G+ +
Subjt:  TDILNKLEGMGVKIDEKVKAMRLLTSLLNSWETMKTAVSNSLGENSLKFTTICDAAIYEEARRKLGKMYASTSGAENEVESALVAQNKEKAKMSYNGKQQ

Query:  QRYSR----GSGSSSGEVECFYCHKKGYIKRFCRKFKEDVE
         R S+     +  SS  VEC+ C K G+ K  C+   +D E
Subjt:  QRYSR----GSGSSSGEVECFYCHKKGYIKRFCRKFKEDVE

BBH05460.1 hypothetical protein Prudu_016848 [Prunus dulcis]7.1e-3437.76Show/hide
Query:  VKDLLTCKKIHKNLGE-RPVGMEDKDWNEMDEQAVASIRMVLSMNVCSLVAKEITTKDLLKVLQDRYEKPSANTKILLWTKYFNIHMEEGTSVNSHINEL
        ++D L  KK+++ L E +P GM D+DW  +D QA+  IR+ LS NV   +AKE TT  L+  L   YEKPSA+ K+ L  + FN+ M EG SV  H+NEL
Subjt:  VKDLLTCKKIHKNLGE-RPVGMEDKDWNEMDEQAVASIRMVLSMNVCSLVAKEITTKDLLKVLQDRYEKPSANTKILLWTKYFNIHMEEGTSVNSHINEL

Query:  TDILNKLEGMGVKIDEKVKAMRLLTSLLNSWETMKTAVSNSLGENSLKFTTICDAAIYEEARRKLGKMYASTSGAENEVESALVAQNKEKAKMSYNGKQQ
          +  +L  +G++ DE+V+A+ LL+SL  SW    TAVS+S G N L F  + D  + EE RR+     +++S    E       +   + + +Y G+ +
Subjt:  TDILNKLEGMGVKIDEKVKAMRLLTSLLNSWETMKTAVSNSLGENSLKFTTICDAAIYEEARRKLGKMYASTSGAENEVESALVAQNKEKAKMSYNGKQQ

Query:  QRYSR----GSGSSSGEVECFYCHKKGYIKRFCRKFKEDVE
         R S+     +  SS  VEC+ C K G+ K  C+   +D E
Subjt:  QRYSR----GSGSSSGEVECFYCHKKGYIKRFCRKFKEDVE

KAG7561662.1 Zinc finger CCHC-type superfamily [Arabidopsis thaliana x Arabidopsis arenosa]2.0e-3641.13Show/hide
Query:  VKDLLTCKKIHKNLGERPVGMEDKDWNEMDEQAVASIRMVLSMNVCSLVAKEITTKDLLKVLQDRYEKPSANTKILLWTKYFNIHMEEGTSVNSHINELT
        ++D L  KK+H+ L  +P  M  ++W+ +D Q +  IR+ LS NV   VAKE TT+ L+KVL D YEKPSAN K+ L  K F++ MEEG  V +H+NE  
Subjt:  VKDLLTCKKIHKNLGERPVGMEDKDWNEMDEQAVASIRMVLSMNVCSLVAKEITTKDLLKVLQDRYEKPSANTKILLWTKYFNIHMEEGTSVNSHINELT

Query:  DILNKLEGMGVKIDEKVKAMRLLTSLLNSWETMKTAVSNSLGENSLKFTTICDAAIYEEARRKLGKMYASTSGAENEVESALVAQNKEKAKMSYNGKQQQ
         I+N+L  + ++ D++V+A+ L+ SL NSWE M+ AVSNS+G   LKF  + D  + EE RR +     STS A N VE+    QN+        G+ + 
Subjt:  DILNKLEGMGVKIDEKVKAMRLLTSLLNSWETMKTAVSNSLGENSLKFTTICDAAIYEEARRKLGKMYASTSGAENEVESALVAQNKEKAKMSYNGKQQQ

Query:  RYSRGSGSSSGEVECFYCHKKGYIKRFC-RKFKEDVEKGNTIVNVVTE
        R  +G   S   VEC+ C K G+ K  C    K++  KG    N VT+
Subjt:  RYSRGSGSSSGEVECFYCHKKGYIKRFC-RKFKEDVEKGNTIVNVVTE

KAG7584790.1 Zinc finger CCHC-type superfamily [Arabidopsis thaliana x Arabidopsis arenosa]3.4e-3639.44Show/hide
Query:  VKDLLTCKKIHKNLGERPVGMEDKDWNEMDEQAVASIRMVLSMNVCSLVAKEITTKDLLKVLQDRYEKPSANTKILLWTKYFNIHMEEGTSVNSHINELT
        ++D L  KK+H+ L  +P  M  ++W+ +D Q +  IR+ LS NV   VAKE TT+ L+KVL D YEKPSAN K+ L  K F++ MEEG  V +H+NE  
Subjt:  VKDLLTCKKIHKNLGERPVGMEDKDWNEMDEQAVASIRMVLSMNVCSLVAKEITTKDLLKVLQDRYEKPSANTKILLWTKYFNIHMEEGTSVNSHINELT

Query:  DILNKLEGMGVKIDEKVKAMRLLTSLLNSWETMKTAVSNSLGENSLKFTTICDAAIYEEARRKLGKMYASTSGAENEVESALVAQNKEKAKMSYN---GK
         I+N+L  + ++ D++V+A+ LL SL NSWE M+ AVSNS+G   LKF  + D  + EE RR            E  + SA   +N+ + +   N   G+
Subjt:  DILNKLEGMGVKIDEKVKAMRLLTSLLNSWETMKTAVSNSLGENSLKFTTICDAAIYEEARRKLGKMYASTSGAENEVESALVAQNKEKAKMSYN---GK

Query:  QQQRYSRGSGSSSGEVECFYCHKKGYIKRFC-RKFKEDVEKGNTIVNVVTE
         + R  +G   S   VEC+ C K G+ K  C    K++  KG    N VT+
Subjt:  QQQRYSRGSGSSSGEVECFYCHKKGYIKRFC-RKFKEDVEKGNTIVNVVTE

KAG7593230.1 Pentatricopeptide repeat [Arabidopsis thaliana x Arabidopsis arenosa]5.8e-3642.67Show/hide
Query:  VKDLLTCKKIHKNLGERPVGMEDKDWNEMDEQAVASIRMVLSMNVCSLVAKEITTKDLLKVLQDRYEKPSANTKILLWTKYFNIHMEEGTSVNSHINELT
        ++D L  KK+H+ L  +P  M  ++W+ +D Q +  IR+ LS NV   VAKE TT+ L+KVL D YEKPSAN K+ L  K F++ MEEG  V +H+NE  
Subjt:  VKDLLTCKKIHKNLGERPVGMEDKDWNEMDEQAVASIRMVLSMNVCSLVAKEITTKDLLKVLQDRYEKPSANTKILLWTKYFNIHMEEGTSVNSHINELT

Query:  DILNKLEGMGVKIDEKVKAMRLLTSLLNSWETMKTAVSNSLGENSLKFTTICDAAIYEEARRKLGKMYASTSGAENEVESALVAQNKEKAKMSYNGKQQQ
         I+N+L  + ++ D++V+A+ LL SL NSWE M+ AVSNS+G   LKF  + D  + EE RR +     STS A N VE+    QN+        G+ + 
Subjt:  DILNKLEGMGVKIDEKVKAMRLLTSLLNSWETMKTAVSNSLGENSLKFTTICDAAIYEEARRKLGKMYASTSGAENEVESALVAQNKEKAKMSYNGKQQQ

Query:  RYSRGSGSSSGEVECFYCHKKGYIK
        R  +G   S   VEC+ C K G+ K
Subjt:  RYSRGSGSSSGEVECFYCHKKGYIK

TrEMBL top hitse value%identityAlignment
A0A0D3AEM1 CCHC-type domain-containing protein1.3e-3842.74Show/hide
Query:  VKDLLTCKKIHKNLGERPVGMEDKDWNEMDEQAVASIRMVLSMNVCSLVAKEITTKDLLKVLQDRYEKPSANTKILLWTKYFNIHMEEGTSVNSHINELT
        ++D L  KK+H+ L ++P  M+  +W  +D Q +  IR+ LS NV   V KE TT+ L+KVL D YEKPSAN+K+ L  K F++ MEEG  V +HINE  
Subjt:  VKDLLTCKKIHKNLGERPVGMEDKDWNEMDEQAVASIRMVLSMNVCSLVAKEITTKDLLKVLQDRYEKPSANTKILLWTKYFNIHMEEGTSVNSHINELT

Query:  DILNKLEGMGVKIDEKVKAMRLLTSLLNSWETMKTAVSNSLGENSLKFTTICDAAIYEEARRKLGKMYASTSGAENEVESALVAQNKEKAKMSYNGKQQQ
         I+N+L  + ++ +++V+A+ LL SL NSWE+M+ AVSNS+G   LKF  + D  + EE RR +    ASTS A N VE+    +N ++   S NG+ + 
Subjt:  DILNKLEGMGVKIDEKVKAMRLLTSLLNSWETMKTAVSNSLGENSLKFTTICDAAIYEEARRKLGKMYASTSGAENEVESALVAQNKEKAKMSYNGKQQQ

Query:  RYSRGSGSSSGEVECFYCHKKGYIKRFCR--KFKEDVEKGNTIVNVVT
        R  RG        EC+ C K G+IK+ CR    KED  +G    N VT
Subjt:  RYSRGSGSSSGEVECFYCHKKGYIKRFCR--KFKEDVEKGNTIVNVVT

A0A0D3CS45 Uncharacterized protein6.7e-3842.34Show/hide
Query:  VKDLLTCKKIHKNLGERPVGMEDKDWNEMDEQAVASIRMVLSMNVCSLVAKEITTKDLLKVLQDRYEKPSANTKILLWTKYFNIHMEEGTSVNSHINELT
        ++D L  KK+H+ L ++P  M+  +W  +D Q +  IR+ LS NV   VAKE  T+ L+KVL D YEKPSAN K+ L  K F++ MEEG  V +H+NE  
Subjt:  VKDLLTCKKIHKNLGERPVGMEDKDWNEMDEQAVASIRMVLSMNVCSLVAKEITTKDLLKVLQDRYEKPSANTKILLWTKYFNIHMEEGTSVNSHINELT

Query:  DILNKLEGMGVKIDEKVKAMRLLTSLLNSWETMKTAVSNSLGENSLKFTTICDAAIYEEARRKLGKMYASTSGAENEVESALVAQNKEKAKMSYNGKQQQ
         I+N+L  + ++ +++V+A+ LL SL NSWE M+ AVSNS+G   LKF  + D  + EE RR +    ASTS A N VE+    +N ++   S NG+ + 
Subjt:  DILNKLEGMGVKIDEKVKAMRLLTSLLNSWETMKTAVSNSLGENSLKFTTICDAAIYEEARRKLGKMYASTSGAENEVESALVAQNKEKAKMSYNGKQQQ

Query:  RYSRGSGSSSGEVECFYCHKKGYIKRFCR--KFKEDVEKGNTIVNVVT
        R  RG        EC+ C K G+IK+ CR    KED  +G    N VT
Subjt:  RYSRGSGSSSGEVECFYCHKKGYIKRFCR--KFKEDVEKGNTIVNVVT

A0A0D3DMW7 CCHC-type domain-containing protein2.5e-3740.24Show/hide
Query:  VKDLLTCKKIHKNLGERPVGMEDKDWNEMDEQAVASIRMVLSMNVCSLVAKEITTKDLLKVLQDRYEKPSANTKILLWTKYFNIHMEEGTSVNSHINELT
        ++D L  KK+H+ L ++P  M+  +W  +D Q +  IR+ LS NV   +AKE TT+ L+KVL D YEKPS N K+ L  K F++ MEEG  V +H+NE  
Subjt:  VKDLLTCKKIHKNLGERPVGMEDKDWNEMDEQAVASIRMVLSMNVCSLVAKEITTKDLLKVLQDRYEKPSANTKILLWTKYFNIHMEEGTSVNSHINELT

Query:  DILNKLEGMGVKIDEKVKAMRLLTSLLNSWETMKTAVSNSLGENSLKFTTICDAAIYEEARRKLGKMYASTSGAENEVESALVAQNKEKAKMSYNGKQQQ
         I+N+L  + ++ D++V+A+ LL SL NSWE M+ AV+NS+G   LKF  + D  + EE RR +     STS A N VE+    +N ++   S NG+ + 
Subjt:  DILNKLEGMGVKIDEKVKAMRLLTSLLNSWETMKTAVSNSLGENSLKFTTICDAAIYEEARRKLGKMYASTSGAENEVESALVAQNKEKAKMSYNGKQQQ

Query:  RYSRGSGSSSGEVECFYCHKKGYIKRFCR--KFKEDVEKG--NTIVNVVTE
        R  RG        EC+ C   G+IK+ CR    KED  +G  N + + +T+
Subjt:  RYSRGSGSSSGEVECFYCHKKGYIKRFCR--KFKEDVEKG--NTIVNVVTE

A0A2N9GHK9 Uncharacterized protein2.1e-3639.52Show/hide
Query:  VKDLLTCKKIH-KNLGERPVGMEDKDWNEMDEQAVASIRMVLSMNVCSLVAKEITTKDLLKVLQDRYEKPSANTKILLWTKYFNIHMEEGTSVNSHINEL
        ++D L  KK+H   LGE+P  MED +W  +D Q +  IR+ LS  V   V KE TT +L+  L   YEKPSAN K+ L  K FN+ M EGT+V  H+NE 
Subjt:  VKDLLTCKKIH-KNLGERPVGMEDKDWNEMDEQAVASIRMVLSMNVCSLVAKEITTKDLLKVLQDRYEKPSANTKILLWTKYFNIHMEEGTSVNSHINEL

Query:  TDILNKLEGMGVKIDEKVKAMRLLTSLLNSWETMKTAVSNSLGENSLKFTTICDAAIYEEARRKLGKMYASTSGAENEVESALVAQNKEKAKMSYNGKQQ
          I N+L  + ++ D++++A+ +L SL NSWE M+ AVSNS G+  LK+  I D  + EE RR+     +S+  A N     L A+ + K +    G+ +
Subjt:  TDILNKLEGMGVKIDEKVKAMRLLTSLLNSWETMKTAVSNSLGENSLKFTTICDAAIYEEARRKLGKMYASTSGAENEVESALVAQNKEKAKMSYNGKQQ

Query:  QRYSRGSGSSSGEVECFYCHKKGYIKRFCRKFKEDVEKGNTIVNVVTE
         R  R       ++EC+ C K G+I++ C + K+  E  N   NVVTE
Subjt:  QRYSRGSGSSSGEVECFYCHKKGYIKRFCRKFKEDVEKGNTIVNVVTE

A0A2N9IKI1 Uncharacterized protein2.1e-3639.52Show/hide
Query:  VKDLLTCKKIH-KNLGERPVGMEDKDWNEMDEQAVASIRMVLSMNVCSLVAKEITTKDLLKVLQDRYEKPSANTKILLWTKYFNIHMEEGTSVNSHINEL
        ++D L  KK+H   LGE+P  MED +W  +D Q +  IR+ LS  V   V KE TT +L+  L   YEKPSAN K+ L  K FN+ M EGT+V  H+NE 
Subjt:  VKDLLTCKKIH-KNLGERPVGMEDKDWNEMDEQAVASIRMVLSMNVCSLVAKEITTKDLLKVLQDRYEKPSANTKILLWTKYFNIHMEEGTSVNSHINEL

Query:  TDILNKLEGMGVKIDEKVKAMRLLTSLLNSWETMKTAVSNSLGENSLKFTTICDAAIYEEARRKLGKMYASTSGAENEVESALVAQNKEKAKMSYNGKQQ
          I N+L  + ++ D++++A+ +L SL NSWE M+ AVSNS G+  LK+  I D  + EE RR+     +S+  A N     L A+ + K +    G+ +
Subjt:  TDILNKLEGMGVKIDEKVKAMRLLTSLLNSWETMKTAVSNSLGENSLKFTTICDAAIYEEARRKLGKMYASTSGAENEVESALVAQNKEKAKMSYNGKQQ

Query:  QRYSRGSGSSSGEVECFYCHKKGYIKRFCRKFKEDVEKGNTIVNVVTE
         R  R       ++EC+ C K G+I++ C + K+  E  N   NVVTE
Subjt:  QRYSRGSGSSSGEVECFYCHKKGYIKRFCRKFKEDVEKGNTIVNVVTE

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.3e-0822.48Show/hide
Query:  DKDWNEMDEQAVASIRMVLSMNVCSLVAKEITTKDLLKVLQDRYEKPSANTKILLWTKYFNIHMEEGTSVNSHINELTDILNKLEGMGVKIDEKVKAMRL
        D  W + +  A ++I   LS +  +    +IT + +L+ L   YE+ S  +++ L  +  ++ +    S+ SH +   +++++L   G KI+E  K   L
Subjt:  DKDWNEMDEQAVASIRMVLSMNVCSLVAKEITTKDLLKVLQDRYEKPSANTKILLWTKYFNIHMEEGTSVNSHINELTDILNKLEGMGVKIDEKVKAMRL

Query:  LTSLLNSWETMKTAVSNSLGENSLKFTTICDAAIYEEARRKLGKMYASTSGAENEVESALVAQNKEKAKMS-YNGKQQQRYSRGSGSSSGEVECFYCHKK
        L +L + ++ + TA+  +L E +L    + +  + +E      K+    +    +V +A+V  N    K + +  +  +      G+S  +V+C +C ++
Subjt:  LTSLLNSWETMKTAVSNSLGENSLKFTTICDAAIYEEARRKLGKMYASTSGAENEVESALVAQNKEKAKMS-YNGKQQQRYSRGSGSSSGEVECFYCHKK

Query:  GYIKRFCRKFKEDVEKGN
        G+IK+ C  +K  +   N
Subjt:  GYIKRFCRKFKEDVEKGN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-1828.09Show/hide
Query:  RTVKDLLTCKKIHKNL---GERPVGMEDKDWNEMDEQAVASIRMVLSMNVCSLVAKEITTKDLLKVLQDRYEKPSANTKILLWTKYFNIHMEEGTSVNSH
        R ++DLL  + +HK L    ++P  M+ +DW ++DE+A ++IR+ LS +V + +  E T + +   L+  Y   +   K+ L  + + +HM EGT+  SH
Subjt:  RTVKDLLTCKKIHKNL---GERPVGMEDKDWNEMDEQAVASIRMVLSMNVCSLVAKEITTKDLLKVLQDRYEKPSANTKILLWTKYFNIHMEEGTSVNSH

Query:  INELTDILNKLEGMGVKIDEKVKAMRLLTSLLNSWETMKTAVSNSLGENSLKFTTICDAAIYEEARRKLGKMYASTSGAENEVESALVAQNKEKAKMSYN
        +N    ++ +L  +GVKI+E+ KA+ LL SL +S++ + T + +  G+ +++   +  A +  E  RK           EN+ ++ +        + S N
Subjt:  INELTDILNKLEGMGVKIDEKVKAMRLLTSLLNSWETMKTAVSNSLGENSLKFTTICDAAIYEEARRKLGKMYASTSGAENEVESALVAQNKEKAKMSYN

Query:  --GKQQQRYSRGSGSSSGEVECFYCHKKGYIKRFC
          G+   R    + S S    C+ C++ G+ KR C
Subjt:  --GKQQQRYSRGSGSSSGEVECFYCHKKGYIKRFC

Arabidopsis top hitse value%identityAlignment
AT3G29785.1 unknown protein3.8e-0939.47Show/hide
Query:  VKDLLTCKKIHKNLGERPVGMEDKDWNEMDEQAVASIRMVLSMNVCSLVAKEITTKDLLKVLQDRYEKPSANTKIL
        ++D L  KK+H+ LG++   M   DWN +  Q +  IR+ +S N+   VAKE +   L+KVL D Y+KPS N  ++
Subjt:  VKDLLTCKKIHKNLGERPVGMEDKDWNEMDEQAVASIRMVLSMNVCSLVAKEITTKDLLKVLQDRYEKPSANTKIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGATAATTGTCTTCCCTTATATACCCACACCGTTCACTTCCCTAACCAATGTGGAACTTGTTGGGATTGGTGGAGTCAAAACGACGCCGGAACGAGCTAGGGAAGA
GTTGGGATCAAGCCAGGATGCGTCAGGATCGAAGCAGGATCAAAACAGGGGAGAAGCGGAAGCTGCAGGCCACTTGGGCCGAGGAGGCAGCGGCGCGGCTCTAGGGGACA
GCGCCACGGCGCCGTCCAGATGGGGGCGCAACACGGACTTTCCATCTGTAGTGCATGACGCTGTTTCAATGTTAATCAATTTTCCAACAAGTGGTATCAGAACTGTAAAG
GATCTTCTTACATGCAAGAAGATACACAAGAATTTAGGGGAGAGACCAGTAGGGATGGAGGACAAGGATTGGAATGAGATGGATGAGCAGGCCGTTGCGAGCATCAGAAT
GGTGTTATCAATGAATGTTTGTAGTTTGGTGGCGAAAGAGATTACAACGAAAGATTTGTTGAAGGTCTTGCAAGACAGGTATGAAAAACCTTCTGCCAATACAAAAATAC
TTCTATGGACAAAGTATTTTAATATCCACATGGAGGAGGGAACCTCGGTGAATTCCCACATTAATGAACTCACCGATATCTTGAACAAACTAGAAGGGATGGGTGTCAAG
ATTGACGAGAAGGTGAAAGCTATGAGGCTGTTGACGTCTTTGCTTAACAGTTGGGAGACGATGAAGACCGCGGTGTCGAATTCGCTAGGAGAAAATAGCTTGAAATTTAC
AACTATTTGTGATGCCGCCATATATGAGGAAGCCCGGAGAAAATTAGGGAAAATGTATGCATCTACTTCAGGGGCAGAAAACGAGGTTGAATCAGCTTTGGTAGCTCAAA
ACAAAGAGAAGGCAAAGATGAGTTACAATGGGAAGCAGCAGCAGAGATATAGCAGGGGTAGTGGGAGTTCTAGTGGAGAAGTGGAGTGTTTTTACTGCCACAAGAAGGGA
TACATTAAACGCTTTTGCAGGAAGTTTAAAGAAGATGTTGAGAAGGGGAACACTATTGTAAATGTTGTAACAGAAGGAGAACGGATTGAAGAACTTGGGTGGGAGCGCCA
AGTCATCAGGGGAATCTTCCTTAGGAGGTCGTTGGGTTCGATCGAGAAAGGAAGCGACAGGGACCACTTAGATCAAATGGGAGCACGTGGCTATGTCTCTAAGTTTGGGG
GACAAGACCACGTAGAGCTGCTCAGTCTCGGGGCAGAAGGGTGTCAAGAAGTTGATGGGGCACTGTTGATGCAGTGGGAGGCAGAGAATTGTCGCTTTGTCTCCAAGTGG
GAGATTGTTGGGATTGGTGGAGTCAAAACAACGCCGGAACGAGCTAGGGAAGAGTCTGGATCAAGCCGGGACGCGTCGGGATTGAAGCAGGATGAAAACAGGGGAGAAGC
GAAAGCTGCAGGCCACTTGGGCCGAGGAGGCAGCGCCGCGGCGCTGCCAGATGGGGGCACAACACGGACTTTCCCCGATTTTGAGCCATTTTTCAGAGGGTTTCGTGGCA
ATTTCAAGGCGAAGGCTCGGGCATTGGCAGAGGCATTGAGGCTCATCAAGATCAACACATTCGGGGTAATTAGATCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGATAATTGTCTTCCCTTATATACCCACACCGTTCACTTCCCTAACCAATGTGGAACTTGTTGGGATTGGTGGAGTCAAAACGACGCCGGAACGAGCTAGGGAAGA
GTTGGGATCAAGCCAGGATGCGTCAGGATCGAAGCAGGATCAAAACAGGGGAGAAGCGGAAGCTGCAGGCCACTTGGGCCGAGGAGGCAGCGGCGCGGCTCTAGGGGACA
GCGCCACGGCGCCGTCCAGATGGGGGCGCAACACGGACTTTCCATCTGTAGTGCATGACGCTGTTTCAATGTTAATCAATTTTCCAACAAGTGGTATCAGAACTGTAAAG
GATCTTCTTACATGCAAGAAGATACACAAGAATTTAGGGGAGAGACCAGTAGGGATGGAGGACAAGGATTGGAATGAGATGGATGAGCAGGCCGTTGCGAGCATCAGAAT
GGTGTTATCAATGAATGTTTGTAGTTTGGTGGCGAAAGAGATTACAACGAAAGATTTGTTGAAGGTCTTGCAAGACAGGTATGAAAAACCTTCTGCCAATACAAAAATAC
TTCTATGGACAAAGTATTTTAATATCCACATGGAGGAGGGAACCTCGGTGAATTCCCACATTAATGAACTCACCGATATCTTGAACAAACTAGAAGGGATGGGTGTCAAG
ATTGACGAGAAGGTGAAAGCTATGAGGCTGTTGACGTCTTTGCTTAACAGTTGGGAGACGATGAAGACCGCGGTGTCGAATTCGCTAGGAGAAAATAGCTTGAAATTTAC
AACTATTTGTGATGCCGCCATATATGAGGAAGCCCGGAGAAAATTAGGGAAAATGTATGCATCTACTTCAGGGGCAGAAAACGAGGTTGAATCAGCTTTGGTAGCTCAAA
ACAAAGAGAAGGCAAAGATGAGTTACAATGGGAAGCAGCAGCAGAGATATAGCAGGGGTAGTGGGAGTTCTAGTGGAGAAGTGGAGTGTTTTTACTGCCACAAGAAGGGA
TACATTAAACGCTTTTGCAGGAAGTTTAAAGAAGATGTTGAGAAGGGGAACACTATTGTAAATGTTGTAACAGAAGGAGAACGGATTGAAGAACTTGGGTGGGAGCGCCA
AGTCATCAGGGGAATCTTCCTTAGGAGGTCGTTGGGTTCGATCGAGAAAGGAAGCGACAGGGACCACTTAGATCAAATGGGAGCACGTGGCTATGTCTCTAAGTTTGGGG
GACAAGACCACGTAGAGCTGCTCAGTCTCGGGGCAGAAGGGTGTCAAGAAGTTGATGGGGCACTGTTGATGCAGTGGGAGGCAGAGAATTGTCGCTTTGTCTCCAAGTGG
GAGATTGTTGGGATTGGTGGAGTCAAAACAACGCCGGAACGAGCTAGGGAAGAGTCTGGATCAAGCCGGGACGCGTCGGGATTGAAGCAGGATGAAAACAGGGGAGAAGC
GAAAGCTGCAGGCCACTTGGGCCGAGGAGGCAGCGCCGCGGCGCTGCCAGATGGGGGCACAACACGGACTTTCCCCGATTTTGAGCCATTTTTCAGAGGGTTTCGTGGCA
ATTTCAAGGCGAAGGCTCGGGCATTGGCAGAGGCATTGAGGCTCATCAAGATCAACACATTCGGGGTAATTAGATCTTGA
Protein sequenceShow/hide protein sequence
MKIIVFPYIPTPFTSLTNVELVGIGGVKTTPERAREELGSSQDASGSKQDQNRGEAEAAGHLGRGGSGAALGDSATAPSRWGRNTDFPSVVHDAVSMLINFPTSGIRTVK
DLLTCKKIHKNLGERPVGMEDKDWNEMDEQAVASIRMVLSMNVCSLVAKEITTKDLLKVLQDRYEKPSANTKILLWTKYFNIHMEEGTSVNSHINELTDILNKLEGMGVK
IDEKVKAMRLLTSLLNSWETMKTAVSNSLGENSLKFTTICDAAIYEEARRKLGKMYASTSGAENEVESALVAQNKEKAKMSYNGKQQQRYSRGSGSSSGEVECFYCHKKG
YIKRFCRKFKEDVEKGNTIVNVVTEGERIEELGWERQVIRGIFLRRSLGSIEKGSDRDHLDQMGARGYVSKFGGQDHVELLSLGAEGCQEVDGALLMQWEAENCRFVSKW
EIVGIGGVKTTPERAREESGSSRDASGLKQDENRGEAKAAGHLGRGGSAAALPDGGTTRTFPDFEPFFRGFRGNFKAKARALAEALRLIKINTFGVIRS