; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g35840 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g35840
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCCHC-type domain-containing protein
Genome locationchr4:26874754..26876363
RNA-Seq ExpressionMoc04g35840
SyntenyMoc04g35840
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBG97541.1 disease resistance family protein / LRR family protein [Prunus dulcis]2.0e-4440.3Show/hide
Query:  KFSGENFSFWKMQVKDLLTCKKIHNTLGE-RPADMTDKAWNEMDEQAVANIRMTLSMGVCSLVAKETTAKELLKALQDRYEKPSANTKIFLWTKYFNIHM
        KF G +F FWKMQ++D L  KK++  L E +P  M D+ W  +D QA+  IR+TLS  V   +AKE T   L+ AL   YEKPSA+ K+ L  + FN+ M
Subjt:  KFSGENFSFWKMQVKDLLTCKKIHNTLGE-RPADMTDKAWNEMDEQAVANIRMTLSMGVCSLVAKETTAKELLKALQDRYEKPSANTKIFLWTKYFNIHM

Query:  EEGTSVNSHINELTDILNKLEGMGVKIEEEVKAMRLLTSFPYSWETIKTAVSNSLGENSLKFSAICDAALSEEARRKLGKMYVTTSGAENEVESALVAQF
         EG SV  H+NEL  +  +L  +G++ +EEV+A+ LL+S P SW    TAVS+S G N L F  + D  LSEE RR+      T+S    E       + 
Subjt:  EEGTSVNSHINELTDILNKLEGMGVKIEEEVKAMRLLTSFPYSWETIKTAVSNSLGENSLKFSAICDAALSEEARRKLGKMYVTTSGAENEVESALVAQF

Query:  KGKGKMKYNGKQQHR----NNRGSGNSSEEVECFYCHKKGHFKKHCRKLKEDQENEDTLNYVS
         G+G+  Y G+ + R     N  + +SS+ VEC+ C K GH+K  C+   +D E +   N  S
Subjt:  KGKGKMKYNGKQQHR----NNRGSGNSSEEVECFYCHKKGHFKKHCRKLKEDQENEDTLNYVS

BBH05460.1 hypothetical protein Prudu_016848 [Prunus dulcis]2.0e-4440.3Show/hide
Query:  KFSGENFSFWKMQVKDLLTCKKIHNTLGE-RPADMTDKAWNEMDEQAVANIRMTLSMGVCSLVAKETTAKELLKALQDRYEKPSANTKIFLWTKYFNIHM
        KF G +F FWKMQ++D L  KK++  L E +P  M D+ W  +D QA+  IR+TLS  V   +AKE T   L+ AL   YEKPSA+ K+ L  + FN+ M
Subjt:  KFSGENFSFWKMQVKDLLTCKKIHNTLGE-RPADMTDKAWNEMDEQAVANIRMTLSMGVCSLVAKETTAKELLKALQDRYEKPSANTKIFLWTKYFNIHM

Query:  EEGTSVNSHINELTDILNKLEGMGVKIEEEVKAMRLLTSFPYSWETIKTAVSNSLGENSLKFSAICDAALSEEARRKLGKMYVTTSGAENEVESALVAQF
         EG SV  H+NEL  +  +L  +G++ +EEV+A+ LL+S P SW    TAVS+S G N L F  + D  LSEE RR+      T+S    E       + 
Subjt:  EEGTSVNSHINELTDILNKLEGMGVKIEEEVKAMRLLTSFPYSWETIKTAVSNSLGENSLKFSAICDAALSEEARRKLGKMYVTTSGAENEVESALVAQF

Query:  KGKGKMKYNGKQQHR----NNRGSGNSSEEVECFYCHKKGHFKKHCRKLKEDQENEDTLNYVS
         G+G+  Y G+ + R     N  + +SS+ VEC+ C K GH+K  C+   +D E +   N  S
Subjt:  KGKGKMKYNGKQQHR----NNRGSGNSSEEVECFYCHKKGHFKKHCRKLKEDQENEDTLNYVS

KAG7561662.1 Zinc finger CCHC-type superfamily [Arabidopsis thaliana x Arabidopsis arenosa]3.0e-4540.08Show/hide
Query:  GVLKFSGENFSFWKMQVKDLLTCKKIHNTLGERPADMTDKAWNEMDEQAVANIRMTLSMGVCSLVAKETTAKELLKALQDRYEKPSANTKIFLWTKYFNI
        G+ KF G +F+FW+MQ++D L  KK+H  L  +P  M  + W+ +D Q +  IR+TLS  V   VAKE T + L+K L D YEKPSAN K+FL  K F++
Subjt:  GVLKFSGENFSFWKMQVKDLLTCKKIHNTLGERPADMTDKAWNEMDEQAVANIRMTLSMGVCSLVAKETTAKELLKALQDRYEKPSANTKIFLWTKYFNI

Query:  HMEEGTSVNSHINELTDILNKLEGMGVKIEEEVKAMRLLTSFPYSWETIKTAVSNSLGENSLKFSAICDAALSEEARRKLGKMYVTTSGAENEVESALVA
         MEEG  V +H+NE   I+N+L  + ++ ++EV+A+ L+ S P SWE ++ AVSNS+G   LKF  + D  L EE RR            E    SA   
Subjt:  HMEEGTSVNSHINELTDILNKLEGMGVKIEEEVKAMRLLTSFPYSWETIKTAVSNSLGENSLKFSAICDAALSEEARRKLGKMYVTTSGAENEVESALVA

Query:  QFKGKGKMKYN---GKQQHRNNRGSGNSSEEVECFYCHKKGHFKKHCRKLKEDQENE
        + +G+ + + N   G+ + RN +G   S + VEC+ C K GHFK +C    + + N+
Subjt:  QFKGKGKMKYN---GKQQHRNNRGSGNSSEEVECFYCHKKGHFKKHCRKLKEDQENE

KAG7584790.1 Zinc finger CCHC-type superfamily [Arabidopsis thaliana x Arabidopsis arenosa]1.4e-4540.47Show/hide
Query:  GVLKFSGENFSFWKMQVKDLLTCKKIHNTLGERPADMTDKAWNEMDEQAVANIRMTLSMGVCSLVAKETTAKELLKALQDRYEKPSANTKIFLWTKYFNI
        G+ KF G +F+FW+MQ++D L  KK+H  L  +P  M  + W+ +D Q +  IR+TLS  V   VAKE T + L+K L D YEKPSAN K+FL  K F++
Subjt:  GVLKFSGENFSFWKMQVKDLLTCKKIHNTLGERPADMTDKAWNEMDEQAVANIRMTLSMGVCSLVAKETTAKELLKALQDRYEKPSANTKIFLWTKYFNI

Query:  HMEEGTSVNSHINELTDILNKLEGMGVKIEEEVKAMRLLTSFPYSWETIKTAVSNSLGENSLKFSAICDAALSEEARRKLGKMYVTTSGAENEVESALVA
         MEEG  V +H+NE   I+N+L  + ++ ++EV+A+ LL S P SWE ++ AVSNS+G   LKF  + D  L EE RR            E  + SA   
Subjt:  HMEEGTSVNSHINELTDILNKLEGMGVKIEEEVKAMRLLTSFPYSWETIKTAVSNSLGENSLKFSAICDAALSEEARRKLGKMYVTTSGAENEVESALVA

Query:  QFKGKGKMKYN---GKQQHRNNRGSGNSSEEVECFYCHKKGHFKKHCRKLKEDQENE
        + +G+ + + N   G+ + RN +G   S + VEC+ C K GHFK +C    + + N+
Subjt:  QFKGKGKMKYN---GKQQHRNNRGSGNSSEEVECFYCHKKGHFKKHCRKLKEDQENE

KAG7593230.1 Pentatricopeptide repeat [Arabidopsis thaliana x Arabidopsis arenosa]1.5e-4441.8Show/hide
Query:  GVLKFSGENFSFWKMQVKDLLTCKKIHNTLGERPADMTDKAWNEMDEQAVANIRMTLSMGVCSLVAKETTAKELLKALQDRYEKPSANTKIFLWTKYFNI
        G+ KF G +F+FW+MQ++D L  KK+H  L  +P  M  + W+ +D Q +  IR+TLS  V   VAKE T + L+K L D YEKPSAN K+FL  K F++
Subjt:  GVLKFSGENFSFWKMQVKDLLTCKKIHNTLGERPADMTDKAWNEMDEQAVANIRMTLSMGVCSLVAKETTAKELLKALQDRYEKPSANTKIFLWTKYFNI

Query:  HMEEGTSVNSHINELTDILNKLEGMGVKIEEEVKAMRLLTSFPYSWETIKTAVSNSLGENSLKFSAICDAALSEEARRKLGKMYVTTSGAENEVESALVA
         MEEG  V +H+NE   I+N+L  + ++ ++EV+A+ LL S P SWE ++ AVSNS+G   LKF  + D  L EE RR            E    SA   
Subjt:  HMEEGTSVNSHINELTDILNKLEGMGVKIEEEVKAMRLLTSFPYSWETIKTAVSNSLGENSLKFSAICDAALSEEARRKLGKMYVTTSGAENEVESALVA

Query:  QFKGKGKMKYN---GKQQHRNNRGSGNSSEEVECFYCHKKGHFK
        + +G+ + + N   G+ + RN +G   S + VEC+ C K GHFK
Subjt:  QFKGKGKMKYN---GKQQHRNNRGSGNSSEEVECFYCHKKGHFK

TrEMBL top hitse value%identityAlignment
A0A2N9G6Q3 Uncharacterized protein3.5e-4741.95Show/hide
Query:  GVLKFSGENFSFWKMQVKDLLTCKKIH-NTLGERPADMTDKAWNEMDEQAVANIRMTLSMGVCSLVAKETTAKELLKALQDRYEKPSANTKIFLWTKYFN
        G+ KF G +F +W+MQ++D L  KK+H   LGE+P DM D  W  +D Q +  IR+TLS  V   V KE T  EL+ AL   YEKPSAN K+ L  K FN
Subjt:  GVLKFSGENFSFWKMQVKDLLTCKKIH-NTLGERPADMTDKAWNEMDEQAVANIRMTLSMGVCSLVAKETTAKELLKALQDRYEKPSANTKIFLWTKYFN

Query:  IHMEEGTSVNSHINELTDILNKLEGMGVKIEEEVKAMRLLTSFPYSWETIKTAVSNSLGENSLKFSAICDAALSEEARRKLGKMYVTTSGAENEVESALV
        + M EGT+V  H+NE   I N+L  + ++ ++E++A+ +L S P SWE ++ AVSNS G+  LK++ I D  L EE RR+        +G  +   SAL 
Subjt:  IHMEEGTSVNSHINELTDILNKLEGMGVKIEEEVKAMRLLTSFPYSWETIKTAVSNSLGENSLKFSAICDAALSEEARRKLGKMYVTTSGAENEVESALV

Query:  AQFKGKGK-MKYN-GKQQHRNNRGSGNSSEEVECFYCHKKGHFKKHCRKLKEDQENEDTLNYVSAEV
         + +G+GK   YN G+ + R  R       ++EC+ C K GH +K+C +LK+  EN D+ N V+ EV
Subjt:  AQFKGKGK-MKYN-GKQQHRNNRGSGNSSEEVECFYCHKKGHFKKHCRKLKEDQENEDTLNYVSAEV

A0A2N9GHK9 Uncharacterized protein3.5e-4741.95Show/hide
Query:  GVLKFSGENFSFWKMQVKDLLTCKKIH-NTLGERPADMTDKAWNEMDEQAVANIRMTLSMGVCSLVAKETTAKELLKALQDRYEKPSANTKIFLWTKYFN
        G+ KF G +F +W+MQ++D L  KK+H   LGE+P DM D  W  +D Q +  IR+TLS  V   V KE T  EL+ AL   YEKPSAN K+ L  K FN
Subjt:  GVLKFSGENFSFWKMQVKDLLTCKKIH-NTLGERPADMTDKAWNEMDEQAVANIRMTLSMGVCSLVAKETTAKELLKALQDRYEKPSANTKIFLWTKYFN

Query:  IHMEEGTSVNSHINELTDILNKLEGMGVKIEEEVKAMRLLTSFPYSWETIKTAVSNSLGENSLKFSAICDAALSEEARRKLGKMYVTTSGAENEVESALV
        + M EGT+V  H+NE   I N+L  + ++ ++E++A+ +L S P SWE ++ AVSNS G+  LK++ I D  L EE RR+        +G  +   SAL 
Subjt:  IHMEEGTSVNSHINELTDILNKLEGMGVKIEEEVKAMRLLTSFPYSWETIKTAVSNSLGENSLKFSAICDAALSEEARRKLGKMYVTTSGAENEVESALV

Query:  AQFKGKGK-MKYN-GKQQHRNNRGSGNSSEEVECFYCHKKGHFKKHCRKLKEDQENEDTLNYVSAEV
         + +G+GK   YN G+ + R  R       ++EC+ C K GH +K+C +LK+  EN D+ N V+ EV
Subjt:  AQFKGKGK-MKYN-GKQQHRNNRGSGNSSEEVECFYCHKKGHFKKHCRKLKEDQENEDTLNYVSAEV

A0A2N9IKI1 Uncharacterized protein3.5e-4741.95Show/hide
Query:  GVLKFSGENFSFWKMQVKDLLTCKKIH-NTLGERPADMTDKAWNEMDEQAVANIRMTLSMGVCSLVAKETTAKELLKALQDRYEKPSANTKIFLWTKYFN
        G+ KF G +F +W+MQ++D L  KK+H   LGE+P DM D  W  +D Q +  IR+TLS  V   V KE T  EL+ AL   YEKPSAN K+ L  K FN
Subjt:  GVLKFSGENFSFWKMQVKDLLTCKKIH-NTLGERPADMTDKAWNEMDEQAVANIRMTLSMGVCSLVAKETTAKELLKALQDRYEKPSANTKIFLWTKYFN

Query:  IHMEEGTSVNSHINELTDILNKLEGMGVKIEEEVKAMRLLTSFPYSWETIKTAVSNSLGENSLKFSAICDAALSEEARRKLGKMYVTTSGAENEVESALV
        + M EGT+V  H+NE   I N+L  + ++ ++E++A+ +L S P SWE ++ AVSNS G+  LK++ I D  L EE RR+        +G  +   SAL 
Subjt:  IHMEEGTSVNSHINELTDILNKLEGMGVKIEEEVKAMRLLTSFPYSWETIKTAVSNSLGENSLKFSAICDAALSEEARRKLGKMYVTTSGAENEVESALV

Query:  AQFKGKGK-MKYN-GKQQHRNNRGSGNSSEEVECFYCHKKGHFKKHCRKLKEDQENEDTLNYVSAEV
         + +G+GK   YN G+ + R  R       ++EC+ C K GH +K+C +LK+  EN D+ N V+ EV
Subjt:  AQFKGKGK-MKYN-GKQQHRNNRGSGNSSEEVECFYCHKKGHFKKHCRKLKEDQENEDTLNYVSAEV

A0A2N9J3Y8 Uncharacterized protein3.5e-4741.95Show/hide
Query:  GVLKFSGENFSFWKMQVKDLLTCKKIH-NTLGERPADMTDKAWNEMDEQAVANIRMTLSMGVCSLVAKETTAKELLKALQDRYEKPSANTKIFLWTKYFN
        G+ KF G +F +W+MQ++D L  KK+H   LGE+P DM D  W  +D Q +  IR+TLS  V   V KE T  EL+ AL   YEKPSAN K+ L  K FN
Subjt:  GVLKFSGENFSFWKMQVKDLLTCKKIH-NTLGERPADMTDKAWNEMDEQAVANIRMTLSMGVCSLVAKETTAKELLKALQDRYEKPSANTKIFLWTKYFN

Query:  IHMEEGTSVNSHINELTDILNKLEGMGVKIEEEVKAMRLLTSFPYSWETIKTAVSNSLGENSLKFSAICDAALSEEARRKLGKMYVTTSGAENEVESALV
        + M EGT+V  H+NE   I N+L  + ++ ++E++A+ +L S P SWE ++ AVSNS G+  LK++ I D  L EE RR+        +G  +   SAL 
Subjt:  IHMEEGTSVNSHINELTDILNKLEGMGVKIEEEVKAMRLLTSFPYSWETIKTAVSNSLGENSLKFSAICDAALSEEARRKLGKMYVTTSGAENEVESALV

Query:  AQFKGKGK-MKYN-GKQQHRNNRGSGNSSEEVECFYCHKKGHFKKHCRKLKEDQENEDTLNYVSAEV
         + +G+GK   YN G+ + R  R       ++EC+ C K GH +K+C +LK+  EN D+ N V+ EV
Subjt:  AQFKGKGK-MKYN-GKQQHRNNRGSGNSSEEVECFYCHKKGHFKKHCRKLKEDQENEDTLNYVSAEV

A0A2N9JBD5 Uncharacterized protein3.5e-4741.95Show/hide
Query:  GVLKFSGENFSFWKMQVKDLLTCKKIH-NTLGERPADMTDKAWNEMDEQAVANIRMTLSMGVCSLVAKETTAKELLKALQDRYEKPSANTKIFLWTKYFN
        G+ KF G +F +W+MQ++D L  KK+H   LGE+P DM D  W  +D Q +  IR+TLS  V   V KE T  EL+ AL   YEKPSAN K+ L  K FN
Subjt:  GVLKFSGENFSFWKMQVKDLLTCKKIH-NTLGERPADMTDKAWNEMDEQAVANIRMTLSMGVCSLVAKETTAKELLKALQDRYEKPSANTKIFLWTKYFN

Query:  IHMEEGTSVNSHINELTDILNKLEGMGVKIEEEVKAMRLLTSFPYSWETIKTAVSNSLGENSLKFSAICDAALSEEARRKLGKMYVTTSGAENEVESALV
        + M EGT+V  H+NE   I N+L  + ++ ++E++A+ +L S P SWE ++ AVSNS G+  LK++ I D  L EE RR+        +G  +   SAL 
Subjt:  IHMEEGTSVNSHINELTDILNKLEGMGVKIEEEVKAMRLLTSFPYSWETIKTAVSNSLGENSLKFSAICDAALSEEARRKLGKMYVTTSGAENEVESALV

Query:  AQFKGKGK-MKYN-GKQQHRNNRGSGNSSEEVECFYCHKKGHFKKHCRKLKEDQENEDTLNYVSAEV
         + +G+GK   YN G+ + R  R       ++EC+ C K GH +K+C +LK+  EN D+ N V+ EV
Subjt:  AQFKGKGK-MKYN-GKQQHRNNRGSGNSSEEVECFYCHKKGHFKKHCRKLKEDQENEDTLNYVSAEV

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.3e-1423.08Show/hide
Query:  FSGENFSFWKMQVKDLLTCKKIHNTLGERPADMTDKAWNEMDEQAVANIRMTLSMGVCSLVAKETTAKELLKALQDRYEKPSANTKIFLWTKYFNIHMEE
        F GE ++ WK +++ LL  + +   +     +  D +W + +  A + I   LS    +    + TA+++L+ L   YE+ S  +++ L  +  ++ +  
Subjt:  FSGENFSFWKMQVKDLLTCKKIHNTLGERPADMTDKAWNEMDEQAVANIRMTLSMGVCSLVAKETTAKELLKALQDRYEKPSANTKIFLWTKYFNIHMEE

Query:  GTSVNSHINELTDILNKLEGMGVKIEEEVKAMRLLTSFPYSWETIKTAVSNSLGENSLKFSAICDAALSEEARRKLGKMYVTTSGAENEVESALVAQFKG
          S+ SH +   +++++L   G KIEE  K   LL + P  ++ I TA+  +L E +L  + + +  L +E + K        +    +V +A+V     
Subjt:  GTSVNSHINELTDILNKLEGMGVKIEEEVKAMRLLTSFPYSWETIKTAVSNSLGENSLKFSAICDAALSEEARRKLGKMYVTTSGAENEVESALVAQFKG

Query:  KGKMK-YNGKQQHRNNRGSGNSSEEVECFYCHKKGHFKKHCRKLKE--DQENEDTLNYVSAEVLACIEGLERQVMHRAA-DNSGGDLNEPAALTVMTDQ
          K   +  +         GNS  +V+C +C ++GH KK C   K   + +N++    V       I  + ++V + +  DN G  L+  A+  ++ D+
Subjt:  KGKMK-YNGKQQHRNNRGSGNSSEEVECFYCHKKGHFKKHCRKLKE--DQENEDTLNYVSAEVLACIEGLERQVMHRAA-DNSGGDLNEPAALTVMTDQ

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.1e-2530.71Show/hide
Query:  VLKFSGEN-FSFWKMQVKDLLTCKKIHNTL---GERPADMTDKAWNEMDEQAVANIRMTLSMGVCSLVAKETTAKELLKALQDRYEKPSANTKIFLWTKY
        V KF+G+N FS W+ +++DLL  + +H  L    ++P  M  + W ++DE+A + IR+ LS  V + +  E TA+ +   L+  Y   +   K++L  + 
Subjt:  VLKFSGEN-FSFWKMQVKDLLTCKKIHNTL---GERPADMTDKAWNEMDEQAVANIRMTLSMGVCSLVAKETTAKELLKALQDRYEKPSANTKIFLWTKY

Query:  FNIHMEEGTSVNSHINELTDILNKLEGMGVKIEEEVKAMRLLTSFPYSWETIKTAVSNSLGENSLKFSAICDAALSEEARRKLGKMYVTTSGAENEVESA
        + +HM EGT+  SH+N    ++ +L  +GVKIEEE KA+ LL S P S++ + T + +  G+ +++   +  A L  E  RK           EN+ + A
Subjt:  FNIHMEEGTSVNSHINELTDILNKLEGMGVKIEEEVKAMRLLTSFPYSWETIKTAVSNSLGENSLKFSAICDAALSEEARRKLGKMYVTTSGAENEVESA

Query:  LVAQFKGKGKMKYN---GKQQHRNNRGSGNSSEEVECFYCHKKGHFKKHC---RKLKED---QENED
        L+ + +G+   + +   G+   R    + + S    C+ C++ GHFK+ C   RK K +   Q+N+D
Subjt:  LVAQFKGKGKMKYN---GKQQHRNNRGSGNSSEEVECFYCHKKGHFKKHC---RKLKED---QENED

Arabidopsis top hitse value%identityAlignment
AT3G29785.1 unknown protein1.8e-1137.5Show/hide
Query:  KFSGENFSFWKMQVKDLLTCKKIHNTLGERPADMTDKAWNEMDEQAVANIRMTLSMGVCSLVAKETTAKELLKALQDRYEKPSANTKI
        K  G ++SF +M+++D L  KK+H  LG++   M+   WN +  Q +  IR+T+S  +   VAKE +   L+K L D Y+KPS N  +
Subjt:  KFSGENFSFWKMQVKDLLTCKKIHNTLGERPADMTDKAWNEMDEQAVANIRMTLSMGVCSLVAKETTAKELLKALQDRYEKPSANTKI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGACAAATATCAAGTACATCATTACTATGAAGGGGTTCTGAAGTTCAGTGGAGAGAATTTCAGTTTTTGGAAGATGCAAGTAAAGGATCTTCTTACGTGCAAGAA
GATACACAATACTTTGGGGGAGAGACCAGCGGATATGACAGACAAAGCTTGGAATGAGATGGATGAGCAGGCCGTTGCAAATATCAGAATGACATTATCAATGGGGGTAT
GCAGTCTCGTGGCGAAAGAGACGACTGCAAAAGAGTTGTTGAAGGCCTTGCAAGACAGGTATGAAAAACCTTCTGCCAATACAAAAATATTTCTATGGACCAAGTATTTT
AACATCCACATGGAGGAGGGAACCTCAGTGAATTCCCACATTAATGAGCTCACCGATATCTTGAACAAATTAGAAGGGATGGGTGTCAAGATTGAGGAGGAGGTGAAAGC
TATGAGGCTGTTGACGTCTTTCCCTTACAGTTGGGAGACGATTAAGACCGCGGTGTCGAATTCGCTAGGAGAAAATAGCTTGAAATTTTCAGCTATTTGTGATGCCGCCT
TATCTGAGGAAGCCCGGAGAAAATTAGGAAAAATGTATGTAACTACTTCAGGGGCAGAAAATGAGGTTGAATCAGCTTTGGTAGCTCAGTTTAAGGGGAAGGGCAAGATG
AAGTACAACGGGAAGCAACAACATAGGAATAATAGGGGTAGTGGGAATTCCAGTGAAGAAGTTGAATGTTTTTACTGTCATAAGAAGGGTCACTTCAAGAAACATTGCAG
GAAGCTTAAAGAGGATCAGGAAAATGAGGATACTCTAAATTACGTGTCAGCGGAGGTGTTAGCTTGTATTGAAGGATTAGAGAGACAGGTTATGCATAGAGCTGCAGATA
ATTCAGGGGGAGACTTGAATGAACCAGCAGCATTGACAGTCATGACAGATCAGGAGAATCTGCCATTAGTTCAAGTACAACAGCTGGGAAGTAGAGGAAAGGGAAAGAGG
AAAAACTCAGTGAGGTGTTCGACAGACTGTCAGTTTCGAGCCCCAGTTGTCAGACGGACTAACGAGATGATGAAGTCGCATAGGCGAAATGGTGCATTGAGAAAGACTAC
AGTTGGTGCTGAGGTCGAGGGTGAAGTTTCTAGGGTGGCAACGGACTTGGGTGAGAGTGCCAAGTCATTAGGGAAATCTTTCTTCAAGAGTCGTTGGGTGCGAGTGAAGA
AGGAAGCGTCGGAGACCACTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAGACAAATATCAAGTACATCATTACTATGAAGGGGTTCTGAAGTTCAGTGGAGAGAATTTCAGTTTTTGGAAGATGCAAGTAAAGGATCTTCTTACGTGCAAGAA
GATACACAATACTTTGGGGGAGAGACCAGCGGATATGACAGACAAAGCTTGGAATGAGATGGATGAGCAGGCCGTTGCAAATATCAGAATGACATTATCAATGGGGGTAT
GCAGTCTCGTGGCGAAAGAGACGACTGCAAAAGAGTTGTTGAAGGCCTTGCAAGACAGGTATGAAAAACCTTCTGCCAATACAAAAATATTTCTATGGACCAAGTATTTT
AACATCCACATGGAGGAGGGAACCTCAGTGAATTCCCACATTAATGAGCTCACCGATATCTTGAACAAATTAGAAGGGATGGGTGTCAAGATTGAGGAGGAGGTGAAAGC
TATGAGGCTGTTGACGTCTTTCCCTTACAGTTGGGAGACGATTAAGACCGCGGTGTCGAATTCGCTAGGAGAAAATAGCTTGAAATTTTCAGCTATTTGTGATGCCGCCT
TATCTGAGGAAGCCCGGAGAAAATTAGGAAAAATGTATGTAACTACTTCAGGGGCAGAAAATGAGGTTGAATCAGCTTTGGTAGCTCAGTTTAAGGGGAAGGGCAAGATG
AAGTACAACGGGAAGCAACAACATAGGAATAATAGGGGTAGTGGGAATTCCAGTGAAGAAGTTGAATGTTTTTACTGTCATAAGAAGGGTCACTTCAAGAAACATTGCAG
GAAGCTTAAAGAGGATCAGGAAAATGAGGATACTCTAAATTACGTGTCAGCGGAGGTGTTAGCTTGTATTGAAGGATTAGAGAGACAGGTTATGCATAGAGCTGCAGATA
ATTCAGGGGGAGACTTGAATGAACCAGCAGCATTGACAGTCATGACAGATCAGGAGAATCTGCCATTAGTTCAAGTACAACAGCTGGGAAGTAGAGGAAAGGGAAAGAGG
AAAAACTCAGTGAGGTGTTCGACAGACTGTCAGTTTCGAGCCCCAGTTGTCAGACGGACTAACGAGATGATGAAGTCGCATAGGCGAAATGGTGCATTGAGAAAGACTAC
AGTTGGTGCTGAGGTCGAGGGTGAAGTTTCTAGGGTGGCAACGGACTTGGGTGAGAGTGCCAAGTCATTAGGGAAATCTTTCTTCAAGAGTCGTTGGGTGCGAGTGAAGA
AGGAAGCGTCGGAGACCACTTAG
Protein sequenceShow/hide protein sequence
MGDKYQVHHYYEGVLKFSGENFSFWKMQVKDLLTCKKIHNTLGERPADMTDKAWNEMDEQAVANIRMTLSMGVCSLVAKETTAKELLKALQDRYEKPSANTKIFLWTKYF
NIHMEEGTSVNSHINELTDILNKLEGMGVKIEEEVKAMRLLTSFPYSWETIKTAVSNSLGENSLKFSAICDAALSEEARRKLGKMYVTTSGAENEVESALVAQFKGKGKM
KYNGKQQHRNNRGSGNSSEEVECFYCHKKGHFKKHCRKLKEDQENEDTLNYVSAEVLACIEGLERQVMHRAADNSGGDLNEPAALTVMTDQENLPLVQVQQLGSRGKGKR
KNSVRCSTDCQFRAPVVRRTNEMMKSHRRNGALRKTTVGAEVEGEVSRVATDLGESAKSLGKSFFKSRWVRVKKEASETT