; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g09570 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g09570
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRNase H domain-containing protein
Genome locationchr5:7438372..7439708
RNA-Seq ExpressionMoc05g09570
SyntenyMoc05g09570
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]5.5e-4143.89Show/hide
Query:  DRAEDEETRKIMIIAWQIWEHRNRIIFKGVEPEAVQIQAAINKHLDLINNHGATSSVTQKSKKKGKKHYLTTDMQSNSNQKWNPPPINCWKLYCDASWSE
        D+A +EE R+ MIIAWQIWE RN+ IFKGV PE   IQ AI+++  +IN+ G  +++  K K   K  +L   ++ N+  +W PP  N WKL  +A+W  
Subjt:  DRAEDEETRKIMIIAWQIWEHRNRIIFKGVEPEAVQIQAAINKHLDLINNHGATSSVTQKSKKKGKKHYLTTDMQSNSNQKWNPPPINCWKLYCDASWSE

Query:  DQRRGGIGWIVRDWRGEAMLAGSRPIVLQQEIKILEAMAICEGLRVITMEGEGNLSIYVESDCLQVINLLNGKESDLTEIKWVIEDGVQLASTANVVLIQ
        D   GGIGWI+RD +GE + A  R I  ++ I  LE MAICEGLR I    E    I++ESD L+ I+LL+ +  D TEI W++E+  Q+     +V ++
Subjt:  DQRRGGIGWIVRDWRGEAMLAGSRPIVLQQEIKILEAMAICEGLRVITMEGEGNLSIYVESDCLQVINLLNGKESDLTEIKWVIEDGVQLASTANVVLIQ

Query:  HASRDLNLVAHEIAKRARDGE
        H SR+ N VAH +A+RA + +
Subjt:  HASRDLNLVAHEIAKRARDGE

XP_022154990.1 uncharacterized protein LOC111022134 isoform X1 [Momordica charantia]4.7e-4040.85Show/hide
Query:  SSCSNNQRARWKKFWKSVAETSTHLLWDCEVCAPIWYNFLSPDTLFYDHDRSGLNLKDHWDWIFDRAEDEETRKIMIIAWQIWEHRNRIIFKGVEPEAVQ
        ++C+ N+    KK      ET+ H+LW+C+V   IW N    +  F+  DR+    K++W+W+ D+A +EE R+ MIIA QIWE RN+ IFKGV  E   
Subjt:  SSCSNNQRARWKKFWKSVAETSTHLLWDCEVCAPIWYNFLSPDTLFYDHDRSGLNLKDHWDWIFDRAEDEETRKIMIIAWQIWEHRNRIIFKGVEPEAVQ

Query:  IQAAINKHLDLINNHGATSSVTQKSKKKGKKHYLTTDMQSNSNQKWNPPPINCWKLYCDASWSEDQRRGGIGWIVRDWRGEAMLAGSRPIVLQQEIKILE
        IQ AI+++  +IN+ G  +++    K+K K  +    +  N+  +W PP  N WKL  DA+W  D    GIGWI+RD +GE +  G R I  ++ I  LE
Subjt:  IQAAINKHLDLINNHGATSSVTQKSKKKGKKHYLTTDMQSNSNQKWNPPPINCWKLYCDASWSEDQRRGGIGWIVRDWRGEAMLAGSRPIVLQQEIKILE

Query:  AMAICEGLRVITMEGEGNLSIYVESDCLQVINLLN
         MAICEGLR I    E    I++ESD L+ I+LL+
Subjt:  AMAICEGLRVITMEGEGNLSIYVESDCLQVINLLN

XP_022154991.1 uncharacterized protein LOC111022134 isoform X2 [Momordica charantia]4.7e-3243.24Show/hide
Query:  RSGLNLKDHWDWIFDRAEDEETRKIMIIAWQIWEHRNRIIFKGVEPEAVQIQAAINKHLDLINNHGATSSVTQKSKKKGKKHYLTTDMQSNSNQKWNPPP
        R+    K++W+W+ D+A +EE R+ MIIA QIWE RN+ IFKGV  E   IQ AI+++  +IN+ G  +++    K+K K  +    +  N+  +W PP 
Subjt:  RSGLNLKDHWDWIFDRAEDEETRKIMIIAWQIWEHRNRIIFKGVEPEAVQIQAAINKHLDLINNHGATSSVTQKSKKKGKKHYLTTDMQSNSNQKWNPPP

Query:  INCWKLYCDASWSEDQRRGGIGWIVRDWRGEAMLAGSRPIVLQQEIKILEAMAICEGLRVITMEGEGNLSIYVESDCLQVINLLN
         N WKL  DA+W  D    GIGWI+RD +GE +  G R I  ++ I  LE MAICEGLR I    E    I++ESD L+ I+LL+
Subjt:  INCWKLYCDASWSEDQRRGGIGWIVRDWRGEAMLAGSRPIVLQQEIKILEAMAICEGLRVITMEGEGNLSIYVESDCLQVINLLN

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]8.0e-3234.58Show/hide
Query:  IFDRAEDEETRKIMIIAWQIWEHRNRIIFKGVEPEAVQIQAAINKHLDLINNHGATSSVTQKSKKKGKKHYLTTDMQSNSNQKWNPPPINCWKLYCDASW
        + D+A DE+   ++I +W IW HRN +IF+G       +   + K +        T S  Q        H        N+  KW PPP++ W L  DASW
Subjt:  IFDRAEDEETRKIMIIAWQIWEHRNRIIFKGVEPEAVQIQAAINKHLDLINNHGATSSVTQKSKKKGKKHYLTTDMQSNSNQKWNPPPINCWKLYCDASW

Query:  SEDQRRGGIGWIVRDWRGEAMLAGSRPIVLQQEIKILEAMAICEGLRVITMEGEGNLSIYVESDCLQVINLLNGKESDLTEIKWVIEDGVQLASTANVVL
        S+   RGGIGWI+R W G+ +LAG+R +     +K+LEA AI EGLR +T  G     +++E+D  +V +LLN K  DLT+  WV+E+ + L  +  ++ 
Subjt:  SEDQRRGGIGWIVRDWRGEAMLAGSRPIVLQQEIKILEAMAICEGLRVITMEGEGNLSIYVESDCLQVINLLNGKESDLTEIKWVIEDGVQLASTANVVL

Query:  IQHASRDLNLVAHEIAKRARDGEDFALWKRGEEENMPSWL
             R+ N  AH +A+RA    +  +W     +  P+WL
Subjt:  IQHASRDLNLVAHEIAKRARDGEDFALWKRGEEENMPSWL

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]4.7e-4042.73Show/hide
Query:  DRAEDEETRKIMIIAWQIWEHRNRIIFKGVEPEAVQIQAAINKHLDLINNHGATSSVTQKSKKKGKKHYLTTDMQSNSNQKWNPPPINCWKLYCDASWSE
        D+A +EE R+ MIIAWQIWE RN+ IFKGV  E   IQ  I+++  +IN+ G  +++  K K   K  +L   +  N+  +W PP  N WKL  DA+W  
Subjt:  DRAEDEETRKIMIIAWQIWEHRNRIIFKGVEPEAVQIQAAINKHLDLINNHGATSSVTQKSKKKGKKHYLTTDMQSNSNQKWNPPPINCWKLYCDASWSE

Query:  DQRRGGIGWIVRDWRGEAMLAGSRPIVLQQEIKILEAMAICEGLRVITME------GEGNLSIYVESDCLQVINLLNGKESDLTEIKWVIEDGVQLASTA
        D   GGIGWI+RD +GE + A  R I  ++ I  LE MAICEGLR I  E       E    I++ESD L+ I+LL+ +  D TEI W++E+  Q+    
Subjt:  DQRRGGIGWIVRDWRGEAMLAGSRPIVLQQEIKILEAMAICEGLRVITME------GEGNLSIYVESDCLQVINLLNGKESDLTEIKWVIEDGVQLASTA

Query:  NVVLIQHASRDLNLVAHEIAKRARDGE
         +V ++H SR+ N VAH++A+RA + +
Subjt:  NVVLIQHASRDLNLVAHEIAKRARDGE

TrEMBL top hitse value%identityAlignment
A0A6J1CP26 uncharacterized protein LOC1110134122.7e-4143.89Show/hide
Query:  DRAEDEETRKIMIIAWQIWEHRNRIIFKGVEPEAVQIQAAINKHLDLINNHGATSSVTQKSKKKGKKHYLTTDMQSNSNQKWNPPPINCWKLYCDASWSE
        D+A +EE R+ MIIAWQIWE RN+ IFKGV PE   IQ AI+++  +IN+ G  +++  K K   K  +L   ++ N+  +W PP  N WKL  +A+W  
Subjt:  DRAEDEETRKIMIIAWQIWEHRNRIIFKGVEPEAVQIQAAINKHLDLINNHGATSSVTQKSKKKGKKHYLTTDMQSNSNQKWNPPPINCWKLYCDASWSE

Query:  DQRRGGIGWIVRDWRGEAMLAGSRPIVLQQEIKILEAMAICEGLRVITMEGEGNLSIYVESDCLQVINLLNGKESDLTEIKWVIEDGVQLASTANVVLIQ
        D   GGIGWI+RD +GE + A  R I  ++ I  LE MAICEGLR I    E    I++ESD L+ I+LL+ +  D TEI W++E+  Q+     +V ++
Subjt:  DQRRGGIGWIVRDWRGEAMLAGSRPIVLQQEIKILEAMAICEGLRVITMEGEGNLSIYVESDCLQVINLLNGKESDLTEIKWVIEDGVQLASTANVVLIQ

Query:  HASRDLNLVAHEIAKRARDGE
        H SR+ N VAH +A+RA + +
Subjt:  HASRDLNLVAHEIAKRARDGE

A0A6J1DL64 uncharacterized protein LOC111022134 isoform X12.3e-4040.85Show/hide
Query:  SSCSNNQRARWKKFWKSVAETSTHLLWDCEVCAPIWYNFLSPDTLFYDHDRSGLNLKDHWDWIFDRAEDEETRKIMIIAWQIWEHRNRIIFKGVEPEAVQ
        ++C+ N+    KK      ET+ H+LW+C+V   IW N    +  F+  DR+    K++W+W+ D+A +EE R+ MIIA QIWE RN+ IFKGV  E   
Subjt:  SSCSNNQRARWKKFWKSVAETSTHLLWDCEVCAPIWYNFLSPDTLFYDHDRSGLNLKDHWDWIFDRAEDEETRKIMIIAWQIWEHRNRIIFKGVEPEAVQ

Query:  IQAAINKHLDLINNHGATSSVTQKSKKKGKKHYLTTDMQSNSNQKWNPPPINCWKLYCDASWSEDQRRGGIGWIVRDWRGEAMLAGSRPIVLQQEIKILE
        IQ AI+++  +IN+ G  +++    K+K K  +    +  N+  +W PP  N WKL  DA+W  D    GIGWI+RD +GE +  G R I  ++ I  LE
Subjt:  IQAAINKHLDLINNHGATSSVTQKSKKKGKKHYLTTDMQSNSNQKWNPPPINCWKLYCDASWSEDQRRGGIGWIVRDWRGEAMLAGSRPIVLQQEIKILE

Query:  AMAICEGLRVITMEGEGNLSIYVESDCLQVINLLN
         MAICEGLR I    E    I++ESD L+ I+LL+
Subjt:  AMAICEGLRVITMEGEGNLSIYVESDCLQVINLLN

A0A6J1DNV9 uncharacterized protein LOC1110224033.9e-3234.58Show/hide
Query:  IFDRAEDEETRKIMIIAWQIWEHRNRIIFKGVEPEAVQIQAAINKHLDLINNHGATSSVTQKSKKKGKKHYLTTDMQSNSNQKWNPPPINCWKLYCDASW
        + D+A DE+   ++I +W IW HRN +IF+G       +   + K +        T S  Q        H        N+  KW PPP++ W L  DASW
Subjt:  IFDRAEDEETRKIMIIAWQIWEHRNRIIFKGVEPEAVQIQAAINKHLDLINNHGATSSVTQKSKKKGKKHYLTTDMQSNSNQKWNPPPINCWKLYCDASW

Query:  SEDQRRGGIGWIVRDWRGEAMLAGSRPIVLQQEIKILEAMAICEGLRVITMEGEGNLSIYVESDCLQVINLLNGKESDLTEIKWVIEDGVQLASTANVVL
        S+   RGGIGWI+R W G+ +LAG+R +     +K+LEA AI EGLR +T  G     +++E+D  +V +LLN K  DLT+  WV+E+ + L  +  ++ 
Subjt:  SEDQRRGGIGWIVRDWRGEAMLAGSRPIVLQQEIKILEAMAICEGLRVITMEGEGNLSIYVESDCLQVINLLNGKESDLTEIKWVIEDGVQLASTANVVL

Query:  IQHASRDLNLVAHEIAKRARDGEDFALWKRGEEENMPSWL
             R+ N  AH +A+RA    +  +W     +  P+WL
Subjt:  IQHASRDLNLVAHEIAKRARDGEDFALWKRGEEENMPSWL

A0A6J1DQC9 uncharacterized protein LOC111022134 isoform X22.3e-3243.24Show/hide
Query:  RSGLNLKDHWDWIFDRAEDEETRKIMIIAWQIWEHRNRIIFKGVEPEAVQIQAAINKHLDLINNHGATSSVTQKSKKKGKKHYLTTDMQSNSNQKWNPPP
        R+    K++W+W+ D+A +EE R+ MIIA QIWE RN+ IFKGV  E   IQ AI+++  +IN+ G  +++    K+K K  +    +  N+  +W PP 
Subjt:  RSGLNLKDHWDWIFDRAEDEETRKIMIIAWQIWEHRNRIIFKGVEPEAVQIQAAINKHLDLINNHGATSSVTQKSKKKGKKHYLTTDMQSNSNQKWNPPP

Query:  INCWKLYCDASWSEDQRRGGIGWIVRDWRGEAMLAGSRPIVLQQEIKILEAMAICEGLRVITMEGEGNLSIYVESDCLQVINLLN
         N WKL  DA+W  D    GIGWI+RD +GE +  G R I  ++ I  LE MAICEGLR I    E    I++ESD L+ I+LL+
Subjt:  INCWKLYCDASWSEDQRRGGIGWIVRDWRGEAMLAGSRPIVLQQEIKILEAMAICEGLRVITMEGEGNLSIYVESDCLQVINLLN

A0A6J1DSV1 uncharacterized protein LOC1110236082.3e-4042.73Show/hide
Query:  DRAEDEETRKIMIIAWQIWEHRNRIIFKGVEPEAVQIQAAINKHLDLINNHGATSSVTQKSKKKGKKHYLTTDMQSNSNQKWNPPPINCWKLYCDASWSE
        D+A +EE R+ MIIAWQIWE RN+ IFKGV  E   IQ  I+++  +IN+ G  +++  K K   K  +L   +  N+  +W PP  N WKL  DA+W  
Subjt:  DRAEDEETRKIMIIAWQIWEHRNRIIFKGVEPEAVQIQAAINKHLDLINNHGATSSVTQKSKKKGKKHYLTTDMQSNSNQKWNPPPINCWKLYCDASWSE

Query:  DQRRGGIGWIVRDWRGEAMLAGSRPIVLQQEIKILEAMAICEGLRVITME------GEGNLSIYVESDCLQVINLLNGKESDLTEIKWVIEDGVQLASTA
        D   GGIGWI+RD +GE + A  R I  ++ I  LE MAICEGLR I  E       E    I++ESD L+ I+LL+ +  D TEI W++E+  Q+    
Subjt:  DQRRGGIGWIVRDWRGEAMLAGSRPIVLQQEIKILEAMAICEGLRVITME------GEGNLSIYVESDCLQVINLLNGKESDLTEIKWVIEDGVQLASTA

Query:  NVVLIQHASRDLNLVAHEIAKRARDGE
         +V ++H SR+ N VAH++A+RA + +
Subjt:  NVVLIQHASRDLNLVAHEIAKRARDGE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein6.1e-1425.94Show/hide
Query:  ETSTHLLWDCEVCAPIWYNFLSPDTLFYDHDRSGLNLKDHWDWIFD-RAEDEETRKI----MIIAWQIWEHRNRIIFKGVEPEAVQIQAAINKHLDLINN
        ET  HLL+ C     +W   +SP   + + + +  +L  +  W+ +   E  +  KI      + W++W+ RN ++FKG E +A ++     +  +    
Subjt:  ETSTHLLWDCEVCAPIWYNFLSPDTLFYDHDRSGLNLKDHWDWIFD-RAEDEETRKI----MIIAWQIWEHRNRIIFKGVEPEAVQIQAAINKHLDLINN

Query:  HGATSSVTQKSKKKGKKHYLTTDMQSNSNQKWNPPPINCWKLYCDASWSEDQRRGGIGWIVRDWRGEAMLAGSRPIVLQQEIKILEAMAICEGLRVITME
               + + + +GK       ++ N + +W  PP    K   DA+W  +  R GIGWI+R+  G  +  G+R   L +   +LEA        V+TM 
Subjt:  HGATSSVTQKSKKKGKKHYLTTDMQSNSNQKWNPPPINCWKLYCDASWSEDQRRGGIGWIVRDWRGEAMLAGSRPIVLQQEIKILEAMAICEGLRVITME

Query:  GEGNLSIYVESDCLQVINLLNGKESDLTEIKWVIEDGVQLASTANVVLIQHASRDLNLVAHEIAKRARDGEDFALWKRGEEENMPSWLFNLLL
              I  ESD   ++NLLN  +   T ++  +ED  QL      V  +   R  N VA  I   AR+   F+ +       +P WL + L+
Subjt:  GEGNLSIYVESDCLQVINLLNGKESDLTEIKWVIEDGVQLASTANVVLIQHASRDLNLVAHEIAKRARDGEDFALWKRGEEENMPSWLFNLLL

AT4G29090.1 Ribonuclease H-like superfamily protein7.2e-1525.09Show/hide
Query:  SVAETSTHLLWDCEVCAPIWYNFLSPDTLFYD-HDRSGLNLKDHWDWIFDRAE-----DEETRKIMIIAWQIWEHRNRIIFKGVEPEAVQIQAAINKHLD
        S  ET  HLL+ C      W     P  L  +  D   +NL     W+F+        ++ ++ +  + W++W++RN ++F+G E  A ++       L+
Subjt:  SVAETSTHLLWDCEVCAPIWYNFLSPDTLFYD-HDRSGLNLKDHWDWIFDRAE-----DEETRKIMIIAWQIWEHRNRIIFKGVEPEAVQIQAAINKHLD

Query:  LINNHGATSSVTQKSKKKGKKHYLTTDMQSNSNQKWNPPPINCWKLYCDASWSEDQRRGGIGWIVRDWRGEAMLAGSRPIVLQQEIKILEAMAICEGLRV
                  +  +++  G K      +  +S  +W PPP    K   DA+W+ D  R GIGW++R+ +GE    G+R   L +   +LEA        V
Subjt:  LINNHGATSSVTQKSKKKGKKHYLTTDMQSNSNQKWNPPPINCWKLYCDASWSEDQRRGGIGWIVRDWRGEAMLAGSRPIVLQQEIKILEAMAICEGLRV

Query:  ITMEGEGNLSIYVESDCLQVINLLNGKESDLTEIKWVIEDGVQLASTANVVLIQHASRDLNLVAHEIAKRARDGEDFALWKRGEEENMPSW
        +++       +  ESD   +I +LN  E     +K  I+D  +L S    V      R+ N +A  +   AR+   F  +       +PSW
Subjt:  ITMEGEGNLSIYVESDCLQVINLLNGKESDLTEIKWVIEDGVQLASTANVVLIQHASRDLNLVAHEIAKRARDGEDFALWKRGEEENMPSW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCTGCGGGAAGGGAGCAGTCTCCCATTTTAGTTAACGACCCTGTCAAATTTCGCAATGTCGCATATTTATTGAATGGCCCAGGCGTTTGGAATGAGGAGCGTGT
CAGGAGCAACTTTTGCGAAGAGGATGCAGAAACGATTCTCAACATCCCTATACCTAGCCAAAGCGTGAAGTTGGACAATGCTCACCTAGCATCGTCTTCATGCTCTAACA
ATCAACGGGCGAGATGGAAGAAGTTTTGGAAGTCTGTAGCAGAAACCTCAACGCATTTGTTGTGGGATTGTGAGGTATGTGCACCTATTTGGTACAATTTCCTTTCCCCT
GATACTCTCTTTTATGATCATGATAGGAGTGGCTTGAATTTAAAAGACCATTGGGACTGGATATTTGATAGAGCCGAGGACGAGGAGACAAGGAAAATTATGATAATTGC
GTGGCAGATTTGGGAGCATAGGAATCGGATAATTTTCAAAGGTGTTGAACCAGAAGCAGTTCAAATCCAGGCAGCAATCAACAAACATCTAGATTTAATTAATAATCACG
GGGCTACTAGTTCAGTCACACAAAAGAGCAAGAAGAAAGGCAAGAAGCATTACCTGACAACTGATATGCAAAGTAACTCAAATCAGAAGTGGAATCCCCCTCCTATTAAC
TGCTGGAAACTATACTGTGACGCATCTTGGAGTGAAGATCAAAGACGCGGTGGTATTGGATGGATAGTTCGTGATTGGCGTGGTGAGGCAATGTTGGCAGGAAGTCGTCC
GATTGTTCTGCAACAAGAGATTAAAATCCTTGAAGCTATGGCTATTTGCGAAGGATTAAGAGTGATAACAATGGAAGGAGAAGGAAATTTATCGATTTATGTGGAATCAG
ATTGCCTTCAAGTCATAAATCTTTTGAATGGTAAAGAATCTGACCTTACAGAGATAAAATGGGTCATTGAAGATGGCGTTCAGTTGGCATCGACTGCAAATGTTGTGTTG
ATCCAACATGCATCGAGAGATTTGAATCTGGTCGCCCATGAGATTGCAAAGAGAGCCAGAGATGGAGAAGATTTTGCCCTTTGGAAGCGTGGGGAGGAGGAAAATATGCC
TTCGTGGTTGTTCAACCTTCTTCTTTTGGAACTCCCGAATGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATTCTGCGGGAAGGGAGCAGTCTCCCATTTTAGTTAACGACCCTGTCAAATTTCGCAATGTCGCATATTTATTGAATGGCCCAGGCGTTTGGAATGAGGAGCGTGT
CAGGAGCAACTTTTGCGAAGAGGATGCAGAAACGATTCTCAACATCCCTATACCTAGCCAAAGCGTGAAGTTGGACAATGCTCACCTAGCATCGTCTTCATGCTCTAACA
ATCAACGGGCGAGATGGAAGAAGTTTTGGAAGTCTGTAGCAGAAACCTCAACGCATTTGTTGTGGGATTGTGAGGTATGTGCACCTATTTGGTACAATTTCCTTTCCCCT
GATACTCTCTTTTATGATCATGATAGGAGTGGCTTGAATTTAAAAGACCATTGGGACTGGATATTTGATAGAGCCGAGGACGAGGAGACAAGGAAAATTATGATAATTGC
GTGGCAGATTTGGGAGCATAGGAATCGGATAATTTTCAAAGGTGTTGAACCAGAAGCAGTTCAAATCCAGGCAGCAATCAACAAACATCTAGATTTAATTAATAATCACG
GGGCTACTAGTTCAGTCACACAAAAGAGCAAGAAGAAAGGCAAGAAGCATTACCTGACAACTGATATGCAAAGTAACTCAAATCAGAAGTGGAATCCCCCTCCTATTAAC
TGCTGGAAACTATACTGTGACGCATCTTGGAGTGAAGATCAAAGACGCGGTGGTATTGGATGGATAGTTCGTGATTGGCGTGGTGAGGCAATGTTGGCAGGAAGTCGTCC
GATTGTTCTGCAACAAGAGATTAAAATCCTTGAAGCTATGGCTATTTGCGAAGGATTAAGAGTGATAACAATGGAAGGAGAAGGAAATTTATCGATTTATGTGGAATCAG
ATTGCCTTCAAGTCATAAATCTTTTGAATGGTAAAGAATCTGACCTTACAGAGATAAAATGGGTCATTGAAGATGGCGTTCAGTTGGCATCGACTGCAAATGTTGTGTTG
ATCCAACATGCATCGAGAGATTTGAATCTGGTCGCCCATGAGATTGCAAAGAGAGCCAGAGATGGAGAAGATTTTGCCCTTTGGAAGCGTGGGGAGGAGGAAAATATGCC
TTCGTGGTTGTTCAACCTTCTTCTTTTGGAACTCCCGAATGTTTAA
Protein sequenceShow/hide protein sequence
MDSAGREQSPILVNDPVKFRNVAYLLNGPGVWNEERVRSNFCEEDAETILNIPIPSQSVKLDNAHLASSSCSNNQRARWKKFWKSVAETSTHLLWDCEVCAPIWYNFLSP
DTLFYDHDRSGLNLKDHWDWIFDRAEDEETRKIMIIAWQIWEHRNRIIFKGVEPEAVQIQAAINKHLDLINNHGATSSVTQKSKKKGKKHYLTTDMQSNSNQKWNPPPIN
CWKLYCDASWSEDQRRGGIGWIVRDWRGEAMLAGSRPIVLQQEIKILEAMAICEGLRVITMEGEGNLSIYVESDCLQVINLLNGKESDLTEIKWVIEDGVQLASTANVVL
IQHASRDLNLVAHEIAKRARDGEDFALWKRGEEENMPSWLFNLLLLELPNV