; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g19170 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g19170
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionBAHD acyltransferase At3g29680-like
Genome locationchr2:14214091..14215424
RNA-Seq ExpressionMoc02g19170
SyntenyMoc02g19170
Gene Ontology termsGO:0016746 - transferase activity, transferring acyl groups (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022147182.1 uncharacterized protein LOC111016193 [Momordica charantia]3.5e-4547.58Show/hide
Query:  LDYTAEVHAAACQTALMVKAELEGRDLRTVKEREASSAALETAAALEGKLKEARVEAQAWKSTSDAKLKSAQAEVARHLETLRGAYAVAKCLEKEKFVLM
        +D  AE   A+  +A+MVKAEL+GR+  T KERE  S  LE A  L+G+L +A+ E    ++  DAK    + E  +H   LR A+A+ K LEKEKF L+
Subjt:  LDYTAEVHAAACQTALMVKAELEGRDLRTVKEREASSAALETAAALEGKLKEARVEAQAWKSTSDAKLKSAQAEVARHLETLRGAYAVAKCLEKEKFVLM

Query:  KQNDDLEHLRDDLEGKLKARDAEMAELRAKLELSKSRLSNGVLLEEAFRQHPDFDGFAKDFSDAGFQFLMKGVQEVSP--ELDLAPIKLRYVEKWASGHN
        K+ DDL  +       L+ +DA +  L  +L+  K RL++G LLEE+FRQHP+FDGFAKDFSDAGF+FLMKG+    P  ++DL+ +K RY E WASG N
Subjt:  KQNDDLEHLRDDLEGKLKARDAEMAELRAKLELSKSRLSNGVLLEEAFRQHPDFDGFAKDFSDAGFQFLMKGVQEVSP--ELDLAPIKLRYVEKWASGHN

Query:  GTPGPQYRVDQYLKDLDSDAELEEEED----EDPSSQDADETLPSAAG
        GTPGPQ  VD+Y+++LDSD    EEED    E        E  PS  G
Subjt:  GTPGPQYRVDQYLKDLDSDAELEEEED----EDPSSQDADETLPSAAG

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]3.4e-4844.98Show/hide
Query:  EVGDSQ-----QVSPFGDLVDDPEARIGGTSDLEMRFKIEPSSAGRSGIGHPGLLDYTAEVHAAACQTALMVKAELEGRDLRTVKEREASSAALETAAA-
        EVG+ +     ++ P    V D  +RI   S L+   +         G      +DY AE   A+ Q+AL VKAEL+GR++   +E+E  SAALETA++ 
Subjt:  EVGDSQ-----QVSPFGDLVDDPEARIGGTSDLEMRFKIEPSSAGRSGIGHPGLLDYTAEVHAAACQTALMVKAELEGRDLRTVKEREASSAALETAAA-

Query:  LEGKLKEARVEAQAWKSTSDAKLKSAQAEVARHLETLRGAYAVAKCLEKEKFVLMKQNDDLEHLRDDLEGKLKARDAEMAELRAKLELSKSRLSNGVLLE
        ++ +L +A  E +  K+  +++ +  + E  R    LR A+A+ + LE+EKF L+K+       +DD+   L+A+D E+    A+LE +K RLSNGVLLE
Subjt:  LEGKLKEARVEAQAWKSTSDAKLKSAQAEVARHLETLRGAYAVAKCLEKEKFVLMKQNDDLEHLRDDLEGKLKARDAEMAELRAKLELSKSRLSNGVLLE

Query:  EAFRQHPDFDGFAKDFSDAGFQFLMKGVQEVSPEL--DLAPIKLRYVEKWASGHNGTPGPQYRVDQYLKDLDSDAELEEEEDEDPSSQD
        EAFRQHPDFDGFAKDFSDAGF+FLMKG+    P+L  DL+ +K RY EKWASG  GTPGPQ  VDQY++DLDSD   + EED+  S+Q+
Subjt:  EAFRQHPDFDGFAKDFSDAGFQFLMKGVQEVSPEL--DLAPIKLRYVEKWASGHNGTPGPQYRVDQYLKDLDSDAELEEEEDEDPSSQD

XP_022150867.1 uncharacterized protein LOC111018913 [Momordica charantia]1.6e-5345Show/hide
Query:  DLPAEVEVVEADNAAASKRTSP-KKSKKKKRRTHHFGDEVREVGDSQQVSPFGDLVDDPEARIGGTSDLEMRFKIEPSSAG-------------------
        D+   +EV +       +R SP  KSK  KR+T    D V EV    +V     L +DP+AR+G T D+ MRFKIEPSSAG                   
Subjt:  DLPAEVEVVEADNAAASKRTSP-KKSKKKKRRTHHFGDEVREVGDSQQVSPFGDLVDDPEARIGGTSDLEMRFKIEPSSAG-------------------

Query:  --------RSGIGHPGLLDYTAEVHAAACQTALMVKAELEGRDLRTVKEREASSAALETAAALEGKLKEARVEAQAWKSTSDAKLKSAQAEVARHLETLR
                RSGI    ++DYT +VHA +C  A+++K++L+ RDL  V EREA S ALE A  LE +LKEARVE +  KS  +AK KS + EV    E  +
Subjt:  --------RSGIGHPGLLDYTAEVHAAACQTALMVKAELEGRDLRTVKEREASSAALETAAALEGKLKEARVEAQAWKSTSDAKLKSAQAEVARHLETLR

Query:  GAYAVAKCLEKEKFVLMKQNDDLEHLRDDLEGKLKARDAEMAELRAKLELSKSRLSNGVLLEEAFRQHPDFDGFAKDFSDAGFQFLMKGVQEVSPELDLA
          Y + K LE EKF LM++ND   HL  D     K   +E+ EL+ ++EL K++LSNGVLLEEAF+ H DFD F  DFSD  F+FLMKG+ EV+ +LDL 
Subjt:  GAYAVAKCLEKEKFVLMKQNDDLEHLRDDLEGKLKARDAEMAELRAKLELSKSRLSNGVLLEEAFRQHPDFDGFAKDFSDAGFQFLMKGVQEVSPELDLA

Query:  PIKLRYVEKWASGHNGTPGP
        P+K  Y +KWASG   T GP
Subjt:  PIKLRYVEKWASGHNGTPGP

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]1.8e-4945.86Show/hide
Query:  IGGTSDLEMRFKIEPSSAG--------------------RSGIGHPG-----LLDYTAEVHAAACQTALMVKAELEGRDLRTVKEREASSAALETAAALE
        +GGT D+  RF++EPSS+G                       +  PG      +D  AE   A+  +A+MVKAEL+GR+    KERE SSAALE A  L+
Subjt:  IGGTSDLEMRFKIEPSSAG--------------------RSGIGHPG-----LLDYTAEVHAAACQTALMVKAELEGRDLRTVKEREASSAALETAAALE

Query:  GKLKEARVEAQAWKSTSDAKLKSAQAEVARHLETLRGAYAVAKCLEKEKFVLMKQNDDLEHLRDDLEGKLKARDAEMAELRAKLELSKSRLSNGVLLEEA
        G+L +A+ E    ++  DAK +  + E  +H   LR A+A+ K LEKEKF L+K+ DDL  +   LEGK    D  +  L A+L+  K RL+NG LLEE+
Subjt:  GKLKEARVEAQAWKSTSDAKLKSAQAEVARHLETLRGAYAVAKCLEKEKFVLMKQNDDLEHLRDDLEGKLKARDAEMAELRAKLELSKSRLSNGVLLEEA

Query:  FRQHPDFDGFAKDFSDAGFQFLMKGVQEVSP--ELDLAPIKLRYVEKWASGHNGTPGPQYRVDQYLKDLDSDAELEEEEDEDPSSQDADE
        FRQH DFDGFAKDFSDAGF+FLMKG+    P  ++DL+ +K +Y EKWASG NGTPGPQ  V +Y+++LDSD    + E+ED  SQ+ +E
Subjt:  FRQHPDFDGFAKDFSDAGFQFLMKGVQEVSP--ELDLAPIKLRYVEKWASGHNGTPGPQYRVDQYLKDLDSDAELEEEEDEDPSSQDADE

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]4.5e-5640.37Show/hide
Query:  MVCGFSQSVKRK---RPSAKKAAKNTEAPSPTVADLPAE----------VEVVEAD---NAAASKRT------------------SPKKSKKKKRRTHHF
        MVCGF+ SVKRK   R  A K    TE  +PTV    A+            V+E D     +  KR+                  SP + ++KK++T   
Subjt:  MVCGFSQSVKRK---RPSAKKAAKNTEAPSPTVADLPAE----------VEVVEAD---NAAASKRT------------------SPKKSKKKKRRTHHF

Query:  GDEVREVGDSQQV-SPFGDLVDDPEARIGGTSDLEMRFKIEPSSAG--------------------RSGIGHPG-----LLDYTAEVHAAACQTALMVKA
             E G    + +   DLVDDPEAR+ GTS++ MRF +EPSS+G                       +  PG      +D  AE   A+   A+MVKA
Subjt:  GDEVREVGDSQQV-SPFGDLVDDPEARIGGTSDLEMRFKIEPSSAG--------------------RSGIGHPG-----LLDYTAEVHAAACQTALMVKA

Query:  ELEGRDLRTVKEREASSAALETAAALEGKLKEARVEAQAWKSTSDAKLKSAQAEVARHLETLRGAYAVAKCLEKEKFVLMKQNDDLEHLRDDLEGKLKAR
        EL+GR+    KERE S AALE A  L+G+L +A+ E    ++  DAK+   + E  +H   LR A+A+ K LEKEKF L+K+ DDL  +       L+ +
Subjt:  ELEGRDLRTVKEREASSAALETAAALEGKLKEARVEAQAWKSTSDAKLKSAQAEVARHLETLRGAYAVAKCLEKEKFVLMKQNDDLEHLRDDLEGKLKAR

Query:  DAEMAELRAKLELSKSRLSNGVLLEEAFRQHPDFDGFAKDFSDAGFQFLMKGVQEVSP--ELDLAPIKLRYVEKWASGHNGTPGPQYRVDQYLKDLDSDA
        DA +  L  +L+  K RL+NG LLEE+FRQHPDFDGFAKDFSDAGF+FLMKG+    P  ++DL  +K +Y EKWASG NGTP PQ  VD+Y+++LDSD 
Subjt:  DAEMAELRAKLELSKSRLSNGVLLEEAFRQHPDFDGFAKDFSDAGFQFLMKGVQEVSP--ELDLAPIKLRYVEKWASGHNGTPGPQYRVDQYLKDLDSDA

Query:  ELEEEEDEDPSSQDAD-----ETLPSAAGAN
           EEED  PS +  +     E +PS  G +
Subjt:  ELEEEEDEDPSSQDAD-----ETLPSAAGAN

TrEMBL top hitse value%identityAlignment
A0A6J1D1N9 uncharacterized protein LOC1110161931.7e-4547.58Show/hide
Query:  LDYTAEVHAAACQTALMVKAELEGRDLRTVKEREASSAALETAAALEGKLKEARVEAQAWKSTSDAKLKSAQAEVARHLETLRGAYAVAKCLEKEKFVLM
        +D  AE   A+  +A+MVKAEL+GR+  T KERE  S  LE A  L+G+L +A+ E    ++  DAK    + E  +H   LR A+A+ K LEKEKF L+
Subjt:  LDYTAEVHAAACQTALMVKAELEGRDLRTVKEREASSAALETAAALEGKLKEARVEAQAWKSTSDAKLKSAQAEVARHLETLRGAYAVAKCLEKEKFVLM

Query:  KQNDDLEHLRDDLEGKLKARDAEMAELRAKLELSKSRLSNGVLLEEAFRQHPDFDGFAKDFSDAGFQFLMKGVQEVSP--ELDLAPIKLRYVEKWASGHN
        K+ DDL  +       L+ +DA +  L  +L+  K RL++G LLEE+FRQHP+FDGFAKDFSDAGF+FLMKG+    P  ++DL+ +K RY E WASG N
Subjt:  KQNDDLEHLRDDLEGKLKARDAEMAELRAKLELSKSRLSNGVLLEEAFRQHPDFDGFAKDFSDAGFQFLMKGVQEVSP--ELDLAPIKLRYVEKWASGHN

Query:  GTPGPQYRVDQYLKDLDSDAELEEEED----EDPSSQDADETLPSAAG
        GTPGPQ  VD+Y+++LDSD    EEED    E        E  PS  G
Subjt:  GTPGPQYRVDQYLKDLDSDAELEEEED----EDPSSQDADETLPSAAG

A0A6J1D971 uncharacterized protein LOC1110185381.7e-4844.98Show/hide
Query:  EVGDSQ-----QVSPFGDLVDDPEARIGGTSDLEMRFKIEPSSAGRSGIGHPGLLDYTAEVHAAACQTALMVKAELEGRDLRTVKEREASSAALETAAA-
        EVG+ +     ++ P    V D  +RI   S L+   +         G      +DY AE   A+ Q+AL VKAEL+GR++   +E+E  SAALETA++ 
Subjt:  EVGDSQ-----QVSPFGDLVDDPEARIGGTSDLEMRFKIEPSSAGRSGIGHPGLLDYTAEVHAAACQTALMVKAELEGRDLRTVKEREASSAALETAAA-

Query:  LEGKLKEARVEAQAWKSTSDAKLKSAQAEVARHLETLRGAYAVAKCLEKEKFVLMKQNDDLEHLRDDLEGKLKARDAEMAELRAKLELSKSRLSNGVLLE
        ++ +L +A  E +  K+  +++ +  + E  R    LR A+A+ + LE+EKF L+K+       +DD+   L+A+D E+    A+LE +K RLSNGVLLE
Subjt:  LEGKLKEARVEAQAWKSTSDAKLKSAQAEVARHLETLRGAYAVAKCLEKEKFVLMKQNDDLEHLRDDLEGKLKARDAEMAELRAKLELSKSRLSNGVLLE

Query:  EAFRQHPDFDGFAKDFSDAGFQFLMKGVQEVSPEL--DLAPIKLRYVEKWASGHNGTPGPQYRVDQYLKDLDSDAELEEEEDEDPSSQD
        EAFRQHPDFDGFAKDFSDAGF+FLMKG+    P+L  DL+ +K RY EKWASG  GTPGPQ  VDQY++DLDSD   + EED+  S+Q+
Subjt:  EAFRQHPDFDGFAKDFSDAGFQFLMKGVQEVSPEL--DLAPIKLRYVEKWASGHNGTPGPQYRVDQYLKDLDSDAELEEEEDEDPSSQD

A0A6J1DBX9 uncharacterized protein LOC1110189137.7e-5445Show/hide
Query:  DLPAEVEVVEADNAAASKRTSP-KKSKKKKRRTHHFGDEVREVGDSQQVSPFGDLVDDPEARIGGTSDLEMRFKIEPSSAG-------------------
        D+   +EV +       +R SP  KSK  KR+T    D V EV    +V     L +DP+AR+G T D+ MRFKIEPSSAG                   
Subjt:  DLPAEVEVVEADNAAASKRTSP-KKSKKKKRRTHHFGDEVREVGDSQQVSPFGDLVDDPEARIGGTSDLEMRFKIEPSSAG-------------------

Query:  --------RSGIGHPGLLDYTAEVHAAACQTALMVKAELEGRDLRTVKEREASSAALETAAALEGKLKEARVEAQAWKSTSDAKLKSAQAEVARHLETLR
                RSGI    ++DYT +VHA +C  A+++K++L+ RDL  V EREA S ALE A  LE +LKEARVE +  KS  +AK KS + EV    E  +
Subjt:  --------RSGIGHPGLLDYTAEVHAAACQTALMVKAELEGRDLRTVKEREASSAALETAAALEGKLKEARVEAQAWKSTSDAKLKSAQAEVARHLETLR

Query:  GAYAVAKCLEKEKFVLMKQNDDLEHLRDDLEGKLKARDAEMAELRAKLELSKSRLSNGVLLEEAFRQHPDFDGFAKDFSDAGFQFLMKGVQEVSPELDLA
          Y + K LE EKF LM++ND   HL  D     K   +E+ EL+ ++EL K++LSNGVLLEEAF+ H DFD F  DFSD  F+FLMKG+ EV+ +LDL 
Subjt:  GAYAVAKCLEKEKFVLMKQNDDLEHLRDDLEGKLKARDAEMAELRAKLELSKSRLSNGVLLEEAFRQHPDFDGFAKDFSDAGFQFLMKGVQEVSPELDLA

Query:  PIKLRYVEKWASGHNGTPGP
        P+K  Y +KWASG   T GP
Subjt:  PIKLRYVEKWASGHNGTPGP

A0A6J1DF31 uncharacterized protein LOC1110199098.8e-5045.86Show/hide
Query:  IGGTSDLEMRFKIEPSSAG--------------------RSGIGHPG-----LLDYTAEVHAAACQTALMVKAELEGRDLRTVKEREASSAALETAAALE
        +GGT D+  RF++EPSS+G                       +  PG      +D  AE   A+  +A+MVKAEL+GR+    KERE SSAALE A  L+
Subjt:  IGGTSDLEMRFKIEPSSAG--------------------RSGIGHPG-----LLDYTAEVHAAACQTALMVKAELEGRDLRTVKEREASSAALETAAALE

Query:  GKLKEARVEAQAWKSTSDAKLKSAQAEVARHLETLRGAYAVAKCLEKEKFVLMKQNDDLEHLRDDLEGKLKARDAEMAELRAKLELSKSRLSNGVLLEEA
        G+L +A+ E    ++  DAK +  + E  +H   LR A+A+ K LEKEKF L+K+ DDL  +   LEGK    D  +  L A+L+  K RL+NG LLEE+
Subjt:  GKLKEARVEAQAWKSTSDAKLKSAQAEVARHLETLRGAYAVAKCLEKEKFVLMKQNDDLEHLRDDLEGKLKARDAEMAELRAKLELSKSRLSNGVLLEEA

Query:  FRQHPDFDGFAKDFSDAGFQFLMKGVQEVSP--ELDLAPIKLRYVEKWASGHNGTPGPQYRVDQYLKDLDSDAELEEEEDEDPSSQDADE
        FRQH DFDGFAKDFSDAGF+FLMKG+    P  ++DL+ +K +Y EKWASG NGTPGPQ  V +Y+++LDSD    + E+ED  SQ+ +E
Subjt:  FRQHPDFDGFAKDFSDAGFQFLMKGVQEVSP--ELDLAPIKLRYVEKWASGHNGTPGPQYRVDQYLKDLDSDAELEEEEDEDPSSQDADE

A0A6J1DZB3 uncharacterized protein LOC1110256652.2e-5640.37Show/hide
Query:  MVCGFSQSVKRK---RPSAKKAAKNTEAPSPTVADLPAE----------VEVVEAD---NAAASKRT------------------SPKKSKKKKRRTHHF
        MVCGF+ SVKRK   R  A K    TE  +PTV    A+            V+E D     +  KR+                  SP + ++KK++T   
Subjt:  MVCGFSQSVKRK---RPSAKKAAKNTEAPSPTVADLPAE----------VEVVEAD---NAAASKRT------------------SPKKSKKKKRRTHHF

Query:  GDEVREVGDSQQV-SPFGDLVDDPEARIGGTSDLEMRFKIEPSSAG--------------------RSGIGHPG-----LLDYTAEVHAAACQTALMVKA
             E G    + +   DLVDDPEAR+ GTS++ MRF +EPSS+G                       +  PG      +D  AE   A+   A+MVKA
Subjt:  GDEVREVGDSQQV-SPFGDLVDDPEARIGGTSDLEMRFKIEPSSAG--------------------RSGIGHPG-----LLDYTAEVHAAACQTALMVKA

Query:  ELEGRDLRTVKEREASSAALETAAALEGKLKEARVEAQAWKSTSDAKLKSAQAEVARHLETLRGAYAVAKCLEKEKFVLMKQNDDLEHLRDDLEGKLKAR
        EL+GR+    KERE S AALE A  L+G+L +A+ E    ++  DAK+   + E  +H   LR A+A+ K LEKEKF L+K+ DDL  +       L+ +
Subjt:  ELEGRDLRTVKEREASSAALETAAALEGKLKEARVEAQAWKSTSDAKLKSAQAEVARHLETLRGAYAVAKCLEKEKFVLMKQNDDLEHLRDDLEGKLKAR

Query:  DAEMAELRAKLELSKSRLSNGVLLEEAFRQHPDFDGFAKDFSDAGFQFLMKGVQEVSP--ELDLAPIKLRYVEKWASGHNGTPGPQYRVDQYLKDLDSDA
        DA +  L  +L+  K RL+NG LLEE+FRQHPDFDGFAKDFSDAGF+FLMKG+    P  ++DL  +K +Y EKWASG NGTP PQ  VD+Y+++LDSD 
Subjt:  DAEMAELRAKLELSKSRLSNGVLLEEAFRQHPDFDGFAKDFSDAGFQFLMKGVQEVSP--ELDLAPIKLRYVEKWASGHNGTPGPQYRVDQYLKDLDSDA

Query:  ELEEEEDEDPSSQDAD-----ETLPSAAGAN
           EEED  PS +  +     E +PS  G +
Subjt:  ELEEEEDEDPSSQDAD-----ETLPSAAGAN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTGTGGATTTTCCCAAAGCGTTAAGCGCAAGCGCCCGAGTGCTAAGAAGGCTGCCAAAAACACTGAGGCGCCCAGCCCCACGGTAGCCGACCTTCCTGCT
GAGGTCGAGGTGGTTGAAGCTGACAATGCGGCTGCTTCGAAGAGGACTTCTCCAAAAAAGTCCAAAAAGAAGAAGAGAAGGACCCACCATTTCGGGGACGAAGTG
AGGGAGGTGGGTGACAGTCAGCAGGTCAGTCCTTTTGGGGACCTGGTGGATGACCCTGAGGCTAGGATAGGCGGCACCTCCGACCTCGAGATGAGGTTCAAGATC
GAACCCTCAAGTGCTGGGCGCTCCGGGATCGGCCATCCAGGATTGTTGGACTACACTGCCGAGGTCCATGCCGCAGCTTGCCAGACGGCCCTCATGGTGAAGGCC
GAGCTAGAAGGGCGTGACCTGCGCACTGTGAAGGAGCGAGAAGCCTCCTCTGCTGCTTTGGAAACTGCAGCTGCTCTGGAGGGGAAGCTCAAAGAAGCTCGGGTT
GAAGCCCAAGCATGGAAATCTACTTCTGACGCCAAGCTCAAAAGTGCTCAAGCAGAGGTGGCCCGTCATCTGGAGACCTTGCGAGGTGCGTACGCCGTGGCCAAG
TGCCTGGAGAAGGAGAAGTTCGTGCTGATGAAGCAGAACGACGACCTTGAACATCTCCGAGATGACCTGGAGGGCAAACTGAAGGCTCGAGACGCCGAGATGGCA
GAGCTGAGGGCCAAGCTTGAGCTATCCAAGTCAAGGCTCAGCAACGGAGTCCTGCTGGAGGAAGCTTTTCGTCAACACCCTGACTTCGATGGGTTCGCCAAAGAT
TTCAGCGATGCTGGCTTCCAGTTCTTAATGAAAGGGGTCCAGGAAGTGTCTCCCGAGCTCGACCTTGCACCCATAAAGCTGCGATATGTAGAGAAGTGGGCTTCG
GGTCACAATGGGACTCCTGGCCCCCAGTACCGCGTTGATCAGTACCTGAAGGACCTTGACTCCGACGCCGAGCTCGAGGAGGAAGAGGATGAGGATCCTTCTTCC
CAAGACGCTGACGAGACTCTTCCTTCTGCTGCCGGTGCGAACTCCACTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTTGTGGATTTTCCCAAAGCGTTAAGCGCAAGCGCCCGAGTGCTAAGAAGGCTGCCAAAAACACTGAGGCGCCCAGCCCCACGGTAGCCGACCTTCCTGCT
GAGGTCGAGGTGGTTGAAGCTGACAATGCGGCTGCTTCGAAGAGGACTTCTCCAAAAAAGTCCAAAAAGAAGAAGAGAAGGACCCACCATTTCGGGGACGAAGTG
AGGGAGGTGGGTGACAGTCAGCAGGTCAGTCCTTTTGGGGACCTGGTGGATGACCCTGAGGCTAGGATAGGCGGCACCTCCGACCTCGAGATGAGGTTCAAGATC
GAACCCTCAAGTGCTGGGCGCTCCGGGATCGGCCATCCAGGATTGTTGGACTACACTGCCGAGGTCCATGCCGCAGCTTGCCAGACGGCCCTCATGGTGAAGGCC
GAGCTAGAAGGGCGTGACCTGCGCACTGTGAAGGAGCGAGAAGCCTCCTCTGCTGCTTTGGAAACTGCAGCTGCTCTGGAGGGGAAGCTCAAAGAAGCTCGGGTT
GAAGCCCAAGCATGGAAATCTACTTCTGACGCCAAGCTCAAAAGTGCTCAAGCAGAGGTGGCCCGTCATCTGGAGACCTTGCGAGGTGCGTACGCCGTGGCCAAG
TGCCTGGAGAAGGAGAAGTTCGTGCTGATGAAGCAGAACGACGACCTTGAACATCTCCGAGATGACCTGGAGGGCAAACTGAAGGCTCGAGACGCCGAGATGGCA
GAGCTGAGGGCCAAGCTTGAGCTATCCAAGTCAAGGCTCAGCAACGGAGTCCTGCTGGAGGAAGCTTTTCGTCAACACCCTGACTTCGATGGGTTCGCCAAAGAT
TTCAGCGATGCTGGCTTCCAGTTCTTAATGAAAGGGGTCCAGGAAGTGTCTCCCGAGCTCGACCTTGCACCCATAAAGCTGCGATATGTAGAGAAGTGGGCTTCG
GGTCACAATGGGACTCCTGGCCCCCAGTACCGCGTTGATCAGTACCTGAAGGACCTTGACTCCGACGCCGAGCTCGAGGAGGAAGAGGATGAGGATCCTTCTTCC
CAAGACGCTGACGAGACTCTTCCTTCTGCTGCCGGTGCGAACTCCACTTAG
Protein sequenceShow/hide protein sequence
MVCGFSQSVKRKRPSAKKAAKNTEAPSPTVADLPAEVEVVEADNAAASKRTSPKKSKKKKRRTHHFGDEVREVGDSQQVSPFGDLVDDPEARIGGTSDLEMRFKI
EPSSAGRSGIGHPGLLDYTAEVHAAACQTALMVKAELEGRDLRTVKEREASSAALETAAALEGKLKEARVEAQAWKSTSDAKLKSAQAEVARHLETLRGAYAVAK
CLEKEKFVLMKQNDDLEHLRDDLEGKLKARDAEMAELRAKLELSKSRLSNGVLLEEAFRQHPDFDGFAKDFSDAGFQFLMKGVQEVSPELDLAPIKLRYVEKWAS
GHNGTPGPQYRVDQYLKDLDSDAELEEEEDEDPSSQDADETLPSAAGANST