; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g14960 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g14960
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionBAHD acyltransferase At3g29680-like
Genome locationchr6:11727307..11728543
RNA-Seq ExpressionMoc06g14960
SyntenyMoc06g14960
Gene Ontology termsGO:0016740 - transferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022147182.1 uncharacterized protein LOC111016193 [Momordica charantia]9.0e-7867.07Show/hide
Query:  NDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEGFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKKEETDARPSSELPYAI
        +DPGSVLQRTID AAEAF+ASI SA+ VKAELDGRE L A+E+E  S  LEAA +T+K ELLKA  EV+IL+AEV+ K +LLKK+ E   +      +AI
Subjt:  NDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEGFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKKEETDARPSSELPYAI

Query:  TKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFNGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAKQW
        TKGLEKEKFQLLKEKDD+ Q LE K+  +   T EL+ +KERL++GALLEESFRQHP+F+GFAKDFSDAGFKFLMKGIA+DMP LQIDL  LKKRY++ W
Subjt:  TKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFNGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAKQW

Query:  ASGPSGTLGPQALVDKYVRDLDFDYSDLEED--------QVGTTQEGAP
        ASGP+GT GPQ+LVDKYVR+LD DYSD+EE+        +VGTTQE AP
Subjt:  ASGPSGTLGPQALVDKYVRDLDFDYSDLEED--------QVGTTQEGAP

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]1.7e-10083.95Show/hide
Query:  PGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEGFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKKEETDARPSSELPYAITK
        PGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKE FSAALE ASSTMKDELLKAHSEVE LKAEVE++AELL KKEE   +      +AIT+
Subjt:  PGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEGFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKKEETDARPSSELPYAITK

Query:  GLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFNGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAKQWAS
        GLE+EKFQLLKEKDDMLQALEAK++EL+HATAELET KERLSNG LLEE+FRQHPDF+GFAKDFSDAGFKFLMKGIASDMPDLQIDL GLK+RYA++WAS
Subjt:  GLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFNGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAKQWAS

Query:  GPSGTLGPQALVDKYVRDLDFDYSDLEEDQVGTTQEGAPQAGS
        GP GT GPQALVD+YVRDLD DYSD EEDQVG+TQEGA   GS
Subjt:  GPSGTLGPQALVDKYVRDLDFDYSDLEEDQVGTTQEGAPQAGS

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]4.8e-7966.8Show/hide
Query:  KEGVQICNDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEGFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKKEETDARPS
        K   +  +DPGSVLQRTID AAEAFVASI SA+ VKAELDGRE LAA+E+E  SAALEAA +T+K ELLKA  EV IL+AEV+ KAELLKK+ E   +  
Subjt:  KEGVQICNDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEGFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKKEETDARPS

Query:  SELPYAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFNGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLK
            +AITKGLEKEKFQLLKEKDD+ Q LE K+  +   TAEL+ +KERL+NG+LLEESFRQH DF+GFAKDFSDAGFKFLMKGIA+DMP LQIDL  LK
Subjt:  SELPYAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFNGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLK

Query:  KRYAKQWASGPSGTLGPQALVDKYVRDLDFDYSDLEED--------QVGTTQEGAP
        K+Y+++WASGP+GT GPQ+LV KYVR+LD DYSD+EE+        ++GTTQE  P
Subjt:  KRYAKQWASGPSGTLGPQALVDKYVRDLDFDYSDLEED--------QVGTTQEGAP

XP_022158203.1 uncharacterized protein LOC111024740 [Momordica charantia]1.9e-7262.09Show/hide
Query:  SSWGLACELRRSGVPHLGRKFGPLPKEGVQICNDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEGFSAALEAASSTMKDELLKAHSEV
        SS G+  ++ R     L R      +   +  +DPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKE FSAALEAA  TMKDELLKAHSEV
Subjt:  SSWGLACELRRSGVPHLGRKFGPLPKEGVQICNDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEGFSAALEAASSTMKDELLKAHSEV

Query:  EILKAEVETKAELLKKKEETDARPSSELPYAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFNGFAKDFSD
        E LKAEVE++AELL KKEE   +      +AIT+GLE+EKFQLLKEKDDMLQALE K++EL+HATAELET KERLSN                       
Subjt:  EILKAEVETKAELLKKKEETDARPSSELPYAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFNGFAKDFSD

Query:  AGFKFLMKGIASDMPDLQIDLGGLKKRYAKQWASGPSGTLGPQALVDKYVRDLDFDYSDLEEDQVGTTQEGAPQAGS
                         +IDL GLK+RYA++WASGP GT GPQALVD+YVRDLD DYSD +EDQVG+TQEGAP AGS
Subjt:  AGFKFLMKGIASDMPDLQIDLGGLKKRYAKQWASGPSGTLGPQALVDKYVRDLDFDYSDLEEDQVGTTQEGAPQAGS

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.5e-8051.72Show/hide
Query:  GPASEDPAPVIELESSEGPSREKRPEIRPRRWTL-PLPWARRWGGSPSEAKEEEEEDHLPLGGRSSWGLA-------CELRRSGVPHLGRKFGPLP----
        GP+S  P PVIEL+ S G S EKR         + PL   R         K+++       G R +   +        E R  G  ++  +FG  P    
Subjt:  GPASEDPAPVIELESSEGPSREKRPEIRPRRWTL-PLPWARRWGGSPSEAKEEEEEDHLPLGGRSSWGLA-------CELRRSGVPHLGRKFGPLP----

Query:  -----------------KEGVQICNDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEGFSAALEAASSTMKDELLKAHSEVEILKAEVE
                         +   +  +DPGSVLQRTID  AEAF+ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+
Subjt:  -----------------KEGVQICNDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEGFSAALEAASSTMKDELLKAHSEVEILKAEVE

Query:  TKAELLKKKEETDARPSSELPYAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFNGFAKDFSDAGFKFLMK
         K +LLKK+ E   +      +AITKGLEKEKFQLLKEKDD+ Q LE K+  +   T EL+ +KERL+NG LLEESFRQHPDF+GFAKDFSDAGFKFLMK
Subjt:  TKAELLKKKEETDARPSSELPYAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFNGFAKDFSDAGFKFLMK

Query:  GIASDMPDLQIDLGGLKKRYAKQWASGPSGTLGPQALVDKYVRDLDFDYSDLEED--------QVGTTQEGAP--QAGS
        GIA+DMP LQIDL GLKK+Y+++WASGP+GT  PQ+LVDKYVR+LD DYSD+EE+        +VGTTQE  P  Q GS
Subjt:  GIASDMPDLQIDLGGLKKRYAKQWASGPSGTLGPQALVDKYVRDLDFDYSDLEED--------QVGTTQEGAP--QAGS

TrEMBL top hitse value%identityAlignment
A0A6J1D1N9 uncharacterized protein LOC1110161934.4e-7867.07Show/hide
Query:  NDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEGFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKKEETDARPSSELPYAI
        +DPGSVLQRTID AAEAF+ASI SA+ VKAELDGRE L A+E+E  S  LEAA +T+K ELLKA  EV+IL+AEV+ K +LLKK+ E   +      +AI
Subjt:  NDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEGFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKKEETDARPSSELPYAI

Query:  TKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFNGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAKQW
        TKGLEKEKFQLLKEKDD+ Q LE K+  +   T EL+ +KERL++GALLEESFRQHP+F+GFAKDFSDAGFKFLMKGIA+DMP LQIDL  LKKRY++ W
Subjt:  TKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFNGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAKQW

Query:  ASGPSGTLGPQALVDKYVRDLDFDYSDLEED--------QVGTTQEGAP
        ASGP+GT GPQ+LVDKYVR+LD DYSD+EE+        +VGTTQE AP
Subjt:  ASGPSGTLGPQALVDKYVRDLDFDYSDLEED--------QVGTTQEGAP

A0A6J1D971 uncharacterized protein LOC1110185388.2e-10183.95Show/hide
Query:  PGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEGFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKKEETDARPSSELPYAITK
        PGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKE FSAALE ASSTMKDELLKAHSEVE LKAEVE++AELL KKEE   +      +AIT+
Subjt:  PGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEGFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKKEETDARPSSELPYAITK

Query:  GLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFNGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAKQWAS
        GLE+EKFQLLKEKDDMLQALEAK++EL+HATAELET KERLSNG LLEE+FRQHPDF+GFAKDFSDAGFKFLMKGIASDMPDLQIDL GLK+RYA++WAS
Subjt:  GLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFNGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAKQWAS

Query:  GPSGTLGPQALVDKYVRDLDFDYSDLEEDQVGTTQEGAPQAGS
        GP GT GPQALVD+YVRDLD DYSD EEDQVG+TQEGA   GS
Subjt:  GPSGTLGPQALVDKYVRDLDFDYSDLEEDQVGTTQEGAPQAGS

A0A6J1DF31 uncharacterized protein LOC1110199092.3e-7966.8Show/hide
Query:  KEGVQICNDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEGFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKKEETDARPS
        K   +  +DPGSVLQRTID AAEAFVASI SA+ VKAELDGRE LAA+E+E  SAALEAA +T+K ELLKA  EV IL+AEV+ KAELLKK+ E   +  
Subjt:  KEGVQICNDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEGFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKKEETDARPS

Query:  SELPYAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFNGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLK
            +AITKGLEKEKFQLLKEKDD+ Q LE K+  +   TAEL+ +KERL+NG+LLEESFRQH DF+GFAKDFSDAGFKFLMKGIA+DMP LQIDL  LK
Subjt:  SELPYAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFNGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLK

Query:  KRYAKQWASGPSGTLGPQALVDKYVRDLDFDYSDLEED--------QVGTTQEGAP
        K+Y+++WASGP+GT GPQ+LV KYVR+LD DYSD+EE+        ++GTTQE  P
Subjt:  KRYAKQWASGPSGTLGPQALVDKYVRDLDFDYSDLEED--------QVGTTQEGAP

A0A6J1DVF6 uncharacterized protein LOC1110247409.4e-7362.09Show/hide
Query:  SSWGLACELRRSGVPHLGRKFGPLPKEGVQICNDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEGFSAALEAASSTMKDELLKAHSEV
        SS G+  ++ R     L R      +   +  +DPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKE FSAALEAA  TMKDELLKAHSEV
Subjt:  SSWGLACELRRSGVPHLGRKFGPLPKEGVQICNDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEGFSAALEAASSTMKDELLKAHSEV

Query:  EILKAEVETKAELLKKKEETDARPSSELPYAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFNGFAKDFSD
        E LKAEVE++AELL KKEE   +      +AIT+GLE+EKFQLLKEKDDMLQALE K++EL+HATAELET KERLSN                       
Subjt:  EILKAEVETKAELLKKKEETDARPSSELPYAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFNGFAKDFSD

Query:  AGFKFLMKGIASDMPDLQIDLGGLKKRYAKQWASGPSGTLGPQALVDKYVRDLDFDYSDLEEDQVGTTQEGAPQAGS
                         +IDL GLK+RYA++WASGP GT GPQALVD+YVRDLD DYSD +EDQVG+TQEGAP AGS
Subjt:  AGFKFLMKGIASDMPDLQIDLGGLKKRYAKQWASGPSGTLGPQALVDKYVRDLDFDYSDLEEDQVGTTQEGAPQAGS

A0A6J1DZB3 uncharacterized protein LOC1110256657.2e-8151.72Show/hide
Query:  GPASEDPAPVIELESSEGPSREKRPEIRPRRWTL-PLPWARRWGGSPSEAKEEEEEDHLPLGGRSSWGLA-------CELRRSGVPHLGRKFGPLP----
        GP+S  P PVIEL+ S G S EKR         + PL   R         K+++       G R +   +        E R  G  ++  +FG  P    
Subjt:  GPASEDPAPVIELESSEGPSREKRPEIRPRRWTL-PLPWARRWGGSPSEAKEEEEEDHLPLGGRSSWGLA-------CELRRSGVPHLGRKFGPLP----

Query:  -----------------KEGVQICNDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEGFSAALEAASSTMKDELLKAHSEVEILKAEVE
                         +   +  +DPGSVLQRTID  AEAF+ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+
Subjt:  -----------------KEGVQICNDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEGFSAALEAASSTMKDELLKAHSEVEILKAEVE

Query:  TKAELLKKKEETDARPSSELPYAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFNGFAKDFSDAGFKFLMK
         K +LLKK+ E   +      +AITKGLEKEKFQLLKEKDD+ Q LE K+  +   T EL+ +KERL+NG LLEESFRQHPDF+GFAKDFSDAGFKFLMK
Subjt:  TKAELLKKKEETDARPSSELPYAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFNGFAKDFSDAGFKFLMK

Query:  GIASDMPDLQIDLGGLKKRYAKQWASGPSGTLGPQALVDKYVRDLDFDYSDLEED--------QVGTTQEGAP--QAGS
        GIA+DMP LQIDL GLKK+Y+++WASGP+GT  PQ+LVDKYVR+LD DYSD+EE+        +VGTTQE  P  Q GS
Subjt:  GIASDMPDLQIDLGGLKKRYAKQWASGPSGTLGPQALVDKYVRDLDFDYSDLEED--------QVGTTQEGAP--QAGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCTGGAGGCCGCCCAGAGTTCGAAACCTGCCACCCCCTGTTGTGGTAGGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGAGGGTCCTTC
GAGGGAGAAGCGCCCAGAGATCAGACCGAGGCGGTGGACGCTCCCCTTGCCTTGGGCGAGGAGGTGGGGAGGAAGTCCCTCTGAAGCGAAGGAAGAAGAAGAAGAAGACC
ACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGTCTTGCCTGCGAGCTTCGCAGATCAGGTGTCCCGCATCTCGGCCGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAA
ATTTGTAATGACCCAGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCTGAGGCGTTCGTTGCTTCCATTCAATCGGCTCTGGCCGTGAAGGCCGAGCTGGATGGGAG
GGAAGTTCTGGCAGCGAGGGAGAAAGAGGGGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAAATTTTGA
AGGCCGAGGTGGAGACCAAAGCCGAGCTGCTGAAGAAGAAGGAAGAGACAGACGCAAGGCCCAGCTCCGAGCTGCCCTATGCTATCACCAAGGGCTTGGAGAAGGAGAAG
TTCCAACTCCTCAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGAGGAGGAGCTGAAGCATGCGACTGCCGAGCTGGAGACGGTGAAGGAGCGTCTCAGCAA
TGGAGCCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCAATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACA
TGCCTGACCTTCAGATCGATCTCGGTGGTCTGAAGAAGAGGTATGCTAAGCAGTGGGCGTCTGGGCCTAGCGGCACCCTTGGCCCCCAAGCGTTGGTGGATAAGTACGTC
AGAGATCTGGACTTTGACTACTCCGACCTCGAAGAGGATCAGGTCGGCACCACTCAAGAGGGCGCTCCTCAAGCAGGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTCTGGAGGCCGCCCAGAGTTCGAAACCTGCCACCCCCTGTTGTGGTAGGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGAGGGTCCTTC
GAGGGAGAAGCGCCCAGAGATCAGACCGAGGCGGTGGACGCTCCCCTTGCCTTGGGCGAGGAGGTGGGGAGGAAGTCCCTCTGAAGCGAAGGAAGAAGAAGAAGAAGACC
ACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGTCTTGCCTGCGAGCTTCGCAGATCAGGTGTCCCGCATCTCGGCCGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAA
ATTTGTAATGACCCAGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCTGAGGCGTTCGTTGCTTCCATTCAATCGGCTCTGGCCGTGAAGGCCGAGCTGGATGGGAG
GGAAGTTCTGGCAGCGAGGGAGAAAGAGGGGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAAATTTTGA
AGGCCGAGGTGGAGACCAAAGCCGAGCTGCTGAAGAAGAAGGAAGAGACAGACGCAAGGCCCAGCTCCGAGCTGCCCTATGCTATCACCAAGGGCTTGGAGAAGGAGAAG
TTCCAACTCCTCAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGAGGAGGAGCTGAAGCATGCGACTGCCGAGCTGGAGACGGTGAAGGAGCGTCTCAGCAA
TGGAGCCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCAATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACA
TGCCTGACCTTCAGATCGATCTCGGTGGTCTGAAGAAGAGGTATGCTAAGCAGTGGGCGTCTGGGCCTAGCGGCACCCTTGGCCCCCAAGCGTTGGTGGATAAGTACGTC
AGAGATCTGGACTTTGACTACTCCGACCTCGAAGAGGATCAGGTCGGCACCACTCAAGAGGGCGCTCCTCAAGCAGGCTCTTAG
Protein sequenceShow/hide protein sequence
MLWRPPRVRNLPPPVVVGPASEDPAPVIELESSEGPSREKRPEIRPRRWTLPLPWARRWGGSPSEAKEEEEEDHLPLGGRSSWGLACELRRSGVPHLGRKFGPLPKEGVQ
ICNDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEGFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKKEETDARPSSELPYAITKGLEKEK
FQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFNGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAKQWASGPSGTLGPQALVDKYV
RDLDFDYSDLEEDQVGTTQEGAPQAGS