; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g32580 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g32580
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionBAHD acyltransferase At3g29680-like
Genome locationchr8:23600185..23602043
RNA-Seq ExpressionMoc08g32580
SyntenyMoc08g32580
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022147182.1 uncharacterized protein LOC111016193 [Momordica charantia]3.9e-7467.24Show/hide
Query:  AFVASIQSALVIKAELDGREALAAREKGEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDD
        AF+ASI SA+++KAELDGREAL A+E+   S  LEAA +T+K ELLKA  EV+ L+AEV+++ +LLKKE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD
Subjt:  AFVASIQSALVIKAELDGREALAAREKGEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDD

Query:  MLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDRY
        + Q LE KD  +   T EL+  KERL++G LLEESFRQHP+FDGFAKDFSDAGFKFLMKGIA+DMP LQIDLS LK+RY+E WASGP+GTPGPQ+LVD+Y
Subjt:  MLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDRY

Query:  VRDLDSDYSDPEED--------RVDSTQEGAP
        VR+LDSDYSD EE+        +V +TQE AP
Subjt:  VRDLDSDYSDPEED--------RVDSTQEGAP

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]2.7e-10793.86Show/hide
Query:  AFVASIQSALVIKAELDGREALAAREKGEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDD
        AFVASIQSAL +KAELDGRE LAAREK EFSAALE ASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRR+AQLRAAHAITRGLEREKFQLLKEKDD
Subjt:  AFVASIQSALVIKAELDGREALAAREKGEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDD

Query:  MLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDRY
        MLQALEAKDKELEHATAELETAKERLSNGVLLEE+FRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGP GTPGPQALVD+Y
Subjt:  MLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDRY

Query:  VRDLDSDYSDPEEDRVDSTQEGAPLAGS
        VRDLDSDYSDPEED+V STQEGA   GS
Subjt:  VRDLDSDYSDPEEDRVDSTQEGAPLAGS

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]3.5e-7569.4Show/hide
Query:  AFVASIQSALVIKAELDGREALAAREKGEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDD
        AFVASI SA+++KAELDGREALAA+E+   SAALEAA +T+K ELLKA  EV  L+AEV+++AELLKKE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD
Subjt:  AFVASIQSALVIKAELDGREALAAREKGEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDD

Query:  MLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDRY
        + Q LE KD  +   TAEL+  KERL+NG LLEESFRQH DFDGFAKDFSDAGFKFLMKGIA+DMP LQIDLS LK++Y+EKWASGP+GTPGPQ+LV +Y
Subjt:  MLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDRY

Query:  VRDLDSDYSDPEED--------RVDSTQEGAP
        VR+LDSDYSD EE+         + +TQE  P
Subjt:  VRDLDSDYSDPEED--------RVDSTQEGAP

XP_022158203.1 uncharacterized protein LOC111024740 [Momordica charantia]4.9e-7776.32Show/hide
Query:  AFVASIQSALVIKAELDGREALAAREKGEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDD
        AFVASIQSAL +KAELDGRE LAAREK EFSAALEAA  TMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDD
Subjt:  AFVASIQSALVIKAELDGREALAAREKGEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDD

Query:  MLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDRY
        MLQALE KDKELEHATAELETAKERLSN                                        +IDLSGLKRRYAEKWASGP GTPGPQALVD+Y
Subjt:  MLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDRY

Query:  VRDLDSDYSDPEEDRVDSTQEGAPLAGS
        VRDLDSDYSDP+ED+V STQEGAP AGS
Subjt:  VRDLDSDYSDPEEDRVDSTQEGAPLAGS

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.0e-7467.67Show/hide
Query:  AFVASIQSALVIKAELDGREALAAREKGEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDD
        AF+ASI  A+++KAELDGREALAA+E+    AALEAA +T+K ELLKA  EV+ L+AEV+++ +LLKKE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD
Subjt:  AFVASIQSALVIKAELDGREALAAREKGEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDD

Query:  MLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDRY
        + Q LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLK++Y+EKWASGP+GTP PQ+LVD+Y
Subjt:  MLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDRY

Query:  VRDLDSDYSDPEED--------RVDSTQEGAP
        VR+LDSDYSD EE+         V +TQE  P
Subjt:  VRDLDSDYSDPEED--------RVDSTQEGAP

TrEMBL top hitse value%identityAlignment
A0A6J1D1N9 uncharacterized protein LOC1110161931.9e-7467.24Show/hide
Query:  AFVASIQSALVIKAELDGREALAAREKGEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDD
        AF+ASI SA+++KAELDGREAL A+E+   S  LEAA +T+K ELLKA  EV+ L+AEV+++ +LLKKE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD
Subjt:  AFVASIQSALVIKAELDGREALAAREKGEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDD

Query:  MLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDRY
        + Q LE KD  +   T EL+  KERL++G LLEESFRQHP+FDGFAKDFSDAGFKFLMKGIA+DMP LQIDLS LK+RY+E WASGP+GTPGPQ+LVD+Y
Subjt:  MLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDRY

Query:  VRDLDSDYSDPEED--------RVDSTQEGAP
        VR+LDSDYSD EE+        +V +TQE AP
Subjt:  VRDLDSDYSDPEED--------RVDSTQEGAP

A0A6J1D971 uncharacterized protein LOC1110185381.3e-10793.86Show/hide
Query:  AFVASIQSALVIKAELDGREALAAREKGEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDD
        AFVASIQSAL +KAELDGRE LAAREK EFSAALE ASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRR+AQLRAAHAITRGLEREKFQLLKEKDD
Subjt:  AFVASIQSALVIKAELDGREALAAREKGEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDD

Query:  MLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDRY
        MLQALEAKDKELEHATAELETAKERLSNGVLLEE+FRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGP GTPGPQALVD+Y
Subjt:  MLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDRY

Query:  VRDLDSDYSDPEEDRVDSTQEGAPLAGS
        VRDLDSDYSDPEED+V STQEGA   GS
Subjt:  VRDLDSDYSDPEEDRVDSTQEGAPLAGS

A0A6J1DF31 uncharacterized protein LOC1110199091.7e-7569.4Show/hide
Query:  AFVASIQSALVIKAELDGREALAAREKGEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDD
        AFVASI SA+++KAELDGREALAA+E+   SAALEAA +T+K ELLKA  EV  L+AEV+++AELLKKE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD
Subjt:  AFVASIQSALVIKAELDGREALAAREKGEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDD

Query:  MLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDRY
        + Q LE KD  +   TAEL+  KERL+NG LLEESFRQH DFDGFAKDFSDAGFKFLMKGIA+DMP LQIDLS LK++Y+EKWASGP+GTPGPQ+LV +Y
Subjt:  MLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDRY

Query:  VRDLDSDYSDPEED--------RVDSTQEGAP
        VR+LDSDYSD EE+         + +TQE  P
Subjt:  VRDLDSDYSDPEED--------RVDSTQEGAP

A0A6J1DVF6 uncharacterized protein LOC1110247402.4e-7776.32Show/hide
Query:  AFVASIQSALVIKAELDGREALAAREKGEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDD
        AFVASIQSAL +KAELDGRE LAAREK EFSAALEAA  TMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDD
Subjt:  AFVASIQSALVIKAELDGREALAAREKGEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDD

Query:  MLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDRY
        MLQALE KDKELEHATAELETAKERLSN                                        +IDLSGLKRRYAEKWASGP GTPGPQALVD+Y
Subjt:  MLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDRY

Query:  VRDLDSDYSDPEEDRVDSTQEGAPLAGS
        VRDLDSDYSDP+ED+V STQEGAP AGS
Subjt:  VRDLDSDYSDPEEDRVDSTQEGAPLAGS

A0A6J1DZB3 uncharacterized protein LOC1110256655.0e-7567.67Show/hide
Query:  AFVASIQSALVIKAELDGREALAAREKGEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDD
        AF+ASI  A+++KAELDGREALAA+E+    AALEAA +T+K ELLKA  EV+ L+AEV+++ +LLKKE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD
Subjt:  AFVASIQSALVIKAELDGREALAAREKGEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDD

Query:  MLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDRY
        + Q LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLK++Y+EKWASGP+GTP PQ+LVD+Y
Subjt:  MLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDRY

Query:  VRDLDSDYSDPEED--------RVDSTQEGAP
        VR+LDSDYSD EE+         V +TQE  P
Subjt:  VRDLDSDYSDPEED--------RVDSTQEGAP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G15420.1 myosin heavy chain-related1.8e-0531.5Show/hide
Query:  PENILLRLPEEGERADHPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIG
        P  I L  P+  +R   PPEG++ LY   F   GL  PL  F+ E+  R  +A +Q+          LAIL       +E    +D D         R+ 
Subjt:  PENILLRLPEEGERADHPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIG

Query:  KKPGRFYMCARKGAGGIVKGPTSKSRG
        + PG +Y  A K    IV G  SK  G
Subjt:  KKPGRFYMCARKGAGGIVKGPTSKSRG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTCATCTTGGGGCACCAATAGGGGTCTTCCACGTGTCCCGAGTATACCCTTCCCCAAACATTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAG
AAGTTCATTCGACCTGCTTTGGACACGTGGCAACTTCCTATTCGTGGGAAAATACAACCGTCGCGGAAGATTTATCGTCGGAATATTCAAATATTCCGACGCTTC
GGGTCTCAGAGAGGATCCCAGCCACTCGTTGATTACACGTCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCTCTCTTTCGAACATAATTGCCATGTCGTCCTCT
ATTAGCAGCAACCTAGGATCCGATCTAGCTCGTAGGATACCTGAGCACTACCTTGGATCCCTCCGTAGGGGATTCGCTATCCCTGAGAACATCCTCCTCAGGCTT
CCGGAGGAGGGGGAGAGAGCTGACCATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAGATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAA
GAATTTCTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTCTTTTGGCTTCGAGCTCGGGATAGT
GAGGAGGCCGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAGGCGAAAAGGATAGGTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCA
GGTGGTATAGTTAAGGGGCCGACCTCCAAGTCTCGGGGTGAGGGAGCAGGTGACCCGCATCTCAGCTGCGAGTTTGGACCGCTGCCTAAGAAGGGCGTCCAAGTT
TGTGAGCGCCCCGGGGCGTTTGTTGCCTCCATTCAATCGGCTCTGGTTATAAAGGCCGAGCTGGATGGGAGGGAAGCTTTGGCAGCGAGGGAGAAAGGAGAGTTC
TCCGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTTCTGAAGGCTCACTCCGAGGTGGAGACTTTGAAGGCCGAGGTGGAGTCTCAGGCCGAGCTA
CTGAAGAAGGAGGAGGACAGGCGCAAGGCTCAACTCCGAGCTGCCCACGCCATCACCAGGGGCTTGGAGAGGGAGAAGTTCCAGCTCCTGAAGGAGAAGGACGAC
ATGCTCCAGGCGCTCGAAGCGAAGGATAAGGAGTTGGAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGCCTCAGCAATGGAGTCCTGCTGGAGGAATCA
TTCAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTTTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATC
GATTTAAGTGGTCTGAAAAGGAGGTATGCCGAGAAGTGGGCGTCTGGGCCTAGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATCGGTATGTCAGGGATCTGGAT
TCTGACTACTCCGATCCCGAAGAGGACCGGGTCGACTCCACTCAGGAGGGCGCTCCCCTAGCAGGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTCATCTTGGGGCACCAATAGGGGTCTTCCACGTGTCCCGAGTATACCCTTCCCCAAACATTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAG
AAGTTCATTCGACCTGCTTTGGACACGTGGCAACTTCCTATTCGTGGGAAAATACAACCGTCGCGGAAGATTTATCGTCGGAATATTCAAATATTCCGACGCTTC
GGGTCTCAGAGAGGATCCCAGCCACTCGTTGATTACACGTCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCTCTCTTTCGAACATAATTGCCATGTCGTCCTCT
ATTAGCAGCAACCTAGGATCCGATCTAGCTCGTAGGATACCTGAGCACTACCTTGGATCCCTCCGTAGGGGATTCGCTATCCCTGAGAACATCCTCCTCAGGCTT
CCGGAGGAGGGGGAGAGAGCTGACCATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAGATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAA
GAATTTCTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTCTTTTGGCTTCGAGCTCGGGATAGT
GAGGAGGCCGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAGGCGAAAAGGATAGGTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCA
GGTGGTATAGTTAAGGGGCCGACCTCCAAGTCTCGGGGTGAGGGAGCAGGTGACCCGCATCTCAGCTGCGAGTTTGGACCGCTGCCTAAGAAGGGCGTCCAAGTT
TGTGAGCGCCCCGGGGCGTTTGTTGCCTCCATTCAATCGGCTCTGGTTATAAAGGCCGAGCTGGATGGGAGGGAAGCTTTGGCAGCGAGGGAGAAAGGAGAGTTC
TCCGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTTCTGAAGGCTCACTCCGAGGTGGAGACTTTGAAGGCCGAGGTGGAGTCTCAGGCCGAGCTA
CTGAAGAAGGAGGAGGACAGGCGCAAGGCTCAACTCCGAGCTGCCCACGCCATCACCAGGGGCTTGGAGAGGGAGAAGTTCCAGCTCCTGAAGGAGAAGGACGAC
ATGCTCCAGGCGCTCGAAGCGAAGGATAAGGAGTTGGAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGCCTCAGCAATGGAGTCCTGCTGGAGGAATCA
TTCAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTTTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATC
GATTTAAGTGGTCTGAAAAGGAGGTATGCCGAGAAGTGGGCGTCTGGGCCTAGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATCGGTATGTCAGGGATCTGGAT
TCTGACTACTCCGATCCCGAAGAGGACCGGGTCGACTCCACTCAGGAGGGCGCTCCCCTAGCAGGCTCTTAG
Protein sequenceShow/hide protein sequence
MSHLGAPIGVFHVSRVYPSPNIGPLSVWSDLDLAEKFIRPALDTWQLPIRGKIQPSRKIYRRNIQIFRRFGSQRGSQPLVDYTSRTLGRSVSSLSLSNIIAMSSS
ISSNLGSDLARRIPEHYLGSLRRGFAIPENILLRLPEEGERADHPPEGWVTLYFKMFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDS
EEAELLDVDQLLACFEAKRIGKKPGRFYMCARKGAGGIVKGPTSKSRGEGAGDPHLSCEFGPLPKKGVQVCERPGAFVASIQSALVIKAELDGREALAAREKGEF
SAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEES
FRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDRYVRDLDSDYSDPEEDRVDSTQEGAPLAGS