; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g12840 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g12840
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionprotein FAR1-RELATED SEQUENCE 4-like
Genome locationchr6:9920628..9921461
RNA-Seq ExpressionMoc06g12840
SyntenyMoc06g12840
Gene Ontology termsGO:0006313 - transposition, DNA-mediated (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0004803 - transposase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR006564 - Zinc finger, PMZ-type
IPR007527 - Zinc finger, SWIM-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022153146.1 uncharacterized protein LOC111020715 [Momordica charantia]1.3e-11373.38Show/hide
Query:  MNLLAKF--TTPALEALFFKAAKAFRESYFNENWVQLCTHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESINALFRHARKLSVTALLDHIRGVLQ
        MNLLAKF     ALE LF KAAKA+RESYFN  W QL  +PGVREYL+ IGKERWARCFQT+LRY+QMT+N AES+NALFRHARKL VTALLDHIRG+LQ
Subjt:  MNLLAKF--TTPALEALFFKAAKAFRESYFNENWVQLCTHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESINALFRHARKLSVTALLDHIRGVLQ

Query:  RWIYERRTLASSRQSTLSDCTEEMIAEASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAANSRSINPYTLCDEAYTVN
         W Y+RRTLASSR +TLS   E  +AE SDNARRH+V+NIDQF+ +VRDGNL+G VD  S+TC CREFDYFK+PCSHAIA A  R+INPYTLCDEAYT N
Subjt:  RWIYERRTLASSRQSTLSDCTEEMIAEASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAANSRSINPYTLCDEAYTVN

Query:  SWMLAYAEPIFPVGSSSTWKSSPGFMNIDVQPPKKVVRVGRRQTVRIPSTGEVRPPRKCSRCGTSGHNRKTCREPLNT
        SW++AYAEPIFP+G  STW SSP F++  V+ P  V RVGRR+TVRIPSTGEVR  RKC RCGTSGHN KTC EPLNT
Subjt:  SWMLAYAEPIFPVGSSSTWKSSPGFMNIDVQPPKKVVRVGRRQTVRIPSTGEVRPPRKCSRCGTSGHNRKTCREPLNT

XP_022154610.1 uncharacterized protein LOC111021833 [Momordica charantia]6.7e-11372.04Show/hide
Query:  MNLLAKF--TTPALEALFFKAAKAFRESYFNENWVQLCTHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESINALFRHARKLSVTALLDHIRGVLQ
        +NL+ KF     A+E LF KA KA+RESYFN  W QL  +PGVREYL+ IGKERWARCFQT+LRY+QMTTNIAES+NA FRHARKL VTALLDHIRG L 
Subjt:  MNLLAKF--TTPALEALFFKAAKAFRESYFNENWVQLCTHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESINALFRHARKLSVTALLDHIRGVLQ

Query:  RWIYERRTLASSRQSTLSDCTEEMIAEASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAANSRSINPYTLCDEAYTVN
         W Y+RRTLA+SR +TLSD  E M AE SD+ARRH+V NIDQF+F+VRDGN +G VDL + TC+CREFDYFK+PCSH IAAA  R+INPY+LCDEAYT N
Subjt:  RWIYERRTLASSRQSTLSDCTEEMIAEASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAANSRSINPYTLCDEAYTVN

Query:  SWMLAYAEPIFPVGSSSTWKSSPGFMNIDVQPPKKVVRVGRRQTVRIPSTGEVRPPRKCSRCGTSGHNRKTCREPLNTV
        SW+LAYAEPIFPVG  STW SSP F+NI V+PPK V RVGRR T RIPSTGEVR  RKC RCG  GHNRKTC EPL T+
Subjt:  SWMLAYAEPIFPVGSSSTWKSSPGFMNIDVQPPKKVVRVGRRQTVRIPSTGEVRPPRKCSRCGTSGHNRKTCREPLNTV

XP_022154964.1 protein FAR1-RELATED SEQUENCE 4-like [Momordica charantia]2.3e-12186.82Show/hide
Query:  MNLLAKFTTPALEALFFKAAKAFRESYFNENWVQLCTHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESINALFRHARKLSVTALLDHIRGVLQRW
        MNLLAKF T  LE LFFKAAKA RESYFNENWVQLC HPGVREY+E IGKERWARCFQTKLRY QMTTNIAES+NALFRHARKL VTALLDHIRGVLQRW
Subjt:  MNLLAKFTTPALEALFFKAAKAFRESYFNENWVQLCTHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESINALFRHARKLSVTALLDHIRGVLQRW

Query:  IYERRTLASSRQSTL------------------SDCTEEMIAEASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAANS
         YERRTLASSRQSTL                  SD  EEMIAEASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAA S
Subjt:  IYERRTLASSRQSTL------------------SDCTEEMIAEASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAANS

Query:  RSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFMNIDVQPPKKVVRVGRR
        RSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGF+NIDVQPPKKVVRVGRR
Subjt:  RSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFMNIDVQPPKKVVRVGRR

XP_022156122.1 uncharacterized protein LOC111023087 [Momordica charantia]5.1e-12984.84Show/hide
Query:  MNLLAKFTTPALEALFFKAAKAFRESYFNENWVQLCTHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESINALFRHARKLSVTALLDHIRGVLQRW
        M+LLAKF TPALE LFFKAAKAFRESYFNENWVQLC +PGVREYLEAI KERWARCFQ KLRYSQMTTNIAES+NALFRHARKL VTALLDHIR      
Subjt:  MNLLAKFTTPALEALFFKAAKAFRESYFNENWVQLCTHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESINALFRHARKLSVTALLDHIRGVLQRW

Query:  IYERRTLASSRQSTLSDCTEEMIAEASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAANSRSINPYTLCDEAYTVNSW
                           EEMIA+ASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCREFDYFKV CS AIAAA+SRSINPYTLCDE YTVNSW
Subjt:  IYERRTLASSRQSTLSDCTEEMIAEASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAANSRSINPYTLCDEAYTVNSW

Query:  MLAYAEPIFPVGSSSTWKSSPGFMNIDVQPPKKVVRVGRRQTVRIPSTGEVRPPRKCSRCGTSGHNRKTCREPLNTV
        MLAYAEPIFPVGSSSTWKSSPGF+NIDVQPPKKVVRVGRRQTV+IPSTGEVRPPR CSRCGTSGHNRKTCREPLNTV
Subjt:  MLAYAEPIFPVGSSSTWKSSPGFMNIDVQPPKKVVRVGRRQTVRIPSTGEVRPPRKCSRCGTSGHNRKTCREPLNTV

XP_022158655.1 uncharacterized protein LOC111025117 [Momordica charantia]4.5e-13386.28Show/hide
Query:  MNLLAKFTTPALEALFFKAAKAFRESYFNENWVQLCTHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESINALFRHARKLSVTALLDHIRGVLQRW
        MNLLAKF TPALE LFFKAAKAF E YFNENWVQLC HPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAES+NAL                   + RW
Subjt:  MNLLAKFTTPALEALFFKAAKAFRESYFNENWVQLCTHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESINALFRHARKLSVTALLDHIRGVLQRW

Query:  IYERRTLASSRQSTLSDCTEEMIAEASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAANSRSINPYTLCDEAYTVNSW
         YER+TLASSRQSTLSD  EEMIAEA+DN+RRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCREFDYFKV CSHAIAAANSRSINPYTLCDEAYTVNSW
Subjt:  IYERRTLASSRQSTLSDCTEEMIAEASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAANSRSINPYTLCDEAYTVNSW

Query:  MLAYAEPIFPVGSSSTWKSSPGFMNIDVQPPKKVVRVGRRQTVRIPSTGEVRPPRKCSRCGTSGHNRKTCREPLNTV
        MLA+AEPIF VGSS+TWKSSPGF+NIDVQPPKKVVRVGRRQTVRIPSTGEVRPPRKCSRCGTSGHNRKTCREPLNTV
Subjt:  MLAYAEPIFPVGSSSTWKSSPGFMNIDVQPPKKVVRVGRRQTVRIPSTGEVRPPRKCSRCGTSGHNRKTCREPLNTV

TrEMBL top hitse value%identityAlignment
A0A6J1DJT1 uncharacterized protein LOC1110207156.5e-11473.38Show/hide
Query:  MNLLAKF--TTPALEALFFKAAKAFRESYFNENWVQLCTHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESINALFRHARKLSVTALLDHIRGVLQ
        MNLLAKF     ALE LF KAAKA+RESYFN  W QL  +PGVREYL+ IGKERWARCFQT+LRY+QMT+N AES+NALFRHARKL VTALLDHIRG+LQ
Subjt:  MNLLAKF--TTPALEALFFKAAKAFRESYFNENWVQLCTHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESINALFRHARKLSVTALLDHIRGVLQ

Query:  RWIYERRTLASSRQSTLSDCTEEMIAEASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAANSRSINPYTLCDEAYTVN
         W Y+RRTLASSR +TLS   E  +AE SDNARRH+V+NIDQF+ +VRDGNL+G VD  S+TC CREFDYFK+PCSHAIA A  R+INPYTLCDEAYT N
Subjt:  RWIYERRTLASSRQSTLSDCTEEMIAEASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAANSRSINPYTLCDEAYTVN

Query:  SWMLAYAEPIFPVGSSSTWKSSPGFMNIDVQPPKKVVRVGRRQTVRIPSTGEVRPPRKCSRCGTSGHNRKTCREPLNT
        SW++AYAEPIFP+G  STW SSP F++  V+ P  V RVGRR+TVRIPSTGEVR  RKC RCGTSGHN KTC EPLNT
Subjt:  SWMLAYAEPIFPVGSSSTWKSSPGFMNIDVQPPKKVVRVGRRQTVRIPSTGEVRPPRKCSRCGTSGHNRKTCREPLNT

A0A6J1DK35 uncharacterized protein LOC1110218333.2e-11372.04Show/hide
Query:  MNLLAKF--TTPALEALFFKAAKAFRESYFNENWVQLCTHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESINALFRHARKLSVTALLDHIRGVLQ
        +NL+ KF     A+E LF KA KA+RESYFN  W QL  +PGVREYL+ IGKERWARCFQT+LRY+QMTTNIAES+NA FRHARKL VTALLDHIRG L 
Subjt:  MNLLAKF--TTPALEALFFKAAKAFRESYFNENWVQLCTHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESINALFRHARKLSVTALLDHIRGVLQ

Query:  RWIYERRTLASSRQSTLSDCTEEMIAEASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAANSRSINPYTLCDEAYTVN
         W Y+RRTLA+SR +TLSD  E M AE SD+ARRH+V NIDQF+F+VRDGN +G VDL + TC+CREFDYFK+PCSH IAAA  R+INPY+LCDEAYT N
Subjt:  RWIYERRTLASSRQSTLSDCTEEMIAEASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAANSRSINPYTLCDEAYTVN

Query:  SWMLAYAEPIFPVGSSSTWKSSPGFMNIDVQPPKKVVRVGRRQTVRIPSTGEVRPPRKCSRCGTSGHNRKTCREPLNTV
        SW+LAYAEPIFPVG  STW SSP F+NI V+PPK V RVGRR T RIPSTGEVR  RKC RCG  GHNRKTC EPL T+
Subjt:  SWMLAYAEPIFPVGSSSTWKSSPGFMNIDVQPPKKVVRVGRRQTVRIPSTGEVRPPRKCSRCGTSGHNRKTCREPLNTV

A0A6J1DNT3 protein FAR1-RELATED SEQUENCE 4-like1.1e-12186.82Show/hide
Query:  MNLLAKFTTPALEALFFKAAKAFRESYFNENWVQLCTHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESINALFRHARKLSVTALLDHIRGVLQRW
        MNLLAKF T  LE LFFKAAKA RESYFNENWVQLC HPGVREY+E IGKERWARCFQTKLRY QMTTNIAES+NALFRHARKL VTALLDHIRGVLQRW
Subjt:  MNLLAKFTTPALEALFFKAAKAFRESYFNENWVQLCTHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESINALFRHARKLSVTALLDHIRGVLQRW

Query:  IYERRTLASSRQSTL------------------SDCTEEMIAEASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAANS
         YERRTLASSRQSTL                  SD  EEMIAEASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAA S
Subjt:  IYERRTLASSRQSTL------------------SDCTEEMIAEASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAANS

Query:  RSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFMNIDVQPPKKVVRVGRR
        RSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGF+NIDVQPPKKVVRVGRR
Subjt:  RSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFMNIDVQPPKKVVRVGRR

A0A6J1DR67 uncharacterized protein LOC1110230872.5e-12984.84Show/hide
Query:  MNLLAKFTTPALEALFFKAAKAFRESYFNENWVQLCTHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESINALFRHARKLSVTALLDHIRGVLQRW
        M+LLAKF TPALE LFFKAAKAFRESYFNENWVQLC +PGVREYLEAI KERWARCFQ KLRYSQMTTNIAES+NALFRHARKL VTALLDHIR      
Subjt:  MNLLAKFTTPALEALFFKAAKAFRESYFNENWVQLCTHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESINALFRHARKLSVTALLDHIRGVLQRW

Query:  IYERRTLASSRQSTLSDCTEEMIAEASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAANSRSINPYTLCDEAYTVNSW
                           EEMIA+ASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCREFDYFKV CS AIAAA+SRSINPYTLCDE YTVNSW
Subjt:  IYERRTLASSRQSTLSDCTEEMIAEASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAANSRSINPYTLCDEAYTVNSW

Query:  MLAYAEPIFPVGSSSTWKSSPGFMNIDVQPPKKVVRVGRRQTVRIPSTGEVRPPRKCSRCGTSGHNRKTCREPLNTV
        MLAYAEPIFPVGSSSTWKSSPGF+NIDVQPPKKVVRVGRRQTV+IPSTGEVRPPR CSRCGTSGHNRKTCREPLNTV
Subjt:  MLAYAEPIFPVGSSSTWKSSPGFMNIDVQPPKKVVRVGRRQTVRIPSTGEVRPPRKCSRCGTSGHNRKTCREPLNTV

A0A6J1DWF8 uncharacterized protein LOC1110251172.2e-13386.28Show/hide
Query:  MNLLAKFTTPALEALFFKAAKAFRESYFNENWVQLCTHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESINALFRHARKLSVTALLDHIRGVLQRW
        MNLLAKF TPALE LFFKAAKAF E YFNENWVQLC HPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAES+NAL                   + RW
Subjt:  MNLLAKFTTPALEALFFKAAKAFRESYFNENWVQLCTHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESINALFRHARKLSVTALLDHIRGVLQRW

Query:  IYERRTLASSRQSTLSDCTEEMIAEASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAANSRSINPYTLCDEAYTVNSW
         YER+TLASSRQSTLSD  EEMIAEA+DN+RRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCREFDYFKV CSHAIAAANSRSINPYTLCDEAYTVNSW
Subjt:  IYERRTLASSRQSTLSDCTEEMIAEASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAANSRSINPYTLCDEAYTVNSW

Query:  MLAYAEPIFPVGSSSTWKSSPGFMNIDVQPPKKVVRVGRRQTVRIPSTGEVRPPRKCSRCGTSGHNRKTCREPLNTV
        MLA+AEPIF VGSS+TWKSSPGF+NIDVQPPKKVVRVGRRQTVRIPSTGEVRPPRKCSRCGTSGHNRKTCREPLNTV
Subjt:  MLAYAEPIFPVGSSSTWKSSPGFMNIDVQPPKKVVRVGRRQTVRIPSTGEVRPPRKCSRCGTSGHNRKTCREPLNTV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49920.1 MuDR family transposase1.9e-0934.74Show/hide
Query:  NGDVDLQSQTCTCREFDYFKVPCSHAIAAANSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFMN----IDVQPPKKVVRVGRRQ
        +G V L   TCTC EF   K PC HA+A  +   INP    D+ YTV  +   Y+    PV   S W  + G       +   PP KV   G+ +
Subjt:  NGDVDLQSQTCTCREFDYFKVPCSHAIAAANSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFMN----IDVQPPKKVVRVGRRQ

AT1G64255.1 MuDR family transposase5.1e-1026.63Show/hide
Query:  HPGVREYLEAIGKERWARCFQTKLRYSQMTTN---IAESINALFR--HARKLSVTALLDHIRGVLQRWIYERRTLASSRQS-TLSDCTEEMIAEASDNAR
        +P  R++L+   + RWA       RY  M  N   +    NA  +  H    SV  L D +R          ++ + SR S    D   E + +  +  R
Subjt:  HPGVREYLEAIGKERWARCFQTKLRYSQMTTN---IAESINALFR--HARKLSVTALLDHIRGVLQRWIYERRTLASSRQS-TLSDCTEEMIAEASDNAR

Query:  ------RHIVMNIDQFNFEVRDGNLNGD--VDLQSQTCTCREFDYFKVPCSHAIAAANSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPG
               +IV  +D   F+V      G+  V L   +CTC +F  +K PC HA+A       NP    D+ YT+      YA     V   S W  + G
Subjt:  ------RHIVMNIDQFNFEVRDGNLNGD--VDLQSQTCTCREFDYFKVPCSHAIAAANSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPG

AT1G64260.1 MuDR family transposase1.2e-1125.65Show/hide
Query:  HPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESINALFRH------ARKLSVTALLDHIRGVLQR---WIYERRTLASSRQSTLSDCTEEMIAEASD
        +P   ++L+ I + +WA    + LRY  +  +  E++ A+ R       A    V  + D +R    +    IY              D  EE + ++  
Subjt:  HPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESINALFRH------ARKLSVTALLDHIRGVLQR---WIYERRTLASSRQSTLSDCTEEMIAEASD

Query:  NARRHIVMNIDQFNFEVRDGNLNGD--VDLQSQTCTCREFDYFKVPCSHAIAAANSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTW
            +++  +++ +F+V + +   +  V L   TCTCR+F  +K PC HA+A      INP    DE YTV  +   YA    PV   + W
Subjt:  NARRHIVMNIDQFNFEVRDGNLNGD--VDLQSQTCTCREFDYFKVPCSHAIAAANSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACTTGCTGGCCAAATTTACAACGCCCGCGCTGGAGGCATTATTTTTTAAGGCTGCGAAGGCATTTCGCGAGTCATATTTCAATGAGAACTGGGTCCAACTGTGCAC
ACACCCAGGAGTTAGGGAATATCTGGAAGCTATAGGAAAGGAACGATGGGCTCGCTGCTTTCAGACGAAACTAAGATACTCACAAATGACCACTAATATTGCAGAGTCCA
TTAATGCACTTTTCAGGCATGCACGTAAGTTGTCAGTCACCGCATTACTTGATCATATCAGAGGTGTGTTGCAGAGGTGGATCTACGAACGTCGGACGCTTGCTTCTTCA
CGTCAGAGTACGTTGTCTGACTGCACAGAGGAAATGATTGCCGAAGCTTCGGATAATGCACGGAGACACATTGTGATGAACATCGACCAGTTTAATTTTGAGGTACGCGA
CGGGAACCTGAATGGGGATGTTGACTTGCAATCGCAGACGTGTACTTGTCGGGAGTTCGATTATTTTAAAGTCCCGTGCTCCCATGCTATTGCTGCAGCCAATTCTCGTA
GCATAAATCCGTACACACTATGCGATGAGGCGTACACAGTCAACAGTTGGATGTTGGCATATGCAGAACCAATATTTCCAGTGGGTTCATCCTCAACATGGAAGAGTTCT
CCGGGGTTTATGAATATCGATGTTCAACCACCGAAGAAGGTCGTTAGGGTTGGACGGCGACAGACGGTGAGGATTCCTTCCACAGGCGAGGTCCGTCCACCGCGCAAGTG
CAGTCGATGTGGTACATCGGGACACAATCGTAAAACTTGTCGCGAACCACTAAATACTGTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACTTGCTGGCCAAATTTACAACGCCCGCGCTGGAGGCATTATTTTTTAAGGCTGCGAAGGCATTTCGCGAGTCATATTTCAATGAGAACTGGGTCCAACTGTGCAC
ACACCCAGGAGTTAGGGAATATCTGGAAGCTATAGGAAAGGAACGATGGGCTCGCTGCTTTCAGACGAAACTAAGATACTCACAAATGACCACTAATATTGCAGAGTCCA
TTAATGCACTTTTCAGGCATGCACGTAAGTTGTCAGTCACCGCATTACTTGATCATATCAGAGGTGTGTTGCAGAGGTGGATCTACGAACGTCGGACGCTTGCTTCTTCA
CGTCAGAGTACGTTGTCTGACTGCACAGAGGAAATGATTGCCGAAGCTTCGGATAATGCACGGAGACACATTGTGATGAACATCGACCAGTTTAATTTTGAGGTACGCGA
CGGGAACCTGAATGGGGATGTTGACTTGCAATCGCAGACGTGTACTTGTCGGGAGTTCGATTATTTTAAAGTCCCGTGCTCCCATGCTATTGCTGCAGCCAATTCTCGTA
GCATAAATCCGTACACACTATGCGATGAGGCGTACACAGTCAACAGTTGGATGTTGGCATATGCAGAACCAATATTTCCAGTGGGTTCATCCTCAACATGGAAGAGTTCT
CCGGGGTTTATGAATATCGATGTTCAACCACCGAAGAAGGTCGTTAGGGTTGGACGGCGACAGACGGTGAGGATTCCTTCCACAGGCGAGGTCCGTCCACCGCGCAAGTG
CAGTCGATGTGGTACATCGGGACACAATCGTAAAACTTGTCGCGAACCACTAAATACTGTGTAG
Protein sequenceShow/hide protein sequence
MNLLAKFTTPALEALFFKAAKAFRESYFNENWVQLCTHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESINALFRHARKLSVTALLDHIRGVLQRWIYERRTLASS
RQSTLSDCTEEMIAEASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAANSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSS
PGFMNIDVQPPKKVVRVGRRQTVRIPSTGEVRPPRKCSRCGTSGHNRKTCREPLNTV