; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g14290 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g14290
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionMuDRA-like transposase
Genome locationchr2:10591256..10593724
RNA-Seq ExpressionMoc02g14290
SyntenyMoc02g14290
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR004332 - Transposase, MuDR, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131652.1 protein FAR1-RELATED SEQUENCE 4-like [Momordica charantia]4.3e-7564.26Show/hide
Query:  EGHSEVEYGNEEHDDALDDELEPDVEQVH-TEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDNKKELSLKMHLVAMRKNFQFKVKK
        EG  E EYGNE   D LD + E +   +H T      + V     N +TG    ++LQ +VQS+ T+DV E DVFD+KKEL +KMHL+A+RKNFQF+VKK
Subjt:  EGHSEVEYGNEEHDDALDDELEPDVEQVH-TEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDNKKELSLKMHLVAMRKNFQFKVKK

Query:  STPELYILRCAD--CTWRLRATKLKECTLFKIKKYCAAHTCCGGA-LKHDHRQAKSWVVRHLVQEKFTDVSRTYRPKDIIQDMRKEYDVNLSYDRAWRSS
        STP+LY++RC D  CTWRLR TK+++C LFKIKKY A H+ C GA +K DHRQAKSWVV HLVQ KFTDVSRTYRPKDI+QD+R+EY VN+SYD+AWRSS
Subjt:  STPELYILRCAD--CTWRLRATKLKECTLFKIKKYCAAHTCCGGA-LKHDHRQAKSWVVRHLVQEKFTDVSRTYRPKDIIQDMRKEYDVNLSYDRAWRSS

Query:  EEALRLIRGDPASSYGLLPAYGEALKIMNPVSSFD
        EEALRLIRGDPASSY LLPAYGEA+KIMNP + F+
Subjt:  EEALRLIRGDPASSYGLLPAYGEALKIMNPVSSFD

XP_022148135.1 uncharacterized protein LOC111016888 [Momordica charantia]2.3e-8483.25Show/hide
Query:  GCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDNKKELSLKMHLVAMRKNFQFKVKKSTPELYILRCAD--CTWRLRATKLKECTLFKIKKYCAAHTCCG
        GC+GLTG PNDEKLQ +VQSSGTNDV EG+VFD KKELSL+MHLVAMR NFQFKVKKSTPELYILRC D  CTWRLRATKL++C +FKIKKY + HTC G
Subjt:  GCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDNKKELSLKMHLVAMRKNFQFKVKKSTPELYILRCAD--CTWRLRATKLKECTLFKIKKYCAAHTCCG

Query:  GALKHDHRQAKSWVVRHLVQEKFTDVSRTYRPKDIIQDMRKEYDVNLSYDRAWRSSEEALRLIRGDPASSYGLLPAYGEALKIMNPVSSFD
        G LK DHRQAKSWVV HLVQ KFTDVSRTYRPKDIIQDMRKEY VNLSYD+AWRSSEEALRLIR DPASSYGLL AYGEALKIMNP + F+
Subjt:  GALKHDHRQAKSWVVRHLVQEKFTDVSRTYRPKDIIQDMRKEYDVNLSYDRAWRSSEEALRLIRGDPASSYGLLPAYGEALKIMNPVSSFD

XP_022153146.1 uncharacterized protein LOC111020715 [Momordica charantia]2.0e-9679.06Show/hide
Query:  EEGHSEVEYGNEEHDDALDDELEPDVEQVHTEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDNKKELSLKMHLVAMRKNFQFKVKK
        EEG  E E+ N+++DDALD+E EPDVEQVH EI RDE AV+  GC+GLTG  N E LQLIVQSSGTNDV EG+VFD KKELSL+MHLV MR NFQFKVKK
Subjt:  EEGHSEVEYGNEEHDDALDDELEPDVEQVHTEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDNKKELSLKMHLVAMRKNFQFKVKK

Query:  STPELYILRCAD--CTWRLRATKLKECTLFKIKKYCAAHTCCGGALKHDHRQAKSWVVRHLVQEKFTDVSRTYRPKDIIQDMRKEYDVNLSYDRAWRSSE
        STPELYIL C D  CTWRLRATKL++C LFKIKKY + HTC GG LK DHRQAKSWVV HLVQ KFTDVSRTYRPKDIIQDMRKEY VNLSYD+AWRSSE
Subjt:  STPELYILRCAD--CTWRLRATKLKECTLFKIKKYCAAHTCCGGALKHDHRQAKSWVVRHLVQEKFTDVSRTYRPKDIIQDMRKEYDVNLSYDRAWRSSE

Query:  EALRLIRGDPASSYGLLPAYGEALKIMNPVSSFD
        EALRLIRGDPASSYGLLP YGEALKIMNP + F+
Subjt:  EALRLIRGDPASSYGLLPAYGEALKIMNPVSSFD

XP_022155970.1 uncharacterized protein LOC111022954 [Momordica charantia]3.5e-12560.48Show/hide
Query:  DMMPRVFITFGGEWNDSEKDYVGGRTRGLTVDSTITYREFLGHVYRLSNINPLQFDIIIRRVYHFKTKVCVMEITDDDDLRFFYWPKCVLDSYVRSLYSP
        +M  RVFITFGGEWNDSEKDYVGGR RGLTVDS               N++     I            C+           F  P   + S+  S  +P
Subjt:  DMMPRVFITFGGEWNDSEKDYVGGRTRGLTVDSTITYREFLGHVYRLSNINPLQFDIIIRRVYHFKTKVCVMEITDDDDLRFFYWPKCVLDSYVRSLYSP

Query:  IPETHISYTLISFLIIEPLFFPTATPSYGHIGHDVEGLTTLGSDVVPCNLGDDRVCDWDVPGVWNDNEDESGESYDPLAESEEGHSEVEYGNEEHDDALD
                        +P  +      YGH+GHD+ GLT L SDVVPCNLGDDRVC W++PG+WNDN+DES ESYD L +SEEG  E E+ N+++DDA D
Subjt:  IPETHISYTLISFLIIEPLFFPTATPSYGHIGHDVEGLTTLGSDVVPCNLGDDRVCDWDVPGVWNDNEDESGESYDPLAESEEGHSEVEYGNEEHDDALD

Query:  DELEPDVEQVHTEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDNKKELSLKMHLVAMRKNFQFKVKKSTPELYILRCAD--CTWRL
        ++ EPDVEQV  EIRRDE  V   GC+GL G PNDEKLQLIVQSSGTNDV EG VFD KKELSL+ HLVAM  NFQFKVKKSTPELYILRC D  CTWRL
Subjt:  DELEPDVEQVHTEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDNKKELSLKMHLVAMRKNFQFKVKKSTPELYILRCAD--CTWRL

Query:  RATKLKECTLFKIKKYCAAHTCCGGALKHDHRQAKSWVVRHLVQEKFTDVSRTYRPKDIIQDMRKEYDVNLSYDRAWRSSEEALRLIRGDPASSYGLLPA
        RA KL +C LFKIKKY + HTC G  LK DHRQAK+WVVRHLVQ KFTDVS TYRPKDIIQDMRKEY VNLSYD+AW+S+EEALRLIRGDP +SYGLLPA
Subjt:  RATKLKECTLFKIKKYCAAHTCCGGALKHDHRQAKSWVVRHLVQEKFTDVSRTYRPKDIIQDMRKEYDVNLSYDRAWRSSEEALRLIRGDPASSYGLLPA

Query:  YGEALKIMNPVSSFD
        YG  L  + PV   D
Subjt:  YGEALKIMNPVSSFD

XP_022157017.1 uncharacterized protein LOC111023843 [Momordica charantia]2.0e-14993.66Show/hide
Query:  GHDVEGLTTLGSDVVPCNLGDDRVCDWDVPGVWNDNEDESGESYDPLAESEEGHSEVEYGNEEHDDALDDELEPDVEQVHTEIRRDEEAVRPPGCNGLTG
        GHDVEGLT LGSDVVPCNLGDDRVCDWDVPGVWNDNEDESGESYDPLA SEEGHS+ EYGNEEHDDALDDELE DVEQVHTEIRRDEEAVR PGCNGLTG
Subjt:  GHDVEGLTTLGSDVVPCNLGDDRVCDWDVPGVWNDNEDESGESYDPLAESEEGHSEVEYGNEEHDDALDDELEPDVEQVHTEIRRDEEAVRPPGCNGLTG

Query:  DPNDEKLQLIVQSSGTNDVNEGDVFDNKKELSLKMHLVAMRKNFQFKVKKSTPELYILRC--ADCTWRLRATKLKECTLFKIKKYCAAHTCCGGALKHDH
        DPNDEKLQLIVQSSGTNDVNEGDVFDNKKELSLKMHLVAMRKNFQFKVKKSTP+LYILRC  ADCTWRLRATKLKECTLFKIKKYCA HTC GGALKHDH
Subjt:  DPNDEKLQLIVQSSGTNDVNEGDVFDNKKELSLKMHLVAMRKNFQFKVKKSTPELYILRC--ADCTWRLRATKLKECTLFKIKKYCAAHTCCGGALKHDH

Query:  RQAKSWVVRHLVQEKFTDVSRTYRPKDIIQDMRKEYDVNLSYDRAWRSSEEALRLIRGDPASSYGLLPAYGEALKIMNPVSSFD
        RQAKSWVV HLVQEKFTDVSRTYRPKDIIQDMRKEY VNLSYDRAWRSSEEALRLIRGDPASSYGLLPAYG+ALKIMNP + F+
Subjt:  RQAKSWVVRHLVQEKFTDVSRTYRPKDIIQDMRKEYDVNLSYDRAWRSSEEALRLIRGDPASSYGLLPAYGEALKIMNPVSSFD

TrEMBL top hitse value%identityAlignment
A0A6J1BRM2 protein FAR1-RELATED SEQUENCE 4-like2.1e-7564.26Show/hide
Query:  EGHSEVEYGNEEHDDALDDELEPDVEQVH-TEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDNKKELSLKMHLVAMRKNFQFKVKK
        EG  E EYGNE   D LD + E +   +H T      + V     N +TG    ++LQ +VQS+ T+DV E DVFD+KKEL +KMHL+A+RKNFQF+VKK
Subjt:  EGHSEVEYGNEEHDDALDDELEPDVEQVH-TEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDNKKELSLKMHLVAMRKNFQFKVKK

Query:  STPELYILRCAD--CTWRLRATKLKECTLFKIKKYCAAHTCCGGA-LKHDHRQAKSWVVRHLVQEKFTDVSRTYRPKDIIQDMRKEYDVNLSYDRAWRSS
        STP+LY++RC D  CTWRLR TK+++C LFKIKKY A H+ C GA +K DHRQAKSWVV HLVQ KFTDVSRTYRPKDI+QD+R+EY VN+SYD+AWRSS
Subjt:  STPELYILRCAD--CTWRLRATKLKECTLFKIKKYCAAHTCCGGA-LKHDHRQAKSWVVRHLVQEKFTDVSRTYRPKDIIQDMRKEYDVNLSYDRAWRSS

Query:  EEALRLIRGDPASSYGLLPAYGEALKIMNPVSSFD
        EEALRLIRGDPASSY LLPAYGEA+KIMNP + F+
Subjt:  EEALRLIRGDPASSYGLLPAYGEALKIMNPVSSFD

A0A6J1D234 uncharacterized protein LOC1110168881.1e-8483.25Show/hide
Query:  GCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDNKKELSLKMHLVAMRKNFQFKVKKSTPELYILRCAD--CTWRLRATKLKECTLFKIKKYCAAHTCCG
        GC+GLTG PNDEKLQ +VQSSGTNDV EG+VFD KKELSL+MHLVAMR NFQFKVKKSTPELYILRC D  CTWRLRATKL++C +FKIKKY + HTC G
Subjt:  GCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDNKKELSLKMHLVAMRKNFQFKVKKSTPELYILRCAD--CTWRLRATKLKECTLFKIKKYCAAHTCCG

Query:  GALKHDHRQAKSWVVRHLVQEKFTDVSRTYRPKDIIQDMRKEYDVNLSYDRAWRSSEEALRLIRGDPASSYGLLPAYGEALKIMNPVSSFD
        G LK DHRQAKSWVV HLVQ KFTDVSRTYRPKDIIQDMRKEY VNLSYD+AWRSSEEALRLIR DPASSYGLL AYGEALKIMNP + F+
Subjt:  GALKHDHRQAKSWVVRHLVQEKFTDVSRTYRPKDIIQDMRKEYDVNLSYDRAWRSSEEALRLIRGDPASSYGLLPAYGEALKIMNPVSSFD

A0A6J1DJT1 uncharacterized protein LOC1110207159.6e-9779.06Show/hide
Query:  EEGHSEVEYGNEEHDDALDDELEPDVEQVHTEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDNKKELSLKMHLVAMRKNFQFKVKK
        EEG  E E+ N+++DDALD+E EPDVEQVH EI RDE AV+  GC+GLTG  N E LQLIVQSSGTNDV EG+VFD KKELSL+MHLV MR NFQFKVKK
Subjt:  EEGHSEVEYGNEEHDDALDDELEPDVEQVHTEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDNKKELSLKMHLVAMRKNFQFKVKK

Query:  STPELYILRCAD--CTWRLRATKLKECTLFKIKKYCAAHTCCGGALKHDHRQAKSWVVRHLVQEKFTDVSRTYRPKDIIQDMRKEYDVNLSYDRAWRSSE
        STPELYIL C D  CTWRLRATKL++C LFKIKKY + HTC GG LK DHRQAKSWVV HLVQ KFTDVSRTYRPKDIIQDMRKEY VNLSYD+AWRSSE
Subjt:  STPELYILRCAD--CTWRLRATKLKECTLFKIKKYCAAHTCCGGALKHDHRQAKSWVVRHLVQEKFTDVSRTYRPKDIIQDMRKEYDVNLSYDRAWRSSE

Query:  EALRLIRGDPASSYGLLPAYGEALKIMNPVSSFD
        EALRLIRGDPASSYGLLP YGEALKIMNP + F+
Subjt:  EALRLIRGDPASSYGLLPAYGEALKIMNPVSSFD

A0A6J1DP00 uncharacterized protein LOC1110229541.7e-12560.48Show/hide
Query:  DMMPRVFITFGGEWNDSEKDYVGGRTRGLTVDSTITYREFLGHVYRLSNINPLQFDIIIRRVYHFKTKVCVMEITDDDDLRFFYWPKCVLDSYVRSLYSP
        +M  RVFITFGGEWNDSEKDYVGGR RGLTVDS               N++     I            C+           F  P   + S+  S  +P
Subjt:  DMMPRVFITFGGEWNDSEKDYVGGRTRGLTVDSTITYREFLGHVYRLSNINPLQFDIIIRRVYHFKTKVCVMEITDDDDLRFFYWPKCVLDSYVRSLYSP

Query:  IPETHISYTLISFLIIEPLFFPTATPSYGHIGHDVEGLTTLGSDVVPCNLGDDRVCDWDVPGVWNDNEDESGESYDPLAESEEGHSEVEYGNEEHDDALD
                        +P  +      YGH+GHD+ GLT L SDVVPCNLGDDRVC W++PG+WNDN+DES ESYD L +SEEG  E E+ N+++DDA D
Subjt:  IPETHISYTLISFLIIEPLFFPTATPSYGHIGHDVEGLTTLGSDVVPCNLGDDRVCDWDVPGVWNDNEDESGESYDPLAESEEGHSEVEYGNEEHDDALD

Query:  DELEPDVEQVHTEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDNKKELSLKMHLVAMRKNFQFKVKKSTPELYILRCAD--CTWRL
        ++ EPDVEQV  EIRRDE  V   GC+GL G PNDEKLQLIVQSSGTNDV EG VFD KKELSL+ HLVAM  NFQFKVKKSTPELYILRC D  CTWRL
Subjt:  DELEPDVEQVHTEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDNKKELSLKMHLVAMRKNFQFKVKKSTPELYILRCAD--CTWRL

Query:  RATKLKECTLFKIKKYCAAHTCCGGALKHDHRQAKSWVVRHLVQEKFTDVSRTYRPKDIIQDMRKEYDVNLSYDRAWRSSEEALRLIRGDPASSYGLLPA
        RA KL +C LFKIKKY + HTC G  LK DHRQAK+WVVRHLVQ KFTDVS TYRPKDIIQDMRKEY VNLSYD+AW+S+EEALRLIRGDP +SYGLLPA
Subjt:  RATKLKECTLFKIKKYCAAHTCCGGALKHDHRQAKSWVVRHLVQEKFTDVSRTYRPKDIIQDMRKEYDVNLSYDRAWRSSEEALRLIRGDPASSYGLLPA

Query:  YGEALKIMNPVSSFD
        YG  L  + PV   D
Subjt:  YGEALKIMNPVSSFD

A0A6J1DTG5 uncharacterized protein LOC1110238439.8e-15093.66Show/hide
Query:  GHDVEGLTTLGSDVVPCNLGDDRVCDWDVPGVWNDNEDESGESYDPLAESEEGHSEVEYGNEEHDDALDDELEPDVEQVHTEIRRDEEAVRPPGCNGLTG
        GHDVEGLT LGSDVVPCNLGDDRVCDWDVPGVWNDNEDESGESYDPLA SEEGHS+ EYGNEEHDDALDDELE DVEQVHTEIRRDEEAVR PGCNGLTG
Subjt:  GHDVEGLTTLGSDVVPCNLGDDRVCDWDVPGVWNDNEDESGESYDPLAESEEGHSEVEYGNEEHDDALDDELEPDVEQVHTEIRRDEEAVRPPGCNGLTG

Query:  DPNDEKLQLIVQSSGTNDVNEGDVFDNKKELSLKMHLVAMRKNFQFKVKKSTPELYILRC--ADCTWRLRATKLKECTLFKIKKYCAAHTCCGGALKHDH
        DPNDEKLQLIVQSSGTNDVNEGDVFDNKKELSLKMHLVAMRKNFQFKVKKSTP+LYILRC  ADCTWRLRATKLKECTLFKIKKYCA HTC GGALKHDH
Subjt:  DPNDEKLQLIVQSSGTNDVNEGDVFDNKKELSLKMHLVAMRKNFQFKVKKSTPELYILRC--ADCTWRLRATKLKECTLFKIKKYCAAHTCCGGALKHDH

Query:  RQAKSWVVRHLVQEKFTDVSRTYRPKDIIQDMRKEYDVNLSYDRAWRSSEEALRLIRGDPASSYGLLPAYGEALKIMNPVSSFD
        RQAKSWVV HLVQEKFTDVSRTYRPKDIIQDMRKEY VNLSYDRAWRSSEEALRLIRGDPASSYGLLPAYG+ALKIMNP + F+
Subjt:  RQAKSWVVRHLVQEKFTDVSRTYRPKDIIQDMRKEYDVNLSYDRAWRSSEEALRLIRGDPASSYGLLPAYGEALKIMNPVSSFD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATCTCGGACCAGTCTTGGTCCGAGACATGTCCTGGACTAATCTCGAAGCATATTTAGTCTGGGACATGATGCCTCGTGTTTTCATAACATTCGGTGGAGAA
TGGAATGATAGTGAAAAAGATTATGTCGGCGGTCGTACGAGGGGATTGACAGTGGATAGTACAATCACGTACAGAGAATTTCTAGGTCACGTGTATAGATTGAGT
AACATTAACCCCCTACAGTTTGATATTATAATTAGACGTGTGTATCATTTTAAAACTAAAGTTTGTGTGATGGAAATAACTGATGACGATGACTTGCGTTTTTTT
TACTGGCCTAAATGTGTCCTCGATTCTTATGTCCGCAGCTTGTATTCCCCCATTCCAGAGACCCACATATCCTATACCCTCATTTCCTTCCTCATCATCGAACCC
CTCTTCTTCCCGACAGCCACACCCTCCTACGGGCATATAGGTCATGATGTAGAGGGTTTAACAACATTGGGGTCAGATGTTGTTCCATGTAATCTAGGAGATGAT
AGAGTGTGTGATTGGGATGTGCCGGGAGTATGGAATGATAACGAAGATGAAAGTGGTGAATCATATGACCCGTTGGCAGAGTCTGAAGAAGGACACTCTGAAGTA
GAATATGGGAACGAAGAGCATGACGATGCGCTTGATGATGAGCTTGAGCCTGATGTCGAACAAGTGCACACTGAGATTCGCAGGGATGAAGAAGCGGTCCGGCCA
CCAGGATGTAATGGTCTCACCGGAGACCCTAATGATGAGAAATTGCAACTCATAGTACAGTCTTCTGGGACAAATGATGTTAATGAGGGCGATGTATTTGATAAT
AAGAAGGAGTTGAGTTTGAAAATGCATTTAGTTGCAATGCGGAAGAATTTTCAGTTTAAAGTAAAGAAGTCGACGCCGGAGCTATATATACTGCGGTGCGCTGAT
TGCACGTGGAGACTTCGAGCTACCAAGCTAAAGGAATGCACTTTGTTCAAGATAAAAAAATATTGTGCTGCCCATACGTGCTGTGGTGGAGCTTTAAAACATGAT
CATAGGCAAGCAAAAAGTTGGGTGGTAAGACATCTAGTGCAAGAGAAGTTCACAGACGTCTCCCGCACGTATAGACCGAAGGACATTATACAAGACATGAGGAAG
GAGTATGATGTCAATTTAAGTTATGATAGAGCATGGCGTTCTAGTGAAGAAGCACTCCGGCTTATTAGAGGTGATCCAGCATCGTCATATGGTCTACTTCCAGCT
TATGGTGAAGCTTTGAAAATCATGAACCCAGTGTCCAGCTTTGACCCGATAACATCGTACTCGAAGAAGCGTACACAGAAAATTCCGCAATCTGTGGCGTTACTC
TGTTGGGGTGTGCGAACCCGACGCACCCTCCATGGCACCACTGGCAGGTCCGGTCGTGATGCAAATATCCCGCCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTATCTCGGACCAGTCTTGGTCCGAGACATGTCCTGGACTAATCTCGAAGCATATTTAGTCTGGGACATGATGCCTCGTGTTTTCATAACATTCGGTGGAGAA
TGGAATGATAGTGAAAAAGATTATGTCGGCGGTCGTACGAGGGGATTGACAGTGGATAGTACAATCACGTACAGAGAATTTCTAGGTCACGTGTATAGATTGAGT
AACATTAACCCCCTACAGTTTGATATTATAATTAGACGTGTGTATCATTTTAAAACTAAAGTTTGTGTGATGGAAATAACTGATGACGATGACTTGCGTTTTTTT
TACTGGCCTAAATGTGTCCTCGATTCTTATGTCCGCAGCTTGTATTCCCCCATTCCAGAGACCCACATATCCTATACCCTCATTTCCTTCCTCATCATCGAACCC
CTCTTCTTCCCGACAGCCACACCCTCCTACGGGCATATAGGTCATGATGTAGAGGGTTTAACAACATTGGGGTCAGATGTTGTTCCATGTAATCTAGGAGATGAT
AGAGTGTGTGATTGGGATGTGCCGGGAGTATGGAATGATAACGAAGATGAAAGTGGTGAATCATATGACCCGTTGGCAGAGTCTGAAGAAGGACACTCTGAAGTA
GAATATGGGAACGAAGAGCATGACGATGCGCTTGATGATGAGCTTGAGCCTGATGTCGAACAAGTGCACACTGAGATTCGCAGGGATGAAGAAGCGGTCCGGCCA
CCAGGATGTAATGGTCTCACCGGAGACCCTAATGATGAGAAATTGCAACTCATAGTACAGTCTTCTGGGACAAATGATGTTAATGAGGGCGATGTATTTGATAAT
AAGAAGGAGTTGAGTTTGAAAATGCATTTAGTTGCAATGCGGAAGAATTTTCAGTTTAAAGTAAAGAAGTCGACGCCGGAGCTATATATACTGCGGTGCGCTGAT
TGCACGTGGAGACTTCGAGCTACCAAGCTAAAGGAATGCACTTTGTTCAAGATAAAAAAATATTGTGCTGCCCATACGTGCTGTGGTGGAGCTTTAAAACATGAT
CATAGGCAAGCAAAAAGTTGGGTGGTAAGACATCTAGTGCAAGAGAAGTTCACAGACGTCTCCCGCACGTATAGACCGAAGGACATTATACAAGACATGAGGAAG
GAGTATGATGTCAATTTAAGTTATGATAGAGCATGGCGTTCTAGTGAAGAAGCACTCCGGCTTATTAGAGGTGATCCAGCATCGTCATATGGTCTACTTCCAGCT
TATGGTGAAGCTTTGAAAATCATGAACCCAGTGTCCAGCTTTGACCCGATAACATCGTACTCGAAGAAGCGTACACAGAAAATTCCGCAATCTGTGGCGTTACTC
TGTTGGGGTGTGCGAACCCGACGCACCCTCCATGGCACCACTGGCAGGTCCGGTCGTGATGCAAATATCCCGCCGTGA
Protein sequenceShow/hide protein sequence
MYLGPVLVRDMSWTNLEAYLVWDMMPRVFITFGGEWNDSEKDYVGGRTRGLTVDSTITYREFLGHVYRLSNINPLQFDIIIRRVYHFKTKVCVMEITDDDDLRFF
YWPKCVLDSYVRSLYSPIPETHISYTLISFLIIEPLFFPTATPSYGHIGHDVEGLTTLGSDVVPCNLGDDRVCDWDVPGVWNDNEDESGESYDPLAESEEGHSEV
EYGNEEHDDALDDELEPDVEQVHTEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDNKKELSLKMHLVAMRKNFQFKVKKSTPELYILRCAD
CTWRLRATKLKECTLFKIKKYCAAHTCCGGALKHDHRQAKSWVVRHLVQEKFTDVSRTYRPKDIIQDMRKEYDVNLSYDRAWRSSEEALRLIRGDPASSYGLLPA
YGEALKIMNPVSSFDPITSYSKKRTQKIPQSVALLCWGVRTRRTLHGTTGRSGRDANIPP