; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g10630 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g10630
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionMuDRA-like transposase
Genome locationchr5:8350557..8353719
RNA-Seq ExpressionMoc05g10630
SyntenyMoc05g10630
Gene Ontology termsGO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR004332 - Transposase, MuDR, plant
IPR006564 - Zinc finger, PMZ-type
IPR007527 - Zinc finger, SWIM-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022145824.1 protein FAR-RED ELONGATED HYPOCOTYL 3-like [Momordica charantia]1.8e-11097.99Show/hide
Query:  KEFGDAVYAYRKSQFTYYWNQILSVGSGTLAKYLQEIGVERWARCYQVGRRYENMTTNSAESVNALLRKARELPITKIVEFIRDLLQRWFHERRTHWSTQ
        K + DAVYAYRKSQFTYYWNQILSVGSGTLAKYLQEIGVERWARCYQVGRRYENMTTNSAESVNALLRKARELPITKIVEFIRDLLQRWFHERRTHWSTQ
Subjt:  KEFGDAVYAYRKSQFTYYWNQILSVGSGTLAKYLQEIGVERWARCYQVGRRYENMTTNSAESVNALLRKARELPITKIVEFIRDLLQRWFHERRTHWSTQ

Query:  NTSHSDYAEERLALQFEKSRRYTVKPVDWCMFPVEDGGLDGTVDLNARTCTCMEFQYMGIPCSHAIAVVRHKNINCHTLIDPCYSVDSLIGAYAESILP
        NTSHSDYAEERLALQFEKSRRYTVKPVDWCMFPVEDGGLDGTVDLNARTCTCMEFQYMGIPCSHAIAVVRHKNINCHTLIDPCYSVDSLIGAYAE ILP
Subjt:  NTSHSDYAEERLALQFEKSRRYTVKPVDWCMFPVEDGGLDGTVDLNARTCTCMEFQYMGIPCSHAIAVVRHKNINCHTLIDPCYSVDSLIGAYAESILP

XP_022154925.1 uncharacterized protein LOC111022071 [Momordica charantia]1.1e-16858.12Show/hide
Query:  MFITDDDDLRFALVQEQVSKVPLFVSTVLRESIDLQLSRLNQDGEQSIPNGESVPEESSTRVDEWGLNDVYMQDMYSSDGIYTQDSELVTPATRMPSVMH
        MF+TDDDDLRFALVQEQVSKVPLFVST+ RESID+Q SRLNQDGEQSIPNG SVPEE STRVDEWGLNDVYMQDMYSSD                     
Subjt:  MFITDDDDLRFALVQEQVSKVPLFVSTVLRESIDLQLSRLNQDGEQSIPNGESVPEESSTRVDEWGLNDVYMQDMYSSDGIYTQDSELVTPATRMPSVMH

Query:  VPSDEINNTAQVQHASSRTEHVFPFDELNNTVYVQHARPLTDHVFPLVEPSSTEPGNVGDHQIPVPQSASTVTIKQATRYNLLPSGSELHVGKIFVSKQD
                                         ++HA PLT      VEPSS++P NV   QIPVPQS STVTI+QA RYNLLP GSELHVGKIFVSKQD
Subjt:  VPSDEINNTAQVQHASSRTEHVFPFDELNNTVYVQHARPLTDHVFPLVEPSSTEPGNVGDHQIPVPQSASTVTIKQATRYNLLPSGSELHVGKIFVSKQD

Query:  LRMVLSNAAMQSNREYKVSKSTKSKFVVRCIDNTCNWRVVAHSVGKSSMFCISKYVDAHTCMIDTVNHDHKQASSWVVANLIKDRVAGTDRIYKIKHIKE
        LRMVLSNAAM+SNREYKVS+STKSKF VRCIDNTCNWRV AHSVGKSS+FCISKYVDAHTC ID+VNHDHKQASSWVVANLIKD VAGT RIYKIKHIKE
Subjt:  LRMVLSNAAMQSNREYKVSKSTKSKFVVRCIDNTCNWRVVAHSVGKSSMFCISKYVDAHTCMIDTVNHDHKQASSWVVANLIKDRVAGTDRIYKIKHIKE

Query:  DVRKEFG---------------------------------------------------------------------------------------------
        DVRKE+G                                                                                             
Subjt:  DVRKEFG---------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------DAVYAYRKSQFTYYWNQILSVGSGTLAKYLQEIGVERWARCYQVGRRY
                                                            DA YAYRKSQFTYYWNQILS+GSG+LAKYLQEIG+ERWARCYQVGRRY
Subjt:  ----------------------------------------------------DAVYAYRKSQFTYYWNQILSVGSGTLAKYLQEIGVERWARCYQVGRRY

Query:  ENMTTNSAESVNALLRKARELPITKIVEFIRDLLQRWFHERRTHWSTQNTSHSDYAEERLALQFEKSRRYTVKPVDWCMFPVEDG
        EN TTNSAESVNALLR+ARELPITKIVEFIR+LLQRWFHERR HWSTQNTSHSDYAEERLA+QFEKSRRYTVKPVDWCMF VEDG
Subjt:  ENMTTNSAESVNALLRKARELPITKIVEFIRDLLQRWFHERRTHWSTQNTSHSDYAEERLALQFEKSRRYTVKPVDWCMFPVEDG

XP_022154997.1 uncharacterized protein LOC111022140 [Momordica charantia]3.6e-11154.42Show/hide
Query:  GSELHVGKIFVSKQDLRMVLSNAAMQSNREYKVSKSTKSKFVVRCIDNTCNWRVVAHSVGKSSMFCISKYVDAHTCMIDTVNHDHKQASSWVVANLIKDR
        GSELHVGKIFVSKQDL MV+SNAAM+SNREYKVS+STKSKFVVRCI+NTCN RV  HSVGKSS+FCISKYVDAHTCMIDTVNHDHKQASS +VANLIKDR
Subjt:  GSELHVGKIFVSKQDLRMVLSNAAMQSNREYKVSKSTKSKFVVRCIDNTCNWRVVAHSVGKSSMFCISKYVDAHTCMIDTVNHDHKQASSWVVANLIKDR

Query:  VAGTDRIYKIKHIKEDVRKEFG------------------------------------------------------------------------------
        VAG  RIY IKHIKEDVRKEFG                                                                              
Subjt:  VAGTDRIYKIKHIKEDVRKEFG------------------------------------------------------------------------------

Query:  -------------------------------------------------------------------DAVYAYRKSQFTYYWNQILSVGSGTLAKYLQEI
                                                                           DA YAYRKSQFTYYWNQILSVGSG+LAKYLQEI
Subjt:  -------------------------------------------------------------------DAVYAYRKSQFTYYWNQILSVGSGTLAKYLQEI

Query:  GVERWARCYQVGRRYENMTTNSAESVNALLRKARELPITKIVEFIRDLLQRWFHERRTHWSTQNTSHSDYAEERLALQFEKSRRYTVKPVDWCMFPVEDG
        GVERWARCYQVGRRYENMTTNS +S                             ERRTHWSTQNTSHSDYA+ERLALQFEKSRRYTVKPVDWCMF VED 
Subjt:  GVERWARCYQVGRRYENMTTNSAESVNALLRKARELPITKIVEFIRDLLQRWFHERRTHWSTQNTSHSDYAEERLALQFEKSRRYTVKPVDWCMFPVEDG

Query:  GLDGTVDLNARTCTCMEFQYMGIPCSHAIA
        GLDGTVDLNA TCTCMEFQYMGIPCSHAIA
Subjt:  GLDGTVDLNARTCTCMEFQYMGIPCSHAIA

XP_022156308.1 uncharacterized protein LOC111023235 [Momordica charantia]2.5e-10492.46Show/hide
Query:  KEFGDAVYAYRKSQFTYYWNQILSVGSGTLAKYLQEIGVERWARCYQVGRRYENMTTNSAESVNALLRKARELPITKIVEFIRDLLQRWFHERRTHWSTQ
        K + DA YAYRKSQFTYYWNQILSVGSG+LAKYLQEIG+ERWARCYQVGRRYENMTTNSAESVNALLR+ARELPITKIVEFI DLLQRWFHERRTHWSTQ
Subjt:  KEFGDAVYAYRKSQFTYYWNQILSVGSGTLAKYLQEIGVERWARCYQVGRRYENMTTNSAESVNALLRKARELPITKIVEFIRDLLQRWFHERRTHWSTQ

Query:  NTSHSDYAEERLALQFEKSRRYTVKPVDWCMFPVEDGGLDGTVDLNARTCTCMEFQYMGIPCSHAIAVVRHKNINCHTLIDPCYSVDSLIGAYAESILP
        NTSHSDYAEERLALQFEKSRRYTVKPVDWCMF VED GLD TVDLNARTCTCMEFQYMGIPCSHAIA  RHKNINCHTLIDPCY+VDSLIGAYAE ILP
Subjt:  NTSHSDYAEERLALQFEKSRRYTVKPVDWCMFPVEDGGLDGTVDLNARTCTCMEFQYMGIPCSHAIAVVRHKNINCHTLIDPCYSVDSLIGAYAESILP

XP_022159086.1 uncharacterized protein LOC111025530 [Momordica charantia]6.0e-14362.22Show/hide
Query:  IPVPQSASTVTIKQATRYNLLPSGSELHVGKIFVSKQDLRMVLSNAAMQSNREYKVSKSTKSKFVVRCIDNTCNWRVVAHSVGKSSMFCISKYVDAHTCM
        IP+  S S+   ++  R     S S+L++G+I   K +L   +       NREYK S+STKSKFVVRCIDNTCNWRV AHSVGKSS+FC SKYVDAHTC 
Subjt:  IPVPQSASTVTIKQATRYNLLPSGSELHVGKIFVSKQDLRMVLSNAAMQSNREYKVSKSTKSKFVVRCIDNTCNWRVVAHSVGKSSMFCISKYVDAHTCM

Query:  IDTVNHDHKQASSWVVANLIKDRVAGTDRIYKIKHIKEDVRKEF--------------------------------------------------------
        IDTVNHDHKQASSWVVANLIKDRVA T RIYKIKHIKEDVR+EF                                                        
Subjt:  IDTVNHDHKQASSWVVANLIKDRVAGTDRIYKIKHIKEDVRKEF--------------------------------------------------------

Query:  ------------------------------------------------------GDAVYAYRKSQFTYYWNQILSVGSGTLAKYLQEIGVERWARCYQVG
                                                               DA YAYRKSQFTYYWNQILSVGSG+LAKYLQEIGVERWARCYQVG
Subjt:  ------------------------------------------------------GDAVYAYRKSQFTYYWNQILSVGSGTLAKYLQEIGVERWARCYQVG

Query:  RRYENMTTNSAESVNALLRKARELPITKIVEFIRDLLQRWFHERRTHWSTQNTSHSDYAEERLALQFEKSRRYTVKPVDWCMFPVEDGGLDGTVDLNART
        RRYENMTTNSAESVN LLR+A ELPITKIVEFIRDLLQRWFH RRTHWSTQNTSHSDYAEE+LALQFEKSRRYTVKPVDWCMF VEDGGLDGTVDLNART
Subjt:  RRYENMTTNSAESVNALLRKARELPITKIVEFIRDLLQRWFHERRTHWSTQNTSHSDYAEERLALQFEKSRRYTVKPVDWCMFPVEDGGLDGTVDLNART

Query:  CTCMEFQYMGIPCSHAIAVVRHKNINCHTLIDPCYSVDSLIGAYAESILP
        CTCMEFQYMGIPCSHAIA  RHKNINCHTLIDPCYSVDSLI AYAE ILP
Subjt:  CTCMEFQYMGIPCSHAIAVVRHKNINCHTLIDPCYSVDSLIGAYAESILP

TrEMBL top hitse value%identityAlignment
A0A6J1CWE6 protein FAR-RED ELONGATED HYPOCOTYL 3-like8.5e-11197.99Show/hide
Query:  KEFGDAVYAYRKSQFTYYWNQILSVGSGTLAKYLQEIGVERWARCYQVGRRYENMTTNSAESVNALLRKARELPITKIVEFIRDLLQRWFHERRTHWSTQ
        K + DAVYAYRKSQFTYYWNQILSVGSGTLAKYLQEIGVERWARCYQVGRRYENMTTNSAESVNALLRKARELPITKIVEFIRDLLQRWFHERRTHWSTQ
Subjt:  KEFGDAVYAYRKSQFTYYWNQILSVGSGTLAKYLQEIGVERWARCYQVGRRYENMTTNSAESVNALLRKARELPITKIVEFIRDLLQRWFHERRTHWSTQ

Query:  NTSHSDYAEERLALQFEKSRRYTVKPVDWCMFPVEDGGLDGTVDLNARTCTCMEFQYMGIPCSHAIAVVRHKNINCHTLIDPCYSVDSLIGAYAESILP
        NTSHSDYAEERLALQFEKSRRYTVKPVDWCMFPVEDGGLDGTVDLNARTCTCMEFQYMGIPCSHAIAVVRHKNINCHTLIDPCYSVDSLIGAYAE ILP
Subjt:  NTSHSDYAEERLALQFEKSRRYTVKPVDWCMFPVEDGGLDGTVDLNARTCTCMEFQYMGIPCSHAIAVVRHKNINCHTLIDPCYSVDSLIGAYAESILP

A0A6J1DL08 uncharacterized protein LOC1110220715.3e-16958.12Show/hide
Query:  MFITDDDDLRFALVQEQVSKVPLFVSTVLRESIDLQLSRLNQDGEQSIPNGESVPEESSTRVDEWGLNDVYMQDMYSSDGIYTQDSELVTPATRMPSVMH
        MF+TDDDDLRFALVQEQVSKVPLFVST+ RESID+Q SRLNQDGEQSIPNG SVPEE STRVDEWGLNDVYMQDMYSSD                     
Subjt:  MFITDDDDLRFALVQEQVSKVPLFVSTVLRESIDLQLSRLNQDGEQSIPNGESVPEESSTRVDEWGLNDVYMQDMYSSDGIYTQDSELVTPATRMPSVMH

Query:  VPSDEINNTAQVQHASSRTEHVFPFDELNNTVYVQHARPLTDHVFPLVEPSSTEPGNVGDHQIPVPQSASTVTIKQATRYNLLPSGSELHVGKIFVSKQD
                                         ++HA PLT      VEPSS++P NV   QIPVPQS STVTI+QA RYNLLP GSELHVGKIFVSKQD
Subjt:  VPSDEINNTAQVQHASSRTEHVFPFDELNNTVYVQHARPLTDHVFPLVEPSSTEPGNVGDHQIPVPQSASTVTIKQATRYNLLPSGSELHVGKIFVSKQD

Query:  LRMVLSNAAMQSNREYKVSKSTKSKFVVRCIDNTCNWRVVAHSVGKSSMFCISKYVDAHTCMIDTVNHDHKQASSWVVANLIKDRVAGTDRIYKIKHIKE
        LRMVLSNAAM+SNREYKVS+STKSKF VRCIDNTCNWRV AHSVGKSS+FCISKYVDAHTC ID+VNHDHKQASSWVVANLIKD VAGT RIYKIKHIKE
Subjt:  LRMVLSNAAMQSNREYKVSKSTKSKFVVRCIDNTCNWRVVAHSVGKSSMFCISKYVDAHTCMIDTVNHDHKQASSWVVANLIKDRVAGTDRIYKIKHIKE

Query:  DVRKEFG---------------------------------------------------------------------------------------------
        DVRKE+G                                                                                             
Subjt:  DVRKEFG---------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------DAVYAYRKSQFTYYWNQILSVGSGTLAKYLQEIGVERWARCYQVGRRY
                                                            DA YAYRKSQFTYYWNQILS+GSG+LAKYLQEIG+ERWARCYQVGRRY
Subjt:  ----------------------------------------------------DAVYAYRKSQFTYYWNQILSVGSGTLAKYLQEIGVERWARCYQVGRRY

Query:  ENMTTNSAESVNALLRKARELPITKIVEFIRDLLQRWFHERRTHWSTQNTSHSDYAEERLALQFEKSRRYTVKPVDWCMFPVEDG
        EN TTNSAESVNALLR+ARELPITKIVEFIR+LLQRWFHERR HWSTQNTSHSDYAEERLA+QFEKSRRYTVKPVDWCMF VEDG
Subjt:  ENMTTNSAESVNALLRKARELPITKIVEFIRDLLQRWFHERRTHWSTQNTSHSDYAEERLALQFEKSRRYTVKPVDWCMFPVEDG

A0A6J1DN69 uncharacterized protein LOC1110221401.7e-11154.42Show/hide
Query:  GSELHVGKIFVSKQDLRMVLSNAAMQSNREYKVSKSTKSKFVVRCIDNTCNWRVVAHSVGKSSMFCISKYVDAHTCMIDTVNHDHKQASSWVVANLIKDR
        GSELHVGKIFVSKQDL MV+SNAAM+SNREYKVS+STKSKFVVRCI+NTCN RV  HSVGKSS+FCISKYVDAHTCMIDTVNHDHKQASS +VANLIKDR
Subjt:  GSELHVGKIFVSKQDLRMVLSNAAMQSNREYKVSKSTKSKFVVRCIDNTCNWRVVAHSVGKSSMFCISKYVDAHTCMIDTVNHDHKQASSWVVANLIKDR

Query:  VAGTDRIYKIKHIKEDVRKEFG------------------------------------------------------------------------------
        VAG  RIY IKHIKEDVRKEFG                                                                              
Subjt:  VAGTDRIYKIKHIKEDVRKEFG------------------------------------------------------------------------------

Query:  -------------------------------------------------------------------DAVYAYRKSQFTYYWNQILSVGSGTLAKYLQEI
                                                                           DA YAYRKSQFTYYWNQILSVGSG+LAKYLQEI
Subjt:  -------------------------------------------------------------------DAVYAYRKSQFTYYWNQILSVGSGTLAKYLQEI

Query:  GVERWARCYQVGRRYENMTTNSAESVNALLRKARELPITKIVEFIRDLLQRWFHERRTHWSTQNTSHSDYAEERLALQFEKSRRYTVKPVDWCMFPVEDG
        GVERWARCYQVGRRYENMTTNS +S                             ERRTHWSTQNTSHSDYA+ERLALQFEKSRRYTVKPVDWCMF VED 
Subjt:  GVERWARCYQVGRRYENMTTNSAESVNALLRKARELPITKIVEFIRDLLQRWFHERRTHWSTQNTSHSDYAEERLALQFEKSRRYTVKPVDWCMFPVEDG

Query:  GLDGTVDLNARTCTCMEFQYMGIPCSHAIA
        GLDGTVDLNA TCTCMEFQYMGIPCSHAIA
Subjt:  GLDGTVDLNARTCTCMEFQYMGIPCSHAIA

A0A6J1DQ99 uncharacterized protein LOC1110232351.2e-10492.46Show/hide
Query:  KEFGDAVYAYRKSQFTYYWNQILSVGSGTLAKYLQEIGVERWARCYQVGRRYENMTTNSAESVNALLRKARELPITKIVEFIRDLLQRWFHERRTHWSTQ
        K + DA YAYRKSQFTYYWNQILSVGSG+LAKYLQEIG+ERWARCYQVGRRYENMTTNSAESVNALLR+ARELPITKIVEFI DLLQRWFHERRTHWSTQ
Subjt:  KEFGDAVYAYRKSQFTYYWNQILSVGSGTLAKYLQEIGVERWARCYQVGRRYENMTTNSAESVNALLRKARELPITKIVEFIRDLLQRWFHERRTHWSTQ

Query:  NTSHSDYAEERLALQFEKSRRYTVKPVDWCMFPVEDGGLDGTVDLNARTCTCMEFQYMGIPCSHAIAVVRHKNINCHTLIDPCYSVDSLIGAYAESILP
        NTSHSDYAEERLALQFEKSRRYTVKPVDWCMF VED GLD TVDLNARTCTCMEFQYMGIPCSHAIA  RHKNINCHTLIDPCY+VDSLIGAYAE ILP
Subjt:  NTSHSDYAEERLALQFEKSRRYTVKPVDWCMFPVEDGGLDGTVDLNARTCTCMEFQYMGIPCSHAIAVVRHKNINCHTLIDPCYSVDSLIGAYAESILP

A0A6J1E2V3 uncharacterized protein LOC1110255302.9e-14362.22Show/hide
Query:  IPVPQSASTVTIKQATRYNLLPSGSELHVGKIFVSKQDLRMVLSNAAMQSNREYKVSKSTKSKFVVRCIDNTCNWRVVAHSVGKSSMFCISKYVDAHTCM
        IP+  S S+   ++  R     S S+L++G+I   K +L   +       NREYK S+STKSKFVVRCIDNTCNWRV AHSVGKSS+FC SKYVDAHTC 
Subjt:  IPVPQSASTVTIKQATRYNLLPSGSELHVGKIFVSKQDLRMVLSNAAMQSNREYKVSKSTKSKFVVRCIDNTCNWRVVAHSVGKSSMFCISKYVDAHTCM

Query:  IDTVNHDHKQASSWVVANLIKDRVAGTDRIYKIKHIKEDVRKEF--------------------------------------------------------
        IDTVNHDHKQASSWVVANLIKDRVA T RIYKIKHIKEDVR+EF                                                        
Subjt:  IDTVNHDHKQASSWVVANLIKDRVAGTDRIYKIKHIKEDVRKEF--------------------------------------------------------

Query:  ------------------------------------------------------GDAVYAYRKSQFTYYWNQILSVGSGTLAKYLQEIGVERWARCYQVG
                                                               DA YAYRKSQFTYYWNQILSVGSG+LAKYLQEIGVERWARCYQVG
Subjt:  ------------------------------------------------------GDAVYAYRKSQFTYYWNQILSVGSGTLAKYLQEIGVERWARCYQVG

Query:  RRYENMTTNSAESVNALLRKARELPITKIVEFIRDLLQRWFHERRTHWSTQNTSHSDYAEERLALQFEKSRRYTVKPVDWCMFPVEDGGLDGTVDLNART
        RRYENMTTNSAESVN LLR+A ELPITKIVEFIRDLLQRWFH RRTHWSTQNTSHSDYAEE+LALQFEKSRRYTVKPVDWCMF VEDGGLDGTVDLNART
Subjt:  RRYENMTTNSAESVNALLRKARELPITKIVEFIRDLLQRWFHERRTHWSTQNTSHSDYAEERLALQFEKSRRYTVKPVDWCMFPVEDGGLDGTVDLNART

Query:  CTCMEFQYMGIPCSHAIAVVRHKNINCHTLIDPCYSVDSLIGAYAESILP
        CTCMEFQYMGIPCSHAIA  RHKNINCHTLIDPCYSVDSLI AYAE ILP
Subjt:  CTCMEFQYMGIPCSHAIAVVRHKNINCHTLIDPCYSVDSLIGAYAESILP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49920.1 MuDR family transposase1.1e-0642.37Show/hide
Query:  GTVDLNARTCTCMEFQYMGIPCSHAIAVVRHKNINCHTLIDPCYSVDSLIGAYAESILP
        G V LN  TCTC EFQ    PC HA+AV     IN    +D CY+V+     Y+    P
Subjt:  GTVDLNARTCTCMEFQYMGIPCSHAIAVVRHKNINCHTLIDPCYSVDSLIGAYAESILP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCATAACGGATGACGATGACCTTCGATTTGCATTAGTCCAAGAACAGGTTTCTAAAGTCCCACTGTTTGTATCGACCGTCCTTCGCGAAAGCATTGACCTA
CAGTTATCTAGATTAAACCAAGATGGAGAGCAATCCATTCCTAATGGAGAATCTGTACCGGAAGAAAGTTCCACGAGAGTAGATGAGTGGGGGTTGAACGACGTA
TACATGCAGGACATGTACAGTTCAGACGGTATATACACTCAAGACTCGGAATTGGTCACCCCCGCGACACGCATGCCTTCCGTAATGCACGTCCCATCAGATGAG
ATAAATAATACAGCACAAGTACAACATGCAAGTTCTCGTACTGAACATGTTTTCCCATTTGATGAATTAAATAATACAGTATATGTACAACATGCAAGGCCTTTG
ACAGACCATGTTTTCCCATTAGTAGAACCCTCATCCACTGAGCCCGGTAATGTTGGTGATCACCAAATACCAGTACCTCAATCCGCAAGCACCGTGACTATAAAA
CAGGCCACAAGGTACAACCTTTTACCAAGCGGATCGGAACTGCACGTGGGGAAGATATTCGTTTCAAAGCAAGACCTGCGTATGGTGTTGTCAAATGCAGCGATG
CAATCTAACCGTGAATACAAGGTCAGTAAGTCAACGAAATCAAAGTTCGTTGTTCGCTGCATCGACAATACATGCAATTGGAGGGTTGTAGCCCACTCCGTCGGT
AAATCTTCGATGTTTTGTATATCAAAGTATGTAGATGCACACACCTGCATGATTGACACTGTGAACCACGACCACAAACAGGCGAGCAGCTGGGTCGTAGCGAAT
CTTATTAAAGATAGAGTTGCTGGAACCGATCGTATTTACAAGATAAAGCATATTAAGGAGGATGTCCGTAAAGAGTTTGGGGATGCAGTGTATGCGTATCGGAAG
TCACAGTTCACGTACTACTGGAACCAGATACTATCGGTTGGATCGGGTACACTTGCCAAATACTTACAAGAAATTGGGGTAGAACGGTGGGCCCGATGCTACCAA
GTTGGTAGAAGATATGAAAACATGACGACAAACAGCGCTGAGTCGGTAAATGCCCTCCTTCGAAAGGCTAGAGAGTTACCTATCACTAAGATTGTCGAGTTCATC
CGCGACTTGCTACAAAGATGGTTCCACGAAAGAAGGACTCACTGGTCCACCCAAAACACCTCTCATTCAGACTATGCAGAAGAGCGACTTGCACTACAATTTGAG
AAGTCTCGTCGCTACACAGTCAAACCAGTCGACTGGTGCATGTTTCCTGTTGAGGACGGTGGCTTGGACGGGACGGTTGATTTGAATGCCCGTACATGTACATGC
ATGGAGTTCCAGTACATGGGCATTCCATGTTCGCACGCAATTGCAGTAGTGAGGCACAAGAATATAAATTGCCACACGTTGATCGATCCATGCTACAGTGTGGAC
TCCCTAATTGGTGCCTACGCCGAATCAATCTTACCGTCGGACGGTTCCTCTTCCGACGGAGCAACCAGTCCTATCTCAAACCATCCCTCCTCCTTTGCGGAGAAC
AGAGTGCCAAGTAACAGTCCTCTTCAACAAGCACGGGAAGAAAACGACCAGCTCAGGAGAGAGCTACGTCAAACGCAACACGAGCTCAACAACACTAGGTATAGG
TTAGCCCGGGTTGAAGAAACGCGGGAATTGCTGGAGGAAGTGCTGAAGGAGGAGAAGGAGGAACGACTTCGTCTGGAGGACAGGGTGGCTCTGTTACTGGCCCGT
CTACGCCGATAA
mRNA sequenceShow/hide mRNA sequence
ATGTTCATAACGGATGACGATGACCTTCGATTTGCATTAGTCCAAGAACAGGTTTCTAAAGTCCCACTGTTTGTATCGACCGTCCTTCGCGAAAGCATTGACCTA
CAGTTATCTAGATTAAACCAAGATGGAGAGCAATCCATTCCTAATGGAGAATCTGTACCGGAAGAAAGTTCCACGAGAGTAGATGAGTGGGGGTTGAACGACGTA
TACATGCAGGACATGTACAGTTCAGACGGTATATACACTCAAGACTCGGAATTGGTCACCCCCGCGACACGCATGCCTTCCGTAATGCACGTCCCATCAGATGAG
ATAAATAATACAGCACAAGTACAACATGCAAGTTCTCGTACTGAACATGTTTTCCCATTTGATGAATTAAATAATACAGTATATGTACAACATGCAAGGCCTTTG
ACAGACCATGTTTTCCCATTAGTAGAACCCTCATCCACTGAGCCCGGTAATGTTGGTGATCACCAAATACCAGTACCTCAATCCGCAAGCACCGTGACTATAAAA
CAGGCCACAAGGTACAACCTTTTACCAAGCGGATCGGAACTGCACGTGGGGAAGATATTCGTTTCAAAGCAAGACCTGCGTATGGTGTTGTCAAATGCAGCGATG
CAATCTAACCGTGAATACAAGGTCAGTAAGTCAACGAAATCAAAGTTCGTTGTTCGCTGCATCGACAATACATGCAATTGGAGGGTTGTAGCCCACTCCGTCGGT
AAATCTTCGATGTTTTGTATATCAAAGTATGTAGATGCACACACCTGCATGATTGACACTGTGAACCACGACCACAAACAGGCGAGCAGCTGGGTCGTAGCGAAT
CTTATTAAAGATAGAGTTGCTGGAACCGATCGTATTTACAAGATAAAGCATATTAAGGAGGATGTCCGTAAAGAGTTTGGGGATGCAGTGTATGCGTATCGGAAG
TCACAGTTCACGTACTACTGGAACCAGATACTATCGGTTGGATCGGGTACACTTGCCAAATACTTACAAGAAATTGGGGTAGAACGGTGGGCCCGATGCTACCAA
GTTGGTAGAAGATATGAAAACATGACGACAAACAGCGCTGAGTCGGTAAATGCCCTCCTTCGAAAGGCTAGAGAGTTACCTATCACTAAGATTGTCGAGTTCATC
CGCGACTTGCTACAAAGATGGTTCCACGAAAGAAGGACTCACTGGTCCACCCAAAACACCTCTCATTCAGACTATGCAGAAGAGCGACTTGCACTACAATTTGAG
AAGTCTCGTCGCTACACAGTCAAACCAGTCGACTGGTGCATGTTTCCTGTTGAGGACGGTGGCTTGGACGGGACGGTTGATTTGAATGCCCGTACATGTACATGC
ATGGAGTTCCAGTACATGGGCATTCCATGTTCGCACGCAATTGCAGTAGTGAGGCACAAGAATATAAATTGCCACACGTTGATCGATCCATGCTACAGTGTGGAC
TCCCTAATTGGTGCCTACGCCGAATCAATCTTACCGTCGGACGGTTCCTCTTCCGACGGAGCAACCAGTCCTATCTCAAACCATCCCTCCTCCTTTGCGGAGAAC
AGAGTGCCAAGTAACAGTCCTCTTCAACAAGCACGGGAAGAAAACGACCAGCTCAGGAGAGAGCTACGTCAAACGCAACACGAGCTCAACAACACTAGGTATAGG
TTAGCCCGGGTTGAAGAAACGCGGGAATTGCTGGAGGAAGTGCTGAAGGAGGAGAAGGAGGAACGACTTCGTCTGGAGGACAGGGTGGCTCTGTTACTGGCCCGT
CTACGCCGATAA
Protein sequenceShow/hide protein sequence
MFITDDDDLRFALVQEQVSKVPLFVSTVLRESIDLQLSRLNQDGEQSIPNGESVPEESSTRVDEWGLNDVYMQDMYSSDGIYTQDSELVTPATRMPSVMHVPSDE
INNTAQVQHASSRTEHVFPFDELNNTVYVQHARPLTDHVFPLVEPSSTEPGNVGDHQIPVPQSASTVTIKQATRYNLLPSGSELHVGKIFVSKQDLRMVLSNAAM
QSNREYKVSKSTKSKFVVRCIDNTCNWRVVAHSVGKSSMFCISKYVDAHTCMIDTVNHDHKQASSWVVANLIKDRVAGTDRIYKIKHIKEDVRKEFGDAVYAYRK
SQFTYYWNQILSVGSGTLAKYLQEIGVERWARCYQVGRRYENMTTNSAESVNALLRKARELPITKIVEFIRDLLQRWFHERRTHWSTQNTSHSDYAEERLALQFE
KSRRYTVKPVDWCMFPVEDGGLDGTVDLNARTCTCMEFQYMGIPCSHAIAVVRHKNINCHTLIDPCYSVDSLIGAYAESILPSDGSSSDGATSPISNHPSSFAEN
RVPSNSPLQQAREENDQLRRELRQTQHELNNTRYRLARVEETRELLEEVLKEEKEERLRLEDRVALLLARLRR