; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g27330 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g27330
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionprotein FAR1-RELATED SEQUENCE 4-like
Genome locationchr9:20481433..20483782
RNA-Seq ExpressionMoc09g27330
SyntenyMoc09g27330
Gene Ontology termsGO:0006313 - transposition, DNA-mediated (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0004803 - transposase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR007527 - Zinc finger, SWIM-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131652.1 protein FAR1-RELATED SEQUENCE 4-like [Momordica charantia]2.8e-6534.14Show/hide
Query:  EGDYEAEFVNNDYDDALD---EESKPDVEQVHAEIRSDEAAVRQMGCDGLIGQPNDEKLQLIVQSSVTNDVKEGEAFDTKKELSLRMHLVAMQMNFQ---
        EGD E E+ N    D LD   E  K  +    AE   D   V +M  + + GQ   ++LQ +VQS+ T+DVKE + FD+KKEL ++MHL+A++ NFQ   
Subjt:  EGDYEAEFVNNDYDDALD---EESKPDVEQVHAEIRSDEAAVRQMGCDGLIGQPNDEKLQLIVQSSVTNDVKEGEAFDTKKELSLRMHLVAMQMNFQ---

Query:  -------------------------------FKASQELGGRTSCP------------------------SEVYRSF--PDIIQDMRKEYGVNLSYDKAWR
                                       FK  + +   + C                         ++V R++   DI+QD+R+EYGVN+SYDKAWR
Subjt:  -------------------------------FKASQELGGRTSCP------------------------SEVYRSF--PDIIQDMRKEYGVNLSYDKAWR

Query:  SSEEALQLIRGDPPSSYGLLPAYG----------------------------------------------------------------------------
        SSEEAL+LIRGDP SSY LLPAYG                                                                            
Subjt:  SSEEALQLIRGDPPSSYGLLPAYG----------------------------------------------------------------------------

Query:  --------------------------------------CEGILRVL-----------FQLNL-------------------------GSTW---CV-PWC
                                              C+ I  V             ++NL                            W   C  P  
Subjt:  --------------------------------------CEGILRVL-----------FQLNL-------------------------GSTW---CV-PWC

Query:  EEYLDDNGKERWARCFQTELRYTQMTSNNAESINTLFRHAHRLPVTALLDHIKGLLQTWFYDRRTFASSRSTTLSDYAENKFAEYSDSARRHVVVNIDQF
         EYL+  GKERWARCFQT+LRY+QMT+N AES+N LFRHA +L +TALLDHI+G+LQ WFY+ RT ASSR +TLSDYAE   AE  D+ARRH+V+NIDQF
Subjt:  EEYLDDNGKERWARCFQTELRYTQMTSNNAESINTLFRHAHRLPVTALLDHIKGLLQTWFYDRRTFASSRSTTLSDYAENKFAEYSDSARRHVVVNIDQF

Query:  HVQVRDGNLDGIVDFNFRTCSCPQFDYFKIPCFHAI
        + +V DGNL+G VD   +TC+C +FDYFK+PC HAI
Subjt:  HVQVRDGNLDGIVDFNFRTCSCPQFDYFKIPCFHAI

XP_022148136.1 uncharacterized protein LOC111016889 [Momordica charantia]8.3e-6586.52Show/hide
Query:  PWCEEYLDDNGKERWARCFQTELRYTQMTSNNAESINTLFRHAHRLPVTALLDHIKGLLQTWFYDRRTFASSRSTTLSDYAENKFAEYSDSARRHVVVNI
        P   EYLDD GKERWA CFQTELRYTQMTSNNAES+N LFRH  RLPVTALLDHI+GLLQTWFYDRRT ASSRSTTLSDYAENK AEY D+ARRHVVVNI
Subjt:  PWCEEYLDDNGKERWARCFQTELRYTQMTSNNAESINTLFRHAHRLPVTALLDHIKGLLQTWFYDRRTFASSRSTTLSDYAENKFAEYSDSARRHVVVNI

Query:  DQFHVQVRDGNLDGIVDFNFRTCSCPQFDYFKIPCFHAIVG
        DQFHVQVRDGNLDGIVDFN RTCSC +FDYFKIPC HAI G
Subjt:  DQFHVQVRDGNLDGIVDFNFRTCSCPQFDYFKIPCFHAIVG

XP_022153146.1 uncharacterized protein LOC111020715 [Momordica charantia]1.2e-11147.95Show/hide
Query:  ESEGDYEAEFVNNDYDDALDEESKPDVEQVHAEIRSDEAAVRQMGCDGLIGQPNDEKLQLIVQSSVTNDVKEGEAFDTKKELSLRMHLVAMQMNFQFKAS
        + EGDYEAEFVN+DYDDALDEES+PDVEQVHAEI  DEAAV+QMGCDGL GQ N E LQLIVQSS TNDVKEGE FDTKKELSLRMHLV M++NFQFK  
Subjt:  ESEGDYEAEFVNNDYDDALDEESKPDVEQVHAEIRSDEAAVRQMGCDGLIGQPNDEKLQLIVQSSVTNDVKEGEAFDTKKELSLRMHLVAMQMNFQFKAS

Query:  QELGG-------RTSCP--------------------------------------------------SEVYRSF--PDIIQDMRKEYGVNLSYDKAWRSS
        +            TSC                                                   ++V R++   DIIQDMRKEYGVNLSYDKAWRSS
Subjt:  QELGG-------RTSCP--------------------------------------------------SEVYRSF--PDIIQDMRKEYGVNLSYDKAWRSS

Query:  EEALQLIRGDPPSSYGLLPAYG------------------------------------------------------------------------------
        EEAL+LIRGDP SSYGLLP YG                                                                              
Subjt:  EEALQLIRGDPPSSYGLLPAYG------------------------------------------------------------------------------

Query:  ------------------------------------CEGILRVL-----------FQLNL---------------------------GSTW----CVPWC
                                            C+ I +V             ++NL                            S W      P  
Subjt:  ------------------------------------CEGILRVL-----------FQLNL---------------------------GSTW----CVPWC

Query:  EEYLDDNGKERWARCFQTELRYTQMTSNNAESINTLFRHAHRLPVTALLDHIKGLLQTWFYDRRTFASSRSTTLSDYAENKFAEYSDSARRHVVVNIDQF
         EYLDD GKERWARCFQTELRYTQMTSNNAES+N LFRHA +LPVTALLDHI+GLLQTWFYDRRT ASSRSTTLS YAENK AEYSD+ARRHVVVNIDQF
Subjt:  EEYLDDNGKERWARCFQTELRYTQMTSNNAESINTLFRHAHRLPVTALLDHIKGLLQTWFYDRRTFASSRSTTLSDYAENKFAEYSDSARRHVVVNIDQF

Query:  HVQVRDGNLDGIVDFNFRTCSCPQFDYFKIPCFHAI
        HVQVRDGNLDGIVDFN RTC+C +FDYFKIPC HAI
Subjt:  HVQVRDGNLDGIVDFNFRTCSCPQFDYFKIPCFHAI

XP_022155970.1 uncharacterized protein LOC111022954 [Momordica charantia]6.4e-11850.28Show/hide
Query:  VFIKFGGEWNDSEKDYVGGRTRGLIV-------------------RPTYLILSFPSSSSNPYSSQQPHPYYGHLGHDIAGLTPLESGVVPCNLVDDRVCY
        VFI FGGEWNDSEKDYVGGR RGL V                   RPT  I SFPSSSSNP SSQQPH YYGHLGHDIAGLTPLES VVPCNL DDRVC 
Subjt:  VFIKFGGEWNDSEKDYVGGRTRGLIV-------------------RPTYLILSFPSSSSNPYSSQQPHPYYGHLGHDIAGLTPLESGVVPCNLVDDRVCY

Query:  WNVPRLWNDNEDESDESYDPLGES-EGDYEAEFVNNDYDDALDEESKPDVEQVHAEIRSDEAAVRQMGCDGLIGQPNDEKLQLIVQSSVTNDVKEGEAFD
        WN+P LWNDN+DESDESYD LG+S EGDYEAEF+N+DYDDA DE+ +PDVEQV  EIR DE  V QMGCDGLIGQPNDEKLQLIVQSS TNDVKEG+ FD
Subjt:  WNVPRLWNDNEDESDESYDPLGES-EGDYEAEFVNNDYDDALDEESKPDVEQVHAEIRSDEAAVRQMGCDGLIGQPNDEKLQLIVQSSVTNDVKEGEAFD

Query:  TKKELSLRMHLVAMQMNFQFKAS---------------------------------QELGGRTSCPSEV---------------------------YRSF
        TKKELSLR HLVAM +NFQFK                                   ++     +C  EV                           YR  
Subjt:  TKKELSLRMHLVAMQMNFQFKAS---------------------------------QELGGRTSCPSEV---------------------------YRSF

Query:  PDIIQDMRKEYGVNLSYDKAWRSSEEALQLIRGDPPSSYGLLPAYG------------------------------------------------------
         DIIQDMRKEYGVNLSYDKAW+S+EEAL+LIRGDP +SYGLLPAYG                                                      
Subjt:  PDIIQDMRKEYGVNLSYDKAWRSSEEALQLIRGDPPSSYGLLPAYG------------------------------------------------------

Query:  ----------CEGILRVL-----------FQLNL---------------------------GSTW----CVPWCEEYLDDNGKERWARCFQTELRYTQMT
                  C+ I +V             + NL                            S W      P   EYLDD GKERW RCFQTELRYTQMT
Subjt:  ----------CEGILRVL-----------FQLNL---------------------------GSTW----CVPWCEEYLDDNGKERWARCFQTELRYTQMT

Query:  SNNAESINTLFRHAHRLPVTALLDHIK
        SNNAES+N LFRHA  LPVTALLDHI+
Subjt:  SNNAESINTLFRHAHRLPVTALLDHIK

XP_022157017.1 uncharacterized protein LOC111023843 [Momordica charantia]1.2e-6856.25Show/hide
Query:  GHDIAGLTPLESGVVPCNLVDDRVCYWNVPRLWNDNEDESDESYDPLGES-EGDYEAEFVNNDYDDALDEESKPDVEQVHAEIRSDEAAVRQMGCDGLIG
        GHD+ GLTPL S VVPCNL DDRVC W+VP +WNDNEDES ESYDPL  S EG  +AE+ N ++DDALD+E + DVEQVH EIR DE AVR  GC+GL G
Subjt:  GHDIAGLTPLESGVVPCNLVDDRVCYWNVPRLWNDNEDESDESYDPLGES-EGDYEAEFVNNDYDDALDEESKPDVEQVHAEIRSDEAAVRQMGCDGLIG

Query:  QPNDEKLQLIVQSSVTNDVKEGEAFDTKKELSLRMHLVAMQMNFQFKASQE----------------------------------------LGG------
         PNDEKLQLIVQSS TNDV EG+ FD KKELSL+MHLVAM+ NFQFK  +                                          GG      
Subjt:  QPNDEKLQLIVQSSVTNDVKEGEAFDTKKELSLRMHLVAMQMNFQFKASQE----------------------------------------LGG------

Query:  --------------RTSCPSEVYRSFPDIIQDMRKEYGVNLSYDKAWRSSEEALQLIRGDPPSSYGLLPAYG
                      + +  S  YR   DIIQDMRKEYGVNLSYD+AWRSSEEAL+LIRGDP SSYGLLPAYG
Subjt:  --------------RTSCPSEVYRSFPDIIQDMRKEYGVNLSYDKAWRSSEEALQLIRGDPPSSYGLLPAYG

TrEMBL top hitse value%identityAlignment
A0A6J1BRM2 protein FAR1-RELATED SEQUENCE 4-like1.4e-6534.14Show/hide
Query:  EGDYEAEFVNNDYDDALD---EESKPDVEQVHAEIRSDEAAVRQMGCDGLIGQPNDEKLQLIVQSSVTNDVKEGEAFDTKKELSLRMHLVAMQMNFQ---
        EGD E E+ N    D LD   E  K  +    AE   D   V +M  + + GQ   ++LQ +VQS+ T+DVKE + FD+KKEL ++MHL+A++ NFQ   
Subjt:  EGDYEAEFVNNDYDDALD---EESKPDVEQVHAEIRSDEAAVRQMGCDGLIGQPNDEKLQLIVQSSVTNDVKEGEAFDTKKELSLRMHLVAMQMNFQ---

Query:  -------------------------------FKASQELGGRTSCP------------------------SEVYRSF--PDIIQDMRKEYGVNLSYDKAWR
                                       FK  + +   + C                         ++V R++   DI+QD+R+EYGVN+SYDKAWR
Subjt:  -------------------------------FKASQELGGRTSCP------------------------SEVYRSF--PDIIQDMRKEYGVNLSYDKAWR

Query:  SSEEALQLIRGDPPSSYGLLPAYG----------------------------------------------------------------------------
        SSEEAL+LIRGDP SSY LLPAYG                                                                            
Subjt:  SSEEALQLIRGDPPSSYGLLPAYG----------------------------------------------------------------------------

Query:  --------------------------------------CEGILRVL-----------FQLNL-------------------------GSTW---CV-PWC
                                              C+ I  V             ++NL                            W   C  P  
Subjt:  --------------------------------------CEGILRVL-----------FQLNL-------------------------GSTW---CV-PWC

Query:  EEYLDDNGKERWARCFQTELRYTQMTSNNAESINTLFRHAHRLPVTALLDHIKGLLQTWFYDRRTFASSRSTTLSDYAENKFAEYSDSARRHVVVNIDQF
         EYL+  GKERWARCFQT+LRY+QMT+N AES+N LFRHA +L +TALLDHI+G+LQ WFY+ RT ASSR +TLSDYAE   AE  D+ARRH+V+NIDQF
Subjt:  EEYLDDNGKERWARCFQTELRYTQMTSNNAESINTLFRHAHRLPVTALLDHIKGLLQTWFYDRRTFASSRSTTLSDYAENKFAEYSDSARRHVVVNIDQF

Query:  HVQVRDGNLDGIVDFNFRTCSCPQFDYFKIPCFHAI
        + +V DGNL+G VD   +TC+C +FDYFK+PC HAI
Subjt:  HVQVRDGNLDGIVDFNFRTCSCPQFDYFKIPCFHAI

A0A6J1D4G8 uncharacterized protein LOC1110168894.0e-6586.52Show/hide
Query:  PWCEEYLDDNGKERWARCFQTELRYTQMTSNNAESINTLFRHAHRLPVTALLDHIKGLLQTWFYDRRTFASSRSTTLSDYAENKFAEYSDSARRHVVVNI
        P   EYLDD GKERWA CFQTELRYTQMTSNNAES+N LFRH  RLPVTALLDHI+GLLQTWFYDRRT ASSRSTTLSDYAENK AEY D+ARRHVVVNI
Subjt:  PWCEEYLDDNGKERWARCFQTELRYTQMTSNNAESINTLFRHAHRLPVTALLDHIKGLLQTWFYDRRTFASSRSTTLSDYAENKFAEYSDSARRHVVVNI

Query:  DQFHVQVRDGNLDGIVDFNFRTCSCPQFDYFKIPCFHAIVG
        DQFHVQVRDGNLDGIVDFN RTCSC +FDYFKIPC HAI G
Subjt:  DQFHVQVRDGNLDGIVDFNFRTCSCPQFDYFKIPCFHAIVG

A0A6J1DJT1 uncharacterized protein LOC1110207155.7e-11247.95Show/hide
Query:  ESEGDYEAEFVNNDYDDALDEESKPDVEQVHAEIRSDEAAVRQMGCDGLIGQPNDEKLQLIVQSSVTNDVKEGEAFDTKKELSLRMHLVAMQMNFQFKAS
        + EGDYEAEFVN+DYDDALDEES+PDVEQVHAEI  DEAAV+QMGCDGL GQ N E LQLIVQSS TNDVKEGE FDTKKELSLRMHLV M++NFQFK  
Subjt:  ESEGDYEAEFVNNDYDDALDEESKPDVEQVHAEIRSDEAAVRQMGCDGLIGQPNDEKLQLIVQSSVTNDVKEGEAFDTKKELSLRMHLVAMQMNFQFKAS

Query:  QELGG-------RTSCP--------------------------------------------------SEVYRSF--PDIIQDMRKEYGVNLSYDKAWRSS
        +            TSC                                                   ++V R++   DIIQDMRKEYGVNLSYDKAWRSS
Subjt:  QELGG-------RTSCP--------------------------------------------------SEVYRSF--PDIIQDMRKEYGVNLSYDKAWRSS

Query:  EEALQLIRGDPPSSYGLLPAYG------------------------------------------------------------------------------
        EEAL+LIRGDP SSYGLLP YG                                                                              
Subjt:  EEALQLIRGDPPSSYGLLPAYG------------------------------------------------------------------------------

Query:  ------------------------------------CEGILRVL-----------FQLNL---------------------------GSTW----CVPWC
                                            C+ I +V             ++NL                            S W      P  
Subjt:  ------------------------------------CEGILRVL-----------FQLNL---------------------------GSTW----CVPWC

Query:  EEYLDDNGKERWARCFQTELRYTQMTSNNAESINTLFRHAHRLPVTALLDHIKGLLQTWFYDRRTFASSRSTTLSDYAENKFAEYSDSARRHVVVNIDQF
         EYLDD GKERWARCFQTELRYTQMTSNNAES+N LFRHA +LPVTALLDHI+GLLQTWFYDRRT ASSRSTTLS YAENK AEYSD+ARRHVVVNIDQF
Subjt:  EEYLDDNGKERWARCFQTELRYTQMTSNNAESINTLFRHAHRLPVTALLDHIKGLLQTWFYDRRTFASSRSTTLSDYAENKFAEYSDSARRHVVVNIDQF

Query:  HVQVRDGNLDGIVDFNFRTCSCPQFDYFKIPCFHAI
        HVQVRDGNLDGIVDFN RTC+C +FDYFKIPC HAI
Subjt:  HVQVRDGNLDGIVDFNFRTCSCPQFDYFKIPCFHAI

A0A6J1DP00 uncharacterized protein LOC1110229543.1e-11850.28Show/hide
Query:  VFIKFGGEWNDSEKDYVGGRTRGLIV-------------------RPTYLILSFPSSSSNPYSSQQPHPYYGHLGHDIAGLTPLESGVVPCNLVDDRVCY
        VFI FGGEWNDSEKDYVGGR RGL V                   RPT  I SFPSSSSNP SSQQPH YYGHLGHDIAGLTPLES VVPCNL DDRVC 
Subjt:  VFIKFGGEWNDSEKDYVGGRTRGLIV-------------------RPTYLILSFPSSSSNPYSSQQPHPYYGHLGHDIAGLTPLESGVVPCNLVDDRVCY

Query:  WNVPRLWNDNEDESDESYDPLGES-EGDYEAEFVNNDYDDALDEESKPDVEQVHAEIRSDEAAVRQMGCDGLIGQPNDEKLQLIVQSSVTNDVKEGEAFD
        WN+P LWNDN+DESDESYD LG+S EGDYEAEF+N+DYDDA DE+ +PDVEQV  EIR DE  V QMGCDGLIGQPNDEKLQLIVQSS TNDVKEG+ FD
Subjt:  WNVPRLWNDNEDESDESYDPLGES-EGDYEAEFVNNDYDDALDEESKPDVEQVHAEIRSDEAAVRQMGCDGLIGQPNDEKLQLIVQSSVTNDVKEGEAFD

Query:  TKKELSLRMHLVAMQMNFQFKAS---------------------------------QELGGRTSCPSEV---------------------------YRSF
        TKKELSLR HLVAM +NFQFK                                   ++     +C  EV                           YR  
Subjt:  TKKELSLRMHLVAMQMNFQFKAS---------------------------------QELGGRTSCPSEV---------------------------YRSF

Query:  PDIIQDMRKEYGVNLSYDKAWRSSEEALQLIRGDPPSSYGLLPAYG------------------------------------------------------
         DIIQDMRKEYGVNLSYDKAW+S+EEAL+LIRGDP +SYGLLPAYG                                                      
Subjt:  PDIIQDMRKEYGVNLSYDKAWRSSEEALQLIRGDPPSSYGLLPAYG------------------------------------------------------

Query:  ----------CEGILRVL-----------FQLNL---------------------------GSTW----CVPWCEEYLDDNGKERWARCFQTELRYTQMT
                  C+ I +V             + NL                            S W      P   EYLDD GKERW RCFQTELRYTQMT
Subjt:  ----------CEGILRVL-----------FQLNL---------------------------GSTW----CVPWCEEYLDDNGKERWARCFQTELRYTQMT

Query:  SNNAESINTLFRHAHRLPVTALLDHIK
        SNNAES+N LFRHA  LPVTALLDHI+
Subjt:  SNNAESINTLFRHAHRLPVTALLDHIK

A0A6J1DTG5 uncharacterized protein LOC1110238436.0e-6956.25Show/hide
Query:  GHDIAGLTPLESGVVPCNLVDDRVCYWNVPRLWNDNEDESDESYDPLGES-EGDYEAEFVNNDYDDALDEESKPDVEQVHAEIRSDEAAVRQMGCDGLIG
        GHD+ GLTPL S VVPCNL DDRVC W+VP +WNDNEDES ESYDPL  S EG  +AE+ N ++DDALD+E + DVEQVH EIR DE AVR  GC+GL G
Subjt:  GHDIAGLTPLESGVVPCNLVDDRVCYWNVPRLWNDNEDESDESYDPLGES-EGDYEAEFVNNDYDDALDEESKPDVEQVHAEIRSDEAAVRQMGCDGLIG

Query:  QPNDEKLQLIVQSSVTNDVKEGEAFDTKKELSLRMHLVAMQMNFQFKASQE----------------------------------------LGG------
         PNDEKLQLIVQSS TNDV EG+ FD KKELSL+MHLVAM+ NFQFK  +                                          GG      
Subjt:  QPNDEKLQLIVQSSVTNDVKEGEAFDTKKELSLRMHLVAMQMNFQFKASQE----------------------------------------LGG------

Query:  --------------RTSCPSEVYRSFPDIIQDMRKEYGVNLSYDKAWRSSEEALQLIRGDPPSSYGLLPAYG
                      + +  S  YR   DIIQDMRKEYGVNLSYD+AWRSSEEAL+LIRGDP SSYGLLPAYG
Subjt:  --------------RTSCPSEVYRSFPDIIQDMRKEYGVNLSYDKAWRSSEEALQLIRGDPPSSYGLLPAYG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTCGGACCAAGACTAGTTCGAGACGGATGCCTCATGTTTTCATAAAATTTGGTGGAGAATGGAACGATAGTGAAAAAGATTATGTTGGTGGTCGCACGAGAGGATT
GATAGTGAGACCCACATATCTGATTCTCTCATTTCCTTCCTCATCATCGAACCCATATTCTTCCCAACAACCACACCCCTACTACGGGCATTTAGGTCATGATATAGCGG
GTTTAACACCATTGGAATCAGGTGTTGTTCCATGTAACCTAGTTGATGACAGGGTATGTTATTGGAATGTGCCGAGATTATGGAATGATAATGAAGATGAAAGTGATGAA
TCATATGACCCATTGGGCGAGTCAGAAGGAGATTATGAAGCAGAATTTGTCAATAATGACTATGACGATGCACTTGATGAAGAGTCTAAGCCCGATGTAGAACAGGTACA
CGCTGAGATTCGTAGTGATGAAGCAGCGGTCCGACAAATGGGGTGTGATGGTCTCATTGGACAGCCTAATGATGAGAAGTTGCAACTCATAGTACAGTCTTCTGTAACAA
ATGATGTTAAGGAGGGTGAAGCATTTGATACAAAGAAGGAGTTGAGTTTGAGAATGCATTTAGTTGCAATGCAGATGAATTTTCAGTTTAAAGCAAGCCAAGAGTTGGGT
GGTCGGACATCTTGTCCAAGCGAAGTTTATAGATCTTTCCCAGACATCATACAAGACATGAGGAAGGAGTATGGTGTCAATTTAAGTTATGACAAAGCTTGGCGGTCCAG
TGAAGAAGCACTTCAACTTATTAGAGGGGATCCACCATCGTCATACGGTCTACTACCAGCTTATGGCTGCGAAGGCATATTGAGAGTCTTATTTCAACTCAATCTGGGCT
CAACTTGGTGCGTACCCTGGTGTGAGGAGTATTTGGATGACAATGGGAAGGAGCGTTGGGCTCGTTGTTTCCAAACAGAATTGAGGTACACACAGATGACCAGTAATAAT
GCGGAGTCCATAAATACCCTCTTTAGACACGCGCATAGGTTGCCAGTAACTGCTTTATTGGACCATATCAAAGGTCTGTTACAGACTTGGTTCTACGATCGACGGACGTT
TGCCTCTTCCCGATCAACCACATTGTCCGACTACGCAGAAAATAAGTTTGCTGAATACTCGGACAGTGCCAGGAGACACGTTGTAGTGAACATCGACCAATTTCATGTCC
AGGTACGCGATGGCAACCTTGACGGGATTGTTGATTTCAACTTTAGGACATGTAGTTGTCCGCAGTTTGATTATTTTAAAATTCCATGCTTTCATGCTATTGTTGGGCGG
TAA
mRNA sequenceShow/hide mRNA sequence
ATGTTTCGGACCAAGACTAGTTCGAGACGGATGCCTCATGTTTTCATAAAATTTGGTGGAGAATGGAACGATAGTGAAAAAGATTATGTTGGTGGTCGCACGAGAGGATT
GATAGTGAGACCCACATATCTGATTCTCTCATTTCCTTCCTCATCATCGAACCCATATTCTTCCCAACAACCACACCCCTACTACGGGCATTTAGGTCATGATATAGCGG
GTTTAACACCATTGGAATCAGGTGTTGTTCCATGTAACCTAGTTGATGACAGGGTATGTTATTGGAATGTGCCGAGATTATGGAATGATAATGAAGATGAAAGTGATGAA
TCATATGACCCATTGGGCGAGTCAGAAGGAGATTATGAAGCAGAATTTGTCAATAATGACTATGACGATGCACTTGATGAAGAGTCTAAGCCCGATGTAGAACAGGTACA
CGCTGAGATTCGTAGTGATGAAGCAGCGGTCCGACAAATGGGGTGTGATGGTCTCATTGGACAGCCTAATGATGAGAAGTTGCAACTCATAGTACAGTCTTCTGTAACAA
ATGATGTTAAGGAGGGTGAAGCATTTGATACAAAGAAGGAGTTGAGTTTGAGAATGCATTTAGTTGCAATGCAGATGAATTTTCAGTTTAAAGCAAGCCAAGAGTTGGGT
GGTCGGACATCTTGTCCAAGCGAAGTTTATAGATCTTTCCCAGACATCATACAAGACATGAGGAAGGAGTATGGTGTCAATTTAAGTTATGACAAAGCTTGGCGGTCCAG
TGAAGAAGCACTTCAACTTATTAGAGGGGATCCACCATCGTCATACGGTCTACTACCAGCTTATGGCTGCGAAGGCATATTGAGAGTCTTATTTCAACTCAATCTGGGCT
CAACTTGGTGCGTACCCTGGTGTGAGGAGTATTTGGATGACAATGGGAAGGAGCGTTGGGCTCGTTGTTTCCAAACAGAATTGAGGTACACACAGATGACCAGTAATAAT
GCGGAGTCCATAAATACCCTCTTTAGACACGCGCATAGGTTGCCAGTAACTGCTTTATTGGACCATATCAAAGGTCTGTTACAGACTTGGTTCTACGATCGACGGACGTT
TGCCTCTTCCCGATCAACCACATTGTCCGACTACGCAGAAAATAAGTTTGCTGAATACTCGGACAGTGCCAGGAGACACGTTGTAGTGAACATCGACCAATTTCATGTCC
AGGTACGCGATGGCAACCTTGACGGGATTGTTGATTTCAACTTTAGGACATGTAGTTGTCCGCAGTTTGATTATTTTAAAATTCCATGCTTTCATGCTATTGTTGGGCGG
TAA
Protein sequenceShow/hide protein sequence
MFRTKTSSRRMPHVFIKFGGEWNDSEKDYVGGRTRGLIVRPTYLILSFPSSSSNPYSSQQPHPYYGHLGHDIAGLTPLESGVVPCNLVDDRVCYWNVPRLWNDNEDESDE
SYDPLGESEGDYEAEFVNNDYDDALDEESKPDVEQVHAEIRSDEAAVRQMGCDGLIGQPNDEKLQLIVQSSVTNDVKEGEAFDTKKELSLRMHLVAMQMNFQFKASQELG
GRTSCPSEVYRSFPDIIQDMRKEYGVNLSYDKAWRSSEEALQLIRGDPPSSYGLLPAYGCEGILRVLFQLNLGSTWCVPWCEEYLDDNGKERWARCFQTELRYTQMTSNN
AESINTLFRHAHRLPVTALLDHIKGLLQTWFYDRRTFASSRSTTLSDYAENKFAEYSDSARRHVVVNIDQFHVQVRDGNLDGIVDFNFRTCSCPQFDYFKIPCFHAIVGR