; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g28060 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g28060
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionprotein FAR1-RELATED SEQUENCE 4-like
Genome locationchr5:20571195..20577659
RNA-Seq ExpressionMoc05g28060
SyntenyMoc05g28060
Gene Ontology termsGO:0006313 - transposition, DNA-mediated (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0004803 - transposase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR004332 - Transposase, MuDR, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131652.1 protein FAR1-RELATED SEQUENCE 4-like [Momordica charantia]5.2e-11956.39Show/hide
Query:  MGEGDDEGEYGNEYASDRLDVQHEHEKVTIHNTVVEYPVDAVHEMASNRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVMRMHLLALRKNFQFRV
        MGEGDDE EYGNEYASDRLDVQHEHEKVTIHNT+ EYPVD VHEMASNRVTGQSEGDRLQAMVQSA TDDVKE DVFDSKKELVM+MHLLALRKNFQF+V
Subjt:  MGEGDDEGEYGNEYASDRLDVQHEHEKVTIHNTVVEYPVDAVHEMASNRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVMRMHLLALRKNFQFRV

Query:  KKSTPELYLLRCIDPTCTWRLRATKIRDCNL----------------------------------------------PKDIMQDIREEYGVNMSYDKAWR
        KKSTP+LYL+RCIDPTCTWRLR TKIRDCNL                                              PKDIMQDIREEYGVNMSYDKAWR
Subjt:  KKSTPELYLLRCIDPTCTWRLRATKIRDCNL----------------------------------------------PKDIMQDIREEYGVNMSYDKAWR

Query:  SSEEALRLIRRDPASSYGLLPAYGEA--------------------------------------------------------------------------
        SSEEALRLIR DPASSY LLPAYGEA                                                                          
Subjt:  SSEEALRLIRRDPASSYGLLPAYGEA--------------------------------------------------------------------------

Query:  -----------------------------------------------------------MNLLAKFKTPALEELFFKVAKAFRESYFNENWVQLYAHPGV
                                                                   MNLLAKFKT ALE LFFK AKAFRESYFNENWVQL AHPGV
Subjt:  -----------------------------------------------------------MNLLAKFKTPALEELFFKVAKAFRESYFNENWVQLYAHPGV

Query:  REYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRG
        REYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKL +TALLDHIRG
Subjt:  REYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRG

XP_022142677.1 uncharacterized protein LOC111012733 [Momordica charantia]1.3e-10154.05Show/hide
Query:  VVEYPVDAVHEMASNRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVMRMHLLALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNL--
        + EYPVDAVHEMA+NRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVM+MH  ALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNL  
Subjt:  VVEYPVDAVHEMASNRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVMRMHLLALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNL--

Query:  --------------------------------------------PKDIMQDIREEYGVNMSYDKAWRSSEEALRLIRRDPASSYGLLPAYGEA-------
                                                    PKDIMQDIREEYGVNMSYDKAWR SEEALRLIR DPASSY LLPAYGEA       
Subjt:  --------------------------------------------PKDIMQDIREEYGVNMSYDKAWRSSEEALRLIRRDPASSYGLLPAYGEA-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------MNLLAKFKTPALEELFFKVAKAFRESYFNENWVQLYAHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESV
                                  MNLLAKFKTP LE LFFK AKAFRESYFNENWVQL AHP VREYLEAIGKERWARCFQTKLRYSQMTTNIAESV
Subjt:  --------------------------MNLLAKFKTPALEELFFKVAKAFRESYFNENWVQLYAHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESV

Query:  NALFRHARKLPVTALLDHIR
        NALFRHARKLPVTALLDHIR
Subjt:  NALFRHARKLPVTALLDHIR

XP_022151512.1 protein FAR1-RELATED SEQUENCE 4-like [Momordica charantia]7.8e-9952.73Show/hide
Query:  VVEYPVDAVHEMASNRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVMRMHLLALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNL--
        + EYPVD VHE A NR+TGQS  DRLQAMVQSAGT+DVKEGDVFDSKKELVM+MHLLALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNL  
Subjt:  VVEYPVDAVHEMASNRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVMRMHLLALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNL--

Query:  --------------------------------------------PKDIMQDIREEYGVNMSYDKAWRSSEEALRLIRRDPASSYGLLPAYGEA-------
                                                     KDIMQDIREEYGVNMSYDK W SSEEALRL R DPASSYGLL AYGEA       
Subjt:  --------------------------------------------PKDIMQDIREEYGVNMSYDKAWRSSEEALRLIRRDPASSYGLLPAYGEA-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------MNLLAKFKTPALEELFFKVAKAFRESYFNENWVQLYAHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESV
                                  +NLLAKFKTPA+EELFFK AKAFRESYFNENWVQL AHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESV
Subjt:  --------------------------MNLLAKFKTPALEELFFKVAKAFRESYFNENWVQLYAHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESV

Query:  NALFRHARKLPVTALLDHIRG
        NALFRHARKLPVTALLDHIRG
Subjt:  NALFRHARKLPVTALLDHIRG

XP_022153146.1 uncharacterized protein LOC111020715 [Momordica charantia]6.8e-7942.6Show/hide
Query:  EGDDEGEYGNEYASDRLDVQHEHEKVTIHNTVVEYPVDAVHEMASNRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVMRMHLLALRKNFQFRVKK
        EGD E E+ N+   D LD + E +   +H  +      AV +M  + +TGQ   + LQ +VQS+GT+DVKEG+VFD+KKEL +RMHL+ +R NFQF+VKK
Subjt:  EGDDEGEYGNEYASDRLDVQHEHEKVTIHNTVVEYPVDAVHEMASNRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVMRMHLLALRKNFQFRVKK

Query:  STPELYLLRCIDPTCTWRLRATKIRDCNL---------------------------------------------PKDIMQDIREEYGVNMSYDKAWRSSE
        STPELY+L C+D +CTWRLRATK+RDCNL                                             PKDI+QD+R+EYGVN+SYDKAWRSSE
Subjt:  STPELYLLRCIDPTCTWRLRATKIRDCNL---------------------------------------------PKDIMQDIREEYGVNMSYDKAWRSSE

Query:  EALRLIRRDPASSYGLLPAYGEA-----------------------------------------------------------------------------
        EALRLIR DPASSYGLLP YGEA                                                                             
Subjt:  EALRLIRRDPASSYGLLPAYGEA-----------------------------------------------------------------------------

Query:  --------------------------------------------------------MNLLAKFK--TPALEELFFKVAKAFRESYFNENWVQLYAHPGVR
                                                                MNLLAKFK    ALEELF K AKA+RESYFN  W QL A+PGVR
Subjt:  --------------------------------------------------------MNLLAKFK--TPALEELFFKVAKAFRESYFNENWVQLYAHPGVR

Query:  EYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRG
        EYL+ IGKERWARCFQT+LRY+QMT+N AESVNALFRHARKLPVTALLDHIRG
Subjt:  EYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRG

XP_022155970.1 uncharacterized protein LOC111022954 [Momordica charantia]1.2e-9445.57Show/hide
Query:  YFGHIGHDITSLTPLGSDVVPCNLGDDKAYDWDVPGLWNGSENVDEDSDESYRPMTDMGEGDDEGEYGNEYASDRLDVQHEHEKVTIHNTVVEYPVD--A
        Y+GH+GHDI  LTPL SDVVPCNLGDD+   W++PGLWN ++   ++SDESY  +    EGD E E+ N+   D  D + E +   +    VE   D   
Subjt:  YFGHIGHDITSLTPLGSDVVPCNLGDDKAYDWDVPGLWNGSENVDEDSDESYRPMTDMGEGDDEGEYGNEYASDRLDVQHEHEKVTIHNTVVEYPVD--A

Query:  VHEMASNRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVMRMHLLALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNL----------
        V +M  + + GQ   ++LQ +VQS+GT+DVKEG VFD+KKEL +R HL+A+  NFQF+VKKSTPELY+LRC+D +CTWRLRA K+ DCNL          
Subjt:  VHEMASNRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVMRMHLLALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNL----------

Query:  -----------------------------------PKDIMQDIREEYGVNMSYDKAWRSSEEALRLIRRDPASSYGLLPAYG------------------
                                           PKDI+QD+R+EYGVN+SYDKAW+S+EEALRLIR DP +SYGLLPAYG                  
Subjt:  -----------------------------------PKDIMQDIREEYGVNMSYDKAWRSSEEALRLIRRDPASSYGLLPAYG------------------

Query:  -----------------------------------------------------------------EAMNLLAKFK--TPALEELFFKVAKAFRESYFNEN
                                                                            NLLAKFK    ALEELF K AKA++ESYFN  
Subjt:  -----------------------------------------------------------------EAMNLLAKFK--TPALEELFFKVAKAFRESYFNEN

Query:  WVQLYAHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIR
        W QL A+PG+REYL+ IGKERW RCFQT+LRY+QMT+N AESVNALFRHA  LPVTALLDHIR
Subjt:  WVQLYAHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIR

TrEMBL top hitse value%identityAlignment
A0A6J1BRM2 protein FAR1-RELATED SEQUENCE 4-like2.5e-11956.39Show/hide
Query:  MGEGDDEGEYGNEYASDRLDVQHEHEKVTIHNTVVEYPVDAVHEMASNRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVMRMHLLALRKNFQFRV
        MGEGDDE EYGNEYASDRLDVQHEHEKVTIHNT+ EYPVD VHEMASNRVTGQSEGDRLQAMVQSA TDDVKE DVFDSKKELVM+MHLLALRKNFQF+V
Subjt:  MGEGDDEGEYGNEYASDRLDVQHEHEKVTIHNTVVEYPVDAVHEMASNRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVMRMHLLALRKNFQFRV

Query:  KKSTPELYLLRCIDPTCTWRLRATKIRDCNL----------------------------------------------PKDIMQDIREEYGVNMSYDKAWR
        KKSTP+LYL+RCIDPTCTWRLR TKIRDCNL                                              PKDIMQDIREEYGVNMSYDKAWR
Subjt:  KKSTPELYLLRCIDPTCTWRLRATKIRDCNL----------------------------------------------PKDIMQDIREEYGVNMSYDKAWR

Query:  SSEEALRLIRRDPASSYGLLPAYGEA--------------------------------------------------------------------------
        SSEEALRLIR DPASSY LLPAYGEA                                                                          
Subjt:  SSEEALRLIRRDPASSYGLLPAYGEA--------------------------------------------------------------------------

Query:  -----------------------------------------------------------MNLLAKFKTPALEELFFKVAKAFRESYFNENWVQLYAHPGV
                                                                   MNLLAKFKT ALE LFFK AKAFRESYFNENWVQL AHPGV
Subjt:  -----------------------------------------------------------MNLLAKFKTPALEELFFKVAKAFRESYFNENWVQLYAHPGV

Query:  REYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRG
        REYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKL +TALLDHIRG
Subjt:  REYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRG

A0A6J1CNJ2 uncharacterized protein LOC1110127336.2e-10254.05Show/hide
Query:  VVEYPVDAVHEMASNRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVMRMHLLALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNL--
        + EYPVDAVHEMA+NRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVM+MH  ALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNL  
Subjt:  VVEYPVDAVHEMASNRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVMRMHLLALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNL--

Query:  --------------------------------------------PKDIMQDIREEYGVNMSYDKAWRSSEEALRLIRRDPASSYGLLPAYGEA-------
                                                    PKDIMQDIREEYGVNMSYDKAWR SEEALRLIR DPASSY LLPAYGEA       
Subjt:  --------------------------------------------PKDIMQDIREEYGVNMSYDKAWRSSEEALRLIRRDPASSYGLLPAYGEA-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------MNLLAKFKTPALEELFFKVAKAFRESYFNENWVQLYAHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESV
                                  MNLLAKFKTP LE LFFK AKAFRESYFNENWVQL AHP VREYLEAIGKERWARCFQTKLRYSQMTTNIAESV
Subjt:  --------------------------MNLLAKFKTPALEELFFKVAKAFRESYFNENWVQLYAHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESV

Query:  NALFRHARKLPVTALLDHIR
        NALFRHARKLPVTALLDHIR
Subjt:  NALFRHARKLPVTALLDHIR

A0A6J1DDQ3 protein FAR1-RELATED SEQUENCE 4-like3.8e-9952.73Show/hide
Query:  VVEYPVDAVHEMASNRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVMRMHLLALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNL--
        + EYPVD VHE A NR+TGQS  DRLQAMVQSAGT+DVKEGDVFDSKKELVM+MHLLALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNL  
Subjt:  VVEYPVDAVHEMASNRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVMRMHLLALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNL--

Query:  --------------------------------------------PKDIMQDIREEYGVNMSYDKAWRSSEEALRLIRRDPASSYGLLPAYGEA-------
                                                     KDIMQDIREEYGVNMSYDK W SSEEALRL R DPASSYGLL AYGEA       
Subjt:  --------------------------------------------PKDIMQDIREEYGVNMSYDKAWRSSEEALRLIRRDPASSYGLLPAYGEA-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------MNLLAKFKTPALEELFFKVAKAFRESYFNENWVQLYAHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESV
                                  +NLLAKFKTPA+EELFFK AKAFRESYFNENWVQL AHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESV
Subjt:  --------------------------MNLLAKFKTPALEELFFKVAKAFRESYFNENWVQLYAHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESV

Query:  NALFRHARKLPVTALLDHIRG
        NALFRHARKLPVTALLDHIRG
Subjt:  NALFRHARKLPVTALLDHIRG

A0A6J1DJT1 uncharacterized protein LOC1110207153.3e-7942.6Show/hide
Query:  EGDDEGEYGNEYASDRLDVQHEHEKVTIHNTVVEYPVDAVHEMASNRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVMRMHLLALRKNFQFRVKK
        EGD E E+ N+   D LD + E +   +H  +      AV +M  + +TGQ   + LQ +VQS+GT+DVKEG+VFD+KKEL +RMHL+ +R NFQF+VKK
Subjt:  EGDDEGEYGNEYASDRLDVQHEHEKVTIHNTVVEYPVDAVHEMASNRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVMRMHLLALRKNFQFRVKK

Query:  STPELYLLRCIDPTCTWRLRATKIRDCNL---------------------------------------------PKDIMQDIREEYGVNMSYDKAWRSSE
        STPELY+L C+D +CTWRLRATK+RDCNL                                             PKDI+QD+R+EYGVN+SYDKAWRSSE
Subjt:  STPELYLLRCIDPTCTWRLRATKIRDCNL---------------------------------------------PKDIMQDIREEYGVNMSYDKAWRSSE

Query:  EALRLIRRDPASSYGLLPAYGEA-----------------------------------------------------------------------------
        EALRLIR DPASSYGLLP YGEA                                                                             
Subjt:  EALRLIRRDPASSYGLLPAYGEA-----------------------------------------------------------------------------

Query:  --------------------------------------------------------MNLLAKFK--TPALEELFFKVAKAFRESYFNENWVQLYAHPGVR
                                                                MNLLAKFK    ALEELF K AKA+RESYFN  W QL A+PGVR
Subjt:  --------------------------------------------------------MNLLAKFK--TPALEELFFKVAKAFRESYFNENWVQLYAHPGVR

Query:  EYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRG
        EYL+ IGKERWARCFQT+LRY+QMT+N AESVNALFRHARKLPVTALLDHIRG
Subjt:  EYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRG

A0A6J1DP00 uncharacterized protein LOC1110229545.6e-9545.57Show/hide
Query:  YFGHIGHDITSLTPLGSDVVPCNLGDDKAYDWDVPGLWNGSENVDEDSDESYRPMTDMGEGDDEGEYGNEYASDRLDVQHEHEKVTIHNTVVEYPVD--A
        Y+GH+GHDI  LTPL SDVVPCNLGDD+   W++PGLWN ++   ++SDESY  +    EGD E E+ N+   D  D + E +   +    VE   D   
Subjt:  YFGHIGHDITSLTPLGSDVVPCNLGDDKAYDWDVPGLWNGSENVDEDSDESYRPMTDMGEGDDEGEYGNEYASDRLDVQHEHEKVTIHNTVVEYPVD--A

Query:  VHEMASNRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVMRMHLLALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNL----------
        V +M  + + GQ   ++LQ +VQS+GT+DVKEG VFD+KKEL +R HL+A+  NFQF+VKKSTPELY+LRC+D +CTWRLRA K+ DCNL          
Subjt:  VHEMASNRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVMRMHLLALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNL----------

Query:  -----------------------------------PKDIMQDIREEYGVNMSYDKAWRSSEEALRLIRRDPASSYGLLPAYG------------------
                                           PKDI+QD+R+EYGVN+SYDKAW+S+EEALRLIR DP +SYGLLPAYG                  
Subjt:  -----------------------------------PKDIMQDIREEYGVNMSYDKAWRSSEEALRLIRRDPASSYGLLPAYG------------------

Query:  -----------------------------------------------------------------EAMNLLAKFK--TPALEELFFKVAKAFRESYFNEN
                                                                            NLLAKFK    ALEELF K AKA++ESYFN  
Subjt:  -----------------------------------------------------------------EAMNLLAKFK--TPALEELFFKVAKAFRESYFNEN

Query:  WVQLYAHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIR
        W QL A+PG+REYL+ IGKERW RCFQT+LRY+QMT+N AESVNALFRHA  LPVTALLDHIR
Subjt:  WVQLYAHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGATTGTTAGAAAAGATTGCTGTGCTAATGATCTTTCTTCAGAATCTCGGAACACAACGTCATCGACCTTGAAGCTCGACCAAGAGGTGACTTGGGCTAGGCGGCA
TCTTGAGTGGTCCTCAAAGGCGCGGAGTGGTTTGAGTAAAGGATGTGCAGGAGCGCTTTGCCCTGTTATTGGTGCGCTCGAATCCTTTCCAGTGCACTTAGTTGTGCCTG
ATTTTTTGACCTGGCCTATGCGGCTTATGTCGTATCAATGTCACATAAATGCCACGAACACCTTCGTCTCCAAGGTATTGGTTCTCTGCACACAGAAATCTAACACTTGG
CCAACACGTGTCCAATTTTGCTATTCCCTCAGCTTGTGCCCCCCCATTTGCAAAACCCTTATTTCCGAGACCCTCATTTTCAAGTCCGTCAGTTCCGTCCTCGTCGTCGA
ACCCCTCTTCTTCCCGCCCACCACCCCCTACTTTGGTCATATTGGTCATGATATAACATCTCTCACACCGTTAGGGTCAGATGTTGTTCCTTGTAATTTGGGAGATGATA
AGGCATATGATTGGGATGTGCCTGGCTTGTGGAATGGAAGTGAAAATGTGGATGAAGATAGTGATGAATCATATCGTCCAATGACCGACATGGGAGAAGGAGACGACGAA
GGGGAATATGGAAATGAGTACGCCAGTGATCGACTTGATGTGCAACATGAGCATGAAAAGGTAACAATTCATAATACAGTGGTTGAATATCCTGTAGATGCCGTCCATGA
AATGGCAAGCAATAGAGTCACCGGTCAGTCAGAAGGTGATAGATTGCAAGCCATGGTCCAATCGGCTGGGACCGATGATGTTAAGGAGGGTGACGTATTCGACTCAAAGA
AGGAACTAGTTATGAGAATGCATTTACTTGCATTGCGGAAGAACTTTCAGTTTCGAGTGAAGAAGTCTACGCCGGAACTATACTTGCTGCGATGCATCGATCCTACTTGC
ACGTGGCGACTTAGAGCCACTAAGATTAGAGATTGCAACCTGCCGAAGGACATCATGCAAGATATTCGTGAGGAGTACGGTGTAAATATGAGTTACGACAAGGCCTGGCG
TTCGAGCGAAGAAGCACTCCGACTTATCAGAAGGGATCCAGCTTCATCGTACGGGCTACTACCCGCTTATGGAGAAGCCATGAACTTGCTGGCCAAATTTAAAACGCCCG
CGTTGGAGGAATTATTTTTTAAGGTTGCGAAGGCATTTCGCGAGTCATATTTCAATGAGAACTGGGTCCAACTGTACGCACACCCAGGAGTGAGGGAATATCTGGAAGCT
ATAGGAAAGGAACGATGGGCTCGCTGCTTTCAGACGAAACTAAGATACTCACAAATGACCACCAATATTGCAGAGTCTGTTAATGCCCTTTTCAGGCATGCACGTAAGTT
GCCAGTCACCGCATTACTTGATCATATCAGAGGTGGTTCTACGAACGTCGGACGCTTGCTTCTTCACGTCAGAGTACGTTGTCTGACTACGCAGAGGAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTGGATTGTTAGAAAAGATTGCTGTGCTAATGATCTTTCTTCAGAATCTCGGAACACAACGTCATCGACCTTGAAGCTCGACCAAGAGGTGACTTGGGCTAGGCGGCA
TCTTGAGTGGTCCTCAAAGGCGCGGAGTGGTTTGAGTAAAGGATGTGCAGGAGCGCTTTGCCCTGTTATTGGTGCGCTCGAATCCTTTCCAGTGCACTTAGTTGTGCCTG
ATTTTTTGACCTGGCCTATGCGGCTTATGTCGTATCAATGTCACATAAATGCCACGAACACCTTCGTCTCCAAGGTATTGGTTCTCTGCACACAGAAATCTAACACTTGG
CCAACACGTGTCCAATTTTGCTATTCCCTCAGCTTGTGCCCCCCCATTTGCAAAACCCTTATTTCCGAGACCCTCATTTTCAAGTCCGTCAGTTCCGTCCTCGTCGTCGA
ACCCCTCTTCTTCCCGCCCACCACCCCCTACTTTGGTCATATTGGTCATGATATAACATCTCTCACACCGTTAGGGTCAGATGTTGTTCCTTGTAATTTGGGAGATGATA
AGGCATATGATTGGGATGTGCCTGGCTTGTGGAATGGAAGTGAAAATGTGGATGAAGATAGTGATGAATCATATCGTCCAATGACCGACATGGGAGAAGGAGACGACGAA
GGGGAATATGGAAATGAGTACGCCAGTGATCGACTTGATGTGCAACATGAGCATGAAAAGGTAACAATTCATAATACAGTGGTTGAATATCCTGTAGATGCCGTCCATGA
AATGGCAAGCAATAGAGTCACCGGTCAGTCAGAAGGTGATAGATTGCAAGCCATGGTCCAATCGGCTGGGACCGATGATGTTAAGGAGGGTGACGTATTCGACTCAAAGA
AGGAACTAGTTATGAGAATGCATTTACTTGCATTGCGGAAGAACTTTCAGTTTCGAGTGAAGAAGTCTACGCCGGAACTATACTTGCTGCGATGCATCGATCCTACTTGC
ACGTGGCGACTTAGAGCCACTAAGATTAGAGATTGCAACCTGCCGAAGGACATCATGCAAGATATTCGTGAGGAGTACGGTGTAAATATGAGTTACGACAAGGCCTGGCG
TTCGAGCGAAGAAGCACTCCGACTTATCAGAAGGGATCCAGCTTCATCGTACGGGCTACTACCCGCTTATGGAGAAGCCATGAACTTGCTGGCCAAATTTAAAACGCCCG
CGTTGGAGGAATTATTTTTTAAGGTTGCGAAGGCATTTCGCGAGTCATATTTCAATGAGAACTGGGTCCAACTGTACGCACACCCAGGAGTGAGGGAATATCTGGAAGCT
ATAGGAAAGGAACGATGGGCTCGCTGCTTTCAGACGAAACTAAGATACTCACAAATGACCACCAATATTGCAGAGTCTGTTAATGCCCTTTTCAGGCATGCACGTAAGTT
GCCAGTCACCGCATTACTTGATCATATCAGAGGTGGTTCTACGAACGTCGGACGCTTGCTTCTTCACGTCAGAGTACGTTGTCTGACTACGCAGAGGAAATGA
Protein sequenceShow/hide protein sequence
MWIVRKDCCANDLSSESRNTTSSTLKLDQEVTWARRHLEWSSKARSGLSKGCAGALCPVIGALESFPVHLVVPDFLTWPMRLMSYQCHINATNTFVSKVLVLCTQKSNTW
PTRVQFCYSLSLCPPICKTLISETLIFKSVSSVLVVEPLFFPPTTPYFGHIGHDITSLTPLGSDVVPCNLGDDKAYDWDVPGLWNGSENVDEDSDESYRPMTDMGEGDDE
GEYGNEYASDRLDVQHEHEKVTIHNTVVEYPVDAVHEMASNRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVMRMHLLALRKNFQFRVKKSTPELYLLRCIDPTC
TWRLRATKIRDCNLPKDIMQDIREEYGVNMSYDKAWRSSEEALRLIRRDPASSYGLLPAYGEAMNLLAKFKTPALEELFFKVAKAFRESYFNENWVQLYAHPGVREYLEA
IGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGGSTNVGRLLLHVRVRCLTTQRK