; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g16150 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g16150
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr2:12132380..12133964
RNA-Seq ExpressionMoc02g16150
SyntenyMoc02g16150
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032583.1 reverse transcriptase [Cucumis melo var. makuwa]2.9e-6636.08Show/hide
Query:  MSATKQLGKSHVDRLVEIEEQLLLLREIPDNLRYVKSRLDEISTKADGIDVVNARINGLAIRKFMLRVETLEGKVK--RANNLERGESSSSSIAHMEE--
        MS++   GK+  DRL+E+EEQ+L L E+PD++RY++SRL+EIS K + ID V  R+ G  I++ M RV+ LE  V   R  N ERG+SS+ S+AH+EE  
Subjt:  MSATKQLGKSHVDRLVEIEEQLLLLREIPDNLRYVKSRLDEISTKADGIDVVNARINGLAIRKFMLRVETLEGKVK--RANNLERGESSSSSIAHMEE--

Query:  -----------------------------------------------------------RVKVPEPKPFCGARDAKALENFIFDLEQYYKATSTVTEESK
                                                                   RVK+PEPKPFCGARDAKALEN+IFDLEQY++AT+TVTEE+K
Subjt:  -----------------------------------------------------------RVKVPEPKPFCGARDAKALENFIFDLEQYYKATSTVTEESK

Query:  VTLATMHLADDASC------------------------------------------------------LKPWARAKLYEQKVQDIATAYATTERLFDLSN
        VTLATMHL++DA                                                        ++PWA+ KLYEQ+VQD+ +AYA  ERLFDLSN
Subjt:  VTLATMHLADDASC------------------------------------------------------LKPWARAKLYEQKVQDIATAYATTERLFDLSN

Query:  DAPPRNRYFSQ-----------------------------------------------------------------------------------------
        D+    R+ S                                                                                          
Subjt:  DAPPRNRYFSQ-----------------------------------------------------------------------------------------

Query:  -DVN-----DNPRMGALKFLSVLQRKAEEVKEPLERGLICVEAWVNQRATKSTMVDFGATHNFMTETEACRLNLRWDKDPGKMKA
         +VN     DNPRMGALKFLS LQ+K  E   P+ERGL+ V+ W+NQ+ TKSTMVD GATHNF+TE EA RLNLRW+KD G+MKA
Subjt:  -DVN-----DNPRMGALKFLSVLQRKAEEVKEPLERGLICVEAWVNQRATKSTMVDFGATHNFMTETEACRLNLRWDKDPGKMKA

KAA0060480.1 reverse transcriptase [Cucumis melo var. makuwa]2.9e-6636.82Show/hide
Query:  MSATKQLGKSHVDRLVEIEEQLLLLREIPDNLRYVKSRLDEISTKADGIDVVNARINGLAIRKFMLRVETLEGKVK--RANNLERGESSSSSIAHMEE--
        MS++   GK+  DRLVE+EEQ+L L E+PD++RY++SRLDEIS K + ID V  R+ G  I++ M RV+ LE  +   R  N ERG+SS+ S+AH+EE  
Subjt:  MSATKQLGKSHVDRLVEIEEQLLLLREIPDNLRYVKSRLDEISTKADGIDVVNARINGLAIRKFMLRVETLEGKVK--RANNLERGESSSSSIAHMEE--

Query:  -----------------------------------------------------------RVKVPEPKPFCGARDAKALENFIFDLEQYYKATSTVTEESK
                                                                   RVK+PEPKPFCGARDAKALEN+IFDLEQY++AT+TVTEE+K
Subjt:  -----------------------------------------------------------RVKVPEPKPFCGARDAKALENFIFDLEQYYKATSTVTEESK

Query:  VTLATMHLADDASC---------------------------------------------------------------------------------LKPWA
        VTLATMHL++DA                                                                                   LKPWA
Subjt:  VTLATMHLADDASC---------------------------------------------------------------------------------LKPWA

Query:  RAKLYEQKVQDIATAYATTERLFDLSND--------------------APPR----NRYFSQDVN-----------------------------------
        + KLYEQ+VQD+ +AYA  ERLFDLSND                    + P+    +R F+ D                                     
Subjt:  RAKLYEQKVQDIATAYATTERLFDLSND--------------------APPR----NRYFSQDVN-----------------------------------

Query:  --DNPRMGALKFLSVLQRKAEEVKEPLERGLICVEAWVNQRATKSTMVDFGATHNFMTETEACRLNLRWDKDPGKMKA
          DNPRMGALKFLS LQ+K  E   P+ERGL+ V+ W+N++ TKSTMVD GATHNF+ E EA RLNLRW+KD G+MKA
Subjt:  --DNPRMGALKFLSVLQRKAEEVKEPLERGLICVEAWVNQRATKSTMVDFGATHNFMTETEACRLNLRWDKDPGKMKA

KAA0060640.1 uncharacterized protein E6C27_scaffold22G005260 [Cucumis melo var. makuwa]4.0e-6839.35Show/hide
Query:  MSATKQLGKSHVDRLVEIEEQLLLLREIPDNLRYVKSRLDEISTKADGIDVVNARINGLAIRKFMLRVETLEGKVK--RANNLERGESSSSSI-------
        MS++   GK+  DRLVE+EEQ+L L E+PD++RY++SRL+EIS K + ID V  R+ G  I++ M RV+ LE  +   R  N ERG+SS+ S        
Subjt:  MSATKQLGKSHVDRLVEIEEQLLLLREIPDNLRYVKSRLDEISTKADGIDVVNARINGLAIRKFMLRVETLEGKVK--RANNLERGESSSSSI-------

Query:  --------AHMEERVKVPEPKPFCGARDAKALENFIFDLEQYYKATSTVTEESKVTLATMHLADDASC--------------------------------
                A    RVK+PEPKPFCGARDAKALEN+I+D+EQY++AT+TVTEE+KVTLATMHL++DA                                  
Subjt:  --------AHMEERVKVPEPKPFCGARDAKALENFIFDLEQYYKATSTVTEESKVTLATMHLADDASC--------------------------------

Query:  -------------------------------------------------LKPWARAKLYEQKVQDIATAYATTERLFDLSNDAPPRNRYFSQD-------
                                                         LKPWA+ KLYEQ+VQD+ +AYA  ERLFDLSND+    R+ S         
Subjt:  -------------------------------------------------LKPWARAKLYEQKVQDIATAYATTERLFDLSNDAPPRNRYFSQD-------

Query:  --------------------VN----------------------------------DNPRMGALKFLSVLQRKAEEVKEPLERGLICVEAWVNQRATKST
                            +N                                  DNPRMGALKFLS LQ+K  E   P+ERGL+ V+ W+NQ+ TKST
Subjt:  --------------------VN----------------------------------DNPRMGALKFLSVLQRKAEEVKEPLERGLICVEAWVNQRATKST

Query:  MVDFGATHNFMTETEACRLNLRWDKDPGKMKA
        MVD GATHNF+TE EA  LNLRW+KD G+MKA
Subjt:  MVDFGATHNFMTETEACRLNLRWDKDPGKMKA

TYK18566.1 uncharacterized protein E5676_scaffold119G00720 [Cucumis melo var. makuwa]4.9e-6636.48Show/hide
Query:  MSATKQLGKSHVDRLVEIEEQLLLLREIPDNLRYVKSRLDEISTKADGIDVVNARINGLAIRKFMLRVETLEGKVK--RANNLERGESSSSSIAHMEE--
        MS++   GK+  DRLVE+EEQ+L L E+PD++RY++SRLDEIS K + ID V  R+ G  I++ M RV+ LE  +   R  N ERG+SS+ S+AH+EE  
Subjt:  MSATKQLGKSHVDRLVEIEEQLLLLREIPDNLRYVKSRLDEISTKADGIDVVNARINGLAIRKFMLRVETLEGKVK--RANNLERGESSSSSIAHMEE--

Query:  -----------------------------------------------------------RVKVPEPKPFCGARDAKALENFIFDLEQYYKATSTVTEESK
                                                                   RVK+PEPKPFCGARDAKALEN+IFDLEQY++AT+TVTEE+K
Subjt:  -----------------------------------------------------------RVKVPEPKPFCGARDAKALENFIFDLEQYYKATSTVTEESK

Query:  VTLATMHLADDASC---------------------------------------------------------------------------------LKPWA
        VTLATMHL++DA                                                                                   LKPWA
Subjt:  VTLATMHLADDASC---------------------------------------------------------------------------------LKPWA

Query:  RAKLYEQKVQDIATAYATTERLFDLSNDAPPRNRY---------------------------------------------FSQDVN--------------
        + KLYEQ+VQD+ +AYA  ERLFDLSND+    R+                                             F  D +              
Subjt:  RAKLYEQKVQDIATAYATTERLFDLSNDAPPRNRY---------------------------------------------FSQDVN--------------

Query:  --DNPRMGALKFLSVLQRKAEEVKEPLERGLICVEAWVNQRATKSTMVDFGATHNFMTETEACRLNLRWDKDPGKMK
          DNPRMGALKFLS LQ+K  E   P+ERGLI V+ W+N++ TKSTMVD GATHNF  E EA R NLRW+KD G+MK
Subjt:  --DNPRMGALKFLSVLQRKAEEVKEPLERGLICVEAWVNQRATKSTMVDFGATHNFMTETEACRLNLRWDKDPGKMK

XP_022155185.1 uncharacterized protein LOC111022320 [Momordica charantia]7.8e-9649.79Show/hide
Query:  MSATKQLGKSHVDRLVEIEEQLLLLREIPDNLRYVKSRLDEISTKADGIDVVNARINGLAIRKFMLRVETLEGKVKRANNLERGESSSSSIAHMEERVK-
        MS TKQLGKSH+DRLVEIEE+LL LREIPDNLRYV+SRLDEISTKADGIDVVNARI+GLAIR+ MLRVETLE KVKR +NLERGESSSSSIAHMEERV+ 
Subjt:  MSATKQLGKSHVDRLVEIEEQLLLLREIPDNLRYVKSRLDEISTKADGIDVVNARINGLAIRKFMLRVETLEGKVKRANNLERGESSSSSIAHMEERVK-

Query:  ----------------------VPE---------PKPFCGARDAKALENFIFDLEQYYKATSTVTEESKVTLATMHLADDASC-----------------
                              V E          KPFCGARDAKALENFIFDLEQY+KATSTVTEESKVTLATMHLADDA                   
Subjt:  ----------------------VPE---------PKPFCGARDAKALENFIFDLEQYYKATSTVTEESKVTLATMHLADDASC-----------------

Query:  -----------------------------------------------------LKPWARAKLYEQKVQDIATAYATTERLFDLSNDA-------------
                                                             LKPWARAKLYEQKVQDI TAYAT E+LF+LS+D              
Subjt:  -----------------------------------------------------LKPWARAKLYEQKVQDIATAYATTERLFDLSNDA-------------

Query:  --------------------------------------PPRN--------------------------RYFS--------------------QDVNDNPR
                                              PP N                          R F                     +D +DNPR
Subjt:  --------------------------------------PPRN--------------------------RYFS--------------------QDVNDNPR

Query:  MGALKFLSVLQRKAEEVKEPLERGLICVEAWVNQRATKSTMVDFGATHNFMTETEACRLNLRWDKDPGKMKA
        MGALKFLS LQ+KAEEVKEPLERGL+ VEAWVNQ+A KSTMVD GATHNFMTETEA RLNL WDKDPGKMKA
Subjt:  MGALKFLSVLQRKAEEVKEPLERGLICVEAWVNQRATKSTMVDFGATHNFMTETEACRLNLRWDKDPGKMKA

TrEMBL top hitse value%identityAlignment
A0A5A7SQC7 Reverse transcriptase1.4e-6636.08Show/hide
Query:  MSATKQLGKSHVDRLVEIEEQLLLLREIPDNLRYVKSRLDEISTKADGIDVVNARINGLAIRKFMLRVETLEGKVK--RANNLERGESSSSSIAHMEE--
        MS++   GK+  DRL+E+EEQ+L L E+PD++RY++SRL+EIS K + ID V  R+ G  I++ M RV+ LE  V   R  N ERG+SS+ S+AH+EE  
Subjt:  MSATKQLGKSHVDRLVEIEEQLLLLREIPDNLRYVKSRLDEISTKADGIDVVNARINGLAIRKFMLRVETLEGKVK--RANNLERGESSSSSIAHMEE--

Query:  -----------------------------------------------------------RVKVPEPKPFCGARDAKALENFIFDLEQYYKATSTVTEESK
                                                                   RVK+PEPKPFCGARDAKALEN+IFDLEQY++AT+TVTEE+K
Subjt:  -----------------------------------------------------------RVKVPEPKPFCGARDAKALENFIFDLEQYYKATSTVTEESK

Query:  VTLATMHLADDASC------------------------------------------------------LKPWARAKLYEQKVQDIATAYATTERLFDLSN
        VTLATMHL++DA                                                        ++PWA+ KLYEQ+VQD+ +AYA  ERLFDLSN
Subjt:  VTLATMHLADDASC------------------------------------------------------LKPWARAKLYEQKVQDIATAYATTERLFDLSN

Query:  DAPPRNRYFSQ-----------------------------------------------------------------------------------------
        D+    R+ S                                                                                          
Subjt:  DAPPRNRYFSQ-----------------------------------------------------------------------------------------

Query:  -DVN-----DNPRMGALKFLSVLQRKAEEVKEPLERGLICVEAWVNQRATKSTMVDFGATHNFMTETEACRLNLRWDKDPGKMKA
         +VN     DNPRMGALKFLS LQ+K  E   P+ERGL+ V+ W+NQ+ TKSTMVD GATHNF+TE EA RLNLRW+KD G+MKA
Subjt:  -DVN-----DNPRMGALKFLSVLQRKAEEVKEPLERGLICVEAWVNQRATKSTMVDFGATHNFMTETEACRLNLRWDKDPGKMKA

A0A5A7UZE9 Reverse transcriptase1.4e-6636.82Show/hide
Query:  MSATKQLGKSHVDRLVEIEEQLLLLREIPDNLRYVKSRLDEISTKADGIDVVNARINGLAIRKFMLRVETLEGKVK--RANNLERGESSSSSIAHMEE--
        MS++   GK+  DRLVE+EEQ+L L E+PD++RY++SRLDEIS K + ID V  R+ G  I++ M RV+ LE  +   R  N ERG+SS+ S+AH+EE  
Subjt:  MSATKQLGKSHVDRLVEIEEQLLLLREIPDNLRYVKSRLDEISTKADGIDVVNARINGLAIRKFMLRVETLEGKVK--RANNLERGESSSSSIAHMEE--

Query:  -----------------------------------------------------------RVKVPEPKPFCGARDAKALENFIFDLEQYYKATSTVTEESK
                                                                   RVK+PEPKPFCGARDAKALEN+IFDLEQY++AT+TVTEE+K
Subjt:  -----------------------------------------------------------RVKVPEPKPFCGARDAKALENFIFDLEQYYKATSTVTEESK

Query:  VTLATMHLADDASC---------------------------------------------------------------------------------LKPWA
        VTLATMHL++DA                                                                                   LKPWA
Subjt:  VTLATMHLADDASC---------------------------------------------------------------------------------LKPWA

Query:  RAKLYEQKVQDIATAYATTERLFDLSND--------------------APPR----NRYFSQDVN-----------------------------------
        + KLYEQ+VQD+ +AYA  ERLFDLSND                    + P+    +R F+ D                                     
Subjt:  RAKLYEQKVQDIATAYATTERLFDLSND--------------------APPR----NRYFSQDVN-----------------------------------

Query:  --DNPRMGALKFLSVLQRKAEEVKEPLERGLICVEAWVNQRATKSTMVDFGATHNFMTETEACRLNLRWDKDPGKMKA
          DNPRMGALKFLS LQ+K  E   P+ERGL+ V+ W+N++ TKSTMVD GATHNF+ E EA RLNLRW+KD G+MKA
Subjt:  --DNPRMGALKFLSVLQRKAEEVKEPLERGLICVEAWVNQRATKSTMVDFGATHNFMTETEACRLNLRWDKDPGKMKA

A0A5A7UZV3 Reverse transcriptase1.9e-6839.35Show/hide
Query:  MSATKQLGKSHVDRLVEIEEQLLLLREIPDNLRYVKSRLDEISTKADGIDVVNARINGLAIRKFMLRVETLEGKVK--RANNLERGESSSSSI-------
        MS++   GK+  DRLVE+EEQ+L L E+PD++RY++SRL+EIS K + ID V  R+ G  I++ M RV+ LE  +   R  N ERG+SS+ S        
Subjt:  MSATKQLGKSHVDRLVEIEEQLLLLREIPDNLRYVKSRLDEISTKADGIDVVNARINGLAIRKFMLRVETLEGKVK--RANNLERGESSSSSI-------

Query:  --------AHMEERVKVPEPKPFCGARDAKALENFIFDLEQYYKATSTVTEESKVTLATMHLADDASC--------------------------------
                A    RVK+PEPKPFCGARDAKALEN+I+D+EQY++AT+TVTEE+KVTLATMHL++DA                                  
Subjt:  --------AHMEERVKVPEPKPFCGARDAKALENFIFDLEQYYKATSTVTEESKVTLATMHLADDASC--------------------------------

Query:  -------------------------------------------------LKPWARAKLYEQKVQDIATAYATTERLFDLSNDAPPRNRYFSQD-------
                                                         LKPWA+ KLYEQ+VQD+ +AYA  ERLFDLSND+    R+ S         
Subjt:  -------------------------------------------------LKPWARAKLYEQKVQDIATAYATTERLFDLSNDAPPRNRYFSQD-------

Query:  --------------------VN----------------------------------DNPRMGALKFLSVLQRKAEEVKEPLERGLICVEAWVNQRATKST
                            +N                                  DNPRMGALKFLS LQ+K  E   P+ERGL+ V+ W+NQ+ TKST
Subjt:  --------------------VN----------------------------------DNPRMGALKFLSVLQRKAEEVKEPLERGLICVEAWVNQRATKST

Query:  MVDFGATHNFMTETEACRLNLRWDKDPGKMKA
        MVD GATHNF+TE EA  LNLRW+KD G+MKA
Subjt:  MVDFGATHNFMTETEACRLNLRWDKDPGKMKA

A0A5D3D4V1 Retrotrans_gag domain-containing protein2.4e-6636.48Show/hide
Query:  MSATKQLGKSHVDRLVEIEEQLLLLREIPDNLRYVKSRLDEISTKADGIDVVNARINGLAIRKFMLRVETLEGKVK--RANNLERGESSSSSIAHMEE--
        MS++   GK+  DRLVE+EEQ+L L E+PD++RY++SRLDEIS K + ID V  R+ G  I++ M RV+ LE  +   R  N ERG+SS+ S+AH+EE  
Subjt:  MSATKQLGKSHVDRLVEIEEQLLLLREIPDNLRYVKSRLDEISTKADGIDVVNARINGLAIRKFMLRVETLEGKVK--RANNLERGESSSSSIAHMEE--

Query:  -----------------------------------------------------------RVKVPEPKPFCGARDAKALENFIFDLEQYYKATSTVTEESK
                                                                   RVK+PEPKPFCGARDAKALEN+IFDLEQY++AT+TVTEE+K
Subjt:  -----------------------------------------------------------RVKVPEPKPFCGARDAKALENFIFDLEQYYKATSTVTEESK

Query:  VTLATMHLADDASC---------------------------------------------------------------------------------LKPWA
        VTLATMHL++DA                                                                                   LKPWA
Subjt:  VTLATMHLADDASC---------------------------------------------------------------------------------LKPWA

Query:  RAKLYEQKVQDIATAYATTERLFDLSNDAPPRNRY---------------------------------------------FSQDVN--------------
        + KLYEQ+VQD+ +AYA  ERLFDLSND+    R+                                             F  D +              
Subjt:  RAKLYEQKVQDIATAYATTERLFDLSNDAPPRNRY---------------------------------------------FSQDVN--------------

Query:  --DNPRMGALKFLSVLQRKAEEVKEPLERGLICVEAWVNQRATKSTMVDFGATHNFMTETEACRLNLRWDKDPGKMK
          DNPRMGALKFLS LQ+K  E   P+ERGLI V+ W+N++ TKSTMVD GATHNF  E EA R NLRW+KD G+MK
Subjt:  --DNPRMGALKFLSVLQRKAEEVKEPLERGLICVEAWVNQRATKSTMVDFGATHNFMTETEACRLNLRWDKDPGKMK

A0A6J1DLQ6 uncharacterized protein LOC1110223203.8e-9649.79Show/hide
Query:  MSATKQLGKSHVDRLVEIEEQLLLLREIPDNLRYVKSRLDEISTKADGIDVVNARINGLAIRKFMLRVETLEGKVKRANNLERGESSSSSIAHMEERVK-
        MS TKQLGKSH+DRLVEIEE+LL LREIPDNLRYV+SRLDEISTKADGIDVVNARI+GLAIR+ MLRVETLE KVKR +NLERGESSSSSIAHMEERV+ 
Subjt:  MSATKQLGKSHVDRLVEIEEQLLLLREIPDNLRYVKSRLDEISTKADGIDVVNARINGLAIRKFMLRVETLEGKVKRANNLERGESSSSSIAHMEERVK-

Query:  ----------------------VPE---------PKPFCGARDAKALENFIFDLEQYYKATSTVTEESKVTLATMHLADDASC-----------------
                              V E          KPFCGARDAKALENFIFDLEQY+KATSTVTEESKVTLATMHLADDA                   
Subjt:  ----------------------VPE---------PKPFCGARDAKALENFIFDLEQYYKATSTVTEESKVTLATMHLADDASC-----------------

Query:  -----------------------------------------------------LKPWARAKLYEQKVQDIATAYATTERLFDLSNDA-------------
                                                             LKPWARAKLYEQKVQDI TAYAT E+LF+LS+D              
Subjt:  -----------------------------------------------------LKPWARAKLYEQKVQDIATAYATTERLFDLSNDA-------------

Query:  --------------------------------------PPRN--------------------------RYFS--------------------QDVNDNPR
                                              PP N                          R F                     +D +DNPR
Subjt:  --------------------------------------PPRN--------------------------RYFS--------------------QDVNDNPR

Query:  MGALKFLSVLQRKAEEVKEPLERGLICVEAWVNQRATKSTMVDFGATHNFMTETEACRLNLRWDKDPGKMKA
        MGALKFLS LQ+KAEEVKEPLERGL+ VEAWVNQ+A KSTMVD GATHNFMTETEA RLNL WDKDPGKMKA
Subjt:  MGALKFLSVLQRKAEEVKEPLERGLICVEAWVNQRATKSTMVDFGATHNFMTETEACRLNLRWDKDPGKMKA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCGCGACAAAACAGTTGGGCAAGTCCCACGTCGACAGACTCGTCGAGATCGAAGAACAGCTGTTGTTATTGAGGGAAATCCCTGACAACCTTAGATATGTG
AAATCTCGGTTGGATGAGATCTCCACCAAAGCTGACGGAATTGATGTCGTAAATGCTCGCATAAATGGGCTTGCTATACGCAAGTTCATGCTTCGGGTTGAGACC
CTTGAAGGCAAGGTTAAGCGTGCTAATAACCTTGAGCGTGGCGAAAGCTCATCGAGCTCAATCGCCCACATGGAGGAGCGTGTGAAAGTTCCCGAACCCAAGCCT
TTCTGTGGAGCGCGAGATGCTAAAGCTCTTGAGAACTTCATCTTCGACCTTGAGCAGTACTACAAGGCGACAAGCACTGTGACAGAAGAATCGAAAGTCACACTA
GCCACAATGCATCTTGCTGACGATGCGAGTTGCTTGAAACCATGGGCTCGGGCTAAGCTGTATGAACAGAAAGTGCAAGATATCGCTACCGCTTATGCCACAACC
GAACGGCTATTCGATCTTAGCAACGATGCACCGCCCCGAAACAGATACTTCTCCCAAGACGTCAATGACAATCCTCGTATGGGAGCGCTCAAATTCTTATCGGTG
CTACAGAGGAAAGCTGAGGAGGTGAAGGAACCTCTGGAACGTGGTTTGATATGTGTGGAAGCGTGGGTCAATCAGAGAGCAACAAAAAGCACCATGGTAGATTTT
GGTGCCACCCACAATTTCATGACTGAAACCGAAGCATGTCGGTTGAACTTGCGATGGGATAAAGACCCAGGAAAGATGAAAGCTGCAACTTGGCCGCCCTCCCCA
TCATGGGAGTTGCCAAGAAAGTCTCAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCCGCGACAAAACAGTTGGGCAAGTCCCACGTCGACAGACTCGTCGAGATCGAAGAACAGCTGTTGTTATTGAGGGAAATCCCTGACAACCTTAGATATGTG
AAATCTCGGTTGGATGAGATCTCCACCAAAGCTGACGGAATTGATGTCGTAAATGCTCGCATAAATGGGCTTGCTATACGCAAGTTCATGCTTCGGGTTGAGACC
CTTGAAGGCAAGGTTAAGCGTGCTAATAACCTTGAGCGTGGCGAAAGCTCATCGAGCTCAATCGCCCACATGGAGGAGCGTGTGAAAGTTCCCGAACCCAAGCCT
TTCTGTGGAGCGCGAGATGCTAAAGCTCTTGAGAACTTCATCTTCGACCTTGAGCAGTACTACAAGGCGACAAGCACTGTGACAGAAGAATCGAAAGTCACACTA
GCCACAATGCATCTTGCTGACGATGCGAGTTGCTTGAAACCATGGGCTCGGGCTAAGCTGTATGAACAGAAAGTGCAAGATATCGCTACCGCTTATGCCACAACC
GAACGGCTATTCGATCTTAGCAACGATGCACCGCCCCGAAACAGATACTTCTCCCAAGACGTCAATGACAATCCTCGTATGGGAGCGCTCAAATTCTTATCGGTG
CTACAGAGGAAAGCTGAGGAGGTGAAGGAACCTCTGGAACGTGGTTTGATATGTGTGGAAGCGTGGGTCAATCAGAGAGCAACAAAAAGCACCATGGTAGATTTT
GGTGCCACCCACAATTTCATGACTGAAACCGAAGCATGTCGGTTGAACTTGCGATGGGATAAAGACCCAGGAAAGATGAAAGCTGCAACTTGGCCGCCCTCCCCA
TCATGGGAGTTGCCAAGAAAGTCTCAGTAA
Protein sequenceShow/hide protein sequence
MSATKQLGKSHVDRLVEIEEQLLLLREIPDNLRYVKSRLDEISTKADGIDVVNARINGLAIRKFMLRVETLEGKVKRANNLERGESSSSSIAHMEERVKVPEPKP
FCGARDAKALENFIFDLEQYYKATSTVTEESKVTLATMHLADDASCLKPWARAKLYEQKVQDIATAYATTERLFDLSNDAPPRNRYFSQDVNDNPRMGALKFLSV
LQRKAEEVKEPLERGLICVEAWVNQRATKSTMVDFGATHNFMTETEACRLNLRWDKDPGKMKAATWPPSPSWELPRKSQ