; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032387 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032387
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr11:31767726..31769148
RNA-Seq ExpressionLag0032387
SyntenyLag0032387
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR005162 - Retrotransposon gag domain
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036945.1 gag protease polyprotein [Cucumis melo var. makuwa]3.0e-3632.73Show/hide
Query:  FKEAFFLKYYSAIIRYRKKVEFLALKHGERSVEEYELEFKQLSRFASKIVDTEAKKTKRFIWGPKDDEQRVVGALALADYAAALRAATFMAMPATSATPV
        FKE F+ K++SA +++ K  EFL L+ G+ +VE+Y++EF  LSRFA  +V  EA +T++F+ G + D Q +V AL  A +A ALR A  +++   + +  
Subjt:  FKEAFFLKYYSAIIRYRKKVEFLALKHGERSVEEYELEFKQLSRFASKIVDTEAKKTKRFIWGPKDDEQRVVGALALADYAAALRAATFMAMPATSATPV

Query:  AKKPEFSAGQKRKFDRSFSKSQQNSQNRRSTRLRN---------VGKEKPLCPKCDRRHEGQCLDGMGVCFRCGKGGHMAAECPKGINLDSRRPFSSN-E
        A     S GQKRK +       Q +   R    R+           +E P+C  C R H G+CL G GV FRC + GH A  CP       R+PF +   
Subjt:  AKKPEFSAGQKRKFDRSFSKSQQNSQNRRSTRLRN---------VGKEKPLCPKCDRRHEGQCLDGMGVCFRCGKGGHMAAECPKGINLDSRRPFSSN-E

Query:  ASTNSQKGSTHAITSKKAGDSSAVVIGTLLVLGHYA---------HPRI--------------MQSLIAMRSIIPATRRSKLQVQRDQIRV---------
          + SQ+G   A T ++A  +  +V G L +LGHYA         H  I              + S +++ ++      SK +++  ++ +         
Subjt:  ASTNSQKGSTHAITSKKAGDSSAVVIGTLLVLGHYA---------HPRI--------------MQSLIAMRSIIPATRRSKLQVQRDQIRV---------

Query:  -----------------------YPKVVSALKARRLMSRGAWCFLASVSNARVEKTRISSVLVVSEFMDVFPEDLPGLPSIREVDFSI
                                PKV+SA+KA +L+S+G W  LASV + R  +  +SS  VV E+ DVFP++LPGLP  RE+DF+I
Subjt:  -----------------------YPKVVSALKARRLMSRGAWCFLASVSNARVEKTRISSVLVVSEFMDVFPEDLPGLPSIREVDFSI

KAA0037906.1 reverse transcriptase [Cucumis melo var. makuwa]1.8e-3632.27Show/hide
Query:  FKEAFFLKYYSAIIRYRKKVEFLALKHGERSVEEYELEFKQLSRFASKIVDTEAKKTKRFIWGPKDDEQRVVGALALADYAAALRAATFMAMPATSATPV
        FKE F+ K++SA +++ K  EFL L+ G+ +VE+Y+ EF  LSRFA  +V  EA +T++F+ G + D Q +V AL  A +A ALR A  +++P  + +  
Subjt:  FKEAFFLKYYSAIIRYRKKVEFLALKHGERSVEEYELEFKQLSRFASKIVDTEAKKTKRFIWGPKDDEQRVVGALALADYAAALRAATFMAMPATSATPV

Query:  AKKPEFSAGQKRKFDRSFSKSQQNSQNRRSTRLRNVGKEKPLCPKCDRRHEGQCLDGMGVCFRCGKGGHMAAECPKGINLDSRRPFSSNEASTN-SQKGS
        A     + GQKRK     +++Q     R         +E P C  C R H G+CL G GVCFRC + GH A  CP       R+PF +     + SQ+G 
Subjt:  AKKPEFSAGQKRKFDRSFSKSQQNSQNRRSTRLRNVGKEKPLCPKCDRRHEGQCLDGMGVCFRCGKGGHMAAECPKGINLDSRRPFSSNEASTN-SQKGS

Query:  THAITSKKAGDSSAVVIGTLLVLGHYA---------HPRI--------------MQSLIAMRSIIPATRRSKLQVQRDQIRV------------------
          A T ++A  +  VV GTL +LGHYA         H  I              + S++++ +       SK Q++  ++ +                  
Subjt:  THAITSKKAGDSSAVVIGTLLVLGHYA---------HPRI--------------MQSLIAMRSIIPATRRSKLQVQRDQIRV------------------

Query:  -----------------------------------------YPKVVSALKARRLMSRGAWCFLASVSNARVEKTRISSVLVVSEFMDVFPEDLPGLPSIR
                                                  PKV+SA+KA +L+S+G W  LASV + R  +  +SS  VV E+ DVFP++LPGLP  R
Subjt:  -----------------------------------------YPKVVSALKARRLMSRGAWCFLASVSNARVEKTRISSVLVVSEFMDVFPEDLPGLPSIR

Query:  EVDFSI
        EVDF+I
Subjt:  EVDFSI

KAA0048134.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]6.1e-3734.88Show/hide
Query:  FKEAFFLKYYSAIIRYRKKVEFLALKHGERSVEEYELEFKQLSRFASKIVDTEAKKTKRFIWGPKDDEQRVVGALALADYAAALRAATFMAMPATSATPV
        FKE F+ K++SA +++ K  EFL L+ G+ +VE+Y+ EF  LSRFA  +V  EA +T++F+ G + D Q +V AL  + +A ALR A  +++   + +  
Subjt:  FKEAFFLKYYSAIIRYRKKVEFLALKHGERSVEEYELEFKQLSRFASKIVDTEAKKTKRFIWGPKDDEQRVVGALALADYAAALRAATFMAMPATSATPV

Query:  AKKPEFSAGQKRKFDRSFSKSQQNSQNRRSTRLRNVG--------------KEKPLCPKCDRRHEGQCLDGMGVCFRCGKGGHMAAECP-KGINLDSRRP
        A   E + GQKRK +     +Q +    R+ RLR V               +E P C  C R H G+CL G GVCFRC +  H A  CP K       +P
Subjt:  AKKPEFSAGQKRKFDRSFSKSQQNSQNRRSTRLRNVG--------------KEKPLCPKCDRRHEGQCLDGMGVCFRCGKGGHMAAECP-KGINLDSRRP

Query:  FSSNEASTNSQKGSTHAITSKKAGDSSAVVIGTLLVLGHYA---------HPRIMQSLI--------AMRSIIPATRRSKLQV------------QRDQI
        F+       SQ+G   A T ++   +  VV GTL +LG+YA         H  I    +         + S++  +  S  +V            +   +
Subjt:  FSSNEASTNSQKGSTHAITSKKAGDSSAVVIGTLLVLGHYA---------HPRIMQSLI--------AMRSIIPATRRSKLQV------------QRDQI

Query:  RVYPKVVSALKARRLMSRGAWCFLASVSNARVEKTRISSVLVVSEFMDVFPEDLPGLPSIREVDFSI
           PKV+ A+KA +L+S+G W  LASV + R  +  +SS  VV E+ DVFP++LPGLP  +EVDF+I
Subjt:  RVYPKVVSALKARRLMSRGAWCFLASVSNARVEKTRISSVLVVSEFMDVFPEDLPGLPSIREVDFSI

XP_022931734.1 uncharacterized protein LOC111437896 [Cucurbita moschata]2.5e-4333.09Show/hide
Query:  FKEAFFLKYYSAIIRYRKKVEFLALKHGERSVEEYELEFKQLSRFASKIVDTEAKKTKRFIWGPKDDEQRVVGALALADYAAALRAATFMAMP---ATSA
        FKEA+  KYY  + R++ +  FL LK G+++VE+Y+LEF +L+RF  + V  E  K  RFI G + + Q  V     +DYA ALR AT M MP   A   
Subjt:  FKEAFFLKYYSAIIRYRKKVEFLALKHGERSVEEYELEFKQLSRFASKIVDTEAKKTKRFIWGPKDDEQRVVGALALADYAAALRAATFMAMP---ATSA

Query:  TPVAKKPEFSAGQKRKFDRSFSKSQQNSQNRRSTRLRNVGKEKPLCPKCDRRHEGQCLDGMGVCFRCGKGGHMAAECPKGINLDSRRP---FSSNEASTN
         PV    + + GQ+R+ +R+  +S +  + R   R R     +P CP C + HEG+C  G G CF CGK GH  A+CP   N +  RP   +     +T 
Subjt:  TPVAKKPEFSAGQKRKFDRSFSKSQQNSQNRRSTRLRNVGKEKPLCPKCDRRHEGQCLDGMGVCFRCGKGGHMAAECPKGINLDSRRP---FSSNEASTN

Query:  SQKGSTHAITSKKAGDSSAVVIGTLLVLGHYAHP-----------------------------------------------------------RIMQSLI
          +   HAIT++KA ++ AVV GTL +L H A                                                             +++ SLI
Subjt:  SQKGSTHAITSKKAGDSSAVVIGTLLVLGHYAHP-----------------------------------------------------------RIMQSLI

Query:  AM-----------------RSIIPATRR---------SKLQVQRDQIRVYPKVVSALKARRLMSRGAWCFLASVSNARVEKTRISSVLVVSEFMDVFPED
         +                 R++I    R            + + D  R  P+V++ALKAR+++++GAW  LASV+       ++SSV VV EF DVFPE+
Subjt:  AM-----------------RSIIPATRR---------SKLQVQRDQIRVYPKVVSALKARRLMSRGAWCFLASVSNARVEKTRISSVLVVSEFMDVFPED

Query:  LPGLPSIREVDFSI
        LPGLP  REVDF I
Subjt:  LPGLPSIREVDFSI

XP_031745635.1 uncharacterized protein LOC116406054 [Cucumis sativus]4.6e-3736.12Show/hide
Query:  DCFKEAFFLKYYSAIIRYRKKVEFLALKHGERSVEEYELEFKQLSRFASKIVDTEAKKTKRFIWGPKDDEQRVVGALALADYAAALRAATFMAMPATSAT
        D FK+ F+ K++SA +R  K  EFL LK G  +VEEY+ EF  LSRFA ++V  E  +  RF+ G +D+ +  V AL     A ALR A  M++      
Subjt:  DCFKEAFFLKYYSAIIRYRKKVEFLALKHGERSVEEYELEFKQLSRFASKIVDTEAKKTKRFIWGPKDDEQRVVGALALADYAAALRAATFMAMPATSAT

Query:  PVAKKPEFSAGQKRKFDR--------------SFSKSQQNSQNRRSTRLRNVGKEKPLCPKCDRRHEGQCLDGMGVCFRCGKGGHMAAECPKGINLDSRR
        P +     S+GQKRK ++              SF   QQ+S     T      +E+P+C  C +RH G+CL G  VC++C + GHMA  CP    L S  
Subjt:  PVAKKPEFSAGQKRKFDR--------------SFSKSQQNSQNRRSTRLRNVGKEKPLCPKCDRRHEGQCLDGMGVCFRCGKGGHMAAECPKGINLDSRR

Query:  PFSSNEASTNSQKGSTHAITSKKAGDSSAVVIGTLLVLGHYAHPRIMQSLIAMRSIIPATRRSKLQVQRDQIRVYPKVVSALKARRLMSRGAWCFLASVS
          SS++     Q+G+  A    +A  +  VV G   +  ++A     +  +    +      S  + +     V PKV+SA+KA +L+S+G W  LASV 
Subjt:  PFSSNEASTNSQKGSTHAITSKKAGDSSAVVIGTLLVLGHYAHPRIMQSLIAMRSIIPATRRSKLQVQRDQIRVYPKVVSALKARRLMSRGAWCFLASVS

Query:  NARVEKTRISSVLVVSEFMDVFPEDLPGLPSIREV
        + R  +T ++S  VV E+ DVFP+DLPGLP  REV
Subjt:  NARVEKTRISSVLVVSEFMDVFPEDLPGLPSIREV

TrEMBL top hitse value%identityAlignment
A0A5A7T629 Gag protease polyprotein1.5e-3632.73Show/hide
Query:  FKEAFFLKYYSAIIRYRKKVEFLALKHGERSVEEYELEFKQLSRFASKIVDTEAKKTKRFIWGPKDDEQRVVGALALADYAAALRAATFMAMPATSATPV
        FKE F+ K++SA +++ K  EFL L+ G+ +VE+Y++EF  LSRFA  +V  EA +T++F+ G + D Q +V AL  A +A ALR A  +++   + +  
Subjt:  FKEAFFLKYYSAIIRYRKKVEFLALKHGERSVEEYELEFKQLSRFASKIVDTEAKKTKRFIWGPKDDEQRVVGALALADYAAALRAATFMAMPATSATPV

Query:  AKKPEFSAGQKRKFDRSFSKSQQNSQNRRSTRLRN---------VGKEKPLCPKCDRRHEGQCLDGMGVCFRCGKGGHMAAECPKGINLDSRRPFSSN-E
        A     S GQKRK +       Q +   R    R+           +E P+C  C R H G+CL G GV FRC + GH A  CP       R+PF +   
Subjt:  AKKPEFSAGQKRKFDRSFSKSQQNSQNRRSTRLRN---------VGKEKPLCPKCDRRHEGQCLDGMGVCFRCGKGGHMAAECPKGINLDSRRPFSSN-E

Query:  ASTNSQKGSTHAITSKKAGDSSAVVIGTLLVLGHYA---------HPRI--------------MQSLIAMRSIIPATRRSKLQVQRDQIRV---------
          + SQ+G   A T ++A  +  +V G L +LGHYA         H  I              + S +++ ++      SK +++  ++ +         
Subjt:  ASTNSQKGSTHAITSKKAGDSSAVVIGTLLVLGHYA---------HPRI--------------MQSLIAMRSIIPATRRSKLQVQRDQIRV---------

Query:  -----------------------YPKVVSALKARRLMSRGAWCFLASVSNARVEKTRISSVLVVSEFMDVFPEDLPGLPSIREVDFSI
                                PKV+SA+KA +L+S+G W  LASV + R  +  +SS  VV E+ DVFP++LPGLP  RE+DF+I
Subjt:  -----------------------YPKVVSALKARRLMSRGAWCFLASVSNARVEKTRISSVLVVSEFMDVFPEDLPGLPSIREVDFSI

A0A5A7T6U3 Reverse transcriptase8.6e-3732.27Show/hide
Query:  FKEAFFLKYYSAIIRYRKKVEFLALKHGERSVEEYELEFKQLSRFASKIVDTEAKKTKRFIWGPKDDEQRVVGALALADYAAALRAATFMAMPATSATPV
        FKE F+ K++SA +++ K  EFL L+ G+ +VE+Y+ EF  LSRFA  +V  EA +T++F+ G + D Q +V AL  A +A ALR A  +++P  + +  
Subjt:  FKEAFFLKYYSAIIRYRKKVEFLALKHGERSVEEYELEFKQLSRFASKIVDTEAKKTKRFIWGPKDDEQRVVGALALADYAAALRAATFMAMPATSATPV

Query:  AKKPEFSAGQKRKFDRSFSKSQQNSQNRRSTRLRNVGKEKPLCPKCDRRHEGQCLDGMGVCFRCGKGGHMAAECPKGINLDSRRPFSSNEASTN-SQKGS
        A     + GQKRK     +++Q     R         +E P C  C R H G+CL G GVCFRC + GH A  CP       R+PF +     + SQ+G 
Subjt:  AKKPEFSAGQKRKFDRSFSKSQQNSQNRRSTRLRNVGKEKPLCPKCDRRHEGQCLDGMGVCFRCGKGGHMAAECPKGINLDSRRPFSSNEASTN-SQKGS

Query:  THAITSKKAGDSSAVVIGTLLVLGHYA---------HPRI--------------MQSLIAMRSIIPATRRSKLQVQRDQIRV------------------
          A T ++A  +  VV GTL +LGHYA         H  I              + S++++ +       SK Q++  ++ +                  
Subjt:  THAITSKKAGDSSAVVIGTLLVLGHYA---------HPRI--------------MQSLIAMRSIIPATRRSKLQVQRDQIRV------------------

Query:  -----------------------------------------YPKVVSALKARRLMSRGAWCFLASVSNARVEKTRISSVLVVSEFMDVFPEDLPGLPSIR
                                                  PKV+SA+KA +L+S+G W  LASV + R  +  +SS  VV E+ DVFP++LPGLP  R
Subjt:  -----------------------------------------YPKVVSALKARRLMSRGAWCFLASVSNARVEKTRISSVLVVSEFMDVFPEDLPGLPSIR

Query:  EVDFSI
        EVDF+I
Subjt:  EVDFSI

A0A5A7TYM4 Ty3-gypsy retrotransposon protein2.9e-3734.88Show/hide
Query:  FKEAFFLKYYSAIIRYRKKVEFLALKHGERSVEEYELEFKQLSRFASKIVDTEAKKTKRFIWGPKDDEQRVVGALALADYAAALRAATFMAMPATSATPV
        FKE F+ K++SA +++ K  EFL L+ G+ +VE+Y+ EF  LSRFA  +V  EA +T++F+ G + D Q +V AL  + +A ALR A  +++   + +  
Subjt:  FKEAFFLKYYSAIIRYRKKVEFLALKHGERSVEEYELEFKQLSRFASKIVDTEAKKTKRFIWGPKDDEQRVVGALALADYAAALRAATFMAMPATSATPV

Query:  AKKPEFSAGQKRKFDRSFSKSQQNSQNRRSTRLRNVG--------------KEKPLCPKCDRRHEGQCLDGMGVCFRCGKGGHMAAECP-KGINLDSRRP
        A   E + GQKRK +     +Q +    R+ RLR V               +E P C  C R H G+CL G GVCFRC +  H A  CP K       +P
Subjt:  AKKPEFSAGQKRKFDRSFSKSQQNSQNRRSTRLRNVG--------------KEKPLCPKCDRRHEGQCLDGMGVCFRCGKGGHMAAECP-KGINLDSRRP

Query:  FSSNEASTNSQKGSTHAITSKKAGDSSAVVIGTLLVLGHYA---------HPRIMQSLI--------AMRSIIPATRRSKLQV------------QRDQI
        F+       SQ+G   A T ++   +  VV GTL +LG+YA         H  I    +         + S++  +  S  +V            +   +
Subjt:  FSSNEASTNSQKGSTHAITSKKAGDSSAVVIGTLLVLGHYA---------HPRIMQSLI--------AMRSIIPATRRSKLQV------------QRDQI

Query:  RVYPKVVSALKARRLMSRGAWCFLASVSNARVEKTRISSVLVVSEFMDVFPEDLPGLPSIREVDFSI
           PKV+ A+KA +L+S+G W  LASV + R  +  +SS  VV E+ DVFP++LPGLP  +EVDF+I
Subjt:  RVYPKVVSALKARRLMSRGAWCFLASVSNARVEKTRISSVLVVSEFMDVFPEDLPGLPSIREVDFSI

A0A5A7UJ81 Reverse transcriptase4.2e-3631.81Show/hide
Query:  FKEAFFLKYYSAIIRYRKKVEFLALKHGERSVEEYELEFKQLSRFASKIVDTEAKKTKRFIWGPKDDEQRVVGALALADYAAALRAATFMAMPATSATPV
        FKE+F+ K++SA +++ K  EFL L+ G+ +VE+Y+ EF  LSRFA  +V  EA +T++F+ G + D Q +V AL  A +A ALR A  +++P  + +  
Subjt:  FKEAFFLKYYSAIIRYRKKVEFLALKHGERSVEEYELEFKQLSRFASKIVDTEAKKTKRFIWGPKDDEQRVVGALALADYAAALRAATFMAMPATSATPV

Query:  AKKPEFSAGQKRKFDRSFSKSQQNSQ------NRRSTRLRNVGK---EKPLCPKCDRRHEGQCLDGMGVCFRCGKGGHMAAECPKGINLDSRRPFSSNEA
        A     + GQKRK +     + Q +        R    L   G+   E P C  C R H G+CL G GVCFRC + GH A  CP       R+PF +   
Subjt:  AKKPEFSAGQKRKFDRSFSKSQQNSQ------NRRSTRLRNVGK---EKPLCPKCDRRHEGQCLDGMGVCFRCGKGGHMAAECPKGINLDSRRPFSSNEA

Query:  STN-SQKGSTHAITSKKAGDSSAVVIGTLLVLGHYA---------HPRI--------------MQSLIAMRSIIPATRRSKLQVQRDQIRV---------
          + +Q+G   A T ++A  +  VV GTL +LGHYA         H  I              + S++++ +       SK Q++  ++ +         
Subjt:  STN-SQKGSTHAITSKKAGDSSAVVIGTLLVLGHYA---------HPRI--------------MQSLIAMRSIIPATRRSKLQVQRDQIRV---------

Query:  --------------------------------------------------YPKVVSALKARRLMSRGAWCFLASVSNARVEKTRISSVLVVSEFMDVFPE
                                                           PKV+SA+KA +L+S G W  LASV + R  +  +SS  VV E+ DVFP+
Subjt:  --------------------------------------------------YPKVVSALKARRLMSRGAWCFLASVSNARVEKTRISSVLVVSEFMDVFPE

Query:  DLPGLPSIREVDFSI
        +LPGLP  REVDF+I
Subjt:  DLPGLPSIREVDFSI

A0A6J1EV26 Reverse transcriptase1.2e-4333.09Show/hide
Query:  FKEAFFLKYYSAIIRYRKKVEFLALKHGERSVEEYELEFKQLSRFASKIVDTEAKKTKRFIWGPKDDEQRVVGALALADYAAALRAATFMAMP---ATSA
        FKEA+  KYY  + R++ +  FL LK G+++VE+Y+LEF +L+RF  + V  E  K  RFI G + + Q  V     +DYA ALR AT M MP   A   
Subjt:  FKEAFFLKYYSAIIRYRKKVEFLALKHGERSVEEYELEFKQLSRFASKIVDTEAKKTKRFIWGPKDDEQRVVGALALADYAAALRAATFMAMP---ATSA

Query:  TPVAKKPEFSAGQKRKFDRSFSKSQQNSQNRRSTRLRNVGKEKPLCPKCDRRHEGQCLDGMGVCFRCGKGGHMAAECPKGINLDSRRP---FSSNEASTN
         PV    + + GQ+R+ +R+  +S +  + R   R R     +P CP C + HEG+C  G G CF CGK GH  A+CP   N +  RP   +     +T 
Subjt:  TPVAKKPEFSAGQKRKFDRSFSKSQQNSQNRRSTRLRNVGKEKPLCPKCDRRHEGQCLDGMGVCFRCGKGGHMAAECPKGINLDSRRP---FSSNEASTN

Query:  SQKGSTHAITSKKAGDSSAVVIGTLLVLGHYAHP-----------------------------------------------------------RIMQSLI
          +   HAIT++KA ++ AVV GTL +L H A                                                             +++ SLI
Subjt:  SQKGSTHAITSKKAGDSSAVVIGTLLVLGHYAHP-----------------------------------------------------------RIMQSLI

Query:  AM-----------------RSIIPATRR---------SKLQVQRDQIRVYPKVVSALKARRLMSRGAWCFLASVSNARVEKTRISSVLVVSEFMDVFPED
         +                 R++I    R            + + D  R  P+V++ALKAR+++++GAW  LASV+       ++SSV VV EF DVFPE+
Subjt:  AM-----------------RSIIPATRR---------SKLQVQRDQIRVYPKVVSALKARRLMSRGAWCFLASVSNARVEKTRISSVLVVSEFMDVFPED

Query:  LPGLPSIREVDFSI
        LPGLP  REVDF I
Subjt:  LPGLPSIREVDFSI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAAGATAAATTACTTATCACTTGTTGCATACTGTGTATCTTTTGCTTCAGCATTTTATCAAAATGAAGTGGAAGCTATTCAAAACCTCGTGGATACGATAAATTA
CTTATCACTTGTTGTAGATTGTTTTAAGGAAGCATTCTTCCTAAAGTATTACTCAGCGATTATCAGATACAGAAAGAAGGTAGAGTTCCTGGCCTTGAAGCATGGTGAAA
GGTCAGTGGAAGAGTATGAGCTGGAGTTCAAGCAGTTATCTCGCTTTGCCTCGAAAATAGTGGACACTGAAGCAAAGAAGACGAAGAGGTTCATCTGGGGCCCCAAGGAT
GATGAGCAAAGAGTGGTAGGGGCTCTTGCCCTAGCTGATTATGCGGCGGCCCTTCGAGCGGCCACGTTCATGGCCATGCCAGCTACGAGTGCAACCCCGGTGGCCAAGAA
GCCAGAATTCAGCGCAGGCCAAAAAAGGAAGTTTGATCGGAGTTTCTCCAAGTCGCAACAGAATTCTCAGAATCGACGATCCACGCGCCTAAGGAATGTAGGAAAGGAAA
AGCCCTTGTGTCCTAAGTGTGATCGACGTCATGAAGGTCAATGCTTGGATGGCATGGGAGTATGTTTCAGATGTGGAAAGGGTGGGCACATGGCAGCAGAGTGCCCCAAA
GGCATAAACTTAGATTCTAGACGGCCCTTCAGTTCAAATGAGGCCAGCACCAACAGCCAGAAGGGTTCAACCCATGCCATTACCAGCAAGAAGGCTGGTGATTCCAGTGC
AGTGGTGATAGGTACATTACTTGTTCTTGGGCACTATGCTCATCCGAGAATCATGCAATCATTGATTGCCATGAGAAGTATTATTCCAGCCACTAGAAGGAGCAAGCTTC
AAGTTCAGAGGGATCAAATCAGAGTCTATCCTAAGGTGGTATCTGCGTTAAAGGCAAGAAGACTCATGAGTCGAGGAGCGTGGTGCTTCTTAGCCAGTGTGTCAAATGCT
CGAGTTGAAAAAACGAGAATAAGTTCTGTACTGGTGGTCAGTGAGTTTATGGATGTTTTCCCTGAAGATCTTCCAGGTTTGCCTTCGATTCGAGAAGTGGATTTCAGCAT
CTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATAAGATAAATTACTTATCACTTGTTGCATACTGTGTATCTTTTGCTTCAGCATTTTATCAAAATGAAGTGGAAGCTATTCAAAACCTCGTGGATACGATAAATTA
CTTATCACTTGTTGTAGATTGTTTTAAGGAAGCATTCTTCCTAAAGTATTACTCAGCGATTATCAGATACAGAAAGAAGGTAGAGTTCCTGGCCTTGAAGCATGGTGAAA
GGTCAGTGGAAGAGTATGAGCTGGAGTTCAAGCAGTTATCTCGCTTTGCCTCGAAAATAGTGGACACTGAAGCAAAGAAGACGAAGAGGTTCATCTGGGGCCCCAAGGAT
GATGAGCAAAGAGTGGTAGGGGCTCTTGCCCTAGCTGATTATGCGGCGGCCCTTCGAGCGGCCACGTTCATGGCCATGCCAGCTACGAGTGCAACCCCGGTGGCCAAGAA
GCCAGAATTCAGCGCAGGCCAAAAAAGGAAGTTTGATCGGAGTTTCTCCAAGTCGCAACAGAATTCTCAGAATCGACGATCCACGCGCCTAAGGAATGTAGGAAAGGAAA
AGCCCTTGTGTCCTAAGTGTGATCGACGTCATGAAGGTCAATGCTTGGATGGCATGGGAGTATGTTTCAGATGTGGAAAGGGTGGGCACATGGCAGCAGAGTGCCCCAAA
GGCATAAACTTAGATTCTAGACGGCCCTTCAGTTCAAATGAGGCCAGCACCAACAGCCAGAAGGGTTCAACCCATGCCATTACCAGCAAGAAGGCTGGTGATTCCAGTGC
AGTGGTGATAGGTACATTACTTGTTCTTGGGCACTATGCTCATCCGAGAATCATGCAATCATTGATTGCCATGAGAAGTATTATTCCAGCCACTAGAAGGAGCAAGCTTC
AAGTTCAGAGGGATCAAATCAGAGTCTATCCTAAGGTGGTATCTGCGTTAAAGGCAAGAAGACTCATGAGTCGAGGAGCGTGGTGCTTCTTAGCCAGTGTGTCAAATGCT
CGAGTTGAAAAAACGAGAATAAGTTCTGTACTGGTGGTCAGTGAGTTTATGGATGTTTTCCCTGAAGATCTTCCAGGTTTGCCTTCGATTCGAGAAGTGGATTTCAGCAT
CTAG
Protein sequenceShow/hide protein sequence
MDKINYLSLVAYCVSFASAFYQNEVEAIQNLVDTINYLSLVVDCFKEAFFLKYYSAIIRYRKKVEFLALKHGERSVEEYELEFKQLSRFASKIVDTEAKKTKRFIWGPKD
DEQRVVGALALADYAAALRAATFMAMPATSATPVAKKPEFSAGQKRKFDRSFSKSQQNSQNRRSTRLRNVGKEKPLCPKCDRRHEGQCLDGMGVCFRCGKGGHMAAECPK
GINLDSRRPFSSNEASTNSQKGSTHAITSKKAGDSSAVVIGTLLVLGHYAHPRIMQSLIAMRSIIPATRRSKLQVQRDQIRVYPKVVSALKARRLMSRGAWCFLASVSNA
RVEKTRISSVLVVSEFMDVFPEDLPGLPSIREVDFSI