; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035575 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035575
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposable element protein
Genome locationchr3:24396943..24404185
RNA-Seq ExpressionLag0035575
SyntenyLag0035575
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023521407.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111785222 [Cucurbita pepo subsp. pepo]5.1e-6430.55Show/hide
Query:  MPLHFILEVEIFDVWGVDFMGPFPPSFGNIYILLAVDYVSTWIEAVATATNDAKVVKKFFLKNIFTRYGTPRFIISGEGSHFLNKIIASLFSKYNIRHRV
        +PL+ ILEVE+FDVWG+DFMGPFPPS+GN+YIL+AVDYVS W+EA+A  +ND K V KF  +NIFTR+G PR +IS EG+HF+N+++ SL  +YN++HRV
Subjt:  MPLHFILEVEIFDVWGVDFMGPFPPSFGNIYILLAVDYVSTWIEAVATATNDAKVVKKFFLKNIFTRYGTPRFIISGEGSHFLNKIIASLFSKYNIRHRV

Query:  ATAYHPQTNGQTELSTREIKAILEKIMDPTFKDWSLRPDEALWAHRTAYKA--------LITAQKIMLLSDWTEQILCCSKTGN--RTATSQLLANFMNR
        AT YHPQTNGQ E+   EIK+ILEK++ P  KDWS++ D+A+WA+RTA+K         L+  +   L  +   +     K  N   T    L    +N 
Subjt:  ATAYHPQTNGQTELSTREIKAILEKIMDPTFKDWSLRPDEALWAHRTAYKA--------LITAQKIMLLSDWTEQILCCSKTGN--RTATSQLLANFMNR

Query:  LLLSVILVHERS--------AWGRFELDPE------------------------------------IERTFRNRRREQRK---------TRWRTCRVFRR
        L     L +E +         W   ++ P+                                    ++   +N   E +K         T       F+ 
Subjt:  LLLSVILVHERS--------AWGRFELDPE------------------------------------IERTFRNRRREQRK---------TRWRTCRVFRR

Query:  VLKA---------------ENPILIANDRTRAIRAYDVPMFNVLNPG------IARPQIQAANFEMKPG-----------------DLAMIANALKNVTV
        +L+                 N + IA  +     A    +    N        IA    Q A+    PG                  LA + N L+N+ +
Subjt:  VLKA---------------ENPILIANDRTRAIRAYDVPMFNVLNPG------IARPQIQAANFEMKPG-----------------DLAMIANALKNVTV

Query:  ISHQQPPA-MEPTAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQRNNPYSNFYNPVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSEL
               A +   AV+NQ   E+CVYCGE+H ++ CPSNPAS+F+VGNQ                                                   
Subjt:  ISHQQPPA-MEPTAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQRNNPYSNFYNPVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSEL

Query:  ESGQGAGGSNKMLEHLVLCQIKAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPR---------------------------------
                                           D+LT +++  EF+ V L EECSAILKN +P +                                 
Subjt:  ESGQGAGGSNKMLEHLVLCQIKAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPR---------------------------------

Query:  ---LRIQGIGEARPTTVTLQLADRSITYLEDEMED
            +  GIGEARPTTVTLQLADRS TY E ++ED
Subjt:  ---LRIQGIGEARPTTVTLQLADRSITYLEDEMED

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]5.7e-5567.97Show/hide
Query:  MPLHFILEVEIFDVWGVDFMGPFPPSFGNIYILLAVDYVSTWIEAVATATNDAKVVKKFFLKNIFTRYGTPRFIISGEGSHFLNKIIASLFSKYNIRHRV
        +PL  ILEVE+FDVWG+DFMGPFPPSFG +YILLAVDYVS W+EA+AT TNDAKVV KF  KNIFTR+GTPR IIS EG+HF NK+  +L SKY ++H++
Subjt:  MPLHFILEVEIFDVWGVDFMGPFPPSFGNIYILLAVDYVSTWIEAVATATNDAKVVKKFFLKNIFTRYGTPRFIISGEGSHFLNKIIASLFSKYNIRHRV

Query:  ATAYHPQTNGQTELSTREIKAILEKIMDPTFKDWSLRPDEALWAHRTAYKALI
        A AYHPQTNGQ E+S REIK ILEK ++   KDW+ + D+ALWA+RTA+K  I
Subjt:  ATAYHPQTNGQTELSTREIKAILEKIMDPTFKDWSLRPDEALWAHRTAYKALI

XP_023899824.1 LOW QUALITY PROTEIN: uncharacterized protein LOC112011709 [Quercus suber]3.7e-5463.98Show/hide
Query:  MPLHFILEVEIFDVWGVDFMGPFPPSFGNIYILLAVDYVSTWIEAVATATNDAKVVKKFFLKNIFTRYGTPRFIISGEGSHFLNKIIASLFSKYNIRHRV
        +PL  ILEVE+FDVWG+DFMGPFPPSFG +YILLAVDYVS W+EA+AT TND KVV KF  KNIFTR+GTPR IIS EG+HF NK+  +L SKY ++H++
Subjt:  MPLHFILEVEIFDVWGVDFMGPFPPSFGNIYILLAVDYVSTWIEAVATATNDAKVVKKFFLKNIFTRYGTPRFIISGEGSHFLNKIIASLFSKYNIRHRV

Query:  ATAYHPQTNGQTELSTREIKAILEKIMDPTFKDWSLRPDEALWAHRTAYKALITAQKIMLL
        A AYHPQTNGQ E+S REIK IL+K ++   KDW+ + D+ALWA+RTA+K  I      L+
Subjt:  ATAYHPQTNGQTELSTREIKAILEKIMDPTFKDWSLRPDEALWAHRTAYKALITAQKIMLL

XP_030479372.1 uncharacterized protein LOC115696618 [Cannabis sativa]2.7e-5768Show/hide
Query:  MPLHFILEVEIFDVWGVDFMGPFPPSFGNIYILLAVDYVSTWIEAVATATNDAKVVKKFFLKNIFTRYGTPRFIISGEGSHFLNKIIASLFSKYNIRHRV
        MPL+ ILEVE+FDVWG+DFMGPFP SFGN+YIL+AVDYVS W+EA+A+  NDA+VV KF  K++FTR+GTPR +IS EG+HF+NK++A+L +KY+++H++
Subjt:  MPLHFILEVEIFDVWGVDFMGPFPPSFGNIYILLAVDYVSTWIEAVATATNDAKVVKKFFLKNIFTRYGTPRFIISGEGSHFLNKIIASLFSKYNIRHRV

Query:  ATAYHPQTNGQTELSTREIKAILEKIMDPTFKDWSLRPDEALWAHRTAYK
        ATAYHPQTNGQ E+S REIK ILEK+++P  KDWS R D+ALWA+RTAYK
Subjt:  ATAYHPQTNGQTELSTREIKAILEKIMDPTFKDWSLRPDEALWAHRTAYK

XP_038902449.1 uncharacterized protein LOC120089096 [Benincasa hispida]1.7e-5468.63Show/hide
Query:  MPLHFILEVEIFDVWGVDFMGPFPPSFGNIYILLAVDYVSTWIEAVATATNDAKVVKKFFLKNIFTRYGTPRFIISGEGSHFLNKIIASLFSKYNIRHRV
        MPL+ ILEVE+FDVWG+DFM PFP S G  YILLAVDYVS W+EAVA A NDA  V KF  +NIFT YGTPR +IS EG+HF+N+II+ L +KYN+RH++
Subjt:  MPLHFILEVEIFDVWGVDFMGPFPPSFGNIYILLAVDYVSTWIEAVATATNDAKVVKKFFLKNIFTRYGTPRFIISGEGSHFLNKIIASLFSKYNIRHRV

Query:  ATAYHPQTNGQTELSTREIKAILEKIMDPTFKDWSLRPDEALWAHRTAYKALI
        ATAYHPQTNGQTE+S REIK+ILEK ++ T KDW+ R D+ALWA+RTAYK  I
Subjt:  ATAYHPQTNGQTELSTREIKAILEKIMDPTFKDWSLRPDEALWAHRTAYKALI

TrEMBL top hitse value%identityAlignment
A0A251UM01 Putative reverse transcriptase domain, Ribonuclease H-like domain protein2.3e-5468.63Show/hide
Query:  MPLHFILEVEIFDVWGVDFMGPFPPSFGNIYILLAVDYVSTWIEAVATATNDAKVVKKFFLKNIFTRYGTPRFIISGEGSHFLNKIIASLFSKYNIRHRV
        MPL  IL  EIFDVWG+DFMGPFP SFGN+YILLAVDYVS W+EA AT TND+KVV  F   NIF+R+GTP+  IS  GSHF N+ I +LF KY + HRV
Subjt:  MPLHFILEVEIFDVWGVDFMGPFPPSFGNIYILLAVDYVSTWIEAVATATNDAKVVKKFFLKNIFTRYGTPRFIISGEGSHFLNKIIASLFSKYNIRHRV

Query:  ATAYHPQTNGQTELSTREIKAILEKIMDPTFKDWSLRPDEALWAHRTAYKALI
        +TAYHPQTNGQ E+S REIK+ILEK ++P  KDWSLR D+ALWA+RTAYK  I
Subjt:  ATAYHPQTNGQTELSTREIKAILEKIMDPTFKDWSLRPDEALWAHRTAYKALI

A0A2G9FWY3 Reverse transcriptase1.7e-5266.01Show/hide
Query:  MPLHFILEVEIFDVWGVDFMGPFPPSFGNIYILLAVDYVSTWIEAVATATNDAKVVKKFFLKNIFTRYGTPRFIISGEGSHFLNKIIASLFSKYNIRHRV
        MPL+ ILEVE+FDVWG+DFMGPF PSFGN+YIL+AVDYVS W+EA A   ND+KVV  F  KNIFTR+GTPR IIS  G+HF N+   +L SKY ++H++
Subjt:  MPLHFILEVEIFDVWGVDFMGPFPPSFGNIYILLAVDYVSTWIEAVATATNDAKVVKKFFLKNIFTRYGTPRFIISGEGSHFLNKIIASLFSKYNIRHRV

Query:  ATAYHPQTNGQTELSTREIKAILEKIMDPTFKDWSLRPDEALWAHRTAYKALI
        +T YHPQT+GQ E+S REIK ILEK +  T KDWS R DEALWA+RTAYK  I
Subjt:  ATAYHPQTNGQTELSTREIKAILEKIMDPTFKDWSLRPDEALWAHRTAYKALI

A0A2G9G7U9 DNA-directed DNA polymerase2.9e-5266.01Show/hide
Query:  MPLHFILEVEIFDVWGVDFMGPFPPSFGNIYILLAVDYVSTWIEAVATATNDAKVVKKFFLKNIFTRYGTPRFIISGEGSHFLNKIIASLFSKYNIRHRV
        MPL  IL  EIFDVWG+DFMGPFPPS+GN YILLAVDYVS W+EA AT TNDAKVV  F   +IF R+G PR IIS  G+HF N+++  L  KY++ HRV
Subjt:  MPLHFILEVEIFDVWGVDFMGPFPPSFGNIYILLAVDYVSTWIEAVATATNDAKVVKKFFLKNIFTRYGTPRFIISGEGSHFLNKIIASLFSKYNIRHRV

Query:  ATAYHPQTNGQTELSTREIKAILEKIMDPTFKDWSLRPDEALWAHRTAYKALI
        +TAYHPQTNGQ E+S RE+K+ILEK + P  KDWS R D+ALWA+RTAYK  I
Subjt:  ATAYHPQTNGQTELSTREIKAILEKIMDPTFKDWSLRPDEALWAHRTAYKALI

A0A5E4GGI6 PREDICTED: LOW QUALITY PROTEIN (Fragment)3.7e-5267.32Show/hide
Query:  MPLHFILEVEIFDVWGVDFMGPFPPSFGNIYILLAVDYVSTWIEAVATATNDAKVVKKFFLKNIFTRYGTPRFIISGEGSHFLNKIIASLFSKYNIRHRV
        +PL  IL VE+FDVWG+DFMGPFP SFG  YIL+AVDYVS W+EA+AT TND KVV KF   NIFTR+GTPR IIS  GSHF+N+  A+L  KY I H+V
Subjt:  MPLHFILEVEIFDVWGVDFMGPFPPSFGNIYILLAVDYVSTWIEAVATATNDAKVVKKFFLKNIFTRYGTPRFIISGEGSHFLNKIIASLFSKYNIRHRV

Query:  ATAYHPQTNGQTELSTREIKAILEKIMDPTFKDWSLRPDEALWAHRTAYKALI
        AT YHPQT+GQ E+S REIK ILEK ++ T KDWS+R D+ALWA+RTAYK  I
Subjt:  ATAYHPQTNGQTELSTREIKAILEKIMDPTFKDWSLRPDEALWAHRTAYKALI

A0A6I9UKS2 uncharacterized protein LOC1051791657.5e-5367.32Show/hide
Query:  MPLHFILEVEIFDVWGVDFMGPFPPSFGNIYILLAVDYVSTWIEAVATATNDAKVVKKFFLKNIFTRYGTPRFIISGEGSHFLNKIIASLFSKYNIRHRV
        MPL  IL  EIFDVWG+DFMGPFP SFG  YI+L VDYVS WIEA AT T+DAK V  F   NIF+RYG PR IIS  G+HF NK++++LF KYN+ HRV
Subjt:  MPLHFILEVEIFDVWGVDFMGPFPPSFGNIYILLAVDYVSTWIEAVATATNDAKVVKKFFLKNIFTRYGTPRFIISGEGSHFLNKIIASLFSKYNIRHRV

Query:  ATAYHPQTNGQTELSTREIKAILEKIMDPTFKDWSLRPDEALWAHRTAYKALI
        +TAYHPQTNGQ E+S REIK+ILEK ++P  KDWS+R D+ALWA+RTAYK  I
Subjt:  ATAYHPQTNGQTELSTREIKAILEKIMDPTFKDWSLRPDEALWAHRTAYKALI

SwissProt top hitse value%identityAlignment
A1Z651 Gag-Pol polyprotein2.3e-1436.84Show/hide
Query:  WGVDFMGPFPPSFGNIYILLAVDYVSTWIEAVATATNDAKVVKKFFLKNIFTRYGTPRFIISGEGSHFLNKIIASLFSKYNIRHRVATAYHPQTNGQTEL
        W VDF    P  +G  Y+L+ VD  S W+EA  T    AKVV K  L++IF R+G P+ + S  G  F +++  S+     I  ++  AY PQ++GQ E 
Subjt:  WGVDFMGPFPPSFGNIYILLAVDYVSTWIEAVATATNDAKVVKKFFLKNIFTRYGTPRFIISGEGSHFLNKIIASLFSKYNIRHRVATAYHPQTNGQTEL

Query:  STREIKAILEKI-MDPTFKDWSLRPDEALWAHR
          R IK  L K+ +    +DW L    AL+  R
Subjt:  STREIKAILEKI-MDPTFKDWSLRPDEALWAHR

P08361 Gag-Pol polyprotein2.3e-1436.84Show/hide
Query:  WGVDFMGPFPPSFGNIYILLAVDYVSTWIEAVATATNDAKVVKKFFLKNIFTRYGTPRFIISGEGSHFLNKIIASLFSKYNIRHRVATAYHPQTNGQTEL
        W +DF    P  +G  Y+L+ VD  S WIEA  T    AKVV K  L+ IF R+G P+ + +  G  F++K+  ++     I  ++  AY PQ++GQ E 
Subjt:  WGVDFMGPFPPSFGNIYILLAVDYVSTWIEAVATATNDAKVVKKFFLKNIFTRYGTPRFIISGEGSHFLNKIIASLFSKYNIRHRVATAYHPQTNGQTEL

Query:  STREIKAILEKIMDPT-FKDWSLRPDEALWAHR
          R IK  L K+   T  +DW L    AL+  R
Subjt:  STREIKAILEKIMDPT-FKDWSLRPDEALWAHR

P10273 Gag-Pol polyprotein1.7e-1437.4Show/hide
Query:  WGVDFMGPFPPSFGNIYILLAVDYVSTWIEAVATATNDAKVVKKFFLKNIFTRYGTPRFIISGEGSHFLNKIIASLFSKYNIRHRVATAYHPQTNGQTEL
        W VDF    P  +G  Y+L+ +D  S W EA       AKVV K  L+ IF RYG P+ + S  G  F++++  S+ +   I  ++  AY PQ++GQ E 
Subjt:  WGVDFMGPFPPSFGNIYILLAVDYVSTWIEAVATATNDAKVVKKFFLKNIFTRYGTPRFIISGEGSHFLNKIIASLFSKYNIRHRVATAYHPQTNGQTEL

Query:  STREIKAILEKI-MDPTFKDWSL
          R IK  L K+ ++   KDW L
Subjt:  STREIKAILEKI-MDPTFKDWSL

Q2F7J0 Gag-Pol polyprotein2.3e-1436.84Show/hide
Query:  WGVDFMGPFPPSFGNIYILLAVDYVSTWIEAVATATNDAKVVKKFFLKNIFTRYGTPRFIISGEGSHFLNKIIASLFSKYNIRHRVATAYHPQTNGQTEL
        W VDF    P  +G  Y+L+ VD  S W+EA  T    AKVV K  L++IF R+G P+ + S  G  F +++  S+     I  ++  AY PQ++GQ E 
Subjt:  WGVDFMGPFPPSFGNIYILLAVDYVSTWIEAVATATNDAKVVKKFFLKNIFTRYGTPRFIISGEGSHFLNKIIASLFSKYNIRHRVATAYHPQTNGQTEL

Query:  STREIKAILEKI-MDPTFKDWSLRPDEALWAHR
          R IK  L K+ +    +DW L    AL+  R
Subjt:  STREIKAILEKI-MDPTFKDWSLRPDEALWAHR

Q2F7J3 Gag-Pol polyprotein3.0e-1436.84Show/hide
Query:  WGVDFMGPFPPSFGNIYILLAVDYVSTWIEAVATATNDAKVVKKFFLKNIFTRYGTPRFIISGEGSHFLNKIIASLFSKYNIRHRVATAYHPQTNGQTEL
        W VDF    P  +G  Y+L+ VD  S W+EA  T    AKVV K  L++IF R+G P+ + S  G  F +++  S+     I  ++  AY PQ++GQ E 
Subjt:  WGVDFMGPFPPSFGNIYILLAVDYVSTWIEAVATATNDAKVVKKFFLKNIFTRYGTPRFIISGEGSHFLNKIIASLFSKYNIRHRVATAYHPQTNGQTEL

Query:  STREIKAILEKI-MDPTFKDWSLRPDEALWAHR
          R IK  L K+ +    +DW L    AL+  R
Subjt:  STREIKAILEKI-MDPTFKDWSLRPDEALWAHR

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCTACACTTCATACTCGAAGTAGAAATCTTTGACGTATGGGGGGTAGACTTTATGGGACCATTCCCACCATCCTTTGGGAACATCTACATCTTGCTCGCAGTGGA
TTATGTATCAACATGGATTGAGGCAGTAGCAACTGCAACAAATGATGCCAAGGTGGTCAAGAAGTTCTTTCTGAAGAATATCTTCACCAGATATGGAACCCCTCGCTTCA
TCATAAGCGGCGAAGGCTCTCACTTCCTGAACAAAATAATAGCCAGTTTGTTTTCCAAATACAACATTAGACACAGGGTAGCTACTGCTTACCATCCCCAAACTAATGGT
CAAACAGAGCTCTCTACTAGGGAAATCAAGGCTATTTTGGAAAAAATAATGGATCCCACTTTCAAGGATTGGTCACTGAGGCCCGATGAAGCATTATGGGCCCATAGAAC
TGCATACAAGGCACTGATAACTGCCCAAAAGATTATGCTGCTGAGCGACTGGACGGAGCAAATTCTGTGCTGCAGCAAAACTGGGAACAGAACTGCCACATCACAGCTCT
TAGCCAACTTCATGAACCGACTTCTGTTGAGTGTGATTTTGGTGCATGAGCGATCCGCCTGGGGTAGGTTTGAGCTTGATCCAGAAATTGAAAGGACATTTAGGAACAGA
AGGAGAGAGCAGCGCAAAACCAGATGGAGAACGTGTCGCGTCTTCCGCAGGGTCCTGAAGGCTGAGAATCCTATCTTGATAGCGAACGATAGGACCAGAGCCATTCGAGC
GTATGATGTCCCGATGTTTAATGTGTTGAATCCAGGGATTGCACGTCCCCAAATCCAAGCGGCAAATTTTGAAATGAAACCGGGTGATCTTGCTATGATTGCTAACGCTC
TTAAGAATGTGACAGTGATTAGTCATCAACAGCCACCAGCTATGGAGCCTACTGCAGTGGTGAACCAAGTCACGGACGAAGCATGTGTCTACTGTGGTGAAGACCATAAC
TACGAGTTTTGCCCCAGCAATCCAGCTTCTGTGTTTTTTGTAGGTAATCAGAGGAACAACCCTTATTCTAACTTCTATAATCCAGTGGGTCAGCTAGCTAATGAGCTGAA
GGCGAGGCCTCAAGGGAAACTTCCATCAGATACTGAACACCCTCGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTAACTCTTAGGAGTGAGTTGGAGTCTGGTCAGGGTG
CTGGAGGTAGCAATAAAATGCTGGAGCATCTGGTTCTGTGCCAGATTAAAGCTATAGAGCAAATGCCTAATTATGCTAAATTTCTTAAGGATATTTTGACTAAAAAGAAG
AGGTTAGGTGAGTTTGAAACTGTATCTCTTACTGAGGAGTGTAGTGCTATTCTTAAGAATGGGCTACCCCCAAGGCTAAGGATCCAGGGTATTGGTGAAGCTAGACCTAC
CACAGTTACACTCCAATTAGCCGATAGGTCTATCACATATCTAGAGGACGAAATGGAGGATTGCTCCTTCATCAGGATTCTAGAGAGCACAGTTATTGAGACATCAATAC
AGGATTCGGCTGATAAGCATTCGGAAAAGCATGGAGAGCCCTTGTCGGATCATCTAAAGTATGTGTATCTTGGGGAGGGTGAGACGTTGCCCATTATTGTTGCATCAGAT
TTGACTATAGTCCATATTAAGGCAGTGAAAACACCTTGGTATGATGACTTTTTCGATTACCTTGATTTTGGAAATTTTCCTCATGGTTTATCAAAAGAACAAATGAAAGA
ATTTTTCCATGGGGTGAAGTTTTATTTATCGAATGATGCATCTGTGGTTAAACAATGTGTTGAAAACGATGGGAGAGTGTTCAAAGTGAATGGTAAATTGCAGGGATGTT
GCCACGTATTGCGTAGAGTCCTATTTATTCACAACCCTTCGGTAAGTGCTTCTCACTCCACTTTCTTGCTTCCAATCTTTGTCGTTTGCATTCTTCTTTCTTTGTTGTTC
TTTTTTGCACACCTCCTTGAGTCTTCAATGCCTAAGACAATAGCAAGAAAAGAAAGAGATAATGAGGAAGATGAGGTACCTGTTGCCCCTGAAGTACCACGGACAAAGGC
GAAGAAGATGAAGACGCCAAAGAAAAAGAAGCTAAGAGAAGACGACGTCAACAACGGATGGAGGACCAAGAAGATGGTCAAAAGGAATGAAGATGTGCGAGAAGAACAGG
CAGAGGTTGCACCTGAAGGAGATAATGAGCCAGTACAGGAGGCTCGAGTGGAGGTGATCATGCCAGAGGCACCCAAACGTCGCCGCATTAAGCAAAAACAGGTCGCATCA
AGCGGTGATCTTCCGTATTTTCTAAAGACCGGTATTGCAGACCACGGCTGGAAGTTGTTTTGTGCGAAGTCTGAGTCTGTAAACGAGCAGGTGGAGCACGAATTTTATGC
TAACATTGACAAAGAGGATGGTTTCCAGGTGATTGTTCGAAGAGTCGAGGTAGACTGGAGTCCTAGTGCTATTAACGCACTGTTTAACCTTCAAGATTTCCCCCATGCAG
CATATAATGAATGGATTGACGTAGGGAAGATCATTGTTAATGAAATATTTGGATGTTGGAAGAAAAAGGTGGGGAAACTGTTTTTTTTCGAACACGATCACTATGTTATG
CAGCAGGCAGGAGTGCCCACGGTTCTAGAGGATATTATTCTGTTTCACAAGGGGATCATCGACACGCCTAACTTGGCACGGCTCCAGCCCCTTCCTGCATTCCCTGTGGA
TCTGTTGAACCCCTGGATTCCGCCCCCACCTGTTGAAAGAGAAGAAGAGAATGATGATGAAGAGCAGGGTTGGGAAGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTCTACACTTCATACTCGAAGTAGAAATCTTTGACGTATGGGGGGTAGACTTTATGGGACCATTCCCACCATCCTTTGGGAACATCTACATCTTGCTCGCAGTGGA
TTATGTATCAACATGGATTGAGGCAGTAGCAACTGCAACAAATGATGCCAAGGTGGTCAAGAAGTTCTTTCTGAAGAATATCTTCACCAGATATGGAACCCCTCGCTTCA
TCATAAGCGGCGAAGGCTCTCACTTCCTGAACAAAATAATAGCCAGTTTGTTTTCCAAATACAACATTAGACACAGGGTAGCTACTGCTTACCATCCCCAAACTAATGGT
CAAACAGAGCTCTCTACTAGGGAAATCAAGGCTATTTTGGAAAAAATAATGGATCCCACTTTCAAGGATTGGTCACTGAGGCCCGATGAAGCATTATGGGCCCATAGAAC
TGCATACAAGGCACTGATAACTGCCCAAAAGATTATGCTGCTGAGCGACTGGACGGAGCAAATTCTGTGCTGCAGCAAAACTGGGAACAGAACTGCCACATCACAGCTCT
TAGCCAACTTCATGAACCGACTTCTGTTGAGTGTGATTTTGGTGCATGAGCGATCCGCCTGGGGTAGGTTTGAGCTTGATCCAGAAATTGAAAGGACATTTAGGAACAGA
AGGAGAGAGCAGCGCAAAACCAGATGGAGAACGTGTCGCGTCTTCCGCAGGGTCCTGAAGGCTGAGAATCCTATCTTGATAGCGAACGATAGGACCAGAGCCATTCGAGC
GTATGATGTCCCGATGTTTAATGTGTTGAATCCAGGGATTGCACGTCCCCAAATCCAAGCGGCAAATTTTGAAATGAAACCGGGTGATCTTGCTATGATTGCTAACGCTC
TTAAGAATGTGACAGTGATTAGTCATCAACAGCCACCAGCTATGGAGCCTACTGCAGTGGTGAACCAAGTCACGGACGAAGCATGTGTCTACTGTGGTGAAGACCATAAC
TACGAGTTTTGCCCCAGCAATCCAGCTTCTGTGTTTTTTGTAGGTAATCAGAGGAACAACCCTTATTCTAACTTCTATAATCCAGTGGGTCAGCTAGCTAATGAGCTGAA
GGCGAGGCCTCAAGGGAAACTTCCATCAGATACTGAACACCCTCGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTAACTCTTAGGAGTGAGTTGGAGTCTGGTCAGGGTG
CTGGAGGTAGCAATAAAATGCTGGAGCATCTGGTTCTGTGCCAGATTAAAGCTATAGAGCAAATGCCTAATTATGCTAAATTTCTTAAGGATATTTTGACTAAAAAGAAG
AGGTTAGGTGAGTTTGAAACTGTATCTCTTACTGAGGAGTGTAGTGCTATTCTTAAGAATGGGCTACCCCCAAGGCTAAGGATCCAGGGTATTGGTGAAGCTAGACCTAC
CACAGTTACACTCCAATTAGCCGATAGGTCTATCACATATCTAGAGGACGAAATGGAGGATTGCTCCTTCATCAGGATTCTAGAGAGCACAGTTATTGAGACATCAATAC
AGGATTCGGCTGATAAGCATTCGGAAAAGCATGGAGAGCCCTTGTCGGATCATCTAAAGTATGTGTATCTTGGGGAGGGTGAGACGTTGCCCATTATTGTTGCATCAGAT
TTGACTATAGTCCATATTAAGGCAGTGAAAACACCTTGGTATGATGACTTTTTCGATTACCTTGATTTTGGAAATTTTCCTCATGGTTTATCAAAAGAACAAATGAAAGA
ATTTTTCCATGGGGTGAAGTTTTATTTATCGAATGATGCATCTGTGGTTAAACAATGTGTTGAAAACGATGGGAGAGTGTTCAAAGTGAATGGTAAATTGCAGGGATGTT
GCCACGTATTGCGTAGAGTCCTATTTATTCACAACCCTTCGGTAAGTGCTTCTCACTCCACTTTCTTGCTTCCAATCTTTGTCGTTTGCATTCTTCTTTCTTTGTTGTTC
TTTTTTGCACACCTCCTTGAGTCTTCAATGCCTAAGACAATAGCAAGAAAAGAAAGAGATAATGAGGAAGATGAGGTACCTGTTGCCCCTGAAGTACCACGGACAAAGGC
GAAGAAGATGAAGACGCCAAAGAAAAAGAAGCTAAGAGAAGACGACGTCAACAACGGATGGAGGACCAAGAAGATGGTCAAAAGGAATGAAGATGTGCGAGAAGAACAGG
CAGAGGTTGCACCTGAAGGAGATAATGAGCCAGTACAGGAGGCTCGAGTGGAGGTGATCATGCCAGAGGCACCCAAACGTCGCCGCATTAAGCAAAAACAGGTCGCATCA
AGCGGTGATCTTCCGTATTTTCTAAAGACCGGTATTGCAGACCACGGCTGGAAGTTGTTTTGTGCGAAGTCTGAGTCTGTAAACGAGCAGGTGGAGCACGAATTTTATGC
TAACATTGACAAAGAGGATGGTTTCCAGGTGATTGTTCGAAGAGTCGAGGTAGACTGGAGTCCTAGTGCTATTAACGCACTGTTTAACCTTCAAGATTTCCCCCATGCAG
CATATAATGAATGGATTGACGTAGGGAAGATCATTGTTAATGAAATATTTGGATGTTGGAAGAAAAAGGTGGGGAAACTGTTTTTTTTCGAACACGATCACTATGTTATG
CAGCAGGCAGGAGTGCCCACGGTTCTAGAGGATATTATTCTGTTTCACAAGGGGATCATCGACACGCCTAACTTGGCACGGCTCCAGCCCCTTCCTGCATTCCCTGTGGA
TCTGTTGAACCCCTGGATTCCGCCCCCACCTGTTGAAAGAGAAGAAGAGAATGATGATGAAGAGCAGGGTTGGGAAGATTGA
Protein sequenceShow/hide protein sequence
MPLHFILEVEIFDVWGVDFMGPFPPSFGNIYILLAVDYVSTWIEAVATATNDAKVVKKFFLKNIFTRYGTPRFIISGEGSHFLNKIIASLFSKYNIRHRVATAYHPQTNG
QTELSTREIKAILEKIMDPTFKDWSLRPDEALWAHRTAYKALITAQKIMLLSDWTEQILCCSKTGNRTATSQLLANFMNRLLLSVILVHERSAWGRFELDPEIERTFRNR
RREQRKTRWRTCRVFRRVLKAENPILIANDRTRAIRAYDVPMFNVLNPGIARPQIQAANFEMKPGDLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHN
YEFCPSNPASVFFVGNQRNNPYSNFYNPVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSELESGQGAGGSNKMLEHLVLCQIKAIEQMPNYAKFLKDILTKKK
RLGEFETVSLTEECSAILKNGLPPRLRIQGIGEARPTTVTLQLADRSITYLEDEMEDCSFIRILESTVIETSIQDSADKHSEKHGEPLSDHLKYVYLGEGETLPIIVASD
LTIVHIKAVKTPWYDDFFDYLDFGNFPHGLSKEQMKEFFHGVKFYLSNDASVVKQCVENDGRVFKVNGKLQGCCHVLRRVLFIHNPSVSASHSTFLLPIFVVCILLSLLF
FFAHLLESSMPKTIARKERDNEEDEVPVAPEVPRTKAKKMKTPKKKKLREDDVNNGWRTKKMVKRNEDVREEQAEVAPEGDNEPVQEARVEVIMPEAPKRRRIKQKQVAS
SGDLPYFLKTGIADHGWKLFCAKSESVNEQVEHEFYANIDKEDGFQVIVRRVEVDWSPSAINALFNLQDFPHAAYNEWIDVGKIIVNEIFGCWKKKVGKLFFFEHDHYVM
QQAGVPTVLEDIILFHKGIIDTPNLARLQPLPAFPVDLLNPWIPPPPVEREEENDDEEQGWED