; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032493 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032493
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr11:33490768..33494903
RNA-Seq ExpressionLag0032493
SyntenyLag0032493
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006487889.1 uncharacterized protein LOC102617714 [Citrus sinensis]5.6e-5846.12Show/hide
Query:  KEEELYWKQRSREVWLKEGDQNTRWFHRQASYRRRLNRIGGLLDDHREMRQDRAMVLQLVTDYFQQLFLSSEPSDQDFDVSLRDLQRSVDNEMNVDLLRP
        ++EE++WKQRSR  WLKEGD+NT++FH +AS RR+ NRIGG+LD+  +  +D   V ++  ++F  LF ++ P+ +  D + +D    V+ EMN  L  P
Subjt:  KEEELYWKQRSREVWLKEGDQNTRWFHRQASYRRRLNRIGGLLDDHREMRQDRAMVLQLVTDYFQQLFLSSEPSDQDFDVSLRDLQRSVDNEMNVDLLRP

Query:  FTEEEIIRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNHGCSPGSINETMIVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKH
        F EEEI+ AL Q  P KAPGPDGL  +F++ HW  V   VI +CL +LN   +   +N T I L+PK   P+ VS+FRPISLCN  Y++I+K++ N +KH
Subjt:  FTEEEIIRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNHGCSPGSINETMIVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKH

Query:  ILPKLISSNHSAFIPGRCVVDNAILGFECIHELRDGLGENLNGLL
        IL K++S N SAFI  R + DN I+G+E ++++R G G+  NG++
Subjt:  ILPKLISSNHSAFIPGRCVVDNAILGFECIHELRDGLGENLNGLL

XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]3.5e-6049.18Show/hide
Query:  EEELYWKQRSREVWLKEGDQNTRWFHRQASYRRRLNRIGGLLDDHREMRQDRAMVLQLVTDYFQQLFLSSEPSDQDFDVSLRDLQRSVDNEMNVDLLRPF
        +EE+YWKQRSR  WLKEGD+NT++FH +AS RRR N+I G+ DD      D   +      +FQQLF SS PS      +L+ L   V  EMN  L  PF
Subjt:  EEELYWKQRSREVWLKEGDQNTRWFHRQASYRRRLNRIGGLLDDHREMRQDRAMVLQLVTDYFQQLFLSSEPSDQDFDVSLRDLQRSVDNEMNVDLLRPF

Query:  TEEEIIRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNHGCSPGSINETMIVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHI
        T E+I RAL +  P KAPGPDGL  +F++ HW IVG  + ++CL +LN   +  S+N T I L+PK++ PR+V +FRPISLCN  Y++++KA+ NR+K I
Subjt:  TEEEIIRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNHGCSPGSINETMIVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHI

Query:  LPKLISSNHSAFIPGRCVVDNAILGFECIHELRDGLGENLNGLL
        L  +IS N SAFIP R + DN I+G+EC+H++R   G   NGL+
Subjt:  LPKLISSNHSAFIPGRCVVDNAILGFECIHELRDGLGENLNGLL

XP_015388020.1 uncharacterized protein LOC107177951 [Citrus sinensis]9.6e-5845.08Show/hide
Query:  EEELYWKQRSREVWLKEGDQNTRWFHRQASYRRRLNRIGGLLDDHREMRQDRAMVLQLVTDYFQQLFLSSEPSDQDFDVSLRDLQRSVDNEMNVDLLRPF
        +EE+YW+QRSR VWL+EGD+NT++FH +AS R+R N I G++D++    +    + ++  +Y+  LF S+ PS Q  +++LR++   V  EMN  L +PF
Subjt:  EEELYWKQRSREVWLKEGDQNTRWFHRQASYRRRLNRIGGLLDDHREMRQDRAMVLQLVTDYFQQLFLSSEPSDQDFDVSLRDLQRSVDNEMNVDLLRPF

Query:  TEEEIIRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNHGCSPGSINETMIVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHI
        TE +I+ AL Q HP KAPGPDGL  +F++ HW+ V   VI +CL +LN G +   +N T I L+ KI  PR V ++RPISLCN  Y++I+K + NR+K I
Subjt:  TEEEIIRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNHGCSPGSINETMIVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHI

Query:  LPKLISSNHSAFIPGRCVVDNAILGFECIHELRDGLGENLNGLL
        L  +I    SAF+P R + DN I+G+EC+H++R    +  NGL+
Subjt:  LPKLISSNHSAFIPGRCVVDNAILGFECIHELRDGLGENLNGLL

XP_023874626.1 uncharacterized protein LOC111987155 [Quercus suber]1.8e-5950.87Show/hide
Query:  EEELYWKQRSREVWLKEGDQNTRWFHRQASYRRRLNRIGGLLDDHREMRQDRAMVLQLVTDYFQQLFLSSEPSDQDFDVSLRDLQRSVDNEMNVDLLRPF
        +EE+ WKQRSR +WL+ GD+NT++FH  AS RRR NRIGGLL+D     +D+  + +++ DYF  +F S  PS   FD SL  +   V  EMN DLL  F
Subjt:  EEELYWKQRSREVWLKEGDQNTRWFHRQASYRRRLNRIGGLLDDHREMRQDRAMVLQLVTDYFQQLFLSSEPSDQDFDVSLRDLQRSVDNEMNVDLLRPF

Query:  TEEEIIRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNHGCSPGSINETMIVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHI
          EE+  AL+Q HP KAPGPDG+S  FY+ +W IV P VI+  LAVLN G  P  INET I L+PK+ +P+++++FRPISLCN  YK+ISK + NR+K +
Subjt:  TEEEIIRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNHGCSPGSINETMIVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHI

Query:  LPKLISSNHSAFIPGRCVVDNAILGFECIH
        L ++I  + SAF+PGR ++DN ++ FE +H
Subjt:  LPKLISSNHSAFIPGRCVVDNAILGFECIH

XP_030942013.1 uncharacterized protein LOC115967068 [Quercus lobata]2.8e-5747.6Show/hide
Query:  EELYWKQRSREVWLKEGDQNTRWFHRQASYRRRLNRIGGLLDDHREMRQDRAMVLQLVTDYFQQLFLSSEPSDQDFDVSLRDLQRSVDNEMNVDLLRPFT
        EE+ W QRSR VW+K GD+NT++FH  AS RRR NRI GL D+    ++D+  V  ++ +YFQ++F +S P   +F  SL  ++R V ++MN DLL+ F 
Subjt:  EELYWKQRSREVWLKEGDQNTRWFHRQASYRRRLNRIGGLLDDHREMRQDRAMVLQLVTDYFQQLFLSSEPSDQDFDVSLRDLQRSVDNEMNVDLLRPFT

Query:  EEEIIRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNHGCSPGSINETMIVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHIL
        EEE+ RALKQ HP K+PGP+ +S  F++++W +VGP V+   L  L  G  P  +N+T I L+PK+  P+++S+FRPISLCN  YK++SK + NR+K +L
Subjt:  EEEIIRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNHGCSPGSINETMIVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHIL

Query:  PKLISSNHSAFIPGRCVVDNAILGFECIH
        P +IS   SAF+PGR + DN ++ FE +H
Subjt:  PKLISSNHSAFIPGRCVVDNAILGFECIH

TrEMBL top hitse value%identityAlignment
A0A2N9GPZ7 Reverse transcriptase domain-containing protein5.1e-5745.92Show/hide
Query:  KEEELYWKQRSREVWLKEGDQNTRWFHRQASYRRRLNRIGGLLDDHREMRQDRAMVLQLVTDYFQQLFLSSEPSDQDFDVSLRDLQRSVDNEMNVDLLRP
        ++EE++W+QRSR  W+ EGD+NT++FH Q + RRR N I GL D     + ++  + ++  DYFQ +F SS PS +     L+ ++  V N MN  L   
Subjt:  KEEELYWKQRSREVWLKEGDQNTRWFHRQASYRRRLNRIGGLLDDHREMRQDRAMVLQLVTDYFQQLFLSSEPSDQDFDVSLRDLQRSVDNEMNVDLLRP

Query:  FTEEEIIRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNHGCSPGSINETMIVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKH
        FT++E+  ALKQ +P KAPGPDG+S  FY+ +W IVGP V Q+ L++L+ G     IN T I L+PK+K P  ++DFRPISLCN  YK++SK + NR+K 
Subjt:  FTEEEIIRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNHGCSPGSINETMIVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKH

Query:  ILPKLISSNHSAFIPGRCVVDNAILGFECIHEL
        +LP +IS   SAF+PGR + DN ++ FE +H +
Subjt:  ILPKLISSNHSAFIPGRCVVDNAILGFECIHEL

A0A2N9I335 Reverse transcriptase domain-containing protein3.8e-6047.64Show/hide
Query:  KEEELYWKQRSREVWLKEGDQNTRWFHRQASYRRRLNRIGGLLDDHREMRQDRAMVLQLVTDYFQQLFLSSEPSDQDFDVSLRDLQRSVDNEMNVDLLRP
        + EE+YW+QRSR  W++EGD+NT++FH   ++RR +N I GL D+   ++ D+  +  +  DYFQ +F SS P D+  +  L  L+R V  EMN  LL  
Subjt:  KEEELYWKQRSREVWLKEGDQNTRWFHRQASYRRRLNRIGGLLDDHREMRQDRAMVLQLVTDYFQQLFLSSEPSDQDFDVSLRDLQRSVDNEMNVDLLRP

Query:  FTEEEIIRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNHGCSPGSINETMIVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKH
        F  EE+ +ALKQ +P KAPGPDG+S  FY+ +W IVGP V Q+ L++L+ G     IN T I L+PK+K P R++DFRPISLCN  YK++SK + NR+K 
Subjt:  FTEEEIIRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNHGCSPGSINETMIVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKH

Query:  ILPKLISSNHSAFIPGRCVVDNAILGFECIHEL
        +LP +IS + SAF+PGR + DN ++ FE +H +
Subjt:  ILPKLISSNHSAFIPGRCVVDNAILGFECIHEL

A0A2N9IPS8 Reverse transcriptase domain-containing protein5.1e-5745.92Show/hide
Query:  KEEELYWKQRSREVWLKEGDQNTRWFHRQASYRRRLNRIGGLLDDHREMRQDRAMVLQLVTDYFQQLFLSSEPSDQDFDVSLRDLQRSVDNEMNVDLLRP
        ++EE++W+QRSR  W+ EGD+NT++FH Q + RRR N I GL D     + ++  + ++  DYFQ +F SS PS +     L+ ++  V N MN  L   
Subjt:  KEEELYWKQRSREVWLKEGDQNTRWFHRQASYRRRLNRIGGLLDDHREMRQDRAMVLQLVTDYFQQLFLSSEPSDQDFDVSLRDLQRSVDNEMNVDLLRP

Query:  FTEEEIIRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNHGCSPGSINETMIVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKH
        FT++E+  ALKQ +P KAPGPDG+S  FY+ +W IVGP V Q+ L++L+ G     IN T I L+PK+K P  ++DFRPISLCN  YK++SK + NR+K 
Subjt:  FTEEEIIRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNHGCSPGSINETMIVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKH

Query:  ILPKLISSNHSAFIPGRCVVDNAILGFECIHEL
        +LP +IS   SAF+PGR + DN ++ FE +H +
Subjt:  ILPKLISSNHSAFIPGRCVVDNAILGFECIHEL

A0A7N2R0C3 Reverse transcriptase domain-containing protein6.7e-5745.57Show/hide
Query:  EELYWKQRSREVWLKEGDQNTRWFHRQASYRRRLNRIGGLLDDHREMRQDRAMVLQLVTDYFQQLFLSSEPSDQDFDVSLRDLQRSVDNEMNVDLLRPFT
        EE+ W QRSR +W+K GD+NTR+FH  A+ RRR N+I G+LD     R++   V +++ +YF++++ S+ P+  +F   L  + R V  +MN DLLR F 
Subjt:  EELYWKQRSREVWLKEGDQNTRWFHRQASYRRRLNRIGGLLDDHREMRQDRAMVLQLVTDYFQQLFLSSEPSDQDFDVSLRDLQRSVDNEMNVDLLRPFT

Query:  EEEIIRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNHGCSPGSINETMIVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHIL
        EEE+ +AL Q HP K+PGPDG+S  F++ +W +VGP V+QS +  L  G  P  +NET I L+PK+K P++++++RPISLCN  YKL+SK + NR+K +L
Subjt:  EEEIIRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNHGCSPGSINETMIVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHIL

Query:  PKLISSNHSAFIPGRCVVDNAILGFE---CIHELRDG
        P ++    SAF+PGR + DN ++ FE   CI++ R G
Subjt:  PKLISSNHSAFIPGRCVVDNAILGFE---CIHELRDG

A0A803Q8X4 Uncharacterized protein5.1e-5748.15Show/hide
Query:  CIREANQKEEELYWKQRSREVWLKEGDQNTRWFHRQASYRRRLNRIGGLLDDHREMRQDRAMVLQLVTDYFQQLFLSSEPSDQDFDVSLRDLQRSVDNEM
        C+ E N    E+YWKQRSR +WLK GD+NT++FH +AS RR+ N I GL DDH++ +     + ++  +YFQ LF  S    + +D     +   +  E 
Subjt:  CIREANQKEEELYWKQRSREVWLKEGDQNTRWFHRQASYRRRLNRIGGLLDDHREMRQDRAMVLQLVTDYFQQLFLSSEPSDQDFDVSLRDLQRSVDNEM

Query:  NVDLLRPFTEEEIIRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNHGCSPGSINETMIVLVPKIKAPRRVSDFRPISLCNFSYKLISKA
        N  LL PF E ++  AL Q HP KAPG DGL   F++ HW IVGP V ++CL +LN      S+NET+I L+PK+K P ++S+FRPISLCN  YK+++K 
Subjt:  NVDLLRPFTEEEIIRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNHGCSPGSINETMIVLVPKIKAPRRVSDFRPISLCNFSYKLISKA

Query:  VVNRMKHILPKLISSNHSAFIPGRCVVDNAILGFECIHELRDG
        + NRMK  L   ISSN SAFI GR + DNAILGFE +H  R G
Subjt:  VVNRMKHILPKLISSNHSAFIPGRCVVDNAILGFECIHELRDG

SwissProt top hitse value%identityAlignment
P08548 LINE-1 reverse transcriptase homolog3.0e-1425.51Show/hide
Query:  HSSAWPSWYKRKDPWCIR-EANQKEEELYWKQ--RSREVWLKEGDQNTRWFHRQASYRRRLNRIGGLLDDHREMRQDRAMVLQLVTDYFQQLFLSSEPSD
        HS+  PS  +RK+   IR E N+ E +   +Q  +S+  + ++ ++  +        +R  + I  + + + E+  D + + +++ +Y+++L+     + 
Subjt:  HSSAWPSWYKRKDPWCIR-EANQKEEELYWKQ--RSREVWLKEGDQNTRWFHRQASYRRRLNRIGGLLDDHREMRQDRAMVLQLVTDYFQQLFLSSEPSD

Query:  QDFDVSLRDLQRSVDNEMNVDLL-RPFTEEEIIRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNHGCSPGSINETMIVLVPKI-KAPRR
        ++ D  L        ++  V++L RP +  EI   ++     K+PGPDG +  FY+     + P ++     +   G  P +  E  I L+PK  K P R
Subjt:  QDFDVSLRDLQRSVDNEMNVDLL-RPFTEEEIIRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNHGCSPGSINETMIVLVPKI-KAPRR

Query:  VSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSNHSAFIPG
          ++RPISL N   K+++K + NR++  + K+I  +   FIPG
Subjt:  VSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSNHSAFIPG

P11369 LINE-1 retrotransposable element ORF2 protein1.7e-1227.22Show/hide
Query:  IGGLLDDHREMRQDRAMVLQLVTDYFQQLFLSSEPSDQDFDVSLRDLQRSVDNEMNVDLLR-PFTEEEIIRALKQSHPHKAPGPDGLSGSFYKNHWSIVG
        I  + ++  ++  D   +   +  ++++L+ +   +  + D  L   Q    N+  VD L  P + +EI   +      K+PGPDG S  FY+     + 
Subjt:  IGGLLDDHREMRQDRAMVLQLVTDYFQQLFLSSEPSDQDFDVSLRDLQRSVDNEMNVDLLR-PFTEEEIIRALKQSHPHKAPGPDGLSGSFYKNHWSIVG

Query:  PSVIQSCLAVLNHGCSPGSINETMIVLVPK-IKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSNHSAFIPG
        P + +    +   G  P S  E  I L+PK  K P ++ +FRPISL N   K+++K + NR++  +  +I  +   FIPG
Subjt:  PSVIQSCLAVLNHGCSPGSINETMIVLVPK-IKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSNHSAFIPG

P14381 Transposon TX1 uncharacterized 149 kDa protein4.5e-1828Show/hide
Query:  RSREVWLKEGDQNTRWFHRQASYRRRLNRIGGLLDDHREMRQDRAMVLQLVTDYFQQLFLSSEPSDQDFDVSLRDLQRSVDNEMNVDLLRPFTEEEIIRA
        RSR   L + D+ +R+F+     +    +I  L  +     +D   +      ++Q LF S +P   D    L D    V       L  P T +E+ +A
Subjt:  RSREVWLKEGDQNTRWFHRQASYRRRLNRIGGLLDDHREMRQDRAMVLQLVTDYFQQLFLSSEPSDQDFDVSLRDLQRSVDNEMNVDLLRPFTEEEIIRA

Query:  LKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNHGCSPGSINETMIVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSN
        L+    +K+PG DGL+  F++  W  +GP   +        G  P S    ++ L+PK    R + ++RP+SL +  YK+++KA+  R+K +L ++I  +
Subjt:  LKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNHGCSPGSINETMIVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSN

Query:  HSAFIPGRCVVDNAILGFECIHELR
         S  +PGR + DN  L  + +H  R
Subjt:  HSAFIPGRCVVDNAILGFECIHELR

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.7e-1529.47Show/hide
Query:  ELYWKQRSREVWLKEGDQNTRWFHRQASYRRRLNRIGGLLDDHREMRQDRAMVLQLVTDYFQQLFLS-SEPSDQDFDVSLRDLQRSVDNEMNVDLLRPF-
        E +++Q+SR  WL++GD NTR+FH+     +  N I  L  D     ++   V +++  Y+  L  S S+    D    ++D+     N+     L    
Subjt:  ELYWKQRSREVWLKEGDQNTRWFHRQASYRRRLNRIGGLLDDHREMRQDRAMVLQLVTDYFQQLFLS-SEPSDQDFDVSLRDLQRSVDNEMNVDLLRPF-

Query:  TEEEIIRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNHGCSPGSINETMIVLVPKIKAPRRVSDFRPISLCNFSYKLIS
        +++EI  A+     +KAPGPD  +  F+   W +V  S I +       G      N T I L+PK+    ++S FRP+S C   YK+I+
Subjt:  TEEEIIRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNHGCSPGSINETMIVLVPKIKAPRRVSDFRPISLCNFSYKLIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTCAGACCCCCAAAAGAAACCCCCAGGGTTTTGATTCTGAAACCATGAGCCAAACACCATCTTTTGGATCTGAATGGGATAATCTTTGGTGGTCTCTCCATGTGAG
AGTGGGTTCTAGGGTTTTGGGGGTATCTCGGCCGTTTGGTGGTGTTGGGTTATCTCGACGGCGAGGTCCTCTCGAGTTGCAGGTTCAGCTTGTCTCCGGCGGCTTCAAGG
TTTCGGCGTGTGTAGGAGGCGTTTTCGATCTCCTGCAAGGGCTAGGTTGTTTCCGACGGTTTCATGGTCTCGGGTTAGTTTTTTGTTTCGTCTGTGGTCTGTCTCGTCTG
TTCTGGTTGCTGTTGCACTTGTTTGGTTTCTCGTCTGTAGTTTCTCAAGGTCTAGGTGGGTGTGTCGGTTGTTGTTCGCCTGTATCGGTGTCTGCTAGGTTGATTTCCGT
CGCATTTCTTTCGTCTACGAGTGTCTGGTCTGTTGTCGTAGCGGTGCAGCTCTGTGGTTCTGCTTGGGCTCTTCGATTTGGGCTGGTCGGTGGTTTGTGTAGTCTGTGGC
TTGTGTGGTTTTCGTGGTTTGTGTGGCGTGTAGTGGTTTGGGGTTTTGCCGATAGAGTTCTCTCAGTGCATTTATGGATGATCTTGTGGTTCAATGGGAAAATATGGGGT
TATCAGAGGCAGAGTCGACGCGTTCTCGTGCCAGCAGACATTCCTCTTTTGGATGAATCAACAGTTCAATGTGTGCCGTTGGCAAGGGATCGGATGACCCGTCTCTCCTG
GATTTTTCTCGCTGTGAGTTTTGGGTCATATCACGAAAGTTCCGTTGAACTATCATACACAGCTATGGCTCGTGCTCTTGGTAGTGTGGTGGGTCAGTGGTTGAGGTGCC
AGGGGAAGGTCACAGTGACTGGCTTGGCTCAATGTGGGCGTATTGGCCATTCACATAGGGAGTGCTCTGAAGAGGGGGAAGGCGTGGGTGCTGATAATCAGTTTCTGTTT
GGTGACTGGTTGCGGGCTGTTCCATTCCGGCGTGTTGTTGCTAGTGCTTCAGAGGAGGGTGGTGGCCGCCCAAGAGGCCTGAGGTGTTTCCGGGGTCTGACCGGGTGGCG
ATCTGGTTGCGGATCAGTAGTTGTTCCTGAGGGGCTATGGTCCTCCGGTTCCTTCGGGTCCACCCTTGCAGTAGGTCCTGATGTTGGGGTGGTTTCTGCGTATAAAGGTA
AGGAGGTGCAGATCCGGATGTTGCTCCGAAGCTCTTTGAAGGACATTACCAATGATTTACCTTCCCCAGTTATTAGTGGGCACAAGCGATCGGTCCAAGGGGACCCGCCT
GATGAGGAGGGGTCAGTTCCCAAGCGGTTGAGGGAGGAGGGGTCTGGTGTGGATCTGTGTGAGGCTGACAGGATGGATGTGGCGGGTCTCCCCGTGCATTCCAGCGCTTG
GCCAAGTTGGTATAAGAGAAAAGACCCCTGGTGCATCAGGGAGGCCAACCAGAAGGAGGAGGAACTTTACTGGAAGCAACGGTCCAGAGAGGTGTGGTTGAAGGAAGGGG
ATCAGAATACTCGGTGGTTTCATCGTCAAGCCTCGTATAGGCGAAGGCTCAATCGTATTGGGGGCCTCTTAGATGATCATAGGGAGATGCGCCAGGACAGAGCTATGGTT
CTTCAGTTGGTGACTGATTATTTCCAGCAGCTTTTCTTGTCATCAGAGCCGAGTGATCAGGACTTCGATGTATCTCTCAGGGACCTTCAGCGTTCTGTGGATAATGAAAT
GAATGTGGATCTGTTACGACCTTTTACTGAGGAGGAGATTATTCGGGCTTTGAAGCAGTCTCATCCTCATAAGGCCCCAGGCCCAGATGGGTTATCTGGCAGTTTCTATA
AGAATCACTGGTCAATAGTGGGGCCTTCAGTGATCCAGAGTTGCTTGGCCGTTTTAAATCACGGATGTTCCCCGGGTTCGATTAATGAGACTATGATTGTTCTCGTTCCG
AAGATCAAGGCCCCTCGTCGAGTTTCTGATTTTCGTCCCATCTCCTTATGTAATTTTAGCTATAAGCTGATTTCGAAGGCGGTGGTTAATAGGATGAAGCATATCCTTCC
TAAACTTATTTCATCCAACCATAGTGCCTTTATCCCTGGGAGGTGTGTGGTGGACAATGCCATCTTGGGGTTTGAGTGCATCCATGAGTTAAGGGACGGACTGGGGGAAA
ATCTAAATGGGCTGCTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTTCAGACCCCCAAAAGAAACCCCCAGGGTTTTGATTCTGAAACCATGAGCCAAACACCATCTTTTGGATCTGAATGGGATAATCTTTGGTGGTCTCTCCATGTGAG
AGTGGGTTCTAGGGTTTTGGGGGTATCTCGGCCGTTTGGTGGTGTTGGGTTATCTCGACGGCGAGGTCCTCTCGAGTTGCAGGTTCAGCTTGTCTCCGGCGGCTTCAAGG
TTTCGGCGTGTGTAGGAGGCGTTTTCGATCTCCTGCAAGGGCTAGGTTGTTTCCGACGGTTTCATGGTCTCGGGTTAGTTTTTTGTTTCGTCTGTGGTCTGTCTCGTCTG
TTCTGGTTGCTGTTGCACTTGTTTGGTTTCTCGTCTGTAGTTTCTCAAGGTCTAGGTGGGTGTGTCGGTTGTTGTTCGCCTGTATCGGTGTCTGCTAGGTTGATTTCCGT
CGCATTTCTTTCGTCTACGAGTGTCTGGTCTGTTGTCGTAGCGGTGCAGCTCTGTGGTTCTGCTTGGGCTCTTCGATTTGGGCTGGTCGGTGGTTTGTGTAGTCTGTGGC
TTGTGTGGTTTTCGTGGTTTGTGTGGCGTGTAGTGGTTTGGGGTTTTGCCGATAGAGTTCTCTCAGTGCATTTATGGATGATCTTGTGGTTCAATGGGAAAATATGGGGT
TATCAGAGGCAGAGTCGACGCGTTCTCGTGCCAGCAGACATTCCTCTTTTGGATGAATCAACAGTTCAATGTGTGCCGTTGGCAAGGGATCGGATGACCCGTCTCTCCTG
GATTTTTCTCGCTGTGAGTTTTGGGTCATATCACGAAAGTTCCGTTGAACTATCATACACAGCTATGGCTCGTGCTCTTGGTAGTGTGGTGGGTCAGTGGTTGAGGTGCC
AGGGGAAGGTCACAGTGACTGGCTTGGCTCAATGTGGGCGTATTGGCCATTCACATAGGGAGTGCTCTGAAGAGGGGGAAGGCGTGGGTGCTGATAATCAGTTTCTGTTT
GGTGACTGGTTGCGGGCTGTTCCATTCCGGCGTGTTGTTGCTAGTGCTTCAGAGGAGGGTGGTGGCCGCCCAAGAGGCCTGAGGTGTTTCCGGGGTCTGACCGGGTGGCG
ATCTGGTTGCGGATCAGTAGTTGTTCCTGAGGGGCTATGGTCCTCCGGTTCCTTCGGGTCCACCCTTGCAGTAGGTCCTGATGTTGGGGTGGTTTCTGCGTATAAAGGTA
AGGAGGTGCAGATCCGGATGTTGCTCCGAAGCTCTTTGAAGGACATTACCAATGATTTACCTTCCCCAGTTATTAGTGGGCACAAGCGATCGGTCCAAGGGGACCCGCCT
GATGAGGAGGGGTCAGTTCCCAAGCGGTTGAGGGAGGAGGGGTCTGGTGTGGATCTGTGTGAGGCTGACAGGATGGATGTGGCGGGTCTCCCCGTGCATTCCAGCGCTTG
GCCAAGTTGGTATAAGAGAAAAGACCCCTGGTGCATCAGGGAGGCCAACCAGAAGGAGGAGGAACTTTACTGGAAGCAACGGTCCAGAGAGGTGTGGTTGAAGGAAGGGG
ATCAGAATACTCGGTGGTTTCATCGTCAAGCCTCGTATAGGCGAAGGCTCAATCGTATTGGGGGCCTCTTAGATGATCATAGGGAGATGCGCCAGGACAGAGCTATGGTT
CTTCAGTTGGTGACTGATTATTTCCAGCAGCTTTTCTTGTCATCAGAGCCGAGTGATCAGGACTTCGATGTATCTCTCAGGGACCTTCAGCGTTCTGTGGATAATGAAAT
GAATGTGGATCTGTTACGACCTTTTACTGAGGAGGAGATTATTCGGGCTTTGAAGCAGTCTCATCCTCATAAGGCCCCAGGCCCAGATGGGTTATCTGGCAGTTTCTATA
AGAATCACTGGTCAATAGTGGGGCCTTCAGTGATCCAGAGTTGCTTGGCCGTTTTAAATCACGGATGTTCCCCGGGTTCGATTAATGAGACTATGATTGTTCTCGTTCCG
AAGATCAAGGCCCCTCGTCGAGTTTCTGATTTTCGTCCCATCTCCTTATGTAATTTTAGCTATAAGCTGATTTCGAAGGCGGTGGTTAATAGGATGAAGCATATCCTTCC
TAAACTTATTTCATCCAACCATAGTGCCTTTATCCCTGGGAGGTGTGTGGTGGACAATGCCATCTTGGGGTTTGAGTGCATCCATGAGTTAAGGGACGGACTGGGGGAAA
ATCTAAATGGGCTGCTCTAA
Protein sequenceShow/hide protein sequence
MLQTPKRNPQGFDSETMSQTPSFGSEWDNLWWSLHVRVGSRVLGVSRPFGGVGLSRRRGPLELQVQLVSGGFKVSACVGGVFDLLQGLGCFRRFHGLGLVFCFVCGLSRL
FWLLLHLFGFSSVVSQGLGGCVGCCSPVSVSARLISVAFLSSTSVWSVVVAVQLCGSAWALRFGLVGGLCSLWLVWFSWFVWRVVVWGFADRVLSVHLWMILWFNGKIWG
YQRQSRRVLVPADIPLLDESTVQCVPLARDRMTRLSWIFLAVSFGSYHESSVELSYTAMARALGSVVGQWLRCQGKVTVTGLAQCGRIGHSHRECSEEGEGVGADNQFLF
GDWLRAVPFRRVVASASEEGGGRPRGLRCFRGLTGWRSGCGSVVVPEGLWSSGSFGSTLAVGPDVGVVSAYKGKEVQIRMLLRSSLKDITNDLPSPVISGHKRSVQGDPP
DEEGSVPKRLREEGSGVDLCEADRMDVAGLPVHSSAWPSWYKRKDPWCIREANQKEEELYWKQRSREVWLKEGDQNTRWFHRQASYRRRLNRIGGLLDDHREMRQDRAMV
LQLVTDYFQQLFLSSEPSDQDFDVSLRDLQRSVDNEMNVDLLRPFTEEEIIRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNHGCSPGSINETMIVLVP
KIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSNHSAFIPGRCVVDNAILGFECIHELRDGLGENLNGLL