; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025283 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025283
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionzf-RVT domain-containing protein
Genome locationchr10:10831518..10837848
RNA-Seq ExpressionLag0025283
SyntenyLag0025283
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBN69746.1 VIRB2-interacting protein 2 [Prunus dulcis]7.1e-2532.75Show/hide
Query:  WEESYKEWRDVKEF------KGRKQEVLIHS-----GNSLSYCFGFARPLSDRETTNLVTLLSLIGEITFSTTRKDIRVWSPDPSLRFSCHSFFRCLLDP
        WE+ +     +KE         RKQ   I S     G  LS+ FGF R L++ E T +  LL L+  +   T+R D R+W  DPS  F+CHSF  C    
Subjt:  WEESYKEWRDVKEF------KGRKQEVLIHS-----GNSLSYCFGFARPLSDRETTNLVTLLSLIGEITFSTTRKDIRVWSPDPSLRFSCHSFFRCLLDP

Query:  VSSEPSIF---SLLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSL-FGPFCCILCRMAEEDLDHMLWNCAFARTVWDEFFSSFGLQFARQRGLRKMI
           E  IF   + +WK K P KV+ F+WQ + G++NT D L R+ P L   P  C LC MA + ++H+L +C F+  +W+         +    G  ++ 
Subjt:  VSSEPSIF---SLLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSL-FGPFCCILCRMAEEDLDHMLWNCAFARTVWDEFFSSFGLQFARQRGLRKMI

Query:  EEFLPHPPFREQGKFLWQAGICAIIWGLW
           +      ++ K LW + + A++W LW
Subjt:  EEFLPHPPFREQGKFLWQAGICAIIWGLW

TYK09969.1 calpain-type cysteine protease DEK1 [Cucumis melo var. makuwa]2.4e-3641.45Show/hide
Query:  EVLIHSGNSLSYCFGFARPLSDRETTNLVTLLSLIGEITFSTTRKDIRVWSPDPSLRFSCHSFFRCLLDPVSSEPSIFSLLWKVKVPKKVQFFIWQVIHG
        + L+ +GNS S+ FGF R LSDRET+N+V L+SL+   +F   R+D+ VWSP P   F C SFF+CL++   +  S+ SL+W++KVP+K   F WQV   
Subjt:  EVLIHSGNSLSYCFGFARPLSDRETTNLVTLLSLIGEITFSTTRKDIRVWSPDPSLRFSCHSFFRCLLDPVSSEPSIFSLLWKVKVPKKVQFFIWQVIHG

Query:  RVNTLDRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTVWDEFFSSFGLQFARQRGLRKMIEEFLPHPPFREQGKFLWQAGICAIIWG
                                   EEDLDH+LW+C     VWD+F  +FGL +AR R +R  +EEFL + P  E+G FLW A + A + G
Subjt:  RVNTLDRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTVWDEFFSSFGLQFARQRGLRKMIEEFLPHPPFREQGKFLWQAGICAIIWG

TYK11201.1 SIL1 [Cucumis melo var. makuwa]6.9e-2847.86Show/hide
Query:  PSLRFSCHSFFRCLLDPV--SSEPSIFSLLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTVWDEFFSS
        P   F   S F CL D V    E    S   ++KVPKK +FFI QV+ G +NT+DRL + +  L GPF C LC+ A+EDLD++ W+C   R VW+ F   
Subjt:  PSLRFSCHSFFRCLLDPV--SSEPSIFSLLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTVWDEFFSS

Query:  FGLQFARQRGLRKMIEEFLPHPPFREQGKFLWQAGICAII
        F   F  QR +R  IEEFL HPPFRE+   LW AG+CA+I
Subjt:  FGLQFARQRGLRKMIEEFLPHPPFREQGKFLWQAGICAII

TYK29954.1 reverse transcriptase [Cucumis melo var. makuwa]6.9e-2841.21Show/hide
Query:  LSDRETTNLVTLLSLIGEITFSTTRKD--------------IRVWSPDPSLRFSCHSFFRCLLDPVSSEPSIFSLLWKVKVPKKVQFFIWQVIHGRVNTL
        L D  +++ ++ +SL+    F+T+ K+              + VWS +PS  FS  S F  LLDP  +   +F  +W++KV KKV+FF WQV+  R N +
Subjt:  LSDRETTNLVTLLSLIGEITFSTTRKD--------------IRVWSPDPSLRFSCHSFFRCLLDPVSSEPSIFSLLWKVKVPKKVQFFIWQVIHGRVNTL

Query:  DRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTVWDEFFSSFGLQFARQRGLRKMIEEFL
        DRL R+    +   CCILCR A+EDLDH+LW+C +AR VW  F   F ++ A  R +RK IEEFL
Subjt:  DRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTVWDEFFSSFGLQFARQRGLRKMIEEFL

VVA39726.1 Hypothetical predicted protein, partial [Prunus dulcis]7.1e-2532.31Show/hide
Query:  WEESYKEWRDVKEF------KGRKQEVLIHS-----GNSLSYCFGFARPLSDRETTNLVTLLSLIGEITFSTTRKDIRVWSPDPSLRFSCHSFFRCLLDP
        WE+ + +   +KE         RKQ   I S     G  LS+ FGF R L++ E T +  LL L+  +   T+R D R+W  DPS  F+CHS   C    
Subjt:  WEESYKEWRDVKEF------KGRKQEVLIHS-----GNSLSYCFGFARPLSDRETTNLVTLLSLIGEITFSTTRKDIRVWSPDPSLRFSCHSFFRCLLDP

Query:  VSSEPSIF---SLLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSL-FGPFCCILCRMAEEDLDHMLWNCAFARTVWDEFFSSFGLQFARQRGLRKMI
           E  IF   + +WK K P KV+ F+WQ + G++NT D L R+ P L   P  C LC  A + +DH+L +C F+  +W+         +    G  ++ 
Subjt:  VSSEPSIF---SLLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSL-FGPFCCILCRMAEEDLDHMLWNCAFARTVWDEFFSSFGLQFARQRGLRKMI

Query:  EEFLPHPPFREQGKFLWQAGICAIIWGLW
           +      ++ K LW + + A++W LW
Subjt:  EEFLPHPPFREQGKFLWQAGICAIIWGLW

TrEMBL top hitse value%identityAlignment
A0A4Y1R3V4 VIRB2-interacting protein 23.4e-2532.75Show/hide
Query:  WEESYKEWRDVKEF------KGRKQEVLIHS-----GNSLSYCFGFARPLSDRETTNLVTLLSLIGEITFSTTRKDIRVWSPDPSLRFSCHSFFRCLLDP
        WE+ +     +KE         RKQ   I S     G  LS+ FGF R L++ E T +  LL L+  +   T+R D R+W  DPS  F+CHSF  C    
Subjt:  WEESYKEWRDVKEF------KGRKQEVLIHS-----GNSLSYCFGFARPLSDRETTNLVTLLSLIGEITFSTTRKDIRVWSPDPSLRFSCHSFFRCLLDP

Query:  VSSEPSIF---SLLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSL-FGPFCCILCRMAEEDLDHMLWNCAFARTVWDEFFSSFGLQFARQRGLRKMI
           E  IF   + +WK K P KV+ F+WQ + G++NT D L R+ P L   P  C LC MA + ++H+L +C F+  +W+         +    G  ++ 
Subjt:  VSSEPSIF---SLLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSL-FGPFCCILCRMAEEDLDHMLWNCAFARTVWDEFFSSFGLQFARQRGLRKMI

Query:  EEFLPHPPFREQGKFLWQAGICAIIWGLW
           +      ++ K LW + + A++W LW
Subjt:  EEFLPHPPFREQGKFLWQAGICAIIWGLW

A0A5D3CHC7 SIL13.3e-2847.86Show/hide
Query:  PSLRFSCHSFFRCLLDPV--SSEPSIFSLLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTVWDEFFSS
        P   F   S F CL D V    E    S   ++KVPKK +FFI QV+ G +NT+DRL + +  L GPF C LC+ A+EDLD++ W+C   R VW+ F   
Subjt:  PSLRFSCHSFFRCLLDPV--SSEPSIFSLLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTVWDEFFSS

Query:  FGLQFARQRGLRKMIEEFLPHPPFREQGKFLWQAGICAII
        F   F  QR +R  IEEFL HPPFRE+   LW AG+CA+I
Subjt:  FGLQFARQRGLRKMIEEFLPHPPFREQGKFLWQAGICAII

A0A5D3CI74 Calpain-type cysteine protease DEK11.1e-3641.45Show/hide
Query:  EVLIHSGNSLSYCFGFARPLSDRETTNLVTLLSLIGEITFSTTRKDIRVWSPDPSLRFSCHSFFRCLLDPVSSEPSIFSLLWKVKVPKKVQFFIWQVIHG
        + L+ +GNS S+ FGF R LSDRET+N+V L+SL+   +F   R+D+ VWSP P   F C SFF+CL++   +  S+ SL+W++KVP+K   F WQV   
Subjt:  EVLIHSGNSLSYCFGFARPLSDRETTNLVTLLSLIGEITFSTTRKDIRVWSPDPSLRFSCHSFFRCLLDPVSSEPSIFSLLWKVKVPKKVQFFIWQVIHG

Query:  RVNTLDRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTVWDEFFSSFGLQFARQRGLRKMIEEFLPHPPFREQGKFLWQAGICAIIWG
                                   EEDLDH+LW+C     VWD+F  +FGL +AR R +R  +EEFL + P  E+G FLW A + A + G
Subjt:  RVNTLDRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTVWDEFFSSFGLQFARQRGLRKMIEEFLPHPPFREQGKFLWQAGICAIIWG

A0A5D3E255 Reverse transcriptase3.3e-2841.21Show/hide
Query:  LSDRETTNLVTLLSLIGEITFSTTRKD--------------IRVWSPDPSLRFSCHSFFRCLLDPVSSEPSIFSLLWKVKVPKKVQFFIWQVIHGRVNTL
        L D  +++ ++ +SL+    F+T+ K+              + VWS +PS  FS  S F  LLDP  +   +F  +W++KV KKV+FF WQV+  R N +
Subjt:  LSDRETTNLVTLLSLIGEITFSTTRKD--------------IRVWSPDPSLRFSCHSFFRCLLDPVSSEPSIFSLLWKVKVPKKVQFFIWQVIHGRVNTL

Query:  DRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTVWDEFFSSFGLQFARQRGLRKMIEEFL
        DRL R+    +   CCILCR A+EDLDH+LW+C +AR VW  F   F ++ A  R +RK IEEFL
Subjt:  DRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTVWDEFFSSFGLQFARQRGLRKMIEEFL

M5XUF8 Reverse transcriptase domain-containing protein (Fragment)2.4e-2634.06Show/hide
Query:  WEESYKEWRDVKEF------KGRKQEVLIHS-----GNSLSYCFGFARPLSDRETTNLVTLLSLIGEITFSTTRKDIRVWSPDPSLRFSCHSFFRCLLDP
        WE+ +     +KE         RKQ   I S     G SLS+ FGF R L++ E T    LL L+  +   T+R D R W  DPS  F+CHSF  C    
Subjt:  WEESYKEWRDVKEF------KGRKQEVLIHS-----GNSLSYCFGFARPLSDRETTNLVTLLSLIGEITFSTTRKDIRVWSPDPSLRFSCHSFFRCLLDP

Query:  VSSEPSIFS---LLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSL-FGPFCCILCRMAEEDLDHMLWNCAFARTVWDEFFSSFGLQFARQRGLRKMI
           E  IFS    +WK K P KV+ F+WQ + G++NT D L R+ P L   P  C LC  A E +DH+L +C F+  +W+         +    G  ++ 
Subjt:  VSSEPSIFS---LLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSL-FGPFCCILCRMAEEDLDHMLWNCAFARTVWDEFFSSFGLQFARQRGLRKMI

Query:  EEFLPHPPFREQGKFLWQAGICAIIWGLW
           +      ++ K LW + + A++W LW
Subjt:  EEFLPHPPFREQGKFLWQAGICAIIWGLW

SwissProt top hitse value%identityAlignment
B5X582 Twinkle homolog protein, chloroplastic/mitochondrial3.1e-0760.87Show/hide
Query:  KRKITVESLQLEPLCDELVAYFAERLISKSTLLRNSVMQKRSNNQL
        +RKITVE ++LEPLCDE+  YFA R IS+ TL RN VMQKR  +++
Subjt:  KRKITVESLQLEPLCDELVAYFAERLISKSTLLRNSVMQKRSNNQL

F4I6E6 Primase homolog protein9.1e-0760.87Show/hide
Query:  KRKITVESLQLEPLCDELVAYFAERLISKSTLLRNSVMQKRSNNQL
        +RKITVES++LEPLCDE+  +FA R IS  TL RN VMQKR ++++
Subjt:  KRKITVESLQLEPLCDELVAYFAERLISKSTLLRNSVMQKRSNNQL

P0C2F6 Putative ribonuclease H protein At1g657505.9e-0626.52Show/hide
Query:  DRETTNLVTLLSLIGEITFSTTRKDIRVWSPDPSLRFSCHSFFRCLL---DPVSSEPSIFSLLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSLFGP
        D  TTN   L      +   T  +D   W      +FS  S +  L     P  +  S F+ LWKV+VP++V+ F+W V +  V T +   R+   L   
Subjt:  DRETTNLVTLLSLIGEITFSTTRKDIRVWSPDPSLRFSCHSFFRCLL---DPVSSEPSIFSLLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSLFGP

Query:  FCCILCRMAEEDLDHMLWNCAFARTVWDEFFSSFGLQFARQRG-LRKMIEEFLPHPPFREQG--KFLWQAGICAIIWGLWE
          C +C+   E + H+L +C     +W            RQ+G   K + E+L        G     W      IIW  W+
Subjt:  FCCILCRMAEEDLDHMLWNCAFARTVWDEFFSSFGLQFARQRG-LRKMIEEFLPHPPFREQG--KFLWQAGICAIIWGLWE

Arabidopsis top hitse value%identityAlignment
AT1G30660.1 nucleic acid binding;nucleic acid binding6.5e-0860.87Show/hide
Query:  KRKITVESLQLEPLCDELVAYFAERLISKSTLLRNSVMQKRSNNQL
        +RKITVES++LEPLCDE+  +FA R IS  TL RN VMQKR ++++
Subjt:  KRKITVESLQLEPLCDELVAYFAERLISKSTLLRNSVMQKRSNNQL

AT1G30680.1 toprim domain-containing protein2.2e-0860.87Show/hide
Query:  KRKITVESLQLEPLCDELVAYFAERLISKSTLLRNSVMQKRSNNQL
        +RKITVE ++LEPLCDE+  YFA R IS+ TL RN VMQKR  +++
Subjt:  KRKITVESLQLEPLCDELVAYFAERLISKSTLLRNSVMQKRSNNQL

AT2G02520.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.9e-0632.14Show/hide
Query:  LDPVSSEPSIFSLLW-KVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTVWDEFFS
        L+P+      F  +W K K+PK   F  W  +  R++T DR+         P  C+ C   +E   H+ ++C FAR VW  F S
Subjt:  LDPVSSEPSIFSLLW-KVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTVWDEFFS

AT2G02650.1 Ribonuclease H-like superfamily protein2.7e-0630.38Show/hide
Query:  LDPVSSEPSIFSLLWKVKVPKKVQFFIWQVIHGRVNTLDRL-SRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTVW
        + P      +   +WK+ V  K++ F+W+ + G + T  RL SR I +   P C   C + EE + H+++NC + ++VW
Subjt:  LDPVSSEPSIFSLLWKVKVPKKVQFFIWQVIHGRVNTLDRL-SRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTVW

AT4G29090.1 Ribonuclease H-like superfamily protein4.6e-0631.58Show/hide
Query:  SEPS---IFSLLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTVW
        SEPS   I+  +WK +   K+Q F+W+ +   +     L+ +   L     CI C   +E ++H+L+ C FAR  W
Subjt:  SEPS---IFSLLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTVW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAACACCAACTCCATTTGGCCAAGATAGCCTTGTTTTTGTCTTCAATACCGTTGAGGCCGAGACCACCTTGATCAATGGGGTGAAAGCTCTTTTCCCATTTGACAA
GTTGAAGGCTAGCTTTGTCTCCATTTCCTCTCCAAACGAAGTTCCTGCCTTTGCTGATGGTCGGTCATCATATCGAAGTTTAGGACAAGTTACACTTAACCAGAAGATCC
AGAACAAACGGAAAATTACAGTGGAGAGTCTACAACTTGAACCACTGTGCGATGAGCTGGTTGCTTATTTTGCTGAGCGATTGATCTCCAAGAGCACATTGTTAAGAAAT
TCAGTTATGCAGAAAAGGTCCAATAATCAGTTAAACCTTCCAAAAGCCAATTATCCACCATTTCTGGGTAAAGAAAATGAGACTCCCAGCAACTTGGCCACAAAACAACA
CGAACGGGTTCTGGAGGACTTTCCAGAGGGGAGGGTGGGGTTTTTTGGGCTATCAACGGAGGATTCGTGGAGGGTTACGCTTAAGGCTGATGACGATGGAAGAGTGGTTA
GAATCCAAGAAGAACACTTGAGAAAGAAGTACGCCCTGTCTGTCGAGGACGTTGTTCTGGTTTGGATTGTCGACTCTATAGACGATTTTTTTCACGCGTCGGCCACCCAC
AAGTTTTTTCGAAAGGCTGACTACAATAATGGGTTCATCTGGATTCAAAAAAATTCGAACAAACGTGGAAGCTTCCTCGAAATTATGAAAGTAATTAACTCAAGCGGAAA
ACACAACTTGGTTGTTCCGGCAGGATCTGAATTCAAAGGTTGGAAGGATTTCTCAAACCTTCTGAAAGATTTCCTTAATGGAAAAGATGAGCGGAAAGAAGCAAAAAACC
TGGACCCAAACAGAAGGAGAGATAAGGGGAGATCTTTTGCAGATATAGTCAAAAGCAAACCACATTCTAAAACCTCAATCTCTGGAGTGCCAAGTAATAATCAGTGGGAA
GAAAGTTACAAAGAGTGGAGAGACGTAAAAGAGTTCAAGGGGAGAAAACAAGAAGTTCTGATTCATTCAGGGAACTCGCTTTCTTATTGCTTTGGCTTTGCGCGTCCGTT
GTCTGATCGTGAAACAACGAACCTCGTGACTCTCCTTTCTCTGATTGGGGAAATCACTTTTAGTACCACTAGGAAAGACATTCGAGTCTGGAGTCCCGACCCTTCCCTCA
GGTTTTCTTGTCATTCCTTCTTCCGATGCCTTTTGGACCCTGTTTCCTCTGAGCCGTCCATTTTCTCTTTATTGTGGAAGGTGAAAGTTCCAAAGAAGGTGCAGTTTTTT
ATTTGGCAGGTGATTCATGGAAGAGTTAATACTCTTGATCGGTTGTCCAGAAAGATTCCTAGTTTGTTTGGGCCTTTTTGTTGCATTCTTTGTCGGATGGCGGAGGAAGA
CCTCGATCATATGTTATGGAACTGCGCTTTTGCTAGGACAGTGTGGGATGAGTTCTTTAGTTCGTTCGGGTTGCAGTTTGCCAGACAAAGAGGCCTCAGAAAGATGATCG
AAGAGTTCCTTCCCCATCCTCCCTTTAGGGAGCAAGGAAAGTTTTTGTGGCAAGCAGGGATTTGTGCTATTATTTGGGGGTTGTGGGAGCCAATTCCCCCCCCCGTTTTC
TGTCCAACGATGTCTACTCACATGGAGTCTCAGATGGAAGCTTTGGAGGCGAATCTCGCGGCTCAAGGGAAAGAAATTGCGGCGATTCACAATTCGATGGAGAATTTTTC
GGATATGGTGCCTCAGATGAAAGAAAGTTTCTCAGCCATGGCGGCACATTTCGAAAGTTTAAAGGTTGAGAAAGTCATTCCGACTAAAAATCTGACGAAGTGGCCAGTGC
AACGATACCTGTTAATCCTGTGGAGGTTGGAACTGTTGATCCACAGGCGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGATAACACCAACTCCATTTGGCCAAGATAGCCTTGTTTTTGTCTTCAATACCGTTGAGGCCGAGACCACCTTGATCAATGGGGTGAAAGCTCTTTTCCCATTTGACAA
GTTGAAGGCTAGCTTTGTCTCCATTTCCTCTCCAAACGAAGTTCCTGCCTTTGCTGATGGTCGGTCATCATATCGAAGTTTAGGACAAGTTACACTTAACCAGAAGATCC
AGAACAAACGGAAAATTACAGTGGAGAGTCTACAACTTGAACCACTGTGCGATGAGCTGGTTGCTTATTTTGCTGAGCGATTGATCTCCAAGAGCACATTGTTAAGAAAT
TCAGTTATGCAGAAAAGGTCCAATAATCAGTTAAACCTTCCAAAAGCCAATTATCCACCATTTCTGGGTAAAGAAAATGAGACTCCCAGCAACTTGGCCACAAAACAACA
CGAACGGGTTCTGGAGGACTTTCCAGAGGGGAGGGTGGGGTTTTTTGGGCTATCAACGGAGGATTCGTGGAGGGTTACGCTTAAGGCTGATGACGATGGAAGAGTGGTTA
GAATCCAAGAAGAACACTTGAGAAAGAAGTACGCCCTGTCTGTCGAGGACGTTGTTCTGGTTTGGATTGTCGACTCTATAGACGATTTTTTTCACGCGTCGGCCACCCAC
AAGTTTTTTCGAAAGGCTGACTACAATAATGGGTTCATCTGGATTCAAAAAAATTCGAACAAACGTGGAAGCTTCCTCGAAATTATGAAAGTAATTAACTCAAGCGGAAA
ACACAACTTGGTTGTTCCGGCAGGATCTGAATTCAAAGGTTGGAAGGATTTCTCAAACCTTCTGAAAGATTTCCTTAATGGAAAAGATGAGCGGAAAGAAGCAAAAAACC
TGGACCCAAACAGAAGGAGAGATAAGGGGAGATCTTTTGCAGATATAGTCAAAAGCAAACCACATTCTAAAACCTCAATCTCTGGAGTGCCAAGTAATAATCAGTGGGAA
GAAAGTTACAAAGAGTGGAGAGACGTAAAAGAGTTCAAGGGGAGAAAACAAGAAGTTCTGATTCATTCAGGGAACTCGCTTTCTTATTGCTTTGGCTTTGCGCGTCCGTT
GTCTGATCGTGAAACAACGAACCTCGTGACTCTCCTTTCTCTGATTGGGGAAATCACTTTTAGTACCACTAGGAAAGACATTCGAGTCTGGAGTCCCGACCCTTCCCTCA
GGTTTTCTTGTCATTCCTTCTTCCGATGCCTTTTGGACCCTGTTTCCTCTGAGCCGTCCATTTTCTCTTTATTGTGGAAGGTGAAAGTTCCAAAGAAGGTGCAGTTTTTT
ATTTGGCAGGTGATTCATGGAAGAGTTAATACTCTTGATCGGTTGTCCAGAAAGATTCCTAGTTTGTTTGGGCCTTTTTGTTGCATTCTTTGTCGGATGGCGGAGGAAGA
CCTCGATCATATGTTATGGAACTGCGCTTTTGCTAGGACAGTGTGGGATGAGTTCTTTAGTTCGTTCGGGTTGCAGTTTGCCAGACAAAGAGGCCTCAGAAAGATGATCG
AAGAGTTCCTTCCCCATCCTCCCTTTAGGGAGCAAGGAAAGTTTTTGTGGCAAGCAGGGATTTGTGCTATTATTTGGGGGTTGTGGGAGCCAATTCCCCCCCCCGTTTTC
TGTCCAACGATGTCTACTCACATGGAGTCTCAGATGGAAGCTTTGGAGGCGAATCTCGCGGCTCAAGGGAAAGAAATTGCGGCGATTCACAATTCGATGGAGAATTTTTC
GGATATGGTGCCTCAGATGAAAGAAAGTTTCTCAGCCATGGCGGCACATTTCGAAAGTTTAAAGGTTGAGAAAGTCATTCCGACTAAAAATCTGACGAAGTGGCCAGTGC
AACGATACCTGTTAATCCTGTGGAGGTTGGAACTGTTGATCCACAGGCGTTAA
Protein sequenceShow/hide protein sequence
MITPTPFGQDSLVFVFNTVEAETTLINGVKALFPFDKLKASFVSISSPNEVPAFADGRSSYRSLGQVTLNQKIQNKRKITVESLQLEPLCDELVAYFAERLISKSTLLRN
SVMQKRSNNQLNLPKANYPPFLGKENETPSNLATKQHERVLEDFPEGRVGFFGLSTEDSWRVTLKADDDGRVVRIQEEHLRKKYALSVEDVVLVWIVDSIDDFFHASATH
KFFRKADYNNGFIWIQKNSNKRGSFLEIMKVINSSGKHNLVVPAGSEFKGWKDFSNLLKDFLNGKDERKEAKNLDPNRRRDKGRSFADIVKSKPHSKTSISGVPSNNQWE
ESYKEWRDVKEFKGRKQEVLIHSGNSLSYCFGFARPLSDRETTNLVTLLSLIGEITFSTTRKDIRVWSPDPSLRFSCHSFFRCLLDPVSSEPSIFSLLWKVKVPKKVQFF
IWQVIHGRVNTLDRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTVWDEFFSSFGLQFARQRGLRKMIEEFLPHPPFREQGKFLWQAGICAIIWGLWEPIPPPVF
CPTMSTHMESQMEALEANLAAQGKEIAAIHNSMENFSDMVPQMKESFSAMAAHFESLKVEKVIPTKNLTKWPVQRYLLILWRLELLIHRR