; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000907 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000907
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr4:19380100..19387673
RNA-Seq ExpressionLag0000907
SyntenyLag0000907
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8695166.1 hypothetical protein F3Y22_tig00110733pilonHSYRG00282 [Hibiscus syriacus]9.4e-2128.78Show/hide
Query:  ITNHGWELFCSKFESMNAQVVHEFYANIDEEEGFQVIARGVEVDWSPGAINA-----------------------------LGVEGVQWRLSKTEKRTFQ
        +T H W+ F      +NA +V EFY+NI E     V+ RG+ + ++P AIN                              L + G +W   + +++T  
Subjt:  ITNHGWELFCSKFESMNAQVVHEFYANIDEEEGFQVIARGVEVDWSPGAINA-----------------------------LGVEGVQWRLSKTEKRTFQ

Query:  SAYLKKEANTWMGFIKQRLLPTTHDLTVSRERVLLAFAILRSLNIDVGKIIVSEISGCWRKKVGKLFFPITITMFCKRAGVLENEGDVILFDKGIIDTPN
           L      W  F+K +L+PT+H+ TVS +R+LL  +IL    ID+GKIIV     C +++   L FP  IT  C++  V E   D IL     ++   
Subjt:  SAYLKKEANTWMGFIKQRLLPTTHDLTVSRERVLLAFAILRSLNIDVGKIIVSEISGCWRKKVGKLFFPITITMFCKRAGVLENEGDVILFDKGIIDTPN

Query:  LARLQRTQEAR------------QGGLVFGIHNILEQLALSASRQEFAERQSK--TFWNYVKRRDANLKKALQESFSK
        +  L   +EA+                V      LEQ A+  + Q   +   K   ++ Y KRRDA L  AL ES  +
Subjt:  LARLQRTQEAR------------QGGLVFGIHNILEQLALSASRQEFAERQSK--TFWNYVKRRDANLKKALQESFSK

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]1.2e-2034.39Show/hide
Query:  TEQVQEKQTENVQEEQAKVAPEKG------NEPGQEAQVEAGITNHGWELFCSKFESMNAQVVHEFYANIDEEEGFQVIARGVEVDWSPGAINA------
        TE  + +   N+Q        EKG         GQ   +   IT H W+ FC+  E     +V EFYAN+ +     V  RGV+V WS  AINA      
Subjt:  TEQVQEKQTENVQEEQAKVAPEKG------NEPGQEAQVEAGITNHGWELFCSKFESMNAQVVHEFYANIDEEEGFQVIARGVEVDWSPGAINA------

Query:  -----------------------LGVEGVQWRLSKTEKRTFQSAYLKKEANTWMGFIKQRLLPTTHDLTVSRERVLLAFAILRSLNIDVGKIIVSEISGC
                               + V G +W +S     T   + L   A  W  F+K  LLPTTH  TVS++R+LL  ++L   +I+VG++I SEI  C
Subjt:  -----------------------LGVEGVQWRLSKTEKRTFQSAYLKKEANTWMGFIKQRLLPTTHDLTVSRERVLLAFAILRSLNIDVGKIIVSEISGC

Query:  WRKKVGKLFFPITITMFCK--RAGVLENEGDVILFDKGIIDTPNLARLQRTQE
          +K G LFFP  IT  C+  RA  L NE    L + G ID   +AR+  TQE
Subjt:  WRKKVGKLFFPITITMFCK--RAGVLENEGDVILFDKGIIDTPNLARLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]6.1e-2831.52Show/hide
Query:  TEQVQEKQTENVQEEQAKVAPEKG------NEPGQEAQVEAGITNHGWELFCSKFESMNAQVVHEFYANIDEEEGFQVIARGVEVDWSPGAINA------
        TE    +   N+Q        EKG         GQ   +   IT H W+ FC+  E     +V EFYAN+ + E   V  RGV+V WS  AINA      
Subjt:  TEQVQEKQTENVQEEQAKVAPEKG------NEPGQEAQVEAGITNHGWELFCSKFESMNAQVVHEFYANIDEEEGFQVIARGVEVDWSPGAINA------

Query:  -----------------------LGVEGVQWRLSKTEKRTFQSAYLKKEANTWMGFIKQRLLPTTHDLTVSRERVLLAFAILRSLNIDVGKIIVSEISGC
                               +   G +W +S     T   + L   A  W  F+K RLLPTTH  TVS++R+LL  ++L   +I+VG++I SEI  C
Subjt:  -----------------------LGVEGVQWRLSKTEKRTFQSAYLKKEANTWMGFIKQRLLPTTHDLTVSRERVLLAFAILRSLNIDVGKIIVSEISGC

Query:  WRKKVGKLFFPITITMFCK--RAGVLENEGDVILFDKGIIDTPNLARL------QRTQEA-----------RQGGLVFGIHNILEQLALSASRQEF----
          +K G LFFP  IT  C+  RA  L NE    L + G ID   +AR+      + TQ+            R  G +      LEQ       Q++    
Subjt:  WRKKVGKLFFPITITMFCK--RAGVLENEGDVILFDKGIIDTPNLARL------QRTQEA-----------RQGGLVFGIHNILEQLALSASRQEF----

Query:  ----AERQSKTFWNYVKRRDANLKKALQESFSKPFPALLVFPDNLLNPGISPTPMEREEEDDENEQDQ
              +Q + FW Y K RD  LKKALQ +F++P P    FP  +L         E E E D++  ++
Subjt:  ----AERQSKTFWNYVKRRDANLKKALQESFSKPFPALLVFPDNLLNPGISPTPMEREEEDDENEQDQ

PON59596.1 hypothetical protein PanWU01x14_158080 [Parasponia andersonii]3.0e-1934.71Show/hide
Query:  EGVQWRLSKTEKRTFQSAYLKKEANTWMGFIKQRLLPTTHDLTVSRERVLLAFAILRSLNIDVGKIIVSEISGCWRKKVGKLFFPITITMFCK--RAGVL
        E +Q R    EK     A     A  W  F+K RLLPTTH  TVS++R+LL +++L   +I+VG++I SEI  C  +K G LFFP  IT  C+  RA  L
Subjt:  EGVQWRLSKTEKRTFQSAYLKKEANTWMGFIKQRLLPTTHDLTVSRERVLLAFAILRSLNIDVGKIIVSEISGCWRKKVGKLFFPITITMFCK--RAGVL

Query:  ENEGDVILFDKGIIDTPNLARL------QRTQE-----------ARQGGLVFGIHNILEQLALSASRQEF--------AERQSKTFWNYVKRRDANLKKA
         NE    L   G ID   +AR+      + TQ+           +R  G +      LEQ       Q++          +Q + FW Y K RD  LKKA
Subjt:  ENEGDVILFDKGIIDTPNLARL------QRTQE-----------ARQGGLVFGIHNILEQLALSASRQEF--------AERQSKTFWNYVKRRDANLKKA

Query:  LQESFSKPFPALLVFPDNLLNPGISPTPMEREEEDDENEQDQ
        LQ +F++P P    FP  LL         E E E D++  ++
Subjt:  LQESFSKPFPALLVFPDNLLNPGISPTPMEREEEDDENEQDQ

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]2.1e-2833.67Show/hide
Query:  VVHEFYANIDEEEGFQVIARGVEVDWSPGAINA-----------------------------LGVEGVQWRLSKTEKRTFQSAYLKKEANTWMGFIKQRL
        +V EFYAN+ + E   +  RGV+V WS  AINA                             +   G +W +S     T   + L   A  W  F+K RL
Subjt:  VVHEFYANIDEEEGFQVIARGVEVDWSPGAINA-----------------------------LGVEGVQWRLSKTEKRTFQSAYLKKEANTWMGFIKQRL

Query:  LPTTHDLTVSRERVLLAFAILRSLNIDVGKIIVSEISGCWRKKVGKLFFPITITMFCKRAGVLENEGDVILFDKGIIDTPNLARL------QRTQE----
        LPTTH   VS++R+LL  ++L   +I+VG++I SEI  C  +K G LFFP  IT  C+ A  L NE    L + G ID   +AR+      + TQ+    
Subjt:  LPTTHDLTVSRERVLLAFAILRSLNIDVGKIIVSEISGCWRKKVGKLFFPITITMFCKRAGVLENEGDVILFDKGIIDTPNLARL------QRTQE----

Query:  -------ARQGGLVFGIHNILEQLALSASRQEFAERQSKTFWNYVKRRDANLKKALQESFSKPFPALLVFPDNLLNPGISPTPMEREEEDDENEQDQ
               +R  G V      LEQ     S+QE   +Q + FW Y K RD  LKKALQ +F++P P    FP  +L         E E E D++  ++
Subjt:  -------ARQGGLVFGIHNILEQLALSASRQEFAERQSKTFWNYVKRRDANLKKALQESFSKPFPALLVFPDNLLNPGISPTPMEREEEDDENEQDQ

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)5.9e-2134.39Show/hide
Query:  TEQVQEKQTENVQEEQAKVAPEKG------NEPGQEAQVEAGITNHGWELFCSKFESMNAQVVHEFYANIDEEEGFQVIARGVEVDWSPGAINA------
        TE  + +   N+Q        EKG         GQ   +   IT H W+ FC+  E     +V EFYAN+ +     V  RGV+V WS  AINA      
Subjt:  TEQVQEKQTENVQEEQAKVAPEKG------NEPGQEAQVEAGITNHGWELFCSKFESMNAQVVHEFYANIDEEEGFQVIARGVEVDWSPGAINA------

Query:  -----------------------LGVEGVQWRLSKTEKRTFQSAYLKKEANTWMGFIKQRLLPTTHDLTVSRERVLLAFAILRSLNIDVGKIIVSEISGC
                               + V G +W +S     T   + L   A  W  F+K  LLPTTH  TVS++R+LL  ++L   +I+VG++I SEI  C
Subjt:  -----------------------LGVEGVQWRLSKTEKRTFQSAYLKKEANTWMGFIKQRLLPTTHDLTVSRERVLLAFAILRSLNIDVGKIIVSEISGC

Query:  WRKKVGKLFFPITITMFCK--RAGVLENEGDVILFDKGIIDTPNLARLQRTQE
          +K G LFFP  IT  C+  RA  L NE    L + G ID   +AR+  TQE
Subjt:  WRKKVGKLFFPITITMFCK--RAGVLENEGDVILFDKGIIDTPNLARLQRTQE

A0A2P5BCG4 Uncharacterized protein (Fragment)2.9e-2831.52Show/hide
Query:  TEQVQEKQTENVQEEQAKVAPEKG------NEPGQEAQVEAGITNHGWELFCSKFESMNAQVVHEFYANIDEEEGFQVIARGVEVDWSPGAINA------
        TE    +   N+Q        EKG         GQ   +   IT H W+ FC+  E     +V EFYAN+ + E   V  RGV+V WS  AINA      
Subjt:  TEQVQEKQTENVQEEQAKVAPEKG------NEPGQEAQVEAGITNHGWELFCSKFESMNAQVVHEFYANIDEEEGFQVIARGVEVDWSPGAINA------

Query:  -----------------------LGVEGVQWRLSKTEKRTFQSAYLKKEANTWMGFIKQRLLPTTHDLTVSRERVLLAFAILRSLNIDVGKIIVSEISGC
                               +   G +W +S     T   + L   A  W  F+K RLLPTTH  TVS++R+LL  ++L   +I+VG++I SEI  C
Subjt:  -----------------------LGVEGVQWRLSKTEKRTFQSAYLKKEANTWMGFIKQRLLPTTHDLTVSRERVLLAFAILRSLNIDVGKIIVSEISGC

Query:  WRKKVGKLFFPITITMFCK--RAGVLENEGDVILFDKGIIDTPNLARL------QRTQEA-----------RQGGLVFGIHNILEQLALSASRQEF----
          +K G LFFP  IT  C+  RA  L NE    L + G ID   +AR+      + TQ+            R  G +      LEQ       Q++    
Subjt:  WRKKVGKLFFPITITMFCK--RAGVLENEGDVILFDKGIIDTPNLARL------QRTQEA-----------RQGGLVFGIHNILEQLALSASRQEF----

Query:  ----AERQSKTFWNYVKRRDANLKKALQESFSKPFPALLVFPDNLLNPGISPTPMEREEEDDENEQDQ
              +Q + FW Y K RD  LKKALQ +F++P P    FP  +L         E E E D++  ++
Subjt:  ----AERQSKTFWNYVKRRDANLKKALQESFSKPFPALLVFPDNLLNPGISPTPMEREEEDDENEQDQ

A0A2P5CEY2 Uncharacterized protein1.5e-1934.71Show/hide
Query:  EGVQWRLSKTEKRTFQSAYLKKEANTWMGFIKQRLLPTTHDLTVSRERVLLAFAILRSLNIDVGKIIVSEISGCWRKKVGKLFFPITITMFCK--RAGVL
        E +Q R    EK     A     A  W  F+K RLLPTTH  TVS++R+LL +++L   +I+VG++I SEI  C  +K G LFFP  IT  C+  RA  L
Subjt:  EGVQWRLSKTEKRTFQSAYLKKEANTWMGFIKQRLLPTTHDLTVSRERVLLAFAILRSLNIDVGKIIVSEISGCWRKKVGKLFFPITITMFCK--RAGVL

Query:  ENEGDVILFDKGIIDTPNLARL------QRTQE-----------ARQGGLVFGIHNILEQLALSASRQEF--------AERQSKTFWNYVKRRDANLKKA
         NE    L   G ID   +AR+      + TQ+           +R  G +      LEQ       Q++          +Q + FW Y K RD  LKKA
Subjt:  ENEGDVILFDKGIIDTPNLARL------QRTQE-----------ARQGGLVFGIHNILEQLALSASRQEF--------AERQSKTFWNYVKRRDANLKKA

Query:  LQESFSKPFPALLVFPDNLLNPGISPTPMEREEEDDENEQDQ
        LQ +F++P P    FP  LL         E E E D++  ++
Subjt:  LQESFSKPFPALLVFPDNLLNPGISPTPMEREEEDDENEQDQ

A0A2P5DXM3 Uncharacterized protein1.0e-2833.67Show/hide
Query:  VVHEFYANIDEEEGFQVIARGVEVDWSPGAINA-----------------------------LGVEGVQWRLSKTEKRTFQSAYLKKEANTWMGFIKQRL
        +V EFYAN+ + E   +  RGV+V WS  AINA                             +   G +W +S     T   + L   A  W  F+K RL
Subjt:  VVHEFYANIDEEEGFQVIARGVEVDWSPGAINA-----------------------------LGVEGVQWRLSKTEKRTFQSAYLKKEANTWMGFIKQRL

Query:  LPTTHDLTVSRERVLLAFAILRSLNIDVGKIIVSEISGCWRKKVGKLFFPITITMFCKRAGVLENEGDVILFDKGIIDTPNLARL------QRTQE----
        LPTTH   VS++R+LL  ++L   +I+VG++I SEI  C  +K G LFFP  IT  C+ A  L NE    L + G ID   +AR+      + TQ+    
Subjt:  LPTTHDLTVSRERVLLAFAILRSLNIDVGKIIVSEISGCWRKKVGKLFFPITITMFCKRAGVLENEGDVILFDKGIIDTPNLARL------QRTQE----

Query:  -------ARQGGLVFGIHNILEQLALSASRQEFAERQSKTFWNYVKRRDANLKKALQESFSKPFPALLVFPDNLLNPGISPTPMEREEEDDENEQDQ
               +R  G V      LEQ     S+QE   +Q + FW Y K RD  LKKALQ +F++P P    FP  +L         E E E D++  ++
Subjt:  -------ARQGGLVFGIHNILEQLALSASRQEFAERQSKTFWNYVKRRDANLKKALQESFSKPFPALLVFPDNLLNPGISPTPMEREEEDDENEQDQ

A0A6A2ZUE4 Uncharacterized protein4.5e-2128.78Show/hide
Query:  ITNHGWELFCSKFESMNAQVVHEFYANIDEEEGFQVIARGVEVDWSPGAINA-----------------------------LGVEGVQWRLSKTEKRTFQ
        +T H W+ F      +NA +V EFY+NI E     V+ RG+ + ++P AIN                              L + G +W   + +++T  
Subjt:  ITNHGWELFCSKFESMNAQVVHEFYANIDEEEGFQVIARGVEVDWSPGAINA-----------------------------LGVEGVQWRLSKTEKRTFQ

Query:  SAYLKKEANTWMGFIKQRLLPTTHDLTVSRERVLLAFAILRSLNIDVGKIIVSEISGCWRKKVGKLFFPITITMFCKRAGVLENEGDVILFDKGIIDTPN
           L      W  F+K +L+PT+H+ TVS +R+LL  +IL    ID+GKIIV     C +++   L FP  IT  C++  V E   D IL     ++   
Subjt:  SAYLKKEANTWMGFIKQRLLPTTHDLTVSRERVLLAFAILRSLNIDVGKIIVSEISGCWRKKVGKLFFPITITMFCKRAGVLENEGDVILFDKGIIDTPN

Query:  LARLQRTQEAR------------QGGLVFGIHNILEQLALSASRQEFAERQSK--TFWNYVKRRDANLKKALQESFSK
        +  L   +EA+                V      LEQ A+  + Q   +   K   ++ Y KRRDA L  AL ES  +
Subjt:  LARLQRTQEAR------------QGGLVFGIHNILEQLALSASRQEFAERQSK--TFWNYVKRRDANLKKALQESFSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
ATMG00750.1 GAG/POL/ENV polyprotein8.5e-0466.67Show/hide
Query:  RRGNLGPRDEMPLTYILEVELFDVWGI
        R+GN   R+EMP  +ILEVE+FDVWGI
Subjt:  RRGNLGPRDEMPLTYILEVELFDVWGI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGATGAGTTAAATCCAGGAATTGCACGTCCTCAAATTGAGGCAGCAAATTTTGAAATGAAACCGATAATGTTTCAGATGTTGCAAACCGTGGGTCAATTCCATGT
GGTTAGTCATCAGCAGCCGCCAGCTGTGGAGCCTGCTGCAGTGGTGAACCAAGTTGAGAGGAAGCATGTGTCTATTGTGGTGAAGATCACAACTATGAGTTTTGCCCCAA
AATCCAGCTTCTGGTTGGGCAACCACCCCAACTTCTCATGGGGAGGACAAGGAAGTAATGTGCAAGCACAGCAAAAGGTGAACCAGTCGGGATTTGCTACAGCGCAGGTA
TTGCCCATCAAAATAAGTAGGCTTTGCCCTAGCAAAATTCAGAGAGTTCTCTTGAGGATGATGAAAGAATATATGGCTCGTACAGATGCCACAATTCAAAGTAATCAAGC
TTCAATGAGAGCCCTTGAATTGCAAGGTGCTGGAGGCAGCAATAATGATGCTGGAGCATCTGGTTCTGTTCCAGATGTAGAACCACCTTATGTGCCCCCCCACCTTATGT
ACCACCTCTACCTTTTCCACAAAGGCAAAGGCCTAAGAATCATGATGGATTCGAATGACAAGCATTTGGAAAATCATGGAGAGACTACTATTGCTCCTGAGGATCAGGAA
AAAACCACTTTCACCTGTCCTTATGGGACGTTTGCTTTTAGGCGAATCCTTTTGGCCTTTGCAATGCTCCAGCAACATTTCAGCGGTGAATTCGACTTGGAGATAAATTA
CAAGAAAGGATCAGAAAATGTCATTGCAGATCATTTATCTCGTCTTGATCCATCATCATCTTTGTTGAAGCAATCTACCATTTTAGATTCCTTTCCAGATGAACACTTTT
TGTTGTTGAGGAGAGGAAACTTGGGGCCTAGAGATGAAATGCCTCTTACTTACATTTTGGAAGTGGAATTATTCGATGTATGGGGTATTGATTTATGGGGCCATTTCCCC
CTTCTAATGGTAATGTTTTTATCTTATTGGCAGTTGTTTACGTGTCCAAGTGGGTGGAGGTCATTGCATGCCATCAGAATGATGCCAAGACTGTATCAAGGTTTCTTCAA
TCGCACATTTTGCGCGGTTTGGACACCTAGAGCTTTAGTAAATTGCATGGATGCTGCCACATGTCGGCAGGGTCGAGGAGTTGCTGTCGAGGCAGTTGAAGAAGGCAATG
CAAAGGAACCTGAAGGACAAAACCCAGAGCAGACTGAGCCGAGAGTTGCGGATACAGAGGAAGTTCAAGAAGAAAATACAGAGCAAGTTCAAGAAAAGCAGACTGAGAAT
GTGCAAGAAGAACAGGCAAAGGTTGCGCCTGAAAAAGGTAATGAGCCAGGACAGGAGGCTCAAGTGGAGGCTGGCATTACGAACCACGGTTGGGAGTTATTCTGTTCAAA
GTTTGAATCTATGAACGCGCAAGTAGTGCACGAATTTTATGCAAATATTGACGAGGAAGAAGGTTTCCAAGTGATCGCTCGAGGTGTAGAAGTCGACTGGAGTCCTGGTG
CTATTAACGCACTTGGCGTTGAAGGGGTGCAGTGGAGACTTTCAAAAACTGAGAAAAGAACATTTCAGTCAGCCTATCTTAAGAAGGAAGCAAATACATGGATGGGATTC
ATCAAGCAAAGGTTGCTTCCAACGACTCATGACTTGACGGTTTCTAGGGAACGTGTTCTACTAGCGTTTGCAATTTTAAGGTCTCTCAATATTGATGTGGGCAAGATTAT
TGTGAGTGAGATATCTGGATGCTGGAGGAAGAAAGTGGGGAAGTTATTTTTCCCGATTACAATTACCATGTTTTGCAAGCGAGCAGGGGTTCTAGAGAATGAAGGAGATG
TCATATTGTTTGACAAGGGAATCATTGATACGCCTAACTTGGCACGGCTTCAGCGTACGCAAGAGGCACGCCAGGGTGGGCTTGTTTTTGGCATTCACAACATTTTAGAA
CAACTTGCACTGTCGGCCAGCAGGCAAGAGTTTGCCGAGAGGCAATCTAAGACTTTCTGGAACTATGTTAAACGTCGTGATGCCAACCTGAAGAAGGCACTGCAGGAAAG
TTTTTCCAAACCATTTCCAGCGCTTCTAGTGTTCCCTGATAATTTATTGAATCCCGGGATTTCGCCCACACCGATGGAAAGAGAAGAAGAGGATGATGAAAATGAGCAGG
ATCAGGAGGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTGATGAGTTAAATCCAGGAATTGCACGTCCTCAAATTGAGGCAGCAAATTTTGAAATGAAACCGATAATGTTTCAGATGTTGCAAACCGTGGGTCAATTCCATGT
GGTTAGTCATCAGCAGCCGCCAGCTGTGGAGCCTGCTGCAGTGGTGAACCAAGTTGAGAGGAAGCATGTGTCTATTGTGGTGAAGATCACAACTATGAGTTTTGCCCCAA
AATCCAGCTTCTGGTTGGGCAACCACCCCAACTTCTCATGGGGAGGACAAGGAAGTAATGTGCAAGCACAGCAAAAGGTGAACCAGTCGGGATTTGCTACAGCGCAGGTA
TTGCCCATCAAAATAAGTAGGCTTTGCCCTAGCAAAATTCAGAGAGTTCTCTTGAGGATGATGAAAGAATATATGGCTCGTACAGATGCCACAATTCAAAGTAATCAAGC
TTCAATGAGAGCCCTTGAATTGCAAGGTGCTGGAGGCAGCAATAATGATGCTGGAGCATCTGGTTCTGTTCCAGATGTAGAACCACCTTATGTGCCCCCCCACCTTATGT
ACCACCTCTACCTTTTCCACAAAGGCAAAGGCCTAAGAATCATGATGGATTCGAATGACAAGCATTTGGAAAATCATGGAGAGACTACTATTGCTCCTGAGGATCAGGAA
AAAACCACTTTCACCTGTCCTTATGGGACGTTTGCTTTTAGGCGAATCCTTTTGGCCTTTGCAATGCTCCAGCAACATTTCAGCGGTGAATTCGACTTGGAGATAAATTA
CAAGAAAGGATCAGAAAATGTCATTGCAGATCATTTATCTCGTCTTGATCCATCATCATCTTTGTTGAAGCAATCTACCATTTTAGATTCCTTTCCAGATGAACACTTTT
TGTTGTTGAGGAGAGGAAACTTGGGGCCTAGAGATGAAATGCCTCTTACTTACATTTTGGAAGTGGAATTATTCGATGTATGGGGTATTGATTTATGGGGCCATTTCCCC
CTTCTAATGGTAATGTTTTTATCTTATTGGCAGTTGTTTACGTGTCCAAGTGGGTGGAGGTCATTGCATGCCATCAGAATGATGCCAAGACTGTATCAAGGTTTCTTCAA
TCGCACATTTTGCGCGGTTTGGACACCTAGAGCTTTAGTAAATTGCATGGATGCTGCCACATGTCGGCAGGGTCGAGGAGTTGCTGTCGAGGCAGTTGAAGAAGGCAATG
CAAAGGAACCTGAAGGACAAAACCCAGAGCAGACTGAGCCGAGAGTTGCGGATACAGAGGAAGTTCAAGAAGAAAATACAGAGCAAGTTCAAGAAAAGCAGACTGAGAAT
GTGCAAGAAGAACAGGCAAAGGTTGCGCCTGAAAAAGGTAATGAGCCAGGACAGGAGGCTCAAGTGGAGGCTGGCATTACGAACCACGGTTGGGAGTTATTCTGTTCAAA
GTTTGAATCTATGAACGCGCAAGTAGTGCACGAATTTTATGCAAATATTGACGAGGAAGAAGGTTTCCAAGTGATCGCTCGAGGTGTAGAAGTCGACTGGAGTCCTGGTG
CTATTAACGCACTTGGCGTTGAAGGGGTGCAGTGGAGACTTTCAAAAACTGAGAAAAGAACATTTCAGTCAGCCTATCTTAAGAAGGAAGCAAATACATGGATGGGATTC
ATCAAGCAAAGGTTGCTTCCAACGACTCATGACTTGACGGTTTCTAGGGAACGTGTTCTACTAGCGTTTGCAATTTTAAGGTCTCTCAATATTGATGTGGGCAAGATTAT
TGTGAGTGAGATATCTGGATGCTGGAGGAAGAAAGTGGGGAAGTTATTTTTCCCGATTACAATTACCATGTTTTGCAAGCGAGCAGGGGTTCTAGAGAATGAAGGAGATG
TCATATTGTTTGACAAGGGAATCATTGATACGCCTAACTTGGCACGGCTTCAGCGTACGCAAGAGGCACGCCAGGGTGGGCTTGTTTTTGGCATTCACAACATTTTAGAA
CAACTTGCACTGTCGGCCAGCAGGCAAGAGTTTGCCGAGAGGCAATCTAAGACTTTCTGGAACTATGTTAAACGTCGTGATGCCAACCTGAAGAAGGCACTGCAGGAAAG
TTTTTCCAAACCATTTCCAGCGCTTCTAGTGTTCCCTGATAATTTATTGAATCCCGGGATTTCGCCCACACCGATGGAAAGAGAAGAAGAGGATGATGAAAATGAGCAGG
ATCAGGAGGACTGA
Protein sequenceShow/hide protein sequence
MFDELNPGIARPQIEAANFEMKPIMFQMLQTVGQFHVVSHQQPPAVEPAAVVNQVERKHVSIVVKITTMSFAPKSSFWLGNHPNFSWGGQGSNVQAQQKVNQSGFATAQV
LPIKISRLCPSKIQRVLLRMMKEYMARTDATIQSNQASMRALELQGAGGSNNDAGASGSVPDVEPPYVPPHLMYHLYLFHKGKGLRIMMDSNDKHLENHGETTIAPEDQE
KTTFTCPYGTFAFRRILLAFAMLQQHFSGEFDLEINYKKGSENVIADHLSRLDPSSSLLKQSTILDSFPDEHFLLLRRGNLGPRDEMPLTYILEVELFDVWGIDLWGHFP
LLMVMFLSYWQLFTCPSGWRSLHAIRMMPRLYQGFFNRTFCAVWTPRALVNCMDAATCRQGRGVAVEAVEEGNAKEPEGQNPEQTEPRVADTEEVQEENTEQVQEKQTEN
VQEEQAKVAPEKGNEPGQEAQVEAGITNHGWELFCSKFESMNAQVVHEFYANIDEEEGFQVIARGVEVDWSPGAINALGVEGVQWRLSKTEKRTFQSAYLKKEANTWMGF
IKQRLLPTTHDLTVSRERVLLAFAILRSLNIDVGKIIVSEISGCWRKKVGKLFFPITITMFCKRAGVLENEGDVILFDKGIIDTPNLARLQRTQEARQGGLVFGIHNILE
QLALSASRQEFAERQSKTFWNYVKRRDANLKKALQESFSKPFPALLVFPDNLLNPGISPTPMEREEEDDENEQDQED