; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg004412 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg004412
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold9:9559821..9571177
RNA-Seq ExpressionSpg004412
SyntenySpg004412
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8695166.1 hypothetical protein F3Y22_tig00110733pilonHSYRG00282 [Hibiscus syriacus]8.4e-1824.84Show/hide
Query:  FINHFARAKYIDMLKRDFLFKRGFSGDLPHFLRVG------ITNHYWELFCSKPEHVNSQVMHEFYVNIDEEKGFQVIVRGVAVDWSPGAINSLYNLQ--
        F++  A+  Y  +  R   F+ GF         +G      +T H W+ F   P  VN+ ++ EFY NI E     V+VRG+++ ++P AIN  + LQ  
Subjt:  FINHFARAKYIDMLKRDFLFKRGFSGDLPHFLRVG------ITNHYWELFCSKPEHVNSQVMHEFYVNIDEEKGFQVIVRGVAVDWSPGAINSLYNLQ--

Query:  -----DFLHAGYNKEQRGVV-----------NEQL-----------------NAAIRERLLPTTHDSTISGEKVLLAFAILRSLSIDMGKVIAGEIFGCW
              F     ++  +G++            +QL                 N  ++ +L+PT+H++T+S +++LL  +IL   +ID+GK+I      C 
Subjt:  -----DFLHAGYNKEQRGVV-----------NEQL-----------------NAAIRERLLPTTHDSTISGEKVLLAFAILRSLSIDMGKVIAGEIFGCW

Query:  RKKVGKLFFLNTMSMLCKRAGVPESAEHVILLDKGIIDTSNLSRLQRTQEARQGGLDCGIHQILE--QLTLSANRQEFAERQAH-----------TYWNY
        +++   L F N ++ LC++  V E     IL     ++ + +  L   +EA+    +    ++     +  S+   E A ++ H            Y+ Y
Subjt:  RKKVGKLFFLNTMSMLCKRAGVPESAEHVILLDKGIIDTSNLSRLQRTQEARQGGLDCGIHQILE--QLTLSANRQEFAERQAH-----------TYWNY

Query:  VKRRDATLRKALQENFSK
         KRRDA L  AL E+  +
Subjt:  VKRRDATLRKALQENFSK

KAE8718449.1 hypothetical protein F3Y22_tig00110013pilonHSYRG00240 [Hibiscus syriacus]2.3e-1522.56Show/hide
Query:  LPYERFINHFARAKYIDMLKRDFLFKRGF-------SGDLPHFLRVGITNHYWELFCSKPEHVNSQVMHEFYVNIDEEKGFQVIVRGVAVDWSPGAINSL
        + +++F N  A+A++ +   R+  F+ GF        G  P  + + +    W  F   P  VN+ ++ EFY NI +     + VRG  + ++  AIN  
Subjt:  LPYERFINHFARAKYIDMLKRDFLFKRGF-------SGDLPHFLRVGITNHYWELFCSKPEHVNSQVMHEFYVNIDEEKGFQVIVRGVAVDWSPGAINSL

Query:  YNLQDFL--HAGYNKE-------------------------QRGVVNEQ--------LNAAIRERLLPTTHDSTISGEKVLLAFAILRSLSIDMGKVIAG
        ++LQ+ +  HA + +E                          R  VN +         N  ++ +L+PT+H++T+S  ++LL  +++ S  ID+G++I  
Subjt:  YNLQDFL--HAGYNKE-------------------------QRGVVNEQ--------LNAAIRERLLPTTHDSTISGEKVLLAFAILRSLSIDMGKVIAG

Query:  EIFGCWRKKVGKLFFLNTMSMLCKRAGVPESAEHVILLDKGIIDTSNLSRL-------------QRTQEARQGGLDCGIHQILEQLTLSANRQEFAERQA
        ++  C  KK   L F N ++ LC++  V E+A   IL     I    L  L             +++    +   +  +  + E +T +  +        
Subjt:  EIFGCWRKKVGKLFFLNTMSMLCKRAGVPESAEHVILLDKGIIDTSNLSRL-------------QRTQEARQGGLDCGIHQILEQLTLSANRQEFAERQA

Query:  HTYWNYVKRRDATLRKALQENFSKPYQALPVFLDDLLNPWIPPPPVEREEEDDDAGQED
          ++ YVK RD  +    QE      +  P F D++L  +      E E +  D    D
Subjt:  HTYWNYVKRRDATLRKALQENFSKPYQALPVFLDDLLNPWIPPPPVEREEEDDDAGQED

KAF4375842.1 hypothetical protein G4B88_026421 [Cannabis sativa]8.4e-1825.27Show/hide
Query:  GKGIAGAEVE----AKVAEPEERRLPYERFINHFARAKYIDMLK-RDFLFKRGF------SGDLPHFLRVGITNHYWELFCSKPEHVNSQVMHEFYVN-I
        GK I G++ +      ++ P +     +   +   + KY++ ++ ++F  KRG        G +P +L   I    W   C  P     QV+ EFY N +
Subjt:  GKGIAGAEVE----AKVAEPEERRLPYERFINHFARAKYIDMLK-RDFLFKRGF------SGDLPHFLRVGITNHYWELFCSKPEHVNSQVMHEFYVN-I

Query:  DEEKGFQVIVRGVAVDWSPGAINSLYNLQDFLHAGYNKEQRGVVNEQLNAA--------------------IRERLLPTTHDSTISGEKVLLAFAILRSL
          E   ++ VR V V +S   IN  Y L++     ++K+   +++E ++                      ++  LLPT+HDST+S E++ + + I++  
Subjt:  DEEKGFQVIVRGVAVDWSPGAINSLYNLQDFLHAGYNKEQRGVVNEQLNAA--------------------IRERLLPTTHDSTISGEKVLLAFAILRSL

Query:  SIDMGKVIAGEIFGCWRKKVGKLFFLNTMSMLCKRAGVPESAEHVILLDKGIIDTSNLSRLQRTQEARQGGLDCGIHQILEQLTL-SANRQEFAERQAHT
         I++GKVIA EIF C  +  GKLFF   ++  C+ A VP   +   +  KG++        +RT  +           + E+L+   A +Q+  ER   T
Subjt:  SIDMGKVIAGEIFGCWRKKVGKLFFLNTMSMLCKRAGVPESAEHVILLDKGIIDTSNLSRLQRTQEARQGGLDCGIHQILEQLTL-SANRQEFAERQAHT

Query:  YWNYVKRRDATLRKALQENFSKPYQALPVFLDDLLNPWIPPPPVEREEEDDDAGQEDYENYHSCFYFD
        +WNY + RD  + + L+ N+   Y+                  VE E + D   + +   YH   Y D
Subjt:  YWNYVKRRDATLRKALQENFSKPYQALPVFLDDLLNPWIPPPPVEREEEDDDAGQEDYENYHSCFYFD

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]5.4e-1731.96Show/hide
Query:  GDLPHFLRVGITNHYWELFCSKPEHVNSQVMHEFYVNIDEEKGFQVIVRGVAVDWSPGAINSLYNLQD-------FLH------------------AGYN
        G LP   +V IT H W+ FC+ PE     ++ EFY N+ +     V VRGV V WS  AIN+++ L D       F+                   A +N
Subjt:  GDLPHFLRVGITNHYWELFCSKPEHVNSQVMHEFYVNIDEEKGFQVIVRGVAVDWSPGAINSLYNLQD-------FLH------------------AGYN

Query:  KEQRGV---VNEQLNAA-------IRERLLPTTHDSTISGEKVLLAFAILRSLSIDMGKVIAGEIFGCWRKKVGKLFFLNTMSMLCKRAGVPESAEHVIL
           +G    +   L  A       ++  LLPTTH  T+S +++LL  ++L   SI++G++I  EI  C  +K G LFF + ++ LC+ A  P       L
Subjt:  KEQRGV---VNEQLNAA-------IRERLLPTTHDSTISGEKVLLAFAILRSLSIDMGKVIAGEIFGCWRKKVGKLFFLNTMSMLCKRAGVPESAEHVIL

Query:  LDKGIIDTSNLSRLQRTQE
         + G ID   ++R+  TQE
Subjt:  LDKGIIDTSNLSRLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]1.4e-2529.34Show/hide
Query:  GDLPHFLRVGITNHYWELFCSKPEHVNSQVMHEFYVNIDEEKGFQVIVRGVAVDWSPGAINSLYNLQD-------FLH------------------AGYN
        G LP   +V IT H W+ FC+ PE     ++ EFY N+ + +   V VRGV V WS  AIN+++ L D       F+                   A +N
Subjt:  GDLPHFLRVGITNHYWELFCSKPEHVNSQVMHEFYVNIDEEKGFQVIVRGVAVDWSPGAINSLYNLQD-------FLH------------------AGYN

Query:  KEQRGV---VNEQLNAA-------IRERLLPTTHDSTISGEKVLLAFAILRSLSIDMGKVIAGEIFGCWRKKVGKLFFLNTMSMLCKRAGVPESAEHVIL
           +G    +   L  A       ++ RLLPTTH  T+S +++LL  ++L   SI++G++I  EI  C  +K G LFF + ++ LC+ A  P       L
Subjt:  KEQRGV---VNEQLNAA-------IRERLLPTTHDSTISGEKVLLAFAILRSLSIDMGKVIAGEIFGCWRKKVGKLFFLNTMSMLCKRAGVPESAEHVIL

Query:  LDKGIIDTSNLSRL--------------QRTQEARQGGLDCGIHQILEQLTLSANRQEFAE-----------RQAHTYWNYVKRRDATLRKALQENFSKP
         + G ID   ++R+               R   A     +  I Q L+ L    ++QE  +           +Q   +W Y K RD  L+KALQ NF++P
Subjt:  LDKGIIDTSNLSRL--------------QRTQEARQGGLDCGIHQILEQLTLSANRQEFAE-----------RQAHTYWNYVKRRDATLRKALQENFSKP

Query:  YQALPVFLDDLLNPWIPPPPVEREEEDDDAGQED
            P F  ++L         E E E D  G  +
Subjt:  YQALPVFLDDLLNPWIPPPPVEREEEDDDAGQED

TrEMBL top hitse value%identityAlignment
A0A2P5BCG4 Uncharacterized protein (Fragment)6.9e-2629.34Show/hide
Query:  GDLPHFLRVGITNHYWELFCSKPEHVNSQVMHEFYVNIDEEKGFQVIVRGVAVDWSPGAINSLYNLQD-------FLH------------------AGYN
        G LP   +V IT H W+ FC+ PE     ++ EFY N+ + +   V VRGV V WS  AIN+++ L D       F+                   A +N
Subjt:  GDLPHFLRVGITNHYWELFCSKPEHVNSQVMHEFYVNIDEEKGFQVIVRGVAVDWSPGAINSLYNLQD-------FLH------------------AGYN

Query:  KEQRGV---VNEQLNAA-------IRERLLPTTHDSTISGEKVLLAFAILRSLSIDMGKVIAGEIFGCWRKKVGKLFFLNTMSMLCKRAGVPESAEHVIL
           +G    +   L  A       ++ RLLPTTH  T+S +++LL  ++L   SI++G++I  EI  C  +K G LFF + ++ LC+ A  P       L
Subjt:  KEQRGV---VNEQLNAA-------IRERLLPTTHDSTISGEKVLLAFAILRSLSIDMGKVIAGEIFGCWRKKVGKLFFLNTMSMLCKRAGVPESAEHVIL

Query:  LDKGIIDTSNLSRL--------------QRTQEARQGGLDCGIHQILEQLTLSANRQEFAE-----------RQAHTYWNYVKRRDATLRKALQENFSKP
         + G ID   ++R+               R   A     +  I Q L+ L    ++QE  +           +Q   +W Y K RD  L+KALQ NF++P
Subjt:  LDKGIIDTSNLSRL--------------QRTQEARQGGLDCGIHQILEQLTLSANRQEFAE-----------RQAHTYWNYVKRRDATLRKALQENFSKP

Query:  YQALPVFLDDLLNPWIPPPPVEREEEDDDAGQED
            P F  ++L         E E E D  G  +
Subjt:  YQALPVFLDDLLNPWIPPPPVEREEEDDDAGQED

A0A2P5DXM3 Uncharacterized protein2.5e-1530.65Show/hide
Query:  IRERLLPTTHDSTISGEKVLLAFAILRSLSIDMGKVIAGEIFGCWRKKVGKLFFLNTMSMLCKRAGVPESAEHVILLDKGIIDTSNLSRL----------
        ++ RLLPTTH   +S +++LL  ++L   SI++G++I  EI  C  +K G LFF + ++ LC+ A  P       L + G ID   ++R+          
Subjt:  IRERLLPTTHDSTISGEKVLLAFAILRSLSIDMGKVIAGEIFGCWRKKVGKLFFLNTMSMLCKRAGVPESAEHVILLDKGIIDTSNLSRL----------

Query:  ----QRTQEARQGGLDCGIHQILEQLTLSANRQEFAERQAHTYWNYVKRRDATLRKALQENFSKPYQALPVFLDDLLNPWIPPPPVEREEEDDDAGQED
             R   A     +  + Q L+ L    ++QE   +Q   +W Y K RD  L+KALQ NF++P    P F  ++L         E E E D  G  +
Subjt:  ----QRTQEARQGGLDCGIHQILEQLTLSANRQEFAERQAHTYWNYVKRRDATLRKALQENFSKPYQALPVFLDDLLNPWIPPPPVEREEEDDDAGQED

A0A6A2ZUE4 Uncharacterized protein4.1e-1824.84Show/hide
Query:  FINHFARAKYIDMLKRDFLFKRGFSGDLPHFLRVG------ITNHYWELFCSKPEHVNSQVMHEFYVNIDEEKGFQVIVRGVAVDWSPGAINSLYNLQ--
        F++  A+  Y  +  R   F+ GF         +G      +T H W+ F   P  VN+ ++ EFY NI E     V+VRG+++ ++P AIN  + LQ  
Subjt:  FINHFARAKYIDMLKRDFLFKRGFSGDLPHFLRVG------ITNHYWELFCSKPEHVNSQVMHEFYVNIDEEKGFQVIVRGVAVDWSPGAINSLYNLQ--

Query:  -----DFLHAGYNKEQRGVV-----------NEQL-----------------NAAIRERLLPTTHDSTISGEKVLLAFAILRSLSIDMGKVIAGEIFGCW
              F     ++  +G++            +QL                 N  ++ +L+PT+H++T+S +++LL  +IL   +ID+GK+I      C 
Subjt:  -----DFLHAGYNKEQRGVV-----------NEQL-----------------NAAIRERLLPTTHDSTISGEKVLLAFAILRSLSIDMGKVIAGEIFGCW

Query:  RKKVGKLFFLNTMSMLCKRAGVPESAEHVILLDKGIIDTSNLSRLQRTQEARQGGLDCGIHQILE--QLTLSANRQEFAERQAH-----------TYWNY
        +++   L F N ++ LC++  V E     IL     ++ + +  L   +EA+    +    ++     +  S+   E A ++ H            Y+ Y
Subjt:  RKKVGKLFFLNTMSMLCKRAGVPESAEHVILLDKGIIDTSNLSRLQRTQEARQGGLDCGIHQILE--QLTLSANRQEFAERQAH-----------TYWNY

Query:  VKRRDATLRKALQENFSK
         KRRDA L  AL E+  +
Subjt:  VKRRDATLRKALQENFSK

A0A6A3BU96 Uncharacterized protein1.1e-1522.56Show/hide
Query:  LPYERFINHFARAKYIDMLKRDFLFKRGF-------SGDLPHFLRVGITNHYWELFCSKPEHVNSQVMHEFYVNIDEEKGFQVIVRGVAVDWSPGAINSL
        + +++F N  A+A++ +   R+  F+ GF        G  P  + + +    W  F   P  VN+ ++ EFY NI +     + VRG  + ++  AIN  
Subjt:  LPYERFINHFARAKYIDMLKRDFLFKRGF-------SGDLPHFLRVGITNHYWELFCSKPEHVNSQVMHEFYVNIDEEKGFQVIVRGVAVDWSPGAINSL

Query:  YNLQDFL--HAGYNKE-------------------------QRGVVNEQ--------LNAAIRERLLPTTHDSTISGEKVLLAFAILRSLSIDMGKVIAG
        ++LQ+ +  HA + +E                          R  VN +         N  ++ +L+PT+H++T+S  ++LL  +++ S  ID+G++I  
Subjt:  YNLQDFL--HAGYNKE-------------------------QRGVVNEQ--------LNAAIRERLLPTTHDSTISGEKVLLAFAILRSLSIDMGKVIAG

Query:  EIFGCWRKKVGKLFFLNTMSMLCKRAGVPESAEHVILLDKGIIDTSNLSRL-------------QRTQEARQGGLDCGIHQILEQLTLSANRQEFAERQA
        ++  C  KK   L F N ++ LC++  V E+A   IL     I    L  L             +++    +   +  +  + E +T +  +        
Subjt:  EIFGCWRKKVGKLFFLNTMSMLCKRAGVPESAEHVILLDKGIIDTSNLSRL-------------QRTQEARQGGLDCGIHQILEQLTLSANRQEFAERQA

Query:  HTYWNYVKRRDATLRKALQENFSKPYQALPVFLDDLLNPWIPPPPVEREEEDDDAGQED
          ++ YVK RD  +    QE      +  P F D++L  +      E E +  D    D
Subjt:  HTYWNYVKRRDATLRKALQENFSKPYQALPVFLDDLLNPWIPPPPVEREEEDDDAGQED

A0A7J6FZ22 Uncharacterized protein4.1e-1825.27Show/hide
Query:  GKGIAGAEVE----AKVAEPEERRLPYERFINHFARAKYIDMLK-RDFLFKRGF------SGDLPHFLRVGITNHYWELFCSKPEHVNSQVMHEFYVN-I
        GK I G++ +      ++ P +     +   +   + KY++ ++ ++F  KRG        G +P +L   I    W   C  P     QV+ EFY N +
Subjt:  GKGIAGAEVE----AKVAEPEERRLPYERFINHFARAKYIDMLK-RDFLFKRGF------SGDLPHFLRVGITNHYWELFCSKPEHVNSQVMHEFYVN-I

Query:  DEEKGFQVIVRGVAVDWSPGAINSLYNLQDFLHAGYNKEQRGVVNEQLNAA--------------------IRERLLPTTHDSTISGEKVLLAFAILRSL
          E   ++ VR V V +S   IN  Y L++     ++K+   +++E ++                      ++  LLPT+HDST+S E++ + + I++  
Subjt:  DEEKGFQVIVRGVAVDWSPGAINSLYNLQDFLHAGYNKEQRGVVNEQLNAA--------------------IRERLLPTTHDSTISGEKVLLAFAILRSL

Query:  SIDMGKVIAGEIFGCWRKKVGKLFFLNTMSMLCKRAGVPESAEHVILLDKGIIDTSNLSRLQRTQEARQGGLDCGIHQILEQLTL-SANRQEFAERQAHT
         I++GKVIA EIF C  +  GKLFF   ++  C+ A VP   +   +  KG++        +RT  +           + E+L+   A +Q+  ER   T
Subjt:  SIDMGKVIAGEIFGCWRKKVGKLFFLNTMSMLCKRAGVPESAEHVILLDKGIIDTSNLSRLQRTQEARQGGLDCGIHQILEQLTL-SANRQEFAERQAHT

Query:  YWNYVKRRDATLRKALQENFSKPYQALPVFLDDLLNPWIPPPPVEREEEDDDAGQEDYENYHSCFYFD
        +WNY + RD  + + L+ N+   Y+                  VE E + D   + +   YH   Y D
Subjt:  YWNYVKRRDATLRKALQENFSKPYQALPVFLDDLLNPWIPPPPVEREEEDDDAGQEDYENYHSCFYFD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCCTCAGTAAAACGCGTGGAGCATCTGGCGCTTCCACCTGTCGCGCAGGGGAGATCCAACGGCCTAGCCGTTGGACAGCACATCAGCATTTACCACAGAAAGGGAA
ACAGGTCATCGAGGACATTGCAAAAGAGGTAGTCACTGAGGAAGAACCGAAAGATCCCGAAAAAGAAAAAGATCCAAAAAAGACTGTTGAAACAGAGCAAGAGATACAAG
AAAAGGCAGCAAAGGATATGCAAAGACAAGGTAATCATGAAACAGAAGTTCAAGAGGTTCAAGAGGTAGTGCAGAATCCGCCGTGTCGTCGTCGCCGTAAGCAGAAAACG
GGACGGATTAAAAGGATTCGGACCGACACTCCATTTCCACCCACAACTGAATCGGAAAAAGAGGAACCTGAAAATCAAAAAGAGGCCGAGAAGGAAAATAAGGGCAAAGG
TATTGCAGGAGCAGAAGTTGAAGCAAAAGTGGCAGAACCAGAGGAGAGGAGGTTGCCATATGAGCGCTTTATCAATCATTTTGCCAGGGCAAAATATATAGATATGCTGA
AGAGAGACTTTTTGTTCAAGAGAGGGTTTAGTGGTGACCTTCCACACTTTTTGAGGGTCGGCATCACAAACCATTACTGGGAGTTGTTCTGTTCTAAACCAGAACATGTG
AACTCACAAGTGATGCACGAGTTCTACGTAAATATCGACGAGGAAAAAGGTTTTCAAGTGATTGTTCGTGGAGTAGCTGTCGACTGGAGCCCCGGAGCCATTAATTCTCT
GTACAATCTCCAAGACTTCCTGCATGCTGGTTACAACAAGGAGCAACGTGGAGTAGTTAACGAGCAGTTAAATGCTGCTATTCGAGAAAGGCTGCTTCCAACAACTCACG
ACTCCACAATCTCTGGGGAAAAAGTTCTTTTGGCTTTTGCGATTTTAAGGTCTCTTAGCATCGACATGGGTAAGGTGATCGCTGGTGAAATTTTTGGATGTTGGCGTAAG
AAGGTTGGGAAATTGTTCTTTCTGAATACAATGTCAATGCTCTGTAAGAGAGCGGGGGTTCCAGAAAGCGCAGAACATGTGATCCTATTGGACAAGGGAATAATTGACAC
GTCTAACTTGTCACGACTTCAACGGACACAGGAAGCACGTCAAGGAGGGCTAGACTGTGGCATTCACCAAATCTTAGAACAACTCACACTTTCAGCCAACAGGCAAGAGT
TCGCTGAGAGGCAGGCTCATACCTACTGGAATTATGTTAAAAGACGTGATGCTACATTGAGGAAAGCACTGCAAGAAAACTTTTCGAAGCCTTATCAAGCCCTTCCTGTA
TTCCTTGATGATTTACTGAACCCTTGGATCCCACCACCACCCGTTGAGAGAGAAGAGGAGGATGATGATGCTGGTCAGGAGGACTATGAAAATTACCATTCGTGCTTTTA
TTTTGATTGTTCATTTTTGTTGCTCGCTCAATATCCTGACCATAATGCTCAGGGTGATTGCTCCCAGGGCGACTCGCAGGCAGATCATGAGAATCTCACGGTGCAACCTA
ATGGAGGAGACCGTTTCGGGTTTGTTCCCAGCGTCGAGACACTAGCCTTTGGGCGTCTCGACGCTGGCATTCCATATCTGAAAAGGCGGCAACGGGTTACAGCATCGCGA
CGCTCTCGGTTTTCCATTCCAGAATCCGCAATATCACGACAGCGTCGCGACGCTGCCCCTATAGCGTCGCGACGCTGTGCCGAATTTTCAGACTATATATATTATGCGAT
TAGGGATTTTGGGGGACTTCTTTTGGATGATTTTGGGGCCGAAAACTCGACTAGAAGGCTGCTGTGGAGGCTGAAGCAAGTGGAGAAAAGTGGATTTCAAGGCTTTGTTC
GTGGAGATCGTGACGGGGACGTGCGGTCTCGGCCTACTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTCCTCAGTAAAACGCGTGGAGCATCTGGCGCTTCCACCTGTCGCGCAGGGGAGATCCAACGGCCTAGCCGTTGGACAGCACATCAGCATTTACCACAGAAAGGGAA
ACAGGTCATCGAGGACATTGCAAAAGAGGTAGTCACTGAGGAAGAACCGAAAGATCCCGAAAAAGAAAAAGATCCAAAAAAGACTGTTGAAACAGAGCAAGAGATACAAG
AAAAGGCAGCAAAGGATATGCAAAGACAAGGTAATCATGAAACAGAAGTTCAAGAGGTTCAAGAGGTAGTGCAGAATCCGCCGTGTCGTCGTCGCCGTAAGCAGAAAACG
GGACGGATTAAAAGGATTCGGACCGACACTCCATTTCCACCCACAACTGAATCGGAAAAAGAGGAACCTGAAAATCAAAAAGAGGCCGAGAAGGAAAATAAGGGCAAAGG
TATTGCAGGAGCAGAAGTTGAAGCAAAAGTGGCAGAACCAGAGGAGAGGAGGTTGCCATATGAGCGCTTTATCAATCATTTTGCCAGGGCAAAATATATAGATATGCTGA
AGAGAGACTTTTTGTTCAAGAGAGGGTTTAGTGGTGACCTTCCACACTTTTTGAGGGTCGGCATCACAAACCATTACTGGGAGTTGTTCTGTTCTAAACCAGAACATGTG
AACTCACAAGTGATGCACGAGTTCTACGTAAATATCGACGAGGAAAAAGGTTTTCAAGTGATTGTTCGTGGAGTAGCTGTCGACTGGAGCCCCGGAGCCATTAATTCTCT
GTACAATCTCCAAGACTTCCTGCATGCTGGTTACAACAAGGAGCAACGTGGAGTAGTTAACGAGCAGTTAAATGCTGCTATTCGAGAAAGGCTGCTTCCAACAACTCACG
ACTCCACAATCTCTGGGGAAAAAGTTCTTTTGGCTTTTGCGATTTTAAGGTCTCTTAGCATCGACATGGGTAAGGTGATCGCTGGTGAAATTTTTGGATGTTGGCGTAAG
AAGGTTGGGAAATTGTTCTTTCTGAATACAATGTCAATGCTCTGTAAGAGAGCGGGGGTTCCAGAAAGCGCAGAACATGTGATCCTATTGGACAAGGGAATAATTGACAC
GTCTAACTTGTCACGACTTCAACGGACACAGGAAGCACGTCAAGGAGGGCTAGACTGTGGCATTCACCAAATCTTAGAACAACTCACACTTTCAGCCAACAGGCAAGAGT
TCGCTGAGAGGCAGGCTCATACCTACTGGAATTATGTTAAAAGACGTGATGCTACATTGAGGAAAGCACTGCAAGAAAACTTTTCGAAGCCTTATCAAGCCCTTCCTGTA
TTCCTTGATGATTTACTGAACCCTTGGATCCCACCACCACCCGTTGAGAGAGAAGAGGAGGATGATGATGCTGGTCAGGAGGACTATGAAAATTACCATTCGTGCTTTTA
TTTTGATTGTTCATTTTTGTTGCTCGCTCAATATCCTGACCATAATGCTCAGGGTGATTGCTCCCAGGGCGACTCGCAGGCAGATCATGAGAATCTCACGGTGCAACCTA
ATGGAGGAGACCGTTTCGGGTTTGTTCCCAGCGTCGAGACACTAGCCTTTGGGCGTCTCGACGCTGGCATTCCATATCTGAAAAGGCGGCAACGGGTTACAGCATCGCGA
CGCTCTCGGTTTTCCATTCCAGAATCCGCAATATCACGACAGCGTCGCGACGCTGCCCCTATAGCGTCGCGACGCTGTGCCGAATTTTCAGACTATATATATTATGCGAT
TAGGGATTTTGGGGGACTTCTTTTGGATGATTTTGGGGCCGAAAACTCGACTAGAAGGCTGCTGTGGAGGCTGAAGCAAGTGGAGAAAAGTGGATTTCAAGGCTTTGTTC
GTGGAGATCGTGACGGGGACGTGCGGTCTCGGCCTACTTAG
Protein sequenceShow/hide protein sequence
MLLSKTRGASGASTCRAGEIQRPSRWTAHQHLPQKGKQVIEDIAKEVVTEEEPKDPEKEKDPKKTVETEQEIQEKAAKDMQRQGNHETEVQEVQEVVQNPPCRRRRKQKT
GRIKRIRTDTPFPPTTESEKEEPENQKEAEKENKGKGIAGAEVEAKVAEPEERRLPYERFINHFARAKYIDMLKRDFLFKRGFSGDLPHFLRVGITNHYWELFCSKPEHV
NSQVMHEFYVNIDEEKGFQVIVRGVAVDWSPGAINSLYNLQDFLHAGYNKEQRGVVNEQLNAAIRERLLPTTHDSTISGEKVLLAFAILRSLSIDMGKVIAGEIFGCWRK
KVGKLFFLNTMSMLCKRAGVPESAEHVILLDKGIIDTSNLSRLQRTQEARQGGLDCGIHQILEQLTLSANRQEFAERQAHTYWNYVKRRDATLRKALQENFSKPYQALPV
FLDDLLNPWIPPPPVEREEEDDDAGQEDYENYHSCFYFDCSFLLLAQYPDHNAQGDCSQGDSQADHENLTVQPNGGDRFGFVPSVETLAFGRLDAGIPYLKRRQRVTASR
RSRFSIPESAISRQRRDAAPIASRRCAEFSDYIYYAIRDFGGLLLDDFGAENSTRRLLWRLKQVEKSGFQGFVRGDRDGDVRSRPT