; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg021563 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg021563
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold9:8154855..8167003
RNA-Seq ExpressionSpg021563
SyntenySpg021563
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4375842.1 hypothetical protein G4B88_026421 [Cannabis sativa]1.3e-1828.73Show/hide
Query:  RDFLFERGF--GDD----LPRFLRTGIANHGWSQFCAKLEPVNSNIVREFYANIDDQEGFQLNAAVREVGI-------------EGAQWRLSKTEKR---
        ++F  +RG   GD+    +P +L   I    W Q C         +V+EFYAN    E       VREV +             +   +  SK   +   
Subjt:  RDFLFERGF--GDD----LPRFLRTGIANHGWSQFCAKLEPVNSNIVREFYANIDDQEGFQLNAAVREVGI-------------EGAQWRLSKTEKR---

Query:  ---------TFQAAYLKSEANTWMGFIKLRLLPKTHDSTVSRDGVLLVFVILRSMSINVGKIISNEIYDCWRKKVGKLFSPNTITMLCQRVRVPMNADDV
                  F+   LK +      F++  LLP +HDSTVSR+ + +++ I++   INVGK+I+ EI++C  +  GKLF    IT  C+   VPM  D+ 
Subjt:  ---------TFQAAYLKSEANTWMGFIKLRLLPKTHDSTVSRDGVLLVFVILRSMSINVGKIISNEIYDCWRKKVGKLFSPNTITMLCQRVRVPMNADDV

Query:  TLMDKGIID-TPNLARLQRTQEAHQGSLVCDIHQMEEQLQMHSSRMEFAERKFQTFWNYVKRRDAALRGALQSNF
         +  KG++   P+ A  + T      S   D   M+E+L  H +  +    + QT WNY + RD  +   L+ N+
Subjt:  TLMDKGIID-TPNLARLQRTQEAHQGSLVCDIHQMEEQLQMHSSRMEFAERKFQTFWNYVKRRDAALRGALQSNF

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]2.5e-1933.18Show/hide
Query:  FLRTGIANHGWSQFCAKLEPVNSNIVREFYANIDDQ-------EGFQLN--------------------------------AAVREVGIEGAQWRLSKTE
        F+   I  H W QFCA  E     +VREFYAN+ D         G Q++                                  +  V + GA+W +S   
Subjt:  FLRTGIANHGWSQFCAKLEPVNSNIVREFYANIDDQ-------EGFQLN--------------------------------AAVREVGIEGAQWRLSKTE

Query:  KRTFQAAYLKSEANTWMGFIKLRLLPKTHDSTVSRDGVLLVFVILRSMSINVGKIISNEIYDCWRKKVGKLFSPNTITMLCQRVRVPMNADDVTLMDKGI
          T   + L   A  W  F+K  LLP TH  TVS+D +LL+  +L   SINVG++I +EI  C  +K G LF P+ IT LC+  R P   ++  L + G 
Subjt:  KRTFQAAYLKSEANTWMGFIKLRLLPKTHDSTVSRDGVLLVFVILRSMSINVGKIISNEIYDCWRKKVGKLFSPNTITMLCQRVRVPMNADDVTLMDKGI

Query:  IDTPNLARLQRTQE
        ID   +AR+  TQE
Subjt:  IDTPNLARLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]1.0e-2831.28Show/hide
Query:  EKQNTEREEQEKKEAEDKAREEIENKMEEEILPKQREDKGKEMLKRDFLFERGFGDD-------LPRFLRTGIANHGWSQFCAKLEPVNSNIVREFYANI
        E+ +T R     K    KA + +  K E E    + E+    +  R    E+GF  D       LP F+   I  H W QFCA  E     +VREFYAN+
Subjt:  EKQNTEREEQEKKEAEDKAREEIENKMEEEILPKQREDKGKEMLKRDFLFERGFGDD-------LPRFLRTGIANHGWSQFCAKLEPVNSNIVREFYANI

Query:  DDQE-------GFQLN--------------------------------AAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPKTHDSTV
         D E       G Q++                                  +  V   GA+W +S     T   + L   A  W  F+K RLLP TH  TV
Subjt:  DDQE-------GFQLN--------------------------------AAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPKTHDSTV

Query:  SRDGVLLVFVILRSMSINVGKIISNEIYDCWRKKVGKLFSPNTITMLCQRVRVPMNADDVTLMDKGIIDTPNLARLQR---TQEAHQ-------------
        S+D +LL+  +L   SINVG++I +EI  C  +K G LF P+ IT LC+  R P   ++  L + G ID   +AR+ +   T+   Q             
Subjt:  SRDGVLLVFVILRSMSINVGKIISNEIYDCWRKKVGKLFSPNTITMLCQRVRVPMNADDVTLMDKGIIDTPNLARLQR---TQEAHQ-------------

Query:  --GSLVCDIHQMEEQL------QMH-SSRMEFAERKFQTFWNYVKRRDAALRGALQSNFSKPYPTFPIFPDDLL
          G ++  +  +E++L      Q H  S ++   ++ Q FW Y K RD AL+ ALQ+NF++P PTFP FP ++L
Subjt:  --GSLVCDIHQMEEQL------QMH-SSRMEFAERKFQTFWNYVKRRDAALRGALQSNFSKPYPTFPIFPDDLL

PON59596.1 hypothetical protein PanWU01x14_158080 [Parasponia andersonii]5.8e-2437.81Show/hide
Query:  LKSEANTWMGFIKLRLLPKTHDSTVSRDGVLLVFVILRSMSINVGKIISNEIYDCWRKKVGKLFSPNTITMLCQRVRVPMNADDVTLMDKGIIDTPNLAR
        L   A  W  F+K RLLP TH  TVS+D +LL++ +L   SINVG++I +EI  C  +K G LF P+ IT LC+  R P   ++  L   G ID   +AR
Subjt:  LKSEANTWMGFIKLRLLPKTHDSTVSRDGVLLVFVILRSMSINVGKIISNEIYDCWRKKVGKLFSPNTITMLCQRVRVPMNADDVTLMDKGIIDTPNLAR

Query:  LQRTQEAH--------------------QGSLVCDIHQMEEQL------QMH-SSRMEFAERKFQTFWNYVKRRDAALRGALQSNFSKPYPTFPIFPDDL
        +  TQE                       G ++  +  +E++L      Q H  S ++   ++ Q FW Y K RD AL+ ALQ+NF++P PTFP FP +L
Subjt:  LQRTQEAH--------------------QGSLVCDIHQMEEQL------QMH-SSRMEFAERKFQTFWNYVKRRDAALRGALQSNFSKPYPTFPIFPDDL

Query:  L
        L
Subjt:  L

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]1.3e-2635.56Show/hide
Query:  EPVNSNIVREFYANIDDQEGFQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPKTHDSTVSRDGVLLVFVILRSMSINVGKIIS
        +PV+ +   EF  NI + E   L   +  V   GA+W +S     T   + L   A  W  F+K RLLP TH   VS+D +LL+  +L   SINVG++I 
Subjt:  EPVNSNIVREFYANIDDQEGFQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPKTHDSTVSRDGVLLVFVILRSMSINVGKIIS

Query:  NEIYDCWRKKVGKLFSPNTITMLCQRVRVPMNADDVTLMDKGIIDTPNLARL--------------QRTQEAHQGSLVCDIHQMEEQLQMHSSRMEFAER
        +EI  C  +K G LF P+ IT LC+    P   ++  L + G ID   +AR+               R   A       D+ Q  + L+   S+ E   +
Subjt:  NEIYDCWRKKVGKLFSPNTITMLCQRVRVPMNADDVTLMDKGIIDTPNLARL--------------QRTQEAHQGSLVCDIHQMEEQLQMHSSRMEFAER

Query:  KFQTFWNYVKRRDAALRGALQSNFSKPYPTFPIFPDDLL
        + Q FW Y K RD AL+ ALQ+NF++P PTFP FP ++L
Subjt:  KFQTFWNYVKRRDAALRGALQSNFSKPYPTFPIFPDDLL

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)1.2e-1933.18Show/hide
Query:  FLRTGIANHGWSQFCAKLEPVNSNIVREFYANIDDQ-------EGFQLN--------------------------------AAVREVGIEGAQWRLSKTE
        F+   I  H W QFCA  E     +VREFYAN+ D         G Q++                                  +  V + GA+W +S   
Subjt:  FLRTGIANHGWSQFCAKLEPVNSNIVREFYANIDDQ-------EGFQLN--------------------------------AAVREVGIEGAQWRLSKTE

Query:  KRTFQAAYLKSEANTWMGFIKLRLLPKTHDSTVSRDGVLLVFVILRSMSINVGKIISNEIYDCWRKKVGKLFSPNTITMLCQRVRVPMNADDVTLMDKGI
          T   + L   A  W  F+K  LLP TH  TVS+D +LL+  +L   SINVG++I +EI  C  +K G LF P+ IT LC+  R P   ++  L + G 
Subjt:  KRTFQAAYLKSEANTWMGFIKLRLLPKTHDSTVSRDGVLLVFVILRSMSINVGKIISNEIYDCWRKKVGKLFSPNTITMLCQRVRVPMNADDVTLMDKGI

Query:  IDTPNLARLQRTQE
        ID   +AR+  TQE
Subjt:  IDTPNLARLQRTQE

A0A2P5BCG4 Uncharacterized protein (Fragment)5.0e-2931.28Show/hide
Query:  EKQNTEREEQEKKEAEDKAREEIENKMEEEILPKQREDKGKEMLKRDFLFERGFGDD-------LPRFLRTGIANHGWSQFCAKLEPVNSNIVREFYANI
        E+ +T R     K    KA + +  K E E    + E+    +  R    E+GF  D       LP F+   I  H W QFCA  E     +VREFYAN+
Subjt:  EKQNTEREEQEKKEAEDKAREEIENKMEEEILPKQREDKGKEMLKRDFLFERGFGDD-------LPRFLRTGIANHGWSQFCAKLEPVNSNIVREFYANI

Query:  DDQE-------GFQLN--------------------------------AAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPKTHDSTV
         D E       G Q++                                  +  V   GA+W +S     T   + L   A  W  F+K RLLP TH  TV
Subjt:  DDQE-------GFQLN--------------------------------AAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPKTHDSTV

Query:  SRDGVLLVFVILRSMSINVGKIISNEIYDCWRKKVGKLFSPNTITMLCQRVRVPMNADDVTLMDKGIIDTPNLARLQR---TQEAHQ-------------
        S+D +LL+  +L   SINVG++I +EI  C  +K G LF P+ IT LC+  R P   ++  L + G ID   +AR+ +   T+   Q             
Subjt:  SRDGVLLVFVILRSMSINVGKIISNEIYDCWRKKVGKLFSPNTITMLCQRVRVPMNADDVTLMDKGIIDTPNLARLQR---TQEAHQ-------------

Query:  --GSLVCDIHQMEEQL------QMH-SSRMEFAERKFQTFWNYVKRRDAALRGALQSNFSKPYPTFPIFPDDLL
          G ++  +  +E++L      Q H  S ++   ++ Q FW Y K RD AL+ ALQ+NF++P PTFP FP ++L
Subjt:  --GSLVCDIHQMEEQL------QMH-SSRMEFAERKFQTFWNYVKRRDAALRGALQSNFSKPYPTFPIFPDDLL

A0A2P5CEY2 Uncharacterized protein2.8e-2437.81Show/hide
Query:  LKSEANTWMGFIKLRLLPKTHDSTVSRDGVLLVFVILRSMSINVGKIISNEIYDCWRKKVGKLFSPNTITMLCQRVRVPMNADDVTLMDKGIIDTPNLAR
        L   A  W  F+K RLLP TH  TVS+D +LL++ +L   SINVG++I +EI  C  +K G LF P+ IT LC+  R P   ++  L   G ID   +AR
Subjt:  LKSEANTWMGFIKLRLLPKTHDSTVSRDGVLLVFVILRSMSINVGKIISNEIYDCWRKKVGKLFSPNTITMLCQRVRVPMNADDVTLMDKGIIDTPNLAR

Query:  LQRTQEAH--------------------QGSLVCDIHQMEEQL------QMH-SSRMEFAERKFQTFWNYVKRRDAALRGALQSNFSKPYPTFPIFPDDL
        +  TQE                       G ++  +  +E++L      Q H  S ++   ++ Q FW Y K RD AL+ ALQ+NF++P PTFP FP +L
Subjt:  LQRTQEAH--------------------QGSLVCDIHQMEEQL------QMH-SSRMEFAERKFQTFWNYVKRRDAALRGALQSNFSKPYPTFPIFPDDL

Query:  L
        L
Subjt:  L

A0A2P5DXM3 Uncharacterized protein6.1e-2735.56Show/hide
Query:  EPVNSNIVREFYANIDDQEGFQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPKTHDSTVSRDGVLLVFVILRSMSINVGKIIS
        +PV+ +   EF  NI + E   L   +  V   GA+W +S     T   + L   A  W  F+K RLLP TH   VS+D +LL+  +L   SINVG++I 
Subjt:  EPVNSNIVREFYANIDDQEGFQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPKTHDSTVSRDGVLLVFVILRSMSINVGKIIS

Query:  NEIYDCWRKKVGKLFSPNTITMLCQRVRVPMNADDVTLMDKGIIDTPNLARL--------------QRTQEAHQGSLVCDIHQMEEQLQMHSSRMEFAER
        +EI  C  +K G LF P+ IT LC+    P   ++  L + G ID   +AR+               R   A       D+ Q  + L+   S+ E   +
Subjt:  NEIYDCWRKKVGKLFSPNTITMLCQRVRVPMNADDVTLMDKGIIDTPNLARL--------------QRTQEAHQGSLVCDIHQMEEQLQMHSSRMEFAER

Query:  KFQTFWNYVKRRDAALRGALQSNFSKPYPTFPIFPDDLL
        + Q FW Y K RD AL+ ALQ+NF++P PTFP FP ++L
Subjt:  KFQTFWNYVKRRDAALRGALQSNFSKPYPTFPIFPDDLL

A0A7J6FZ22 Uncharacterized protein6.1e-1928.73Show/hide
Query:  RDFLFERGF--GDD----LPRFLRTGIANHGWSQFCAKLEPVNSNIVREFYANIDDQEGFQLNAAVREVGI-------------EGAQWRLSKTEKR---
        ++F  +RG   GD+    +P +L   I    W Q C         +V+EFYAN    E       VREV +             +   +  SK   +   
Subjt:  RDFLFERGF--GDD----LPRFLRTGIANHGWSQFCAKLEPVNSNIVREFYANIDDQEGFQLNAAVREVGI-------------EGAQWRLSKTEKR---

Query:  ---------TFQAAYLKSEANTWMGFIKLRLLPKTHDSTVSRDGVLLVFVILRSMSINVGKIISNEIYDCWRKKVGKLFSPNTITMLCQRVRVPMNADDV
                  F+   LK +      F++  LLP +HDSTVSR+ + +++ I++   INVGK+I+ EI++C  +  GKLF    IT  C+   VPM  D+ 
Subjt:  ---------TFQAAYLKSEANTWMGFIKLRLLPKTHDSTVSRDGVLLVFVILRSMSINVGKIISNEIYDCWRKKVGKLFSPNTITMLCQRVRVPMNADDV

Query:  TLMDKGIID-TPNLARLQRTQEAHQGSLVCDIHQMEEQLQMHSSRMEFAERKFQTFWNYVKRRDAALRGALQSNF
         +  KG++   P+ A  + T      S   D   M+E+L  H +  +    + QT WNY + RD  +   L+ N+
Subjt:  TLMDKGIID-TPNLARLQRTQEAHQGSLVCDIHQMEEQLQMHSSRMEFAERKFQTFWNYVKRRDAALRGALQSNF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCAAACAAGAGGACGAAAGGAAAGGGATGTTGAGGAAGAAGAGGTGCAGGTTACACCTGAAGCACCAAAAACTAAGGCAAAGAAGAAGAAGACGCCAGAGGAAAA
AGAAGCTAAAAGAAGGAGGCGACAACAGAGGGCTGAGGTTGAAGAGGTAGTGCAAAAAGTGGTGGAAGATGTTGTTGCTGCGATGGTTGAGGAGGAAAATCCGAAGGAAC
CAGAGGAACAGAATCCTGGACAGAATGTTCCGATAGTCGAAAATCCTCATTATGCCGGGCGTCGCCGTCGGAAGCAGAAAGTCGGCCACATTAAGGTAATCCGGACTGAT
ACTCCTTCGCCGCCAACAACTGATTCTGAGAAACAGAATACGGAAAGAGAAGAGCAGGAGAAAAAGGAGGCAGAGGACAAAGCGAGAGAAGAAATAGAAAATAAAATGGA
GGAGGAAATTTTGCCCAAACAGAGGGAAGACAAGGGCAAAGAGATGCTGAAAAGAGATTTTCTGTTTGAAAGGGGATTTGGTGATGATTTGCCACGATTCTTGAGGACTG
GAATAGCAAATCATGGATGGAGTCAGTTCTGCGCAAAGCTGGAGCCGGTCAATTCCAATATTGTCCGTGAATTTTATGCAAATATAGATGATCAAGAAGGATTTCAGTTA
AATGCGGCTGTCAGGGAGGTTGGCATCGAAGGGGCTCAATGGAGGTTGTCGAAGACAGAAAAGCGCACGTTTCAAGCGGCTTATTTGAAGAGCGAAGCCAATACATGGAT
GGGCTTCATCAAGTTGCGCTTGCTTCCGAAAACTCACGATTCAACGGTGTCTCGAGACGGGGTGCTGCTGGTATTTGTTATTCTTCGTTCTATGAGTATTAACGTGGGCA
AAATCATTTCTAATGAGATTTATGATTGTTGGCGGAAGAAGGTAGGGAAGCTATTTTCCCCAAACACCATAACGATGCTATGTCAAAGGGTAAGGGTTCCTATGAATGCA
GACGATGTCACTCTAATGGACAAGGGAATAATCGACACGCCAAACCTAGCTAGGCTTCAAAGAACACAAGAGGCACACCAAGGCAGTTTGGTGTGCGACATCCATCAAAT
GGAAGAGCAATTACAAATGCATTCCAGCAGGATGGAGTTTGCCGAAAGGAAATTCCAAACCTTCTGGAATTATGTGAAGAGAAGGGATGCCGCGTTGAGGGGGGCCTTGC
AGTCTAACTTTTCTAAACCATATCCGACCTTCCCAATATTCCCTGATGACCTACTGAACCCTTGGATTCCGCCACCGCCAGTCGAAAGAGAAGGAGATGAAGAAGAAGAT
CCTGGACTTAGAATTATTTTGCTACAGCAGAGCTTGGTTTTGCAGAATGCTCAAGTAAAGGTTGAAGGTAGTGTTGGATTATCTGTTTTGATTGAGTTTCGCTGCTCGAC
CCTCCCGCTGCCACCGCCTATCCAGTCGCGTCGCACCGCCTATCTAGTCGCTGGAGTCGCTCCGCCGCCACCACTCATCAAGCCTCGCAGGAAAAGCTACTTGTGTAGTC
GCCCATTCCTTTGCAAATCTCTCCCTCTCTCAGTTTTTCGCGTGGAAGTCGCCAAGCCGGCGAAGTTACAGCCTATCGCACCGTCGCCACCTGCTCGTGTCATCGCTGCT
CAACATCTCGCACGCGAGGTGAAGAAACCAGTGTGCCGTCAAAGGGATCCATTGGGGTCGAATAACTGTAAGCTCGAATACCCACTGCCCAAAGATCGTTCTAGCACGTT
GTTCGAGGTTAATGTAACCCGTCTAGCAATAGGTCGATATCTTGTAAAGCTAAGGTTACAAGAGAACTCACCCAGCAAGATACTGTCTAAGGCTGGAGGTCCTCACCGAA
GCTTTGGAAGCAATTCGGGGCTTAAACGAGCGAAACCAGAGCGTAAAATGACCATTTTGCCCCTGGAGCCTCATCAACGTCGAGACACCCATCATCAAAGTCGAGACTTT
CAGGAAAGTACTCGAGCTGAGTTGAAAGACATGTCTGATAATATTGATAAGATAGAAACATATGTGACAAAACTGGAGAAGCATGTAAAGCAACAAGCACAACTGACCCA
AGAGACAATCCTTACAACAGAAGTTCCCTTCGCTGCAAAGGTTGATGATGAAGAAATTGAGTGTCCATTGCCATCTAACGATCGATCATGTTATATTAAAGAAGAGTCTA
AACCAGATTGCGAGGAAATTCAGGGGGAAAATGAGTTGAGGTTCTCATATGGTCCTCACCAGAGCTTTGGAAGCAATTCGGAGCTTAAACGAGCGAAACCAGAGCGTAAA
ATGACCATTTTGCCCCTGGAGCCTCATCAGCATCTAGATGCCCATAATTTGAGTCGAGACGCTAATCAAATAAGGAAGGAATGTGGGAGTCAAGCTGAAGTAACAGCGTC
GCGACGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGCAAACAAGAGGACGAAAGGAAAGGGATGTTGAGGAAGAAGAGGTGCAGGTTACACCTGAAGCACCAAAAACTAAGGCAAAGAAGAAGAAGACGCCAGAGGAAAA
AGAAGCTAAAAGAAGGAGGCGACAACAGAGGGCTGAGGTTGAAGAGGTAGTGCAAAAAGTGGTGGAAGATGTTGTTGCTGCGATGGTTGAGGAGGAAAATCCGAAGGAAC
CAGAGGAACAGAATCCTGGACAGAATGTTCCGATAGTCGAAAATCCTCATTATGCCGGGCGTCGCCGTCGGAAGCAGAAAGTCGGCCACATTAAGGTAATCCGGACTGAT
ACTCCTTCGCCGCCAACAACTGATTCTGAGAAACAGAATACGGAAAGAGAAGAGCAGGAGAAAAAGGAGGCAGAGGACAAAGCGAGAGAAGAAATAGAAAATAAAATGGA
GGAGGAAATTTTGCCCAAACAGAGGGAAGACAAGGGCAAAGAGATGCTGAAAAGAGATTTTCTGTTTGAAAGGGGATTTGGTGATGATTTGCCACGATTCTTGAGGACTG
GAATAGCAAATCATGGATGGAGTCAGTTCTGCGCAAAGCTGGAGCCGGTCAATTCCAATATTGTCCGTGAATTTTATGCAAATATAGATGATCAAGAAGGATTTCAGTTA
AATGCGGCTGTCAGGGAGGTTGGCATCGAAGGGGCTCAATGGAGGTTGTCGAAGACAGAAAAGCGCACGTTTCAAGCGGCTTATTTGAAGAGCGAAGCCAATACATGGAT
GGGCTTCATCAAGTTGCGCTTGCTTCCGAAAACTCACGATTCAACGGTGTCTCGAGACGGGGTGCTGCTGGTATTTGTTATTCTTCGTTCTATGAGTATTAACGTGGGCA
AAATCATTTCTAATGAGATTTATGATTGTTGGCGGAAGAAGGTAGGGAAGCTATTTTCCCCAAACACCATAACGATGCTATGTCAAAGGGTAAGGGTTCCTATGAATGCA
GACGATGTCACTCTAATGGACAAGGGAATAATCGACACGCCAAACCTAGCTAGGCTTCAAAGAACACAAGAGGCACACCAAGGCAGTTTGGTGTGCGACATCCATCAAAT
GGAAGAGCAATTACAAATGCATTCCAGCAGGATGGAGTTTGCCGAAAGGAAATTCCAAACCTTCTGGAATTATGTGAAGAGAAGGGATGCCGCGTTGAGGGGGGCCTTGC
AGTCTAACTTTTCTAAACCATATCCGACCTTCCCAATATTCCCTGATGACCTACTGAACCCTTGGATTCCGCCACCGCCAGTCGAAAGAGAAGGAGATGAAGAAGAAGAT
CCTGGACTTAGAATTATTTTGCTACAGCAGAGCTTGGTTTTGCAGAATGCTCAAGTAAAGGTTGAAGGTAGTGTTGGATTATCTGTTTTGATTGAGTTTCGCTGCTCGAC
CCTCCCGCTGCCACCGCCTATCCAGTCGCGTCGCACCGCCTATCTAGTCGCTGGAGTCGCTCCGCCGCCACCACTCATCAAGCCTCGCAGGAAAAGCTACTTGTGTAGTC
GCCCATTCCTTTGCAAATCTCTCCCTCTCTCAGTTTTTCGCGTGGAAGTCGCCAAGCCGGCGAAGTTACAGCCTATCGCACCGTCGCCACCTGCTCGTGTCATCGCTGCT
CAACATCTCGCACGCGAGGTGAAGAAACCAGTGTGCCGTCAAAGGGATCCATTGGGGTCGAATAACTGTAAGCTCGAATACCCACTGCCCAAAGATCGTTCTAGCACGTT
GTTCGAGGTTAATGTAACCCGTCTAGCAATAGGTCGATATCTTGTAAAGCTAAGGTTACAAGAGAACTCACCCAGCAAGATACTGTCTAAGGCTGGAGGTCCTCACCGAA
GCTTTGGAAGCAATTCGGGGCTTAAACGAGCGAAACCAGAGCGTAAAATGACCATTTTGCCCCTGGAGCCTCATCAACGTCGAGACACCCATCATCAAAGTCGAGACTTT
CAGGAAAGTACTCGAGCTGAGTTGAAAGACATGTCTGATAATATTGATAAGATAGAAACATATGTGACAAAACTGGAGAAGCATGTAAAGCAACAAGCACAACTGACCCA
AGAGACAATCCTTACAACAGAAGTTCCCTTCGCTGCAAAGGTTGATGATGAAGAAATTGAGTGTCCATTGCCATCTAACGATCGATCATGTTATATTAAAGAAGAGTCTA
AACCAGATTGCGAGGAAATTCAGGGGGAAAATGAGTTGAGGTTCTCATATGGTCCTCACCAGAGCTTTGGAAGCAATTCGGAGCTTAAACGAGCGAAACCAGAGCGTAAA
ATGACCATTTTGCCCCTGGAGCCTCATCAGCATCTAGATGCCCATAATTTGAGTCGAGACGCTAATCAAATAAGGAAGGAATGTGGGAGTCAAGCTGAAGTAACAGCGTC
GCGACGCTAG
Protein sequenceShow/hide protein sequence
MAQTRGRKERDVEEEEVQVTPEAPKTKAKKKKTPEEKEAKRRRRQQRAEVEEVVQKVVEDVVAAMVEEENPKEPEEQNPGQNVPIVENPHYAGRRRRKQKVGHIKVIRTD
TPSPPTTDSEKQNTEREEQEKKEAEDKAREEIENKMEEEILPKQREDKGKEMLKRDFLFERGFGDDLPRFLRTGIANHGWSQFCAKLEPVNSNIVREFYANIDDQEGFQL
NAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPKTHDSTVSRDGVLLVFVILRSMSINVGKIISNEIYDCWRKKVGKLFSPNTITMLCQRVRVPMNA
DDVTLMDKGIIDTPNLARLQRTQEAHQGSLVCDIHQMEEQLQMHSSRMEFAERKFQTFWNYVKRRDAALRGALQSNFSKPYPTFPIFPDDLLNPWIPPPPVEREGDEEED
PGLRIILLQQSLVLQNAQVKVEGSVGLSVLIEFRCSTLPLPPPIQSRRTAYLVAGVAPPPPLIKPRRKSYLCSRPFLCKSLPLSVFRVEVAKPAKLQPIAPSPPARVIAA
QHLAREVKKPVCRQRDPLGSNNCKLEYPLPKDRSSTLFEVNVTRLAIGRYLVKLRLQENSPSKILSKAGGPHRSFGSNSGLKRAKPERKMTILPLEPHQRRDTHHQSRDF
QESTRAELKDMSDNIDKIETYVTKLEKHVKQQAQLTQETILTTEVPFAAKVDDEEIECPLPSNDRSCYIKEESKPDCEEIQGENELRFSYGPHQSFGSNSELKRAKPERK
MTILPLEPHQHLDAHNLSRDANQIRKECGSQAEVTASRR