; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg027203 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg027203
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationscaffold8:6909239..6911707
RNA-Seq ExpressionSpg027203
SyntenySpg027203
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB49850.1 hypothetical protein L484_000844 [Morus notabilis]1.1e-2129.63Show/hide
Query:  FVSRVISQYKWQEFCAHPQEAVVPLV---------------------------HINRVYRIKTPLNPRGNDVIKNPSAKQMKEALKLMANKGVQWKESQT
        F+S VI    WQ FC HP + +VPLV                           +IN V  I    +    ++I +   +Q+KE LK +A  G QW  S  
Subjt:  FVSRVISQYKWQEFCAHPQEAVVPLV---------------------------HINRVYRIKTPLNPRGNDVIKNPSAKQMKEALKLMANKGVQWKESQT

Query:  KVKTLVPSDLKPESAVWLHFLKKRLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDKERHFFKP
           T    +L+P + VW HFL  RL+ +TH  TIS +R +LLY ++ G  IN+G +I D+I AC  K  G L+F SLI++LC +  +     + R     
Subjt:  KVKTLVPSDLKPESAVWLHFLKKRLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDKERHFFKP

Query:  TIDLSLIRKLQQNNILRKDK-PSTSQATPPSGPSIASPSQHTPFTGPSPSSEALA-----------IAYRQLDHIRENLKTYWAYAKERDEAIREFY
         +DL  I ++      + +K     +   PS PS    + HT     + S E L              +  L   +E L  +W Y+++RD A+++ +
Subjt:  TIDLSLIRKLQQNNILRKDK-PSTSQATPPSGPSIASPSQHTPFTGPSPSSEALA-----------IAYRQLDHIRENLKTYWAYAKERDEAIREFY

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]2.1e-2532.23Show/hide
Query:  LEFVSRVISQYKWQEFCAHPQEAVVPLVH---------------------------INRVYRIKTPLNPRGNDVIKNPSAKQMKEALKLMANKGVQWKES
        L F+++VI+Q+ W++FCAHP++ +VPLV                            IN V+ +  P++   ++ I+N +   +   L+ +A  G +W  S
Subjt:  LEFVSRVISQYKWQEFCAHPQEAVVPLVH---------------------------INRVYRIKTPLNPRGNDVIKNPSAKQMKEALKLMANKGVQWKES

Query:  QTKVKTLVPSDLKPESAVWLHFLKKRLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDKERHFF
             T + S L P + VW HFLK  L+PTTH  T+S DR++LL+ ++ G  IN+G +I  EI AC  ++ G LFF SLIT+LC+  +     ++E+   
Subjt:  QTKVKTLVPSDLKPESAVWLHFLKKRLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDKERHFF

Query:  KPTIDLSLIRKLQQNNILRKDKPSTSQATPPSG-PSIASPSQ
           ID   + ++ Q      + P+ S   P S  P+ AS S+
Subjt:  KPTIDLSLIRKLQQNNILRKDKPSTSQATPPSG-PSIASPSQ

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]7.4e-3129.91Show/hide
Query:  LEFVSRVISQYKWQEFCAHPQEAVVPLVH---------------------------INRVYRIKTPLNPRGNDVIKNPSAKQMKEALKLMANKGVQWKES
        L F+++VI+Q+ W++FCAHP++ +VPLV                            IN V+ +  P++   ++ I+N + + +   L+ +A  G +W  S
Subjt:  LEFVSRVISQYKWQEFCAHPQEAVVPLVH---------------------------INRVYRIKTPLNPRGNDVIKNPSAKQMKEALKLMANKGVQWKES

Query:  QTKVKTLVPSDLKPESAVWLHFLKKRLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDKERHFF
             T + S L P + VW HFLK RL+PTTH  T+S DR++LL+ ++ G  IN+G +I  EI AC  ++ G LFF SLIT+LC+  +     ++E+   
Subjt:  QTKVKTLVPSDLKPESAVWLHFLKKRLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDKERHFF

Query:  KPTID-LSLIRKLQQNNILRKDKPSTSQ-ATPPSGPSIASPSQHTPFTGPSPSSEALAIAYRQ--LDHIRENLKTYWAYAKERDEAIREFYLSIVPSIAL
           ID +++ R  Q+       +PS+S+ AT  S  +     Q         S + +   +    L H  +  + +WAY+KERD A+++   +       
Subjt:  KPTID-LSLIRKLQQNNILRKDKPSTSQ-ATPPSGPSIASPSQHTPFTGPSPSSEALAIAYRQ--LDHIRENLKTYWAYAKERDEAIREFYLSIVPSIAL

Query:  VFPDFPQSLLPQEDKDFDDEEVEENESSSDE
         FP FPQ +L   D +++ E  ++  + + E
Subjt:  VFPDFPQSLLPQEDKDFDDEEVEENESSSDE

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]1.1e-2333.87Show/hide
Query:  FVSRVISQYKWQEFCAHPQEAVVPLVH---------------------------INRVYRIKTPLNPRGNDVIKNPSAKQMKEALKLMANKGVQWKESQT
        F++ VI Q+ WQ FCAHP++ +VPLV                            IN ++ +  P++   ++ +++ +  ++   L+ +A  G +W  S  
Subjt:  FVSRVISQYKWQEFCAHPQEAVVPLVH---------------------------INRVYRIKTPLNPRGNDVIKNPSAKQMKEALKLMANKGVQWKESQT

Query:  KVKTLVPSDLKPESAVWLHFLKKRLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVK
           T + S L P + VW HFLK RL+PTTH  T+S + V LLY ++ G  IN+G +I  EI AC  +++G LFF SLIT +C+  +
Subjt:  KVKTLVPSDLKPESAVWLHFLKKRLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVK

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]4.0e-2430.63Show/hide
Query:  INRVYRIKTPLNPRGNDVIKNPSAKQMKEALKLMANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLKKRLMPTTHDSTISVDRVMLLYCIMKGLEINIG
        IN V+ +  P++   ++ I+N +  ++   L+ +A  G +W  S     T + S L P + VW HFLK RL+PTTH   +S DR++LL+ ++ G  IN+G
Subjt:  INRVYRIKTPLNPRGNDVIKNPSAKQMKEALKLMANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLKKRLMPTTHDSTISVDRVMLLYCIMKGLEINIG

Query:  SIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDKERHFFKPTIDLSLIRKLQQNNILRKDKPSTSQATPPSGPSIASPSQHTPFTGPSPSSEALA
         +I  EI AC  ++ G LFF SLIT+LC+    +  ++K  +     ID   + ++ Q      + P+ S   P S    A+ S  T         +AL 
Subjt:  SIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDKERHFFKPTIDLSLIRKLQQNNILRKDKPSTSQATPPSGPSIASPSQHTPFTGPSPSSEALA

Query:  IAYRQLDHIRENLKTYWAYAKERDEAIREFYLSIVPSIALVFPDFPQSLLPQEDKDFDDEEVEENESSSDE
            Q +H  +  + +WAY+KERD A+++   +        FP FPQ +L   D +++ E  ++  + + E
Subjt:  IAYRQLDHIRENLKTYWAYAKERDEAIREFYLSIVPSIALVFPDFPQSLLPQEDKDFDDEEVEENESSSDE

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)1.0e-2532.23Show/hide
Query:  LEFVSRVISQYKWQEFCAHPQEAVVPLVH---------------------------INRVYRIKTPLNPRGNDVIKNPSAKQMKEALKLMANKGVQWKES
        L F+++VI+Q+ W++FCAHP++ +VPLV                            IN V+ +  P++   ++ I+N +   +   L+ +A  G +W  S
Subjt:  LEFVSRVISQYKWQEFCAHPQEAVVPLVH---------------------------INRVYRIKTPLNPRGNDVIKNPSAKQMKEALKLMANKGVQWKES

Query:  QTKVKTLVPSDLKPESAVWLHFLKKRLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDKERHFF
             T + S L P + VW HFLK  L+PTTH  T+S DR++LL+ ++ G  IN+G +I  EI AC  ++ G LFF SLIT+LC+  +     ++E+   
Subjt:  QTKVKTLVPSDLKPESAVWLHFLKKRLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDKERHFF

Query:  KPTIDLSLIRKLQQNNILRKDKPSTSQATPPSG-PSIASPSQ
           ID   + ++ Q      + P+ S   P S  P+ AS S+
Subjt:  KPTIDLSLIRKLQQNNILRKDKPSTSQATPPSG-PSIASPSQ

A0A2P5BCG4 Uncharacterized protein (Fragment)3.6e-3129.91Show/hide
Query:  LEFVSRVISQYKWQEFCAHPQEAVVPLVH---------------------------INRVYRIKTPLNPRGNDVIKNPSAKQMKEALKLMANKGVQWKES
        L F+++VI+Q+ W++FCAHP++ +VPLV                            IN V+ +  P++   ++ I+N + + +   L+ +A  G +W  S
Subjt:  LEFVSRVISQYKWQEFCAHPQEAVVPLVH---------------------------INRVYRIKTPLNPRGNDVIKNPSAKQMKEALKLMANKGVQWKES

Query:  QTKVKTLVPSDLKPESAVWLHFLKKRLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDKERHFF
             T + S L P + VW HFLK RL+PTTH  T+S DR++LL+ ++ G  IN+G +I  EI AC  ++ G LFF SLIT+LC+  +     ++E+   
Subjt:  QTKVKTLVPSDLKPESAVWLHFLKKRLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDKERHFF

Query:  KPTID-LSLIRKLQQNNILRKDKPSTSQ-ATPPSGPSIASPSQHTPFTGPSPSSEALAIAYRQ--LDHIRENLKTYWAYAKERDEAIREFYLSIVPSIAL
           ID +++ R  Q+       +PS+S+ AT  S  +     Q         S + +   +    L H  +  + +WAY+KERD A+++   +       
Subjt:  KPTID-LSLIRKLQQNNILRKDKPSTSQ-ATPPSGPSIASPSQHTPFTGPSPSSEALAIAYRQ--LDHIRENLKTYWAYAKERDEAIREFYLSIVPSIAL

Query:  VFPDFPQSLLPQEDKDFDDEEVEENESSSDE
         FP FPQ +L   D +++ E  ++  + + E
Subjt:  VFPDFPQSLLPQEDKDFDDEEVEENESSSDE

A0A2P5DAQ2 Uncharacterized protein5.6e-2433.87Show/hide
Query:  FVSRVISQYKWQEFCAHPQEAVVPLVH---------------------------INRVYRIKTPLNPRGNDVIKNPSAKQMKEALKLMANKGVQWKESQT
        F++ VI Q+ WQ FCAHP++ +VPLV                            IN ++ +  P++   ++ +++ +  ++   L+ +A  G +W  S  
Subjt:  FVSRVISQYKWQEFCAHPQEAVVPLVH---------------------------INRVYRIKTPLNPRGNDVIKNPSAKQMKEALKLMANKGVQWKESQT

Query:  KVKTLVPSDLKPESAVWLHFLKKRLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVK
           T + S L P + VW HFLK RL+PTTH  T+S + V LLY ++ G  IN+G +I  EI AC  +++G LFF SLIT +C+  +
Subjt:  KVKTLVPSDLKPESAVWLHFLKKRLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVK

A0A2P5DXM3 Uncharacterized protein1.9e-2430.63Show/hide
Query:  INRVYRIKTPLNPRGNDVIKNPSAKQMKEALKLMANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLKKRLMPTTHDSTISVDRVMLLYCIMKGLEINIG
        IN V+ +  P++   ++ I+N +  ++   L+ +A  G +W  S     T + S L P + VW HFLK RL+PTTH   +S DR++LL+ ++ G  IN+G
Subjt:  INRVYRIKTPLNPRGNDVIKNPSAKQMKEALKLMANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLKKRLMPTTHDSTISVDRVMLLYCIMKGLEINIG

Query:  SIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDKERHFFKPTIDLSLIRKLQQNNILRKDKPSTSQATPPSGPSIASPSQHTPFTGPSPSSEALA
         +I  EI AC  ++ G LFF SLIT+LC+    +  ++K  +     ID   + ++ Q      + P+ S   P S    A+ S  T         +AL 
Subjt:  SIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDKERHFFKPTIDLSLIRKLQQNNILRKDKPSTSQATPPSGPSIASPSQHTPFTGPSPSSEALA

Query:  IAYRQLDHIRENLKTYWAYAKERDEAIREFYLSIVPSIALVFPDFPQSLLPQEDKDFDDEEVEENESSSDE
            Q +H  +  + +WAY+KERD A+++   +        FP FPQ +L   D +++ E  ++  + + E
Subjt:  IAYRQLDHIRENLKTYWAYAKERDEAIREFYLSIVPSIALVFPDFPQSLLPQEDKDFDDEEVEENESSSDE

W9RBS1 Uncharacterized protein5.2e-2229.63Show/hide
Query:  FVSRVISQYKWQEFCAHPQEAVVPLV---------------------------HINRVYRIKTPLNPRGNDVIKNPSAKQMKEALKLMANKGVQWKESQT
        F+S VI    WQ FC HP + +VPLV                           +IN V  I    +    ++I +   +Q+KE LK +A  G QW  S  
Subjt:  FVSRVISQYKWQEFCAHPQEAVVPLV---------------------------HINRVYRIKTPLNPRGNDVIKNPSAKQMKEALKLMANKGVQWKESQT

Query:  KVKTLVPSDLKPESAVWLHFLKKRLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDKERHFFKP
           T    +L+P + VW HFL  RL+ +TH  TIS +R +LLY ++ G  IN+G +I D+I AC  K  G L+F SLI++LC +  +     + R     
Subjt:  KVKTLVPSDLKPESAVWLHFLKKRLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDKERHFFKP

Query:  TIDLSLIRKLQQNNILRKDK-PSTSQATPPSGPSIASPSQHTPFTGPSPSSEALA-----------IAYRQLDHIRENLKTYWAYAKERDEAIREFY
         +DL  I ++      + +K     +   PS PS    + HT     + S E L              +  L   +E L  +W Y+++RD A+++ +
Subjt:  TIDLSLIRKLQQNNILRKDK-PSTSQATPPSGPSIASPSQHTPFTGPSPSSEALA-----------IAYRQLDHIRENLKTYWAYAKERDEAIREFY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTCGCGCCGACGATCGTGAACCAAGGTGCTTCCAACACTCAAACTCCTTCTTCTTCGACAATGCCGGCCAGTTCGAGGGAGAATCCGAGTTCGTCTGCACGTAG
GAGGTCCACGCGCGCCACTGCTGTTTGCCAAACCCAAAAACCCGCAACTCAAAAGTTCAAAAAACGCTCGCAGGAATGGTTTTCAATGATCCGGGCAATGGGAGCTCAGA
GACGGGCTGCTCTTGAAGAAGAAGGAAATAGGAGAGATGAAGAAGAAGCCGCCAAAGCAGCAGGAAGCTCCCGACAAGGAGAGGCTTCAACGGGTAAAAATTCTGAACCT
TCAACTAACCCCTCTTCGTCTTGCAGGAACAAACCATTCGTTACCTACAGCGCAAGAAAGAGGAGTCCCAAGAAAGTTGCTCTCGTGATTGCTGAAGGGTATAAGGCTGA
ACAAGAAGCCTTGAAGGATATTGAGGCTGAGAGAGAGATGGAAAATCAACACATGAGGGAAGAAGATGAAGGTGCAAGAGAAAGAGATCTTGAAGAAGAGAGGAAAAAGG
AAGAGGAAAGACTGGAGGCCGAGATGGCCAAGTTAGCTGAAGAAGAAGAGAGGAAGTTAGATGAAGACCTCAGGAGGGCAGCTGCTGATTTGCAACTCCTTGAGGAAGAA
AAACGAAGAAGGGAAGAATTAAAAGAAGAAGAGAAAAGAAGAAAGGAAGCTGAAGACTTCCTTGCAGCTTTTGAACCACTCCACAAGGCTCAAAGTGAGGCTGAGATGCT
GCAAGGAAGAAAAAAAAGGGCCCAGCAGGGGCCAAGTGAAGGAAGCCTAGCAGAGACCACTGAAGTTCAGCCTGCTGATGAGGTTTTCGAACCTCTATTCAAAGATGACC
CACCAGCAGCTGATAGCACCTCTTCGGGAGAGAAGAGGGATGAAGAAGAGAAAGAAAGTAAGGAGGTCGAGACCTCCAGTGATTCAGAGACAGAATCCGACTTAGAGATT
AAGGAGCTAGATGATGACCAAGTTCCTATCTCTGCGGCATTGAGGAGAAAGAGAAGAAGAGAGATCAAAGCTGAAAGAAGGACCAAGAATAAGAATGAACCTATTTTTGC
CAAGAGGCTGAGGACTAAGTCTATGGACGCCTCTCCAGCAGCTCCTCCTACCATCTCACCCGACAAGCCAAAGGCCAAATCACCCAAGGCTGCATCTCCTAAAAATCCAT
TCCCCGAGGGATTCTCTAACAGAGCAGGAGCACTGCTAGAGTTCGTAAGCAGAGTTATCTCCCAGTATAAATGGCAGGAGTTCTGTGCTCACCCTCAGGAGGCTGTTGTG
CCTTTAGTTCACATTAACCGGGTGTATAGAATCAAAACACCCTTGAATCCAAGAGGGAACGATGTTATCAAGAACCCCTCGGCCAAGCAGATGAAAGAAGCACTTAAACT
CATGGCCAACAAGGGAGTTCAGTGGAAAGAGTCCCAAACGAAGGTGAAGACTCTAGTGCCAAGCGATCTAAAGCCAGAATCGGCAGTTTGGCTTCACTTTCTGAAGAAAC
GTTTGATGCCAACCACCCACGATAGCACCATTTCAGTAGATAGAGTGATGCTACTCTACTGTATTATGAAGGGGTTGGAGATCAATATTGGGAGCATAATCAGGGATGAG
ATTTTAGCCTGTGGAAGGAAACGAGCAGGTAAACTTTTCTTTGGATCACTCATCACCCAGCTTTGTCAGAGGGTGAAGATAGTTCCAGGCAAGGACAAGGAGCGTCATTT
CTTCAAGCCGACCATCGACCTGTCCTTGATCAGGAAGCTTCAACAGAATAACATCCTAAGGAAAGATAAACCCTCCACATCTCAGGCCACTCCACCATCAGGGCCGAGCA
TTGCTTCTCCATCCCAGCACACTCCTTTTACAGGGCCCTCACCGTCATCGGAAGCCCTAGCTATTGCCTACCGCCAGCTTGATCACATCAGGGAAAACTTGAAGACATAT
TGGGCATATGCAAAGGAGAGGGATGAAGCCATTAGAGAGTTCTATCTCTCTATCGTCCCGAGTATTGCTCTGGTTTTTCCCGATTTCCCTCAGTCGCTGCTGCCTCAAGA
AGACAAGGATTTTGATGATGAAGAAGTTGAAGAGAACGAGAGTTCCTCGGACGAGGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTTTCGCGCCGACGATCGTGAACCAAGGTGCTTCCAACACTCAAACTCCTTCTTCTTCGACAATGCCGGCCAGTTCGAGGGAGAATCCGAGTTCGTCTGCACGTAG
GAGGTCCACGCGCGCCACTGCTGTTTGCCAAACCCAAAAACCCGCAACTCAAAAGTTCAAAAAACGCTCGCAGGAATGGTTTTCAATGATCCGGGCAATGGGAGCTCAGA
GACGGGCTGCTCTTGAAGAAGAAGGAAATAGGAGAGATGAAGAAGAAGCCGCCAAAGCAGCAGGAAGCTCCCGACAAGGAGAGGCTTCAACGGGTAAAAATTCTGAACCT
TCAACTAACCCCTCTTCGTCTTGCAGGAACAAACCATTCGTTACCTACAGCGCAAGAAAGAGGAGTCCCAAGAAAGTTGCTCTCGTGATTGCTGAAGGGTATAAGGCTGA
ACAAGAAGCCTTGAAGGATATTGAGGCTGAGAGAGAGATGGAAAATCAACACATGAGGGAAGAAGATGAAGGTGCAAGAGAAAGAGATCTTGAAGAAGAGAGGAAAAAGG
AAGAGGAAAGACTGGAGGCCGAGATGGCCAAGTTAGCTGAAGAAGAAGAGAGGAAGTTAGATGAAGACCTCAGGAGGGCAGCTGCTGATTTGCAACTCCTTGAGGAAGAA
AAACGAAGAAGGGAAGAATTAAAAGAAGAAGAGAAAAGAAGAAAGGAAGCTGAAGACTTCCTTGCAGCTTTTGAACCACTCCACAAGGCTCAAAGTGAGGCTGAGATGCT
GCAAGGAAGAAAAAAAAGGGCCCAGCAGGGGCCAAGTGAAGGAAGCCTAGCAGAGACCACTGAAGTTCAGCCTGCTGATGAGGTTTTCGAACCTCTATTCAAAGATGACC
CACCAGCAGCTGATAGCACCTCTTCGGGAGAGAAGAGGGATGAAGAAGAGAAAGAAAGTAAGGAGGTCGAGACCTCCAGTGATTCAGAGACAGAATCCGACTTAGAGATT
AAGGAGCTAGATGATGACCAAGTTCCTATCTCTGCGGCATTGAGGAGAAAGAGAAGAAGAGAGATCAAAGCTGAAAGAAGGACCAAGAATAAGAATGAACCTATTTTTGC
CAAGAGGCTGAGGACTAAGTCTATGGACGCCTCTCCAGCAGCTCCTCCTACCATCTCACCCGACAAGCCAAAGGCCAAATCACCCAAGGCTGCATCTCCTAAAAATCCAT
TCCCCGAGGGATTCTCTAACAGAGCAGGAGCACTGCTAGAGTTCGTAAGCAGAGTTATCTCCCAGTATAAATGGCAGGAGTTCTGTGCTCACCCTCAGGAGGCTGTTGTG
CCTTTAGTTCACATTAACCGGGTGTATAGAATCAAAACACCCTTGAATCCAAGAGGGAACGATGTTATCAAGAACCCCTCGGCCAAGCAGATGAAAGAAGCACTTAAACT
CATGGCCAACAAGGGAGTTCAGTGGAAAGAGTCCCAAACGAAGGTGAAGACTCTAGTGCCAAGCGATCTAAAGCCAGAATCGGCAGTTTGGCTTCACTTTCTGAAGAAAC
GTTTGATGCCAACCACCCACGATAGCACCATTTCAGTAGATAGAGTGATGCTACTCTACTGTATTATGAAGGGGTTGGAGATCAATATTGGGAGCATAATCAGGGATGAG
ATTTTAGCCTGTGGAAGGAAACGAGCAGGTAAACTTTTCTTTGGATCACTCATCACCCAGCTTTGTCAGAGGGTGAAGATAGTTCCAGGCAAGGACAAGGAGCGTCATTT
CTTCAAGCCGACCATCGACCTGTCCTTGATCAGGAAGCTTCAACAGAATAACATCCTAAGGAAAGATAAACCCTCCACATCTCAGGCCACTCCACCATCAGGGCCGAGCA
TTGCTTCTCCATCCCAGCACACTCCTTTTACAGGGCCCTCACCGTCATCGGAAGCCCTAGCTATTGCCTACCGCCAGCTTGATCACATCAGGGAAAACTTGAAGACATAT
TGGGCATATGCAAAGGAGAGGGATGAAGCCATTAGAGAGTTCTATCTCTCTATCGTCCCGAGTATTGCTCTGGTTTTTCCCGATTTCCCTCAGTCGCTGCTGCCTCAAGA
AGACAAGGATTTTGATGATGAAGAAGTTGAAGAGAACGAGAGTTCCTCGGACGAGGACTAG
Protein sequenceShow/hide protein sequence
MGFAPTIVNQGASNTQTPSSSTMPASSRENPSSSARRRSTRATAVCQTQKPATQKFKKRSQEWFSMIRAMGAQRRAALEEEGNRRDEEEAAKAAGSSRQGEASTGKNSEP
STNPSSSCRNKPFVTYSARKRSPKKVALVIAEGYKAEQEALKDIEAEREMENQHMREEDEGARERDLEEERKKEEERLEAEMAKLAEEEERKLDEDLRRAAADLQLLEEE
KRRREELKEEEKRRKEAEDFLAAFEPLHKAQSEAEMLQGRKKRAQQGPSEGSLAETTEVQPADEVFEPLFKDDPPAADSTSSGEKRDEEEKESKEVETSSDSETESDLEI
KELDDDQVPISAALRRKRRREIKAERRTKNKNEPIFAKRLRTKSMDASPAAPPTISPDKPKAKSPKAASPKNPFPEGFSNRAGALLEFVSRVISQYKWQEFCAHPQEAVV
PLVHINRVYRIKTPLNPRGNDVIKNPSAKQMKEALKLMANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLKKRLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDE
ILACGRKRAGKLFFGSLITQLCQRVKIVPGKDKERHFFKPTIDLSLIRKLQQNNILRKDKPSTSQATPPSGPSIASPSQHTPFTGPSPSSEALAIAYRQLDHIRENLKTY
WAYAKERDEAIREFYLSIVPSIALVFPDFPQSLLPQEDKDFDDEEVEENESSSDED