; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg006714 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg006714
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRT_RNaseH_2 domain-containing protein
Genome locationscaffold1:55861043..55869098
RNA-Seq ExpressionSpg006714
SyntenySpg006714
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8695166.1 hypothetical protein F3Y22_tig00110733pilonHSYRG00282 [Hibiscus syriacus]4.0e-1326.3Show/hide
Query:  IKSLKAPSPKNPFLEVFKDVNFQERMEIMRKKDFLNEKGF---SNRAGTLPKFVTKVITQYKWQELYAHPQEVVVPLVREFYTGLREKSMSMTVVRGKMA
        ++S  AP P  P    F D   +E  + ++ +    E GF         L   V  V+T++KWQ+   HP  V   +V+EFY+ + E +    +VRG   
Subjt:  IKSLKAPSPKNPFLEVFKDVNFQERMEIMRKKDFLNEKGF---SNRAGTLPKFVTKVITQYKWQELYAHPQEVVVPLVREFYTGLREKSMSMTVVRGKMA

Query:  SFSFVDINRVYRIKAP-------LHPRGNDAIKNPLVQTDERSTK-NGGQL-----------------------GLMPTTHDSTISVERVMHFYSIMKGL
         F+   INR ++++              ++  +  L       T+ NG QL                        LMPT+H++T+S +R++  +SI+ G 
Subjt:  SFSFVDINRVYRIKAP-------LHPRGNDAIKNPLVQTDERSTK-NGGQL-----------------------GLMPTTHDSTISVERVMHFYSIMKGL

Query:  EINIGSIIRKEILLCGRKKAGKLFFGSLITQLCQRVTIVPSKDEERHFFRSTIDLPLIGKL--QQNNAQRKDKASTSQVTPSPGLNLAS
         I+IG II +   LC +++A  L F +LIT LC++  +     +E     + ++   I  L   +    +K +A+TS+V  SP +  +S
Subjt:  EINIGSIIRKEILLCGRKKAGKLFFGSLITQLCQRVTIVPSKDEERHFFRSTIDLPLIGKL--QQNNAQRKDKASTSQVTPSPGLNLAS

KAF4375842.1 hypothetical protein G4B88_026421 [Cannabis sativa]2.2e-1631.22Show/hide
Query:  KNPFLEVFKDVNFQERMEIMRKKDFLNEKGF---SNRAGTLPKFVTKVITQYKWQELYAHPQEVVVPLVREFYTG-LREKSMSMTVVRGKMASFSFVDIN
        KN F E+ K    ++ +  +R K+F  ++G        G++P ++ + I +  W +L   P   V  +V+EFY   L  +  +   VR     FS  DIN
Subjt:  KNPFLEVFKDVNFQERMEIMRKKDFLNEKGF---SNRAGTLPKFVTKVITQYKWQELYAHPQEVVVPLVREFYTG-LREKSMSMTVVRGKMASFSFVDIN

Query:  RVYRIK---------------APLHPRGNDAIKNPLVQTDERSTKNGGQLGLMPTTHDSTISVERVMHFYSIMKGLEINIGSIIRKEILLCGRKKAGKLF
          Y +K                 +  RG    K   ++ D +   N  Q  L+PT+HDST+S ER+   Y I+KG +IN+G +I KEI  C  +  GKLF
Subjt:  RVYRIK---------------APLHPRGNDAIKNPLVQTDERSTKNGGQLGLMPTTHDSTISVERVMHFYSIMKGLEINIGSIIRKEILLCGRKKAGKLF

Query:  FGSLITQLCQRVTIVPSKDEE
        F  LIT+ C+   +    DE+
Subjt:  FGSLITQLCQRVTIVPSKDEE

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]2.2e-1930Show/hide
Query:  EKGF----SNRAGTLPKFVTKVITQYKWQELYAHPQEVVVPLVREFYTGLREKSMSMTVVRGKMASFSFVDINRVYRIKAPLHPRG---NDAIKNPLVQT
        EKGF    S   G LP F+ +VITQ+ W++  AHP++ +VPLVREFY  L +   +   VRG   S+S   IN V+ +  P+        +  ++ L+  
Subjt:  EKGF----SNRAGTLPKFVTKVITQYKWQELYAHPQEVVVPLVREFYTGLREKSMSMTVVRGKMASFSFVDINRVYRIKAPLHPRG---NDAIKNPLVQT

Query:  DERSTKNGGQLG----------------------------LMPTTHDSTISVERVMHFYSIMKGLEINIGSIIRKEILLCGRKKAGKLFFGSLITQLCQR
         E     G +                              L+PTTH  T+S +R++  +S++ G  IN+G +I  EI  C  +K G LFF SLIT+LC+ 
Subjt:  DERSTKNGGQLG----------------------------LMPTTHDSTISVERVMHFYSIMKGLEINIGSIIRKEILLCGRKKAGKLFFGSLITQLCQR

Query:  VTIVPSKDEERHFFRSTIDLPLIGKLQQNNAQRKDKASTS
               +EE+      ID   + ++ Q       +  +S
Subjt:  VTIVPSKDEERHFFRSTIDLPLIGKLQQNNAQRKDKASTS

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]1.3e-1930Show/hide
Query:  EKGF----SNRAGTLPKFVTKVITQYKWQELYAHPQEVVVPLVREFYTGLREKSMSMTVVRGKMASFSFVDINRVYRIKAPLHPRG---NDAIKNPLVQT
        EKGF    S   G LP F+ +VITQ+ W++  AHP++ +VPLVREFY  L +   +   VRG   S+S   IN V+ +  P+        +  +  L+  
Subjt:  EKGF----SNRAGTLPKFVTKVITQYKWQELYAHPQEVVVPLVREFYTGLREKSMSMTVVRGKMASFSFVDINRVYRIKAPLHPRG---NDAIKNPLVQT

Query:  DERSTKNGGQLG----------------------------LMPTTHDSTISVERVMHFYSIMKGLEINIGSIIRKEILLCGRKKAGKLFFGSLITQLCQR
         E     G +                              L+PTTH  T+S +R++  +S++ G  IN+G +I  EI  C  +K G LFF SLIT+LC+ 
Subjt:  DERSTKNGGQLG----------------------------LMPTTHDSTISVERVMHFYSIMKGLEINIGSIIRKEILLCGRKKAGKLFFGSLITQLCQR

Query:  VTIVPSKDEERHFFRSTIDLPLIGKLQQNNAQRKDKASTS
               +EE+      ID   + ++ Q       +  +S
Subjt:  VTIVPSKDEERHFFRSTIDLPLIGKLQQNNAQRKDKASTS

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]3.5e-1731.63Show/hide
Query:  PKFVTKVITQYKWQELYAHPQEVVVPLVREFYTGLREKSMSMTVVRGKMASFSFVDINRVYRIKAPLHPRG---NDAIKNPLVQTDERSTKNGGQLG---
        P F+  VI Q+ WQ   AHP++ +VPLVREFYT +         +RG     S   IN ++ +  P+        D  K  LV   E     G +     
Subjt:  PKFVTKVITQYKWQELYAHPQEVVVPLVREFYTGLREKSMSMTVVRGKMASFSFVDINRVYRIKAPLHPRG---NDAIKNPLVQTDERSTKNGGQLG---

Query:  -------------------------LMPTTHDSTISVERVMHFYSIMKGLEINIGSIIRKEILLCGRKKAGKLFFGSLITQLCQRVTIVPSKDEER
                                 L+PTTH  T+S E V   YS++ G  IN+G +I +EI  C  +K+G LFF SLIT +C+        +EE+
Subjt:  -------------------------LMPTTHDSTISVERVMHFYSIMKGLEINIGSIIRKEILLCGRKKAGKLFFGSLITQLCQRVTIVPSKDEER

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)1.0e-1930Show/hide
Query:  EKGF----SNRAGTLPKFVTKVITQYKWQELYAHPQEVVVPLVREFYTGLREKSMSMTVVRGKMASFSFVDINRVYRIKAPLHPRG---NDAIKNPLVQT
        EKGF    S   G LP F+ +VITQ+ W++  AHP++ +VPLVREFY  L +   +   VRG   S+S   IN V+ +  P+        +  ++ L+  
Subjt:  EKGF----SNRAGTLPKFVTKVITQYKWQELYAHPQEVVVPLVREFYTGLREKSMSMTVVRGKMASFSFVDINRVYRIKAPLHPRG---NDAIKNPLVQT

Query:  DERSTKNGGQLG----------------------------LMPTTHDSTISVERVMHFYSIMKGLEINIGSIIRKEILLCGRKKAGKLFFGSLITQLCQR
         E     G +                              L+PTTH  T+S +R++  +S++ G  IN+G +I  EI  C  +K G LFF SLIT+LC+ 
Subjt:  DERSTKNGGQLG----------------------------LMPTTHDSTISVERVMHFYSIMKGLEINIGSIIRKEILLCGRKKAGKLFFGSLITQLCQR

Query:  VTIVPSKDEERHFFRSTIDLPLIGKLQQNNAQRKDKASTS
               +EE+      ID   + ++ Q       +  +S
Subjt:  VTIVPSKDEERHFFRSTIDLPLIGKLQQNNAQRKDKASTS

A0A2P5BCG4 Uncharacterized protein (Fragment)6.2e-2030Show/hide
Query:  EKGF----SNRAGTLPKFVTKVITQYKWQELYAHPQEVVVPLVREFYTGLREKSMSMTVVRGKMASFSFVDINRVYRIKAPLHPRG---NDAIKNPLVQT
        EKGF    S   G LP F+ +VITQ+ W++  AHP++ +VPLVREFY  L +   +   VRG   S+S   IN V+ +  P+        +  +  L+  
Subjt:  EKGF----SNRAGTLPKFVTKVITQYKWQELYAHPQEVVVPLVREFYTGLREKSMSMTVVRGKMASFSFVDINRVYRIKAPLHPRG---NDAIKNPLVQT

Query:  DERSTKNGGQLG----------------------------LMPTTHDSTISVERVMHFYSIMKGLEINIGSIIRKEILLCGRKKAGKLFFGSLITQLCQR
         E     G +                              L+PTTH  T+S +R++  +S++ G  IN+G +I  EI  C  +K G LFF SLIT+LC+ 
Subjt:  DERSTKNGGQLG----------------------------LMPTTHDSTISVERVMHFYSIMKGLEINIGSIIRKEILLCGRKKAGKLFFGSLITQLCQR

Query:  VTIVPSKDEERHFFRSTIDLPLIGKLQQNNAQRKDKASTS
               +EE+      ID   + ++ Q       +  +S
Subjt:  VTIVPSKDEERHFFRSTIDLPLIGKLQQNNAQRKDKASTS

A0A2P5DAQ2 Uncharacterized protein1.7e-1731.63Show/hide
Query:  PKFVTKVITQYKWQELYAHPQEVVVPLVREFYTGLREKSMSMTVVRGKMASFSFVDINRVYRIKAPLHPRG---NDAIKNPLVQTDERSTKNGGQLG---
        P F+  VI Q+ WQ   AHP++ +VPLVREFYT +         +RG     S   IN ++ +  P+        D  K  LV   E     G +     
Subjt:  PKFVTKVITQYKWQELYAHPQEVVVPLVREFYTGLREKSMSMTVVRGKMASFSFVDINRVYRIKAPLHPRG---NDAIKNPLVQTDERSTKNGGQLG---

Query:  -------------------------LMPTTHDSTISVERVMHFYSIMKGLEINIGSIIRKEILLCGRKKAGKLFFGSLITQLCQRVTIVPSKDEER
                                 L+PTTH  T+S E V   YS++ G  IN+G +I +EI  C  +K+G LFF SLIT +C+        +EE+
Subjt:  -------------------------LMPTTHDSTISVERVMHFYSIMKGLEINIGSIIRKEILLCGRKKAGKLFFGSLITQLCQRVTIVPSKDEER

A0A7J6FZ22 Uncharacterized protein1.1e-1631.22Show/hide
Query:  KNPFLEVFKDVNFQERMEIMRKKDFLNEKGF---SNRAGTLPKFVTKVITQYKWQELYAHPQEVVVPLVREFYTG-LREKSMSMTVVRGKMASFSFVDIN
        KN F E+ K    ++ +  +R K+F  ++G        G++P ++ + I +  W +L   P   V  +V+EFY   L  +  +   VR     FS  DIN
Subjt:  KNPFLEVFKDVNFQERMEIMRKKDFLNEKGF---SNRAGTLPKFVTKVITQYKWQELYAHPQEVVVPLVREFYTG-LREKSMSMTVVRGKMASFSFVDIN

Query:  RVYRIK---------------APLHPRGNDAIKNPLVQTDERSTKNGGQLGLMPTTHDSTISVERVMHFYSIMKGLEINIGSIIRKEILLCGRKKAGKLF
          Y +K                 +  RG    K   ++ D +   N  Q  L+PT+HDST+S ER+   Y I+KG +IN+G +I KEI  C  +  GKLF
Subjt:  RVYRIK---------------APLHPRGNDAIKNPLVQTDERSTKNGGQLGLMPTTHDSTISVERVMHFYSIMKGLEINIGSIIRKEILLCGRKKAGKLF

Query:  FGSLITQLCQRVTIVPSKDEE
        F  LIT+ C+   +    DE+
Subjt:  FGSLITQLCQRVTIVPSKDEE

A0A803Q715 Uncharacterized protein1.6e-1532.79Show/hide
Query:  GTLPKFVTKVITQYKWQELYAHPQEVVVPLVREFYTG-LREKSMSMTVVRGKMASFSFVDINRVYRIK---------------APLHPRGNDAIKNPLVQ
        G++P ++ + I +  W +L   P   V  +V+EFY   L  +  +   VR     FS  DIN  Y +K                 +  RG    K   ++
Subjt:  GTLPKFVTKVITQYKWQELYAHPQEVVVPLVREFYTG-LREKSMSMTVVRGKMASFSFVDINRVYRIK---------------APLHPRGNDAIKNPLVQ

Query:  TDERSTKNGGQLGLMPTTHDSTISVERVMHFYSIMKGLEINIGSIIRKEILLCGRKKAGKLFFGSLITQLCQRVTIVPSKDEE
         D +   N  Q  L+PT+HDST+S ER+   Y I+KG +IN+G +I KEI  C  +  GKLFF  LIT+ C+   +    DE+
Subjt:  TDERSTKNGGQLGLMPTTHDSTISVERVMHFYSIMKGLEINIGSIIRKEILLCGRKKAGKLFFGSLITQLCQRVTIVPSKDEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACGGTACGAGAGGAGCGAGACCTACAGGTTTCTCGCCAGCGATTGTGGCCCAAGGAACCAACGTTCAAACTCCTTCTTCCTCAACAATGTTGGCCACTTCAAGGAA
GAATCAGAGTAGTTCTCGTCCAAGAATGTCCACACGCACTGATTCCATCCACAAAACCCAAAAACCCGCGGGCCAACAGTTGGAGAAACGCTCGAGGGAATGGTATTCAA
TGATTAGAGGGACGAGAGCCCAAAGGCGTGCGACTCTTGAAGAAGAAGCCAGACTCCGTGATGCTAAAGAAATAGCCAAAGTTGGAGAGAGCTCTCGGCAAGGAGAGACT
CTAATGGGTAACGTCTCCCAACCTTCTTCTAATCCATCTTCTTCTTGTAGGGACAAGACTTTCATGACCTACAAGGCAAAGAAGAAGAATGTGTTCGAGGACATGATCCG
CCAAGTCGTGGCACAAGCCATTGTTATTTCTGAAGGTTACAGGGTTGAGCAAGATGCACTCAGGGAAATTCAGTCTGAAAGGGAGATGGAAAACCAGAGTATGAGGGAAG
AGGACGATTTTGCAAGAAAAAGGGACTTGGAAGAAGAAAGGGAAGCTGAGAGAAGGCAAGAAGAAAAAGACAAGAGGTCGAGAGGGCCAGCCTTTGAGCCATTGCATAAG
GCTCAAAGTGAGGCTGATCTATTGCAAGAAAGAGAAGAAGATGCCCTTGAGGGGCCAAGAGAAGAAAATCCAGAAAAAAGAAGAAGAAAAAATGAAGAAAATGAAGGTCA
GGATGCGACCGCATATGGGCCGCATTCTAAAGAAGGACAAAAGGCCACTGAAGAACAGCCAGCTGATGAGGTTTTCAATCCTCTGTTTAAATATGATCCACCAGCTGCTG
AGAGCATCTATTCAGGAGAGAAGAAGGATGAAGAGGAAATTGAAAGTGAAGAGGCCAAGACCTCCAGTGATTCAGAAACCGATTTAGATTCTAAGATCAAGGAATTGAAT
GACAACCAAATTCCTATCTTTGCAGCATTGAGAAATAAGAGAAGAAGAGAGATTAGGGCTGAGAGGAAAACAAAGAATAAAAATGATCCTATTTTTTTCAAGAGGTTGAG
GACAAGGTCCATGGACGCTTCCCCGATACCTCCTCCAACCATCTCACCTACCAAGCCAAAAATCAAATCACTTAAGGCTCCATCTCCCAAAAATCCATTCCTAGAAGTCT
TCAAAGATGTCAATTTTCAGGAACGGATGGAGATCATGAGAAAAAAAGACTTCCTGAACGAGAAAGGATTCTCAAACAGAGCTGGAACACTGCCAAAGTTCGTTACCAAA
GTTATTACACAATACAAATGGCAGGAACTCTATGCTCATCCCCAGGAGGTCGTGGTGCCTCTAGTTCGAGAATTCTACACTGGTTTGAGGGAGAAGAGCATGAGCATGAC
AGTGGTGAGAGGTAAGATGGCCAGTTTCTCTTTTGTTGACATCAACAGGGTGTACAGAATCAAGGCACCCTTGCATCCAAGAGGGAACGATGCCATCAAGAACCCCCTCG
TCCAAACAGATGAAAGAAGCACTAAAAATGGTGGCCAACTAGGGTTGATGCCAACAACCCATGATAGCACTATTTCAGTAGAGAGAGTTATGCATTTCTACAGTATCATG
AAGGGGTTGGAGATAAACATCGGGAGCATAATAAGGAAGGAGATCCTTTTGTGTGGAAGGAAGAAAGCAGGGAAGCTATTCTTTGGGTCACTTATCACCCAATTATGTCA
GAGGGTAACAATAGTCCCTAGTAAGGATGAGGAGCGCCACTTCTTCAGGTCTACCATTGATCTACCTCTAATTGGGAAGCTCCAACAGAACAATGCCCAAAGGAAGGACA
AAGCTTCCACATCTCAAGTCACTCCATCACCGGGGCTGAATCTGGCTTCTCCACCTCAACTAGGGGTGAGCGTAAATTTTCGAAAAACCGATCCGACCGATCGAAACCGG
CCAAACCGACGTCGGTCAGTCAGTTTCAGTCAAGATTCGATCGGTTTCGGTTTGCCATTATGCCAAACCGAAATGTCAGTCGACAGTAGACTCCCCTCACTTCAAGTTCA
CGAATCACGTCAGTCGCCGCCCCTCGCCGACCCTCGCCGTCGATTTCTTTTTCTTCTTCTTCGTCTGCAACTCACAGTCACAGTCACCGCGCCGCCGTCCGTTCTTGTTC
TTGAGATTAAGATCACCATTCGACATTCGTCACTTGAGATTATTGTCCATTTCTCTCATCACATTGTCGATTTGTCACGCTGGAAAATTCTTACGAGTGATACTAGAGTT
GCATCTGTTTGTTTTAGTAGTGGGAGTGGTGGTGGTAGTAGTAGTGCTAGTGCTAGTACTAGTGCTAATGTTGATTTGAGATTTGATGATGAAACAATGGACCTAGATGA
GGATGAAAACTACAACTATGATATAATACCTCAGTTCAAAAGGCCCACGCCATCATCTGAGGCCCTAGCATTTGCCTACCGACAGTTGGACCAAATCAGGGATAACCTGA
GGAGTTATTGGGCTTATGCCAAGGAGAGAGATGAAGCTAGGAGAGAGTTTTACCTCTCTGTCGCCCCGAGTATTGCTCCTGTCTTTCCTGATTTCCCTCAATCGTTCTTG
CCTCAAGAAGAAAAGGAAACTGAAGATGAAGATGAAGATGAAGAGAAAGAGATGCCCTCGGATGAGGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGCACGGTACGAGAGGAGCGAGACCTACAGGTTTCTCGCCAGCGATTGTGGCCCAAGGAACCAACGTTCAAACTCCTTCTTCCTCAACAATGTTGGCCACTTCAAGGAA
GAATCAGAGTAGTTCTCGTCCAAGAATGTCCACACGCACTGATTCCATCCACAAAACCCAAAAACCCGCGGGCCAACAGTTGGAGAAACGCTCGAGGGAATGGTATTCAA
TGATTAGAGGGACGAGAGCCCAAAGGCGTGCGACTCTTGAAGAAGAAGCCAGACTCCGTGATGCTAAAGAAATAGCCAAAGTTGGAGAGAGCTCTCGGCAAGGAGAGACT
CTAATGGGTAACGTCTCCCAACCTTCTTCTAATCCATCTTCTTCTTGTAGGGACAAGACTTTCATGACCTACAAGGCAAAGAAGAAGAATGTGTTCGAGGACATGATCCG
CCAAGTCGTGGCACAAGCCATTGTTATTTCTGAAGGTTACAGGGTTGAGCAAGATGCACTCAGGGAAATTCAGTCTGAAAGGGAGATGGAAAACCAGAGTATGAGGGAAG
AGGACGATTTTGCAAGAAAAAGGGACTTGGAAGAAGAAAGGGAAGCTGAGAGAAGGCAAGAAGAAAAAGACAAGAGGTCGAGAGGGCCAGCCTTTGAGCCATTGCATAAG
GCTCAAAGTGAGGCTGATCTATTGCAAGAAAGAGAAGAAGATGCCCTTGAGGGGCCAAGAGAAGAAAATCCAGAAAAAAGAAGAAGAAAAAATGAAGAAAATGAAGGTCA
GGATGCGACCGCATATGGGCCGCATTCTAAAGAAGGACAAAAGGCCACTGAAGAACAGCCAGCTGATGAGGTTTTCAATCCTCTGTTTAAATATGATCCACCAGCTGCTG
AGAGCATCTATTCAGGAGAGAAGAAGGATGAAGAGGAAATTGAAAGTGAAGAGGCCAAGACCTCCAGTGATTCAGAAACCGATTTAGATTCTAAGATCAAGGAATTGAAT
GACAACCAAATTCCTATCTTTGCAGCATTGAGAAATAAGAGAAGAAGAGAGATTAGGGCTGAGAGGAAAACAAAGAATAAAAATGATCCTATTTTTTTCAAGAGGTTGAG
GACAAGGTCCATGGACGCTTCCCCGATACCTCCTCCAACCATCTCACCTACCAAGCCAAAAATCAAATCACTTAAGGCTCCATCTCCCAAAAATCCATTCCTAGAAGTCT
TCAAAGATGTCAATTTTCAGGAACGGATGGAGATCATGAGAAAAAAAGACTTCCTGAACGAGAAAGGATTCTCAAACAGAGCTGGAACACTGCCAAAGTTCGTTACCAAA
GTTATTACACAATACAAATGGCAGGAACTCTATGCTCATCCCCAGGAGGTCGTGGTGCCTCTAGTTCGAGAATTCTACACTGGTTTGAGGGAGAAGAGCATGAGCATGAC
AGTGGTGAGAGGTAAGATGGCCAGTTTCTCTTTTGTTGACATCAACAGGGTGTACAGAATCAAGGCACCCTTGCATCCAAGAGGGAACGATGCCATCAAGAACCCCCTCG
TCCAAACAGATGAAAGAAGCACTAAAAATGGTGGCCAACTAGGGTTGATGCCAACAACCCATGATAGCACTATTTCAGTAGAGAGAGTTATGCATTTCTACAGTATCATG
AAGGGGTTGGAGATAAACATCGGGAGCATAATAAGGAAGGAGATCCTTTTGTGTGGAAGGAAGAAAGCAGGGAAGCTATTCTTTGGGTCACTTATCACCCAATTATGTCA
GAGGGTAACAATAGTCCCTAGTAAGGATGAGGAGCGCCACTTCTTCAGGTCTACCATTGATCTACCTCTAATTGGGAAGCTCCAACAGAACAATGCCCAAAGGAAGGACA
AAGCTTCCACATCTCAAGTCACTCCATCACCGGGGCTGAATCTGGCTTCTCCACCTCAACTAGGGGTGAGCGTAAATTTTCGAAAAACCGATCCGACCGATCGAAACCGG
CCAAACCGACGTCGGTCAGTCAGTTTCAGTCAAGATTCGATCGGTTTCGGTTTGCCATTATGCCAAACCGAAATGTCAGTCGACAGTAGACTCCCCTCACTTCAAGTTCA
CGAATCACGTCAGTCGCCGCCCCTCGCCGACCCTCGCCGTCGATTTCTTTTTCTTCTTCTTCGTCTGCAACTCACAGTCACAGTCACCGCGCCGCCGTCCGTTCTTGTTC
TTGAGATTAAGATCACCATTCGACATTCGTCACTTGAGATTATTGTCCATTTCTCTCATCACATTGTCGATTTGTCACGCTGGAAAATTCTTACGAGTGATACTAGAGTT
GCATCTGTTTGTTTTAGTAGTGGGAGTGGTGGTGGTAGTAGTAGTGCTAGTGCTAGTACTAGTGCTAATGTTGATTTGAGATTTGATGATGAAACAATGGACCTAGATGA
GGATGAAAACTACAACTATGATATAATACCTCAGTTCAAAAGGCCCACGCCATCATCTGAGGCCCTAGCATTTGCCTACCGACAGTTGGACCAAATCAGGGATAACCTGA
GGAGTTATTGGGCTTATGCCAAGGAGAGAGATGAAGCTAGGAGAGAGTTTTACCTCTCTGTCGCCCCGAGTATTGCTCCTGTCTTTCCTGATTTCCCTCAATCGTTCTTG
CCTCAAGAAGAAAAGGAAACTGAAGATGAAGATGAAGATGAAGAGAAAGAGATGCCCTCGGATGAGGATTAG
Protein sequenceShow/hide protein sequence
MHGTRGARPTGFSPAIVAQGTNVQTPSSSTMLATSRKNQSSSRPRMSTRTDSIHKTQKPAGQQLEKRSREWYSMIRGTRAQRRATLEEEARLRDAKEIAKVGESSRQGET
LMGNVSQPSSNPSSSCRDKTFMTYKAKKKNVFEDMIRQVVAQAIVISEGYRVEQDALREIQSEREMENQSMREEDDFARKRDLEEEREAERRQEEKDKRSRGPAFEPLHK
AQSEADLLQEREEDALEGPREENPEKRRRKNEENEGQDATAYGPHSKEGQKATEEQPADEVFNPLFKYDPPAAESIYSGEKKDEEEIESEEAKTSSDSETDLDSKIKELN
DNQIPIFAALRNKRRREIRAERKTKNKNDPIFFKRLRTRSMDASPIPPPTISPTKPKIKSLKAPSPKNPFLEVFKDVNFQERMEIMRKKDFLNEKGFSNRAGTLPKFVTK
VITQYKWQELYAHPQEVVVPLVREFYTGLREKSMSMTVVRGKMASFSFVDINRVYRIKAPLHPRGNDAIKNPLVQTDERSTKNGGQLGLMPTTHDSTISVERVMHFYSIM
KGLEINIGSIIRKEILLCGRKKAGKLFFGSLITQLCQRVTIVPSKDEERHFFRSTIDLPLIGKLQQNNAQRKDKASTSQVTPSPGLNLASPPQLGVSVNFRKTDPTDRNR
PNRRRSVSFSQDSIGFGLPLCQTEMSVDSRLPSLQVHESRQSPPLADPRRRFLFLLLRLQLTVTVTAPPSVLVLEIKITIRHSSLEIIVHFSHHIVDLSRWKILTSDTRV
ASVCFSSGSGGGSSSASASTSANVDLRFDDETMDLDEDENYNYDIIPQFKRPTPSSEALAFAYRQLDQIRDNLRSYWAYAKERDEARREFYLSVAPSIAPVFPDFPQSFL
PQEEKETEDEDEDEEKEMPSDED