; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg032954 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg032954
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold11:15770041..15777318
RNA-Seq ExpressionSpg032954
SyntenySpg032954
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0016791 - phosphatase activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB49850.1 hypothetical protein L484_000844 [Morus notabilis]9.2e-2430.99Show/hide
Query:  FCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGND----VIRNPLAKQMKDALKLVANKGVQWKESQTKVKSLVPND
        FC HP + +VPLV+EFYA L+ +  +   V    ++F+S  IN V  I     P  +D    +I + + +Q+K+ LK +A  G QW  S     +   ++
Subjt:  FCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGND----VIRNPLAKQMKDALKLVANKGVQWKESQTKVKSLVPND

Query:  LKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSNIRDEILVCGRKRAGKLFFGSLITQLYQRVKFVPGKDEERHFFKPTIDLSLIGK
        L+P + VW HF+ +RL+ +TH  TIS +R +LLY ++ G  INVG  I D+I  C  K  G L+F SLI++L  +        E R      +DL  I +
Subjt:  LKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSNIRDEILVCGRKRAGKLFFGSLITQLYQRVKFVPGKDEERHFFKPTIDLSLIGK

Query:  LQQNSIQRKDKASTSQ--------ATPQSRSNVASPSQSTPFTGPSPSSEALAIAYRQLDQIRENLRTYWVYAKERDEAIREFY
        +     ++ +K    +        +T  + S  A+ SQ       S         +  L Q +E L  +WVY+++RD A+++ +
Subjt:  LQQNSIQRKDKASTSQ--------ATPQSRSNVASPSQSTPFTGPSPSSEALAIAYRQLDQIRENLRTYWVYAKERDEAIREFY

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]3.0e-2734.93Show/hide
Query:  EDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPLAKQMKDALKLVANKGVQWKESQTKVKSLVPNDLK
        + FCAHP++ +VPLVREFYA L +   +   VRG  VS+S   IN V+ +  P++   ++ I N     +   L+ VA  G +W  S     + + + L 
Subjt:  EDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPLAKQMKDALKLVANKGVQWKESQTKVKSLVPNDLK

Query:  PESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSNIRDEILVCGRKRAGKLFFGSLITQLYQRVKFVPGKDEERHFFKPTIDLSLIGKLQ
        P + VW HF+K+ L+PTTH  T+S DR++LL+ ++ G  INVG  I  EI  C  ++ G LFF SLIT+L +  +     +EE+      ID   + ++ 
Subjt:  PESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSNIRDEILVCGRKRAGKLFFGSLITQLYQRVKFVPGKDEERHFFKPTIDLSLIGKLQ

Query:  QNSIQRKDKASTSQATPQSRSNVASPSQS
        Q     +    ++Q    SR   AS S++
Subjt:  QNSIQRKDKASTSQATPQSRSNVASPSQS

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]1.3e-3331.94Show/hide
Query:  EDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPLAKQMKDALKLVANKGVQWKESQTKVKSLVPNDLK
        + FCAHP++ +VPLVREFYA L +   +   VRG  VS+S   IN V+ +  P++   ++ I+N   + +   L+ VA  G +W  S     + + + L 
Subjt:  EDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPLAKQMKDALKLVANKGVQWKESQTKVKSLVPNDLK

Query:  PESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSNIRDEILVCGRKRAGKLFFGSLITQLYQRVKFVPGKDEERHFFKPTIDLSLIGKLQ
        P + VW HF+K+RL+PTTH  T+S DR++LL+ ++ G  INVG  I  EI  C  ++ G LFF SLIT+L +  +     +EE+      ID   + ++ 
Subjt:  PESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSNIRDEILVCGRKRAGKLFFGSLITQLYQRVKFVPGKDEERHFFKPTIDLSLIGKLQ

Query:  QN--SIQRKDKASTSQATPQSRSNVASPSQSTPFTGPSPSSEALAIAYRQ--LDQIRENLRTYWVYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPRE
        Q   +   +  +S+  AT  S        Q         S + +   +    L    +  + +W Y+KERD A+++   +      P FP FPQ +L   
Subjt:  QN--SIQRKDKASTSQATPQSRSNVASPSQSTPFTGPSPSSEALAIAYRQ--LDQIRENLRTYWVYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPRE

Query:  DKDSDEENDE
        D + + E+D+
Subjt:  DKDSDEENDE

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]1.1e-2435.84Show/hide
Query:  FCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPLAKQMKDALKLVANKGVQWKESQTKVKSLVPNDLKPE
        FCAHP++ +VPLVREFY  +         +RG  V  S   IN ++ +  P++   ++ + +    ++   L+ VA  G +W  S     + + + L P 
Subjt:  FCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPLAKQMKDALKLVANKGVQWKESQTKVKSLVPNDLKPE

Query:  SAVWLHFIKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSNIRDEILVCGRKRAGKLFFGSLITQLYQRVK
        + VW HF+K+RL+PTTH  T+S + V LLY ++ G  INVG  I  EI  C  +++G LFF SLIT + +  +
Subjt:  SAVWLHFIKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSNIRDEILVCGRKRAGKLFFGSLITQLYQRVK

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]3.8e-3031.76Show/hide
Query:  VPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPLAKQMKDALKLVANKGVQWKESQTKVKSLVPNDLKPESAVWLHFIK
        +PLVREFYA L +   +   VRG  VS+S   IN V+ +  P++   ++ I N    ++   L+ VA  G +W  S     + + + L P + VW HF+K
Subjt:  VPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPLAKQMKDALKLVANKGVQWKESQTKVKSLVPNDLKPESAVWLHFIK

Query:  NRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSNIRDEILVCGRKRAGKLFFGSLITQLYQRVKFVPGKDEERHFFKPTIDLSLIGKLQQNSIQRKDKAS
        +RL+PTTH   +S DR++LL+ ++ G  INVG  I  EI  C  ++ G LFF SLIT+L +   F+  +++          L   G++   ++ R  +  
Subjt:  NRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSNIRDEILVCGRKRAGKLFFGSLITQLYQRVKFVPGKDEERHFFKPTIDLSLIGKLQQNSIQRKDKAS

Query:  TSQATPQ-SRSNVASPSQSTPFTGPSPSSEALAIAYRQLDQIRENLRTYWVYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPREDKDSDEENDE
         +++T Q S S  A+ S S          +AL     Q +   +  + +W Y+KERD A+++   +      P FP FPQ +L   D + + E+D+
Subjt:  TSQATPQ-SRSNVASPSQSTPFTGPSPSSEALAIAYRQLDQIRENLRTYWVYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPREDKDSDEENDE

TrEMBL top hitse value%identityAlignment
A0A2N9HFT1 Uncharacterized protein1.4e-2534.25Show/hide
Query:  FRTAIDNSQLLDPGFTGGNYTWVKNRQSETLGWIIWD-LWRHLIIVPFLQAGTEEEVVH---QGRDVLNLL---DLRRVGLVSKSVRI--------SLSE
        FR A+D+   +D G+ G  +TW  NR S   G  +W+ L R +    +L    +  V H    G D   L     +    LV+K  R           +E
Subjt:  FRTAIDNSQLLDPGFTGGNYTWVKNRQSETLGWIIWD-LWRHLIIVPFLQAGTEEEVVH---QGRDVLNLL---DLRRVGLVSKSVRI--------SLSE

Query:  AIKC--------KEEEVRILESGKPENWEANWVEAE-KEFESLLVENEAYWQQRAKEEWLVWGDRNSKWFHARASQRRRRNRIEGLRDSEGQWQSDPSVV
         I          K EE++I E    +   +  + +   E   LL + E  W+QR++ +WL  GDRN+ +FH+RA+ R+RRN I GLRDS+G+W+ DP  V
Subjt:  AIKC--------KEEEVRILESGKPENWEANWVEAE-KEFESLLVENEAYWQQRAKEEWLVWGDRNSKWFHARASQRRRRNRIEGLRDSEGQWQSDPSVV

Query:  ETIVVDYFTNLFFSSSSQLAVDTVLRCVTPTVNDVQNQALLKEFTRAEVEDELR
        +T+++ YF N+F SS+   ++DTVL+C+   + D  N+AL + +T  EVE  LR
Subjt:  ETIVVDYFTNLFFSSSSQLAVDTVLRCVTPTVNDVQNQALLKEFTRAEVEDELR

A0A2P5AGA5 Uncharacterized protein (Fragment)1.5e-2734.93Show/hide
Query:  EDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPLAKQMKDALKLVANKGVQWKESQTKVKSLVPNDLK
        + FCAHP++ +VPLVREFYA L +   +   VRG  VS+S   IN V+ +  P++   ++ I N     +   L+ VA  G +W  S     + + + L 
Subjt:  EDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPLAKQMKDALKLVANKGVQWKESQTKVKSLVPNDLK

Query:  PESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSNIRDEILVCGRKRAGKLFFGSLITQLYQRVKFVPGKDEERHFFKPTIDLSLIGKLQ
        P + VW HF+K+ L+PTTH  T+S DR++LL+ ++ G  INVG  I  EI  C  ++ G LFF SLIT+L +  +     +EE+      ID   + ++ 
Subjt:  PESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSNIRDEILVCGRKRAGKLFFGSLITQLYQRVKFVPGKDEERHFFKPTIDLSLIGKLQ

Query:  QNSIQRKDKASTSQATPQSRSNVASPSQS
        Q     +    ++Q    SR   AS S++
Subjt:  QNSIQRKDKASTSQATPQSRSNVASPSQS

A0A2P5BCG4 Uncharacterized protein (Fragment)6.2e-3431.94Show/hide
Query:  EDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPLAKQMKDALKLVANKGVQWKESQTKVKSLVPNDLK
        + FCAHP++ +VPLVREFYA L +   +   VRG  VS+S   IN V+ +  P++   ++ I+N   + +   L+ VA  G +W  S     + + + L 
Subjt:  EDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPLAKQMKDALKLVANKGVQWKESQTKVKSLVPNDLK

Query:  PESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSNIRDEILVCGRKRAGKLFFGSLITQLYQRVKFVPGKDEERHFFKPTIDLSLIGKLQ
        P + VW HF+K+RL+PTTH  T+S DR++LL+ ++ G  INVG  I  EI  C  ++ G LFF SLIT+L +  +     +EE+      ID   + ++ 
Subjt:  PESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSNIRDEILVCGRKRAGKLFFGSLITQLYQRVKFVPGKDEERHFFKPTIDLSLIGKLQ

Query:  QN--SIQRKDKASTSQATPQSRSNVASPSQSTPFTGPSPSSEALAIAYRQ--LDQIRENLRTYWVYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPRE
        Q   +   +  +S+  AT  S        Q         S + +   +    L    +  + +W Y+KERD A+++   +      P FP FPQ +L   
Subjt:  QN--SIQRKDKASTSQATPQSRSNVASPSQSTPFTGPSPSSEALAIAYRQ--LDQIRENLRTYWVYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPRE

Query:  DKDSDEENDE
        D + + E+D+
Subjt:  DKDSDEENDE

A0A2P5DAQ2 Uncharacterized protein5.2e-2535.84Show/hide
Query:  FCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPLAKQMKDALKLVANKGVQWKESQTKVKSLVPNDLKPE
        FCAHP++ +VPLVREFY  +         +RG  V  S   IN ++ +  P++   ++ + +    ++   L+ VA  G +W  S     + + + L P 
Subjt:  FCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPLAKQMKDALKLVANKGVQWKESQTKVKSLVPNDLKPE

Query:  SAVWLHFIKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSNIRDEILVCGRKRAGKLFFGSLITQLYQRVK
        + VW HF+K+RL+PTTH  T+S + V LLY ++ G  INVG  I  EI  C  +++G LFF SLIT + +  +
Subjt:  SAVWLHFIKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSNIRDEILVCGRKRAGKLFFGSLITQLYQRVK

A0A2P5DXM3 Uncharacterized protein1.9e-3031.76Show/hide
Query:  VPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPLAKQMKDALKLVANKGVQWKESQTKVKSLVPNDLKPESAVWLHFIK
        +PLVREFYA L +   +   VRG  VS+S   IN V+ +  P++   ++ I N    ++   L+ VA  G +W  S     + + + L P + VW HF+K
Subjt:  VPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPLAKQMKDALKLVANKGVQWKESQTKVKSLVPNDLKPESAVWLHFIK

Query:  NRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSNIRDEILVCGRKRAGKLFFGSLITQLYQRVKFVPGKDEERHFFKPTIDLSLIGKLQQNSIQRKDKAS
        +RL+PTTH   +S DR++LL+ ++ G  INVG  I  EI  C  ++ G LFF SLIT+L +   F+  +++          L   G++   ++ R  +  
Subjt:  NRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSNIRDEILVCGRKRAGKLFFGSLITQLYQRVKFVPGKDEERHFFKPTIDLSLIGKLQQNSIQRKDKAS

Query:  TSQATPQ-SRSNVASPSQSTPFTGPSPSSEALAIAYRQLDQIRENLRTYWVYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPREDKDSDEENDE
         +++T Q S S  A+ S S          +AL     Q +   +  + +W Y+KERD A+++   +      P FP FPQ +L   D + + E+D+
Subjt:  TSQATPQ-SRSNVASPSQSTPFTGPSPSSEALAIAYRQLDQIRENLRTYWVYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPREDKDSDEENDE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGTCGAGTGGTGAATCAGTGGCTCGATTGCGTGATAGTGCTGGTCAGAAACACAGGCTTGATAGTCCATCTGATGAGGTCCAATCAGGGGTGATTGAGGATTGCTT
ACGATCGGAAAAGAAAGTTCGAGCTAATTCATCTGTGGCTTTTAGGACGGCAATTGATAACTCTCAACTTCTTGACCCAGGATTTACTGGAGGCAATTATACTTGGGTGA
AGAACCGACAGTCAGAGACTTTAGGGTGGATCATTTGGGATTTATGGAGGCATCTGATCATCGTCCCATTCTTGCAAGCTGGGACAGAGGAGGAGGTGGTGCACCAGGGG
AGAGATGTCCTTAACTTGTTAGATTTGAGAAGGGTTGGACTCGTTTCCAAGAGTGTAAGGATATCATTAAGCGAGGCCATCAAGTGTAAAGAGGAGGAGGTTCGTATTTT
GGAGTCTGGCAAGCCAGAAAATTGGGAGGCGAATTGGGTGGAGGCTGAAAAGGAATTTGAGTCTTTGCTAGTTGAGAATGAGGCATATTGGCAACAAAGGGCGAAGGAGG
AATGGTTGGTGTGGGGGGACAGGAATTCGAAGTGGTTCCATGCCCGAGCATCCCAACGGCGTCGGCGTAACAGAATTGAGGGCCTCAGGGATAGTGAGGGCCAATGGCAG
TCAGATCCGAGTGTGGTAGAGACCATTGTTGTTGATTATTTTACTAATTTGTTCTTTTCGTCCTCTTCGCAGTTGGCAGTCGATACGGTACTACGATGTGTCACTCCTAC
AGTTAATGATGTACAGAATCAGGCTTTACTCAAGGAGTTCACTCGGGCTGAGGTAGAGGATGAGCTTCGTTTAGAGACGGTGTCCTCGCTGATTATGGAGGATGGGTCCT
GGGATGTGGAGAAGGTGAGAGGGAAATTTGTGCAGGACGATGATGAACATATTTTGGCTATACCATTAAGTGGCCAGAGAGAGGACGATCAGATATATTGGGCGCCAGAT
GGTAAGGGGCATTTTTCTGTGAAAAGTGCTTATACGCTTGGGGTGTGTTTGGTCGAAAATGCGTCTGGTGCATTGTCATCCACAGATATGGCTCCTGAAACCACTTTACA
TTTGTATGATTGGGATCCTCTGGATCACTTTGCATGGGTTAAAGACCATGGTGTGGAACAAGATGGGACATACTTCGTCCTACTTCTATGGCATATTTGGGAGTTCAGAA
ATTGCAAGATCTTTCGCAAGGGAGAGATCTCCATTGGAATGGTGAAGGAGGCTATTCAGGCATCTCTTTGGGAATTTTTCCGCTCCATTACTCGTGGGACTTGTGGGGAT
GACTATCTAGACTCGCATGGGACAGCAATTCGGGCTGGATTATTGGCTATTGGTGAGATGGGGCCTCCTCGATTGATGGTGGAGTCAGATTGCTTGGTTGCTGTAAATCT
CTTGAATGGTGTGGATGAGAACTTCACCTTAGTTCATTCTGTGACTATGGAGGTGCAGAAGCTTTCTTCTTGTTTGGTGGGAGTACAATTTAAGCATATTCGACGAGGGC
AAAATTTGGTGGCTGATATGTTGGCACAACAAGCAATGAGTCATGGGATTTATGGCACCTGGTTTTCGGGGTTTCCAAGTTGGCTTCTTGAAGCTATAGAGGGTGAACAG
CGAAAGTTGGAAAACAGAGGAAAAGCTGGAATTTCCCATAAATGCGACCGCATTTCTGGAAAGGCAAAAATGAAATGCGACCGCATTTCTGAAAAAATTGAAGTCGTTCC
AAGTCGTCTGCGGGTCGTTGTTGACGAGTCTTCTTCGCACCTAACCGGACCATTGAGGAGAAAGAGAAAGAGAGAGATTAAGGCTGAAAGGAGGACAAAGAACAAGAATG
ATCCAATCTTTGCCAAGAGGCCGAGGACGAGGTCCATGGACGCCTCTCTTGTAGTTCCTCCCACCATCTCACCCGCCAAGCCAAAAGGAAAATCACCTAAGGCTGCATCT
CCCAAAAATCCATTTCTTGAGGACTTCTGTGCTCACCCTCAGGAGGCTGTAGTGCCTTTAGTTCGTGAATTTTATGCCGGCCTGAGGGAGGAGAGCATTAGCATGGCGGT
TGTGAGGGGGAAGATGGTCAGTTTCTCCTCAGTCGACATTAATAGGGTGTACAGGATCAAGGCACCCCTGAATCCGAGAGGGAATGATGTGATAAGGAACCCTTTGGCCA
AGCAGATGAAAGATGCATTGAAACTTGTGGCCAATAAGGGGGTTCAATGGAAAGAGTCTCAGACAAAAGTGAAGTCTCTGGTGCCAAACGACCTAAAGCCAGAATCGGCA
GTTTGGCTTCACTTTATCAAGAACCGTTTGATGCCAACCACCCACGATAGCACCATCTCAGTGGATAGAGTTATGTTACTCTATTGCATTATGAAGGGGTTGGAGATCAA
CGTAGGGAGCAATATCAGGGATGAGATTTTAGTCTGTGGGAGAAAAAGAGCGGGCAAGCTTTTCTTTGGATCACTCATCACCCAACTCTATCAGAGGGTGAAGTTCGTTC
CAGGCAAGGACGAGGAGCGTCACTTCTTCAAGCCGACTATTGACCTGTCTTTGATTGGAAAGCTCCAACAGAACAGCATCCAGAGGAAAGATAAAGCCTCCACATCTCAG
GCTACTCCTCAATCAAGGTCGAATGTAGCTTCTCCATCACAGAGCACTCCTTTTACAGGGCCCTCACCATCATCAGAGGCCCTAGCCATTGCCTACCGCCAGCTTGATCA
AATCAGGGAGAACTTGAGAACATATTGGGTATATGCAAAGGAGAGGGATGAAGCCATTAGAGAGTTCTATCTCTCTATTGCCCCGAGTATCGCTCCAGTCTTTCCAAATT
TCCCTCAATCGCTGCTGCCTCGGGAGGACAAGGATTCTGATGAAGAGAATGATGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGAGGTCGAGTGGTGAATCAGTGGCTCGATTGCGTGATAGTGCTGGTCAGAAACACAGGCTTGATAGTCCATCTGATGAGGTCCAATCAGGGGTGATTGAGGATTGCTT
ACGATCGGAAAAGAAAGTTCGAGCTAATTCATCTGTGGCTTTTAGGACGGCAATTGATAACTCTCAACTTCTTGACCCAGGATTTACTGGAGGCAATTATACTTGGGTGA
AGAACCGACAGTCAGAGACTTTAGGGTGGATCATTTGGGATTTATGGAGGCATCTGATCATCGTCCCATTCTTGCAAGCTGGGACAGAGGAGGAGGTGGTGCACCAGGGG
AGAGATGTCCTTAACTTGTTAGATTTGAGAAGGGTTGGACTCGTTTCCAAGAGTGTAAGGATATCATTAAGCGAGGCCATCAAGTGTAAAGAGGAGGAGGTTCGTATTTT
GGAGTCTGGCAAGCCAGAAAATTGGGAGGCGAATTGGGTGGAGGCTGAAAAGGAATTTGAGTCTTTGCTAGTTGAGAATGAGGCATATTGGCAACAAAGGGCGAAGGAGG
AATGGTTGGTGTGGGGGGACAGGAATTCGAAGTGGTTCCATGCCCGAGCATCCCAACGGCGTCGGCGTAACAGAATTGAGGGCCTCAGGGATAGTGAGGGCCAATGGCAG
TCAGATCCGAGTGTGGTAGAGACCATTGTTGTTGATTATTTTACTAATTTGTTCTTTTCGTCCTCTTCGCAGTTGGCAGTCGATACGGTACTACGATGTGTCACTCCTAC
AGTTAATGATGTACAGAATCAGGCTTTACTCAAGGAGTTCACTCGGGCTGAGGTAGAGGATGAGCTTCGTTTAGAGACGGTGTCCTCGCTGATTATGGAGGATGGGTCCT
GGGATGTGGAGAAGGTGAGAGGGAAATTTGTGCAGGACGATGATGAACATATTTTGGCTATACCATTAAGTGGCCAGAGAGAGGACGATCAGATATATTGGGCGCCAGAT
GGTAAGGGGCATTTTTCTGTGAAAAGTGCTTATACGCTTGGGGTGTGTTTGGTCGAAAATGCGTCTGGTGCATTGTCATCCACAGATATGGCTCCTGAAACCACTTTACA
TTTGTATGATTGGGATCCTCTGGATCACTTTGCATGGGTTAAAGACCATGGTGTGGAACAAGATGGGACATACTTCGTCCTACTTCTATGGCATATTTGGGAGTTCAGAA
ATTGCAAGATCTTTCGCAAGGGAGAGATCTCCATTGGAATGGTGAAGGAGGCTATTCAGGCATCTCTTTGGGAATTTTTCCGCTCCATTACTCGTGGGACTTGTGGGGAT
GACTATCTAGACTCGCATGGGACAGCAATTCGGGCTGGATTATTGGCTATTGGTGAGATGGGGCCTCCTCGATTGATGGTGGAGTCAGATTGCTTGGTTGCTGTAAATCT
CTTGAATGGTGTGGATGAGAACTTCACCTTAGTTCATTCTGTGACTATGGAGGTGCAGAAGCTTTCTTCTTGTTTGGTGGGAGTACAATTTAAGCATATTCGACGAGGGC
AAAATTTGGTGGCTGATATGTTGGCACAACAAGCAATGAGTCATGGGATTTATGGCACCTGGTTTTCGGGGTTTCCAAGTTGGCTTCTTGAAGCTATAGAGGGTGAACAG
CGAAAGTTGGAAAACAGAGGAAAAGCTGGAATTTCCCATAAATGCGACCGCATTTCTGGAAAGGCAAAAATGAAATGCGACCGCATTTCTGAAAAAATTGAAGTCGTTCC
AAGTCGTCTGCGGGTCGTTGTTGACGAGTCTTCTTCGCACCTAACCGGACCATTGAGGAGAAAGAGAAAGAGAGAGATTAAGGCTGAAAGGAGGACAAAGAACAAGAATG
ATCCAATCTTTGCCAAGAGGCCGAGGACGAGGTCCATGGACGCCTCTCTTGTAGTTCCTCCCACCATCTCACCCGCCAAGCCAAAAGGAAAATCACCTAAGGCTGCATCT
CCCAAAAATCCATTTCTTGAGGACTTCTGTGCTCACCCTCAGGAGGCTGTAGTGCCTTTAGTTCGTGAATTTTATGCCGGCCTGAGGGAGGAGAGCATTAGCATGGCGGT
TGTGAGGGGGAAGATGGTCAGTTTCTCCTCAGTCGACATTAATAGGGTGTACAGGATCAAGGCACCCCTGAATCCGAGAGGGAATGATGTGATAAGGAACCCTTTGGCCA
AGCAGATGAAAGATGCATTGAAACTTGTGGCCAATAAGGGGGTTCAATGGAAAGAGTCTCAGACAAAAGTGAAGTCTCTGGTGCCAAACGACCTAAAGCCAGAATCGGCA
GTTTGGCTTCACTTTATCAAGAACCGTTTGATGCCAACCACCCACGATAGCACCATCTCAGTGGATAGAGTTATGTTACTCTATTGCATTATGAAGGGGTTGGAGATCAA
CGTAGGGAGCAATATCAGGGATGAGATTTTAGTCTGTGGGAGAAAAAGAGCGGGCAAGCTTTTCTTTGGATCACTCATCACCCAACTCTATCAGAGGGTGAAGTTCGTTC
CAGGCAAGGACGAGGAGCGTCACTTCTTCAAGCCGACTATTGACCTGTCTTTGATTGGAAAGCTCCAACAGAACAGCATCCAGAGGAAAGATAAAGCCTCCACATCTCAG
GCTACTCCTCAATCAAGGTCGAATGTAGCTTCTCCATCACAGAGCACTCCTTTTACAGGGCCCTCACCATCATCAGAGGCCCTAGCCATTGCCTACCGCCAGCTTGATCA
AATCAGGGAGAACTTGAGAACATATTGGGTATATGCAAAGGAGAGGGATGAAGCCATTAGAGAGTTCTATCTCTCTATTGCCCCGAGTATCGCTCCAGTCTTTCCAAATT
TCCCTCAATCGCTGCTGCCTCGGGAGGACAAGGATTCTGATGAAGAGAATGATGAATAA
Protein sequenceShow/hide protein sequence
MRSSGESVARLRDSAGQKHRLDSPSDEVQSGVIEDCLRSEKKVRANSSVAFRTAIDNSQLLDPGFTGGNYTWVKNRQSETLGWIIWDLWRHLIIVPFLQAGTEEEVVHQG
RDVLNLLDLRRVGLVSKSVRISLSEAIKCKEEEVRILESGKPENWEANWVEAEKEFESLLVENEAYWQQRAKEEWLVWGDRNSKWFHARASQRRRRNRIEGLRDSEGQWQ
SDPSVVETIVVDYFTNLFFSSSSQLAVDTVLRCVTPTVNDVQNQALLKEFTRAEVEDELRLETVSSLIMEDGSWDVEKVRGKFVQDDDEHILAIPLSGQREDDQIYWAPD
GKGHFSVKSAYTLGVCLVENASGALSSTDMAPETTLHLYDWDPLDHFAWVKDHGVEQDGTYFVLLLWHIWEFRNCKIFRKGEISIGMVKEAIQASLWEFFRSITRGTCGD
DYLDSHGTAIRAGLLAIGEMGPPRLMVESDCLVAVNLLNGVDENFTLVHSVTMEVQKLSSCLVGVQFKHIRRGQNLVADMLAQQAMSHGIYGTWFSGFPSWLLEAIEGEQ
RKLENRGKAGISHKCDRISGKAKMKCDRISEKIEVVPSRLRVVVDESSSHLTGPLRRKRKREIKAERRTKNKNDPIFAKRPRTRSMDASLVVPPTISPAKPKGKSPKAAS
PKNPFLEDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPLAKQMKDALKLVANKGVQWKESQTKVKSLVPNDLKPESA
VWLHFIKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSNIRDEILVCGRKRAGKLFFGSLITQLYQRVKFVPGKDEERHFFKPTIDLSLIGKLQQNSIQRKDKASTSQ
ATPQSRSNVASPSQSTPFTGPSPSSEALAIAYRQLDQIRENLRTYWVYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPREDKDSDEENDE