; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg027136 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg027136
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold8:4883048..4886107
RNA-Seq ExpressionSpg027136
SyntenySpg027136
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8695166.1 hypothetical protein F3Y22_tig00110733pilonHSYRG00282 [Hibiscus syriacus]2.1e-2427.36Show/hide
Query:  FINNLAIAKHAEMLKRDLLFERGF------SEDLPHFLRAGITNHGWELFCSKHESVNTHVVCEFYANIDEEEGFQVIVRGVAVDWSPSAIN--------
        F++  A   +  +  R + FE GF      + +L   +   +T H W+ F      VN  +V EFY+NI E     V+VRG+++ ++P+AIN        
Subjt:  FINNLAIAKHAEMLKRDLLFERGF------SEDLPHFLRAGITNHGWELFCSKHESVNTHVVCEFYANIDEEEGFQVIVRGVAVDWSPSAIN--------

Query:  --------------SLDAVLE-VGVEGAQWRLSKIEKRTFQAAYLKKETNTWMGFIKQRLLPMTHDSTVSREHVLLAFEILRSLSIDVGKIIASEIFGCW
                      +   +LE + + G +W   +++++T     L      W  F+K +L+P +H++TVS + +LL   IL   +ID+GKII      C 
Subjt:  --------------SLDAVLE-VGVEGAQWRLSKIEKRTFQAAYLKKETNTWMGFIKQRLLPMTHDSTVSREHVLLAFEILRSLSIDVGKIIASEIFGCW

Query:  RKKVGKLFFPNTITMLSKRAGVSENAEDIILFDKGIIDTSNLARLQRTQEARQGGLVCGIHKILEQ---RALSASRQEFVERQSQ----------TFWSY
        +++   L FPN IT L ++  V E   D IL     ++ + +  L   +EA+         ++      RA S   ++ V+R  Q           +++Y
Subjt:  RKKVGKLFFPNTITMLSKRAGVSENAEDIILFDKGIIDTSNLARLQRTQEARQGGLVCGIHKILEQ---RALSASRQEFVERQSQ----------TFWSY

Query:  VKRRDANLKNALQESFSK
         KRRDA L +AL ES  +
Subjt:  VKRRDANLKNALQESFSK

KAE8718449.1 hypothetical protein F3Y22_tig00110013pilonHSYRG00240 [Hibiscus syriacus]2.6e-2227.98Show/hide
Query:  LPYNRFINNLAIAKHAEMLKRDLLFERG--FSEDLPHFLRAGITN----HGWELFCSKHESVNTHVVCEFYANIDEEEGFQVIVRGVAVDWSPSAI----
        + + +F N+ A A+      R+L FE G  F+E+        + +      W  F     SVN  +V EFYANI +     + VRG  + ++  AI    
Subjt:  LPYNRFINNLAIAKHAEMLKRDLLFERG--FSEDLPHFLRAGITN----HGWELFCSKHESVNTHVVCEFYANIDEEEGFQVIVRGVAVDWSPSAI----

Query:  ------------------NSLDAVLE-VGVEGAQWRLSKIEKRTFQAAYLKKETNTWMGFIKQRLLPMTHDSTVSREHVLLAFEILRSLSIDVGKIIASE
                          N  D VLE +  E  +W   +  + +     L+     W  F+K +L+P +H++TVS   +LL   ++ S  IDVG+II  +
Subjt:  ------------------NSLDAVLE-VGVEGAQWRLSKIEKRTFQAAYLKKETNTWMGFIKQRLLPMTHDSTVSREHVLLAFEILRSLSIDVGKIIASE

Query:  IFGCWRKKVGKLFFPNTITMLSKRAGVSENAEDIILFDKGIIDTSNLARLQRTQEARQGGLV----CGIHKILEQRALSASRQEFVERQSQ---------
        +  C  KK   L FPN IT L ++  V ENA D IL     I    L  L   +  +    V     G  +   +  L A  +   + Q+Q         
Subjt:  IFGCWRKKVGKLFFPNTITMLSKRAGVSENAEDIILFDKGIIDTSNLARLQRTQEARQGGLV----CGIHKILEQRALSASRQEFVERQSQ---------

Query:  TFWSYVKRRDANLKNALQESFSKPYPAFPVFPDDLL
         F+ YVK RD  +++  QE        FP FPD++L
Subjt:  TFWSYVKRRDANLKNALQESFSKPYPAFPVFPDDLL

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]1.1e-2335.5Show/hide
Query:  ETEKPRLPYNRFINNLAIAKHAEMLKRDLLFERGFSED-------LPHFLRAGITNHGWELFCSKHESVNTHVVCEFYANIDEEEGFQVIVRGVAVDWSP
        ETE     Y   I N           R L  E+GF  D       LP F+   IT H W+ FC+  E     +V EFYAN+ +     V VRGV V WS 
Subjt:  ETEKPRLPYNRFINNLAIAKHAEMLKRDLLFERGFSED-------LPHFLRAGITNHGWELFCSKHESVNTHVVCEFYANIDEEEGFQVIVRGVAVDWSP

Query:  SAINS----------------------LDAVLE-VGVEGAQWRLSKIEKRTFQAAYLKKETNTWMGFIKQRLLPMTHDSTVSREHVLLAFEILRSLSIDV
         AIN+                      L  VLE V V GA+W +S     T   + L      W  F+K  LLP TH  TVS++ +LL   +L   SI+V
Subjt:  SAINS----------------------LDAVLE-VGVEGAQWRLSKIEKRTFQAAYLKKETNTWMGFIKQRLLPMTHDSTVSREHVLLAFEILRSLSIDV

Query:  GKIIASEIFGCWRKKVGKLFFPNTITMLSKRAGVSENAEDIILFDKGIIDTSNLARLQRTQE
        G++I SEI  C  +K G LFFP+ IT L + A       +  L + G ID   +AR+  TQE
Subjt:  GKIIASEIFGCWRKKVGKLFFPNTITMLSKRAGVSENAEDIILFDKGIIDTSNLARLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]4.6e-3533.69Show/hide
Query:  ETEKPRLPYNRFINNLAIAKHAEMLKRDLLFERGFSED-------LPHFLRAGITNHGWELFCSKHESVNTHVVCEFYANIDEEEGFQVIVRGVAVDWSP
        ETE     Y   I N           R L  E+GF  D       LP F+   IT H W+ FC+  E     +V EFYAN+ + E   V VRGV V WS 
Subjt:  ETEKPRLPYNRFINNLAIAKHAEMLKRDLLFERGFSED-------LPHFLRAGITNHGWELFCSKHESVNTHVVCEFYANIDEEEGFQVIVRGVAVDWSP

Query:  SAINS----------------------LDAVLE-VGVEGAQWRLSKIEKRTFQAAYLKKETNTWMGFIKQRLLPMTHDSTVSREHVLLAFEILRSLSIDV
         AIN+                      L  VLE V   GA+W +S     T   + L      W  F+K RLLP TH  TVS++ +LL   +L   SI+V
Subjt:  SAINS----------------------LDAVLE-VGVEGAQWRLSKIEKRTFQAAYLKKETNTWMGFIKQRLLPMTHDSTVSREHVLLAFEILRSLSIDV

Query:  GKIIASEIFGCWRKKVGKLFFPNTITMLSKRAGVSENAEDIILFDKGIIDTSNLARL------QRTQEA-----------RQGGLVCGIHKILEQRALSA
        G++I SEI  C  +K G LFFP+ IT L + A       +  L + G ID   +AR+      + TQ+            R  G +    K LEQR    
Subjt:  GKIIASEIFGCWRKKVGKLFFPNTITMLSKRAGVSENAEDIILFDKGIIDTSNLARL------QRTQEA-----------RQGGLVCGIHKILEQRALSA

Query:  SRQEF--------VERQSQTFWSYVKRRDANLKNALQESFSKPYPAFPVFPDDLLNPWIPPPPMEREEEDDEEQGQE
          Q++          +Q Q FW+Y K RD  LK ALQ +F++P P FP FP ++L         E E E D++   E
Subjt:  SRQEF--------VERQSQTFWSYVKRRDANLKNALQESFSKPYPAFPVFPDDLLNPWIPPPPMEREEEDDEEQGQE

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]5.2e-3136.36Show/hide
Query:  VVCEFYANIDEEEGFQVIVRGVAVDWSPSAINS----------------------LDAVLE-VGVEGAQWRLSKIEKRTFQAAYLKKETNTWMGFIKQRL
        +V EFYAN+ + E   + VRGV V WS  AIN+                      L  VLE V   GA+W +S     T   + L      W  F+K RL
Subjt:  VVCEFYANIDEEEGFQVIVRGVAVDWSPSAINS----------------------LDAVLE-VGVEGAQWRLSKIEKRTFQAAYLKKETNTWMGFIKQRL

Query:  LPMTHDSTVSREHVLLAFEILRSLSIDVGKIIASEIFGCWRKKVGKLFFPNTITMLSKRAGVSENAEDIILFDKGIIDTSNLARL------QRTQE----
        LP TH   VS++ +LL   +L   SI+VG++I SEI  C  +K G LFFP+ IT L + A    N E   L + G ID   +AR+      + TQ+    
Subjt:  LPMTHDSTVSREHVLLAFEILRSLSIDVGKIIASEIFGCWRKKVGKLFFPNTITMLSKRAGVSENAEDIILFDKGIIDTSNLARL------QRTQE----

Query:  -------ARQGGLVCGIHKILEQRALSASRQEFVERQSQTFWSYVKRRDANLKNALQESFSKPYPAFPVFPDDLLNPWIPPPPMEREEEDDEEQGQE
               +R  G V    K LEQR    S+QE   +Q Q FW+Y K RD  LK ALQ +F++P P FP FP ++L         E E E D++   E
Subjt:  -------ARQGGLVCGIHKILEQRALSASRQEFVERQSQTFWSYVKRRDANLKNALQESFSKPYPAFPVFPDDLLNPWIPPPPMEREEEDDEEQGQE

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)5.1e-2435.5Show/hide
Query:  ETEKPRLPYNRFINNLAIAKHAEMLKRDLLFERGFSED-------LPHFLRAGITNHGWELFCSKHESVNTHVVCEFYANIDEEEGFQVIVRGVAVDWSP
        ETE     Y   I N           R L  E+GF  D       LP F+   IT H W+ FC+  E     +V EFYAN+ +     V VRGV V WS 
Subjt:  ETEKPRLPYNRFINNLAIAKHAEMLKRDLLFERGFSED-------LPHFLRAGITNHGWELFCSKHESVNTHVVCEFYANIDEEEGFQVIVRGVAVDWSP

Query:  SAINS----------------------LDAVLE-VGVEGAQWRLSKIEKRTFQAAYLKKETNTWMGFIKQRLLPMTHDSTVSREHVLLAFEILRSLSIDV
         AIN+                      L  VLE V V GA+W +S     T   + L      W  F+K  LLP TH  TVS++ +LL   +L   SI+V
Subjt:  SAINS----------------------LDAVLE-VGVEGAQWRLSKIEKRTFQAAYLKKETNTWMGFIKQRLLPMTHDSTVSREHVLLAFEILRSLSIDV

Query:  GKIIASEIFGCWRKKVGKLFFPNTITMLSKRAGVSENAEDIILFDKGIIDTSNLARLQRTQE
        G++I SEI  C  +K G LFFP+ IT L + A       +  L + G ID   +AR+  TQE
Subjt:  GKIIASEIFGCWRKKVGKLFFPNTITMLSKRAGVSENAEDIILFDKGIIDTSNLARLQRTQE

A0A2P5BCG4 Uncharacterized protein (Fragment)2.2e-3533.69Show/hide
Query:  ETEKPRLPYNRFINNLAIAKHAEMLKRDLLFERGFSED-------LPHFLRAGITNHGWELFCSKHESVNTHVVCEFYANIDEEEGFQVIVRGVAVDWSP
        ETE     Y   I N           R L  E+GF  D       LP F+   IT H W+ FC+  E     +V EFYAN+ + E   V VRGV V WS 
Subjt:  ETEKPRLPYNRFINNLAIAKHAEMLKRDLLFERGFSED-------LPHFLRAGITNHGWELFCSKHESVNTHVVCEFYANIDEEEGFQVIVRGVAVDWSP

Query:  SAINS----------------------LDAVLE-VGVEGAQWRLSKIEKRTFQAAYLKKETNTWMGFIKQRLLPMTHDSTVSREHVLLAFEILRSLSIDV
         AIN+                      L  VLE V   GA+W +S     T   + L      W  F+K RLLP TH  TVS++ +LL   +L   SI+V
Subjt:  SAINS----------------------LDAVLE-VGVEGAQWRLSKIEKRTFQAAYLKKETNTWMGFIKQRLLPMTHDSTVSREHVLLAFEILRSLSIDV

Query:  GKIIASEIFGCWRKKVGKLFFPNTITMLSKRAGVSENAEDIILFDKGIIDTSNLARL------QRTQEA-----------RQGGLVCGIHKILEQRALSA
        G++I SEI  C  +K G LFFP+ IT L + A       +  L + G ID   +AR+      + TQ+            R  G +    K LEQR    
Subjt:  GKIIASEIFGCWRKKVGKLFFPNTITMLSKRAGVSENAEDIILFDKGIIDTSNLARL------QRTQEA-----------RQGGLVCGIHKILEQRALSA

Query:  SRQEF--------VERQSQTFWSYVKRRDANLKNALQESFSKPYPAFPVFPDDLLNPWIPPPPMEREEEDDEEQGQE
          Q++          +Q Q FW+Y K RD  LK ALQ +F++P P FP FP ++L         E E E D++   E
Subjt:  SRQEF--------VERQSQTFWSYVKRRDANLKNALQESFSKPYPAFPVFPDDLLNPWIPPPPMEREEEDDEEQGQE

A0A2P5DXM3 Uncharacterized protein2.5e-3136.36Show/hide
Query:  VVCEFYANIDEEEGFQVIVRGVAVDWSPSAINS----------------------LDAVLE-VGVEGAQWRLSKIEKRTFQAAYLKKETNTWMGFIKQRL
        +V EFYAN+ + E   + VRGV V WS  AIN+                      L  VLE V   GA+W +S     T   + L      W  F+K RL
Subjt:  VVCEFYANIDEEEGFQVIVRGVAVDWSPSAINS----------------------LDAVLE-VGVEGAQWRLSKIEKRTFQAAYLKKETNTWMGFIKQRL

Query:  LPMTHDSTVSREHVLLAFEILRSLSIDVGKIIASEIFGCWRKKVGKLFFPNTITMLSKRAGVSENAEDIILFDKGIIDTSNLARL------QRTQE----
        LP TH   VS++ +LL   +L   SI+VG++I SEI  C  +K G LFFP+ IT L + A    N E   L + G ID   +AR+      + TQ+    
Subjt:  LPMTHDSTVSREHVLLAFEILRSLSIDVGKIIASEIFGCWRKKVGKLFFPNTITMLSKRAGVSENAEDIILFDKGIIDTSNLARL------QRTQE----

Query:  -------ARQGGLVCGIHKILEQRALSASRQEFVERQSQTFWSYVKRRDANLKNALQESFSKPYPAFPVFPDDLLNPWIPPPPMEREEEDDEEQGQE
               +R  G V    K LEQR    S+QE   +Q Q FW+Y K RD  LK ALQ +F++P P FP FP ++L         E E E D++   E
Subjt:  -------ARQGGLVCGIHKILEQRALSASRQEFVERQSQTFWSYVKRRDANLKNALQESFSKPYPAFPVFPDDLLNPWIPPPPMEREEEDDEEQGQE

A0A6A2ZUE4 Uncharacterized protein1.0e-2427.36Show/hide
Query:  FINNLAIAKHAEMLKRDLLFERGF------SEDLPHFLRAGITNHGWELFCSKHESVNTHVVCEFYANIDEEEGFQVIVRGVAVDWSPSAIN--------
        F++  A   +  +  R + FE GF      + +L   +   +T H W+ F      VN  +V EFY+NI E     V+VRG+++ ++P+AIN        
Subjt:  FINNLAIAKHAEMLKRDLLFERGF------SEDLPHFLRAGITNHGWELFCSKHESVNTHVVCEFYANIDEEEGFQVIVRGVAVDWSPSAIN--------

Query:  --------------SLDAVLE-VGVEGAQWRLSKIEKRTFQAAYLKKETNTWMGFIKQRLLPMTHDSTVSREHVLLAFEILRSLSIDVGKIIASEIFGCW
                      +   +LE + + G +W   +++++T     L      W  F+K +L+P +H++TVS + +LL   IL   +ID+GKII      C 
Subjt:  --------------SLDAVLE-VGVEGAQWRLSKIEKRTFQAAYLKKETNTWMGFIKQRLLPMTHDSTVSREHVLLAFEILRSLSIDVGKIIASEIFGCW

Query:  RKKVGKLFFPNTITMLSKRAGVSENAEDIILFDKGIIDTSNLARLQRTQEARQGGLVCGIHKILEQ---RALSASRQEFVERQSQ----------TFWSY
        +++   L FPN IT L ++  V E   D IL     ++ + +  L   +EA+         ++      RA S   ++ V+R  Q           +++Y
Subjt:  RKKVGKLFFPNTITMLSKRAGVSENAEDIILFDKGIIDTSNLARLQRTQEARQGGLVCGIHKILEQ---RALSASRQEFVERQSQ----------TFWSY

Query:  VKRRDANLKNALQESFSK
         KRRDA L +AL ES  +
Subjt:  VKRRDANLKNALQESFSK

A0A6A3BU96 Uncharacterized protein1.3e-2227.98Show/hide
Query:  LPYNRFINNLAIAKHAEMLKRDLLFERG--FSEDLPHFLRAGITN----HGWELFCSKHESVNTHVVCEFYANIDEEEGFQVIVRGVAVDWSPSAI----
        + + +F N+ A A+      R+L FE G  F+E+        + +      W  F     SVN  +V EFYANI +     + VRG  + ++  AI    
Subjt:  LPYNRFINNLAIAKHAEMLKRDLLFERG--FSEDLPHFLRAGITN----HGWELFCSKHESVNTHVVCEFYANIDEEEGFQVIVRGVAVDWSPSAI----

Query:  ------------------NSLDAVLE-VGVEGAQWRLSKIEKRTFQAAYLKKETNTWMGFIKQRLLPMTHDSTVSREHVLLAFEILRSLSIDVGKIIASE
                          N  D VLE +  E  +W   +  + +     L+     W  F+K +L+P +H++TVS   +LL   ++ S  IDVG+II  +
Subjt:  ------------------NSLDAVLE-VGVEGAQWRLSKIEKRTFQAAYLKKETNTWMGFIKQRLLPMTHDSTVSREHVLLAFEILRSLSIDVGKIIASE

Query:  IFGCWRKKVGKLFFPNTITMLSKRAGVSENAEDIILFDKGIIDTSNLARLQRTQEARQGGLV----CGIHKILEQRALSASRQEFVERQSQ---------
        +  C  KK   L FPN IT L ++  V ENA D IL     I    L  L   +  +    V     G  +   +  L A  +   + Q+Q         
Subjt:  IFGCWRKKVGKLFFPNTITMLSKRAGVSENAEDIILFDKGIIDTSNLARLQRTQEARQGGLV----CGIHKILEQRALSASRQEFVERQSQ---------

Query:  TFWSYVKRRDANLKNALQESFSKPYPAFPVFPDDLL
         F+ YVK RD  +++  QE        FP FPD++L
Subjt:  TFWSYVKRRDANLKNALQESFSKPYPAFPVFPDDLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAAAACGAGAGGAAGAAAGGAAAGAGACAATGAAGAAGAGAGGGTGTCGATTACCTCCGAAGCATCGAAAACTAAGGCGAAGAAAAAGAAAACACTAGAAGAGAA
AGAAGCTAAAAGAAGGAAGAGACAACAGAGGGCTGAGAATCAAGAAATCGTACAGAAGGTAGTGGAGGATATTGCTGCCGGAGAAGTTGAAGAAGGCAATCCGAAGGAGC
CTGAAGGGCAAAACCCAGGGCAGACTGACCCGATAGTTGCAGTTATCAAGGAGGTTCAAGAAAAACAGACCGAGGATGCACGAGAAGGACAGACAGAGGATGCAATGGAA
AAAGGCAATGAGCAGGTGGAACAAGAGCAGGAGGCTCTAGTGGAGGTTATCATGCCAGAAAATGCAGAGAGAGTAGAACAGGAGAAAAAAGAGGCCGAAGACAAAGCAAG
AGAGGAAGCAGAGAAAAAAGCAGAAGAAGAAATTTTGCTCAAACAAAGGGAAGACAAGGGCAACGGTATTGTTGAAGCATTGCCGGAATCTGAAGATAGTGAAACAGAGA
AGCCGCGGTTACCATACAATCGCTTCATCAACAATCTTGCCATAGCAAAGCATGCTGAGATGTTGAAGAGAGACTTACTGTTTGAGAGAGGATTTAGTGAGGATCTTCCA
CATTTTCTGCGGGCCGGCATTACGAACCACGGTTGGGAATTATTCTGCTCCAAGCATGAATCTGTGAACACGCATGTAGTGTGCGAGTTTTATGCAAACATTGATGAGGA
AGAGGGTTTCCAAGTTATCGTTCGAGGAGTAGCAGTTGACTGGAGTCCTAGTGCCATTAACTCCCTCGATGCTGTTCTGGAAGTTGGTGTTGAAGGGGCGCAGTGGAGAC
TTTCGAAAATAGAGAAAAGGACATTTCAGGCAGCTTATCTAAAGAAGGAAACAAATACATGGATGGGATTCATTAAACAAAGGTTGCTTCCAATGACTCATGACTCGACG
GTTTCTAGGGAACATGTTCTTCTGGCTTTTGAGATTTTAAGGTCTCTCAGTATTGATGTGGGCAAGATTATTGCGAGTGAGATATTTGGATGCTGGCGGAAGAAAGTTGG
GAAGTTGTTTTTCCCGAATACAATTACAATGCTTTCCAAGCGAGCAGGGGTTTCGGAGAATGCAGAAGACATCATATTATTCGACAAGGGAATCATTGATACGTCTAACT
TGGCACGACTTCAGCGTACGCAAGAAGCACGTCAGGGTGGGCTTGTATGTGGCATTCACAAGATTTTAGAACAACGTGCACTGTCGGCCAGCAGGCAAGAGTTTGTCGAG
AGGCAATCTCAAACTTTCTGGAGCTATGTTAAACGTCGTGATGCCAACCTTAAAAATGCTCTACAGGAAAGTTTTTCCAAACCATATCCAGCCTTTCCAGTATTCCCTGA
TGATTTATTGAACCCCTGGATTCCGCCCCCACCAATGGAAAGAGAAGAAGAGGATGATGAAGAGCAGGGCCAGGAAGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGAAAACGAGAGGAAGAAAGGAAAGAGACAATGAAGAAGAGAGGGTGTCGATTACCTCCGAAGCATCGAAAACTAAGGCGAAGAAAAAGAAAACACTAGAAGAGAA
AGAAGCTAAAAGAAGGAAGAGACAACAGAGGGCTGAGAATCAAGAAATCGTACAGAAGGTAGTGGAGGATATTGCTGCCGGAGAAGTTGAAGAAGGCAATCCGAAGGAGC
CTGAAGGGCAAAACCCAGGGCAGACTGACCCGATAGTTGCAGTTATCAAGGAGGTTCAAGAAAAACAGACCGAGGATGCACGAGAAGGACAGACAGAGGATGCAATGGAA
AAAGGCAATGAGCAGGTGGAACAAGAGCAGGAGGCTCTAGTGGAGGTTATCATGCCAGAAAATGCAGAGAGAGTAGAACAGGAGAAAAAAGAGGCCGAAGACAAAGCAAG
AGAGGAAGCAGAGAAAAAAGCAGAAGAAGAAATTTTGCTCAAACAAAGGGAAGACAAGGGCAACGGTATTGTTGAAGCATTGCCGGAATCTGAAGATAGTGAAACAGAGA
AGCCGCGGTTACCATACAATCGCTTCATCAACAATCTTGCCATAGCAAAGCATGCTGAGATGTTGAAGAGAGACTTACTGTTTGAGAGAGGATTTAGTGAGGATCTTCCA
CATTTTCTGCGGGCCGGCATTACGAACCACGGTTGGGAATTATTCTGCTCCAAGCATGAATCTGTGAACACGCATGTAGTGTGCGAGTTTTATGCAAACATTGATGAGGA
AGAGGGTTTCCAAGTTATCGTTCGAGGAGTAGCAGTTGACTGGAGTCCTAGTGCCATTAACTCCCTCGATGCTGTTCTGGAAGTTGGTGTTGAAGGGGCGCAGTGGAGAC
TTTCGAAAATAGAGAAAAGGACATTTCAGGCAGCTTATCTAAAGAAGGAAACAAATACATGGATGGGATTCATTAAACAAAGGTTGCTTCCAATGACTCATGACTCGACG
GTTTCTAGGGAACATGTTCTTCTGGCTTTTGAGATTTTAAGGTCTCTCAGTATTGATGTGGGCAAGATTATTGCGAGTGAGATATTTGGATGCTGGCGGAAGAAAGTTGG
GAAGTTGTTTTTCCCGAATACAATTACAATGCTTTCCAAGCGAGCAGGGGTTTCGGAGAATGCAGAAGACATCATATTATTCGACAAGGGAATCATTGATACGTCTAACT
TGGCACGACTTCAGCGTACGCAAGAAGCACGTCAGGGTGGGCTTGTATGTGGCATTCACAAGATTTTAGAACAACGTGCACTGTCGGCCAGCAGGCAAGAGTTTGTCGAG
AGGCAATCTCAAACTTTCTGGAGCTATGTTAAACGTCGTGATGCCAACCTTAAAAATGCTCTACAGGAAAGTTTTTCCAAACCATATCCAGCCTTTCCAGTATTCCCTGA
TGATTTATTGAACCCCTGGATTCCGCCCCCACCAATGGAAAGAGAAGAAGAGGATGATGAAGAGCAGGGCCAGGAAGATTGA
Protein sequenceShow/hide protein sequence
MAKTRGRKERDNEEERVSITSEASKTKAKKKKTLEEKEAKRRKRQQRAENQEIVQKVVEDIAAGEVEEGNPKEPEGQNPGQTDPIVAVIKEVQEKQTEDAREGQTEDAME
KGNEQVEQEQEALVEVIMPENAERVEQEKKEAEDKAREEAEKKAEEEILLKQREDKGNGIVEALPESEDSETEKPRLPYNRFINNLAIAKHAEMLKRDLLFERGFSEDLP
HFLRAGITNHGWELFCSKHESVNTHVVCEFYANIDEEEGFQVIVRGVAVDWSPSAINSLDAVLEVGVEGAQWRLSKIEKRTFQAAYLKKETNTWMGFIKQRLLPMTHDST
VSREHVLLAFEILRSLSIDVGKIIASEIFGCWRKKVGKLFFPNTITMLSKRAGVSENAEDIILFDKGIIDTSNLARLQRTQEARQGGLVCGIHKILEQRALSASRQEFVE
RQSQTFWSYVKRRDANLKNALQESFSKPYPAFPVFPDDLLNPWIPPPPMEREEEDDEEQGQED