; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg009055 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg009055
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase
Genome locationscaffold8:17912394..17929869
RNA-Seq ExpressionSpg009055
SyntenySpg009055
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR005162 - Retrotransposon gag domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022149380.1 uncharacterized protein LOC111017810 [Momordica charantia]9.3e-1935.68Show/hide
Query:  VVNGGSGRSGPPDVPEVNSHPEANPPIPPLVPQVVLTAEALQVMLSNA-VHNNLQHAGANQAPAHGKDAQFFRSFMKAKPPSFDGHPESSQAC-------
        VV GG    GP   P+          +P + PQV L AEALQV+L NA      Q     +A    ++ QF R F +  PP F+G  E   A        
Subjt:  VVNGGSGRSGPPDVPEVNSHPEANPPIPPLVPQVVLTAEALQVMLSNA-VHNNLQHAGANQAPAHGKDAQFFRSFMKAKPPSFDGHPESSQAC-------

Query:  --------------VQGAASMLRGHALTWWNVVEAEFVSLVQGTMTVAQYVRRFEEFSCRVPELVAIEASRTNWFINGLRSEIRGLVMLGRPATFAAALA
                      V+GA  MLRG A+  WN    EF+ L QG +TVAQY  +F E S    + +  E  + + FI+GLR EI+GL++L  P T+AAA+ 
Subjt:  --------------VQGAASMLRGHALTWWNVVEAEFVSLVQGTMTVAQYVRRFEEFSCRVPELVAIEASRTNWFINGLRSEIRGLVMLGRPATFAAALA

Query:  SAPMLDNDIPSAEQSQEVGTSSSTKRK
         A ++D  +   +  Q +G+SS  KRK
Subjt:  SAPMLDNDIPSAEQSQEVGTSSSTKRK

XP_022151688.1 uncharacterized protein LOC111019603 [Momordica charantia]5.1e-1733.06Show/hide
Query:  HPEANPPIPPLVPQVVLTAEALQVMLSNAVHNNLQHA-GANQAPAHGKDAQFFRSFMKAKPPSFDGHPE---------------------SSQACVQGAA
        H  A+P    L  +V L   ALQ ++ N++      A    QA A   +AQF R F +  PP+F+G  E                     S Q  V+GA 
Subjt:  HPEANPPIPPLVPQVVLTAEALQVMLSNAVHNNLQHA-GANQAPAHGKDAQFFRSFMKAKPPSFDGHPE---------------------SSQACVQGAA

Query:  SMLRGHALTWWNVV---------------------------------EAEFVSLVQGTMTVAQYVRRFEEFSCRVPELVAIEASRTNWFINGLRSEIRGL
         MLRG AL WW+VV                                 E EF+ L Q T+ VAQY ++F EFS    +L+  EA +   F+ GL   I+G 
Subjt:  SMLRGHALTWWNVV---------------------------------EAEFVSLVQGTMTVAQYVRRFEEFSCRVPELVAIEASRTNWFINGLRSEIRGL

Query:  VMLGRPATFAAALASAPMLDND-IPSAEQSQEVGTSSSTKRK
        + L RP T+A A+  A ++D D I  A+  Q+VG SS  KRK
Subjt:  VMLGRPATFAAALASAPMLDND-IPSAEQSQEVGTSSSTKRK

XP_022155925.1 uncharacterized protein LOC111022925 [Momordica charantia]1.9e-1632Show/hide
Query:  PEVNSHPEANPPIPPLVP-QVVLTAEALQVMLSNAVHNNLQHAGANQAPAHG----KDAQFFRSFMKAKPPSFDGHPESSQAC-----------------
        P       A+  +PP+VP +VVL AEALQV+L NA   N       Q P+ G    ++ QF R F +  PP F+G  E   A                  
Subjt:  PEVNSHPEANPPIPPLVP-QVVLTAEALQVMLSNAVHNNLQHAGANQAPAHG----KDAQFFRSFMKAKPPSFDGHPESSQAC-----------------

Query:  ----VQGAASMLRGHALTWW---------------------------------NVVEAEFVSLVQGTMTVAQYVRRFEEFSCRVPELVAIEASRTNWFIN
            V+GA  ML+G A+ WW                                 N   AEF+ L Q ++ VAQY R+F E S    + +  E  + + FI+
Subjt:  ----VQGAASMLRGHALTWW---------------------------------NVVEAEFVSLVQGTMTVAQYVRRFEEFSCRVPELVAIEASRTNWFIN

Query:  GLRSEIRGLVMLGRPATFAAALASAPMLDNDIPSAEQSQEVGTSSSTKRK
        GLR EI+GL++L  P T+AAA+  A ++D  +   +  Q +G+SS  KRK
Subjt:  GLRSEIRGLVMLGRPATFAAALASAPMLDNDIPSAEQSQEVGTSSSTKRK

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]1.9e-1632.14Show/hide
Query:  PEANPP-IPPLVPQVVLTAEALQVMLSNA-VHNNLQHAGANQAPAHGKDAQFFRSFMKAKPPSFDGHPESSQAC---------------------VQGAA
        P+A P  +P + PQV L AEALQV+L NA      Q     +A     + QF R F    PP F+G  E   A                      V+GA 
Subjt:  PEANPP-IPPLVPQVVLTAEALQVMLSNA-VHNNLQHAGANQAPAHGKDAQFFRSFMKAKPPSFDGHPESSQAC---------------------VQGAA

Query:  SMLRGHALTWWNVVEA---------------------------------EFVSLVQGTMTVAQYVRRFEEFSCRVPELVAIEASRTNWFINGLRSEIRGL
         MLRG A+ WW  V A                                 EF+ L QG++TVAQY R+F E S    + V  E  + + FI+GLR EI+GL
Subjt:  SMLRGHALTWWNVVEA---------------------------------EFVSLVQGTMTVAQYVRRFEEFSCRVPELVAIEASRTNWFINGLRSEIRGL

Query:  VMLGRPATFAAALASAPMLDNDIPSAEQSQEVGTSSSTKRKHEELVPAPSQK
        ++L  P T+AAA+  A ++D  +   +  Q +G++S  KRK      + S +
Subjt:  VMLGRPATFAAALASAPMLDNDIPSAEQSQEVGTSSSTKRKHEELVPAPSQK

XP_022157413.1 uncharacterized protein LOC111024114 [Momordica charantia]5.1e-1732.68Show/hide
Query:  GRSGPPDVPEVNSHPEANPPIPPLVPQVVLTAEALQVMLSNA-VHNNLQHAGANQAPAHGKDAQFFRSFMKAKPPSFDGHPESSQAC-------------
        GR  PP VP+      A   +P + PQV L AEALQV+L NA      Q     +A     + QF R F +  PP F+G  E   A              
Subjt:  GRSGPPDVPEVNSHPEANPPIPPLVPQVVLTAEALQVMLSNA-VHNNLQHAGANQAPAHGKDAQFFRSFMKAKPPSFDGHPESSQAC-------------

Query:  --------VQGAASMLRGHALTWW---------------------------------NVVEAEFVSLVQGTMTVAQYVRRFEEFSCRVPELVAIEASRTN
                V+GA  MLRG A+ WW                                 N   AEF+ L QG++TVAQY R+F E S    + +  E  + +
Subjt:  --------VQGAASMLRGHALTWW---------------------------------NVVEAEFVSLVQGTMTVAQYVRRFEEFSCRVPELVAIEASRTN

Query:  WFINGLRSEIRGLVMLGRPATFAAALASAPMLDNDIPSAEQSQEVGTSSSTKRK
         FI+GLR EI+GL+++  P T+AAA+  A ++D  +   +  Q +G+SS  KRK
Subjt:  WFINGLRSEIRGLVMLGRPATFAAALASAPMLDNDIPSAEQSQEVGTSSSTKRK

TrEMBL top hitse value%identityAlignment
A0A6J1D5J7 uncharacterized protein LOC1110178104.5e-1935.68Show/hide
Query:  VVNGGSGRSGPPDVPEVNSHPEANPPIPPLVPQVVLTAEALQVMLSNA-VHNNLQHAGANQAPAHGKDAQFFRSFMKAKPPSFDGHPESSQAC-------
        VV GG    GP   P+          +P + PQV L AEALQV+L NA      Q     +A    ++ QF R F +  PP F+G  E   A        
Subjt:  VVNGGSGRSGPPDVPEVNSHPEANPPIPPLVPQVVLTAEALQVMLSNA-VHNNLQHAGANQAPAHGKDAQFFRSFMKAKPPSFDGHPESSQAC-------

Query:  --------------VQGAASMLRGHALTWWNVVEAEFVSLVQGTMTVAQYVRRFEEFSCRVPELVAIEASRTNWFINGLRSEIRGLVMLGRPATFAAALA
                      V+GA  MLRG A+  WN    EF+ L QG +TVAQY  +F E S    + +  E  + + FI+GLR EI+GL++L  P T+AAA+ 
Subjt:  --------------VQGAASMLRGHALTWWNVVEAEFVSLVQGTMTVAQYVRRFEEFSCRVPELVAIEASRTNWFINGLRSEIRGLVMLGRPATFAAALA

Query:  SAPMLDNDIPSAEQSQEVGTSSSTKRK
         A ++D  +   +  Q +G+SS  KRK
Subjt:  SAPMLDNDIPSAEQSQEVGTSSSTKRK

A0A6J1DCW8 uncharacterized protein LOC1110196032.5e-1733.06Show/hide
Query:  HPEANPPIPPLVPQVVLTAEALQVMLSNAVHNNLQHA-GANQAPAHGKDAQFFRSFMKAKPPSFDGHPE---------------------SSQACVQGAA
        H  A+P    L  +V L   ALQ ++ N++      A    QA A   +AQF R F +  PP+F+G  E                     S Q  V+GA 
Subjt:  HPEANPPIPPLVPQVVLTAEALQVMLSNAVHNNLQHA-GANQAPAHGKDAQFFRSFMKAKPPSFDGHPE---------------------SSQACVQGAA

Query:  SMLRGHALTWWNVV---------------------------------EAEFVSLVQGTMTVAQYVRRFEEFSCRVPELVAIEASRTNWFINGLRSEIRGL
         MLRG AL WW+VV                                 E EF+ L Q T+ VAQY ++F EFS    +L+  EA +   F+ GL   I+G 
Subjt:  SMLRGHALTWWNVV---------------------------------EAEFVSLVQGTMTVAQYVRRFEEFSCRVPELVAIEASRTNWFINGLRSEIRGL

Query:  VMLGRPATFAAALASAPMLDND-IPSAEQSQEVGTSSSTKRK
        + L RP T+A A+  A ++D D I  A+  Q+VG SS  KRK
Subjt:  VMLGRPATFAAALASAPMLDND-IPSAEQSQEVGTSSSTKRK

A0A6J1DNV8 uncharacterized protein LOC1110229259.4e-1732Show/hide
Query:  PEVNSHPEANPPIPPLVP-QVVLTAEALQVMLSNAVHNNLQHAGANQAPAHG----KDAQFFRSFMKAKPPSFDGHPESSQAC-----------------
        P       A+  +PP+VP +VVL AEALQV+L NA   N       Q P+ G    ++ QF R F +  PP F+G  E   A                  
Subjt:  PEVNSHPEANPPIPPLVP-QVVLTAEALQVMLSNAVHNNLQHAGANQAPAHG----KDAQFFRSFMKAKPPSFDGHPESSQAC-----------------

Query:  ----VQGAASMLRGHALTWW---------------------------------NVVEAEFVSLVQGTMTVAQYVRRFEEFSCRVPELVAIEASRTNWFIN
            V+GA  ML+G A+ WW                                 N   AEF+ L Q ++ VAQY R+F E S    + +  E  + + FI+
Subjt:  ----VQGAASMLRGHALTWW---------------------------------NVVEAEFVSLVQGTMTVAQYVRRFEEFSCRVPELVAIEASRTNWFIN

Query:  GLRSEIRGLVMLGRPATFAAALASAPMLDNDIPSAEQSQEVGTSSSTKRK
        GLR EI+GL++L  P T+AAA+  A ++D  +   +  Q +G+SS  KRK
Subjt:  GLRSEIRGLVMLGRPATFAAALASAPMLDNDIPSAEQSQEVGTSSSTKRK

A0A6J1DQB9 Reverse transcriptase9.4e-1732.14Show/hide
Query:  PEANPP-IPPLVPQVVLTAEALQVMLSNA-VHNNLQHAGANQAPAHGKDAQFFRSFMKAKPPSFDGHPESSQAC---------------------VQGAA
        P+A P  +P + PQV L AEALQV+L NA      Q     +A     + QF R F    PP F+G  E   A                      V+GA 
Subjt:  PEANPP-IPPLVPQVVLTAEALQVMLSNA-VHNNLQHAGANQAPAHGKDAQFFRSFMKAKPPSFDGHPESSQAC---------------------VQGAA

Query:  SMLRGHALTWWNVVEA---------------------------------EFVSLVQGTMTVAQYVRRFEEFSCRVPELVAIEASRTNWFINGLRSEIRGL
         MLRG A+ WW  V A                                 EF+ L QG++TVAQY R+F E S    + V  E  + + FI+GLR EI+GL
Subjt:  SMLRGHALTWWNVVEA---------------------------------EFVSLVQGTMTVAQYVRRFEEFSCRVPELVAIEASRTNWFINGLRSEIRGL

Query:  VMLGRPATFAAALASAPMLDNDIPSAEQSQEVGTSSSTKRKHEELVPAPSQK
        ++L  P T+AAA+  A ++D  +   +  Q +G++S  KRK      + S +
Subjt:  VMLGRPATFAAALASAPMLDNDIPSAEQSQEVGTSSSTKRKHEELVPAPSQK

A0A6J1DTA8 uncharacterized protein LOC1110241142.5e-1732.68Show/hide
Query:  GRSGPPDVPEVNSHPEANPPIPPLVPQVVLTAEALQVMLSNA-VHNNLQHAGANQAPAHGKDAQFFRSFMKAKPPSFDGHPESSQAC-------------
        GR  PP VP+      A   +P + PQV L AEALQV+L NA      Q     +A     + QF R F +  PP F+G  E   A              
Subjt:  GRSGPPDVPEVNSHPEANPPIPPLVPQVVLTAEALQVMLSNA-VHNNLQHAGANQAPAHGKDAQFFRSFMKAKPPSFDGHPESSQAC-------------

Query:  --------VQGAASMLRGHALTWW---------------------------------NVVEAEFVSLVQGTMTVAQYVRRFEEFSCRVPELVAIEASRTN
                V+GA  MLRG A+ WW                                 N   AEF+ L QG++TVAQY R+F E S    + +  E  + +
Subjt:  --------VQGAASMLRGHALTWW---------------------------------NVVEAEFVSLVQGTMTVAQYVRRFEEFSCRVPELVAIEASRTN

Query:  WFINGLRSEIRGLVMLGRPATFAAALASAPMLDNDIPSAEQSQEVGTSSSTKRK
         FI+GLR EI+GL+++  P T+AAA+  A ++D  +   +  Q +G+SS  KRK
Subjt:  WFINGLRSEIRGLVMLGRPATFAAALASAPMLDNDIPSAEQSQEVGTSSSTKRK

SwissProt top hitse value%identityAlignment
Q8W586 GPN-loop GTPase QQT26.7e-0460Show/hide
Query:  MEKLRKDMESSKGQTVVLNTSLKD-DNSKTKMVDDEDEEINEEDEDDDDN
        MEKLRKDMESS+G TVVLNT LKD D ++  M++++DE+   EDE+D D+
Subjt:  MEKLRKDMESSKGQTVVLNTSLKD-DNSKTKMVDDEDEEINEEDEDDDDN

Arabidopsis top hitse value%identityAlignment
AT1G43730.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.0e-0430.51Show/hide
Query:  FWKNFWKIKAIPKAKICSWRIIHNSLPTNMNIAKKGIPINTLCAFCRSHNEDSSHIFWE
        ++K  W    +PK     W +  N L T   +   G+ I  +C  C SH+E  +H+F+E
Subjt:  FWKNFWKIKAIPKAKICSWRIIHNSLPTNMNIAKKGIPINTLCAFCRSHNEDSSHIFWE

AT4G21800.1 P-loop containing nucleoside triphosphate hydrolases superfamily protein4.8e-0560Show/hide
Query:  MEKLRKDMESSKGQTVVLNTSLKD-DNSKTKMVDDEDEEINEEDEDDDDN
        MEKLRKDMESS+G TVVLNT LKD D ++  M++++DE+   EDE+D D+
Subjt:  MEKLRKDMESSKGQTVVLNTSLKD-DNSKTKMVDDEDEEINEEDEDDDDN

AT4G21800.2 P-loop containing nucleoside triphosphate hydrolases superfamily protein4.8e-0560Show/hide
Query:  MEKLRKDMESSKGQTVVLNTSLKD-DNSKTKMVDDEDEEINEEDEDDDDN
        MEKLRKDMESS+G TVVLNT LKD D ++  M++++DE+   EDE+D D+
Subjt:  MEKLRKDMESSKGQTVVLNTSLKD-DNSKTKMVDDEDEEINEEDEDDDDN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAAGTTGAGGAAAGATATGGAGAGCTCCAAGGGACAAACTGTGGTTTTGAACACCAGTTTGAAGGACGACAATAGTAAAACTAAGATGGTCGATGACGAAGATGA
AGAGATTAATGAGGAAGATGAAGACGACGATGATAATGACAGATTTACCGAAGCGAGGATGAGGATGAAGATGAAGAGGTTGCCAGATTCTCCTTTTGGTAGTGAAACAA
ACATGATTGCATATAGGAGAACAATGATTAAACATCATTTAGCACATACTAGAAAAAATGTTGCTCCATGTAGTAAAATTTCGGATGAGGTTCGAGATATTTTCAAGAAC
TTATTGAATGGAAATAAAAGAAATAAAGATGGAGATGATTTTAATGATGAAGATGTTGTTGAGAGCAAGAAGAAACAGATGGAAAGTTGCAGAGACGATCCTCAATACCC
CAACTGGGGGCAAAGACTCAAGAGATTAGATCATCTGGGACAGAGACAAAAAAGACCTTTTCTCAGTCAAGAACGCTATCACCTTGCAGCTTCTCTAGAGTCCAGCAAAG
AAGCCTCGTCCTCTAGGGGCTCCTCTTGTGATAGATTTTGGAAAAACTTTTGGAAGATCAAAGCAATCCCAAAAGCCAAAATCTGCTCTTGGAGGATTATCCACAATTCC
TTGCCCACTAATATGAATATTGCTAAGAAAGGCATTCCTATTAACACCCTTTGTGCTTTTTGCAGGTCCCACAACGAAGATTCTAGCCATATATTCTGGGAATGGTCTTC
GCTTTGCTTGGGCTATAAACGAATTCGAATCCGCTGGTCAGTAAAGTTGTTGGAATTGAAAGCCATCATCGAAGGATTGAAATGCCTTTCTTCCTCCGGTGTGCTTGAAG
ACTCTCCGATCGCCACCCCCGTTGTGGTTGAATCCGACGCCATCTCTGTGGTGAAGCTGCTAAATGCGGAAGAAAACGACATTTCAAAAATTCCCTTTTTGATCGAGGAA
ATTAACGAGCTAAAGAAATCCTTCAGGGATCTTACCTTTGTTTTTTGCCCGAGGAGTTGCAATGTCGCTGCTGATCGCCTGGCTCGCATGGCTTGCTCTCTGTCGCTGGG
TTCTTTTTTGGATTCCCCTCCTCTCGTCGAAGAGGATGAGGGATTTTGGTGGGGACCTCCCCTTCTTGTATTAAAAACCTCCTTTATGAGGAGTCCAGAATTGCATTCGG
AATGCGCCCACATCTGGAAAAGGAAGAATTTAGCAGAGCATGGAAATGTAGATTTGCACCCACATCTGGAAAAAAAGATATCTAGCAGACTCCTCTATTCGAAAGAAGTA
GAGGGGAAAGACTTGTCTCTCCTTGGAAAGGTTAAGAGCAACCCCAGCCGCCCCCCCCTTCGCGTGTGTGTAACCGCTGCACAACCCTGCCCCCGCCGCCCAGTCGATCG
CCTCGTCGTCGCCGGTGTTGAAGTTTCGCCACCCCGCCAGCCCAGTCGTCGCGTCGTCGCCACCCAGTCCGCGCGATTCTTTTTCTCCACGGGTTCTCTCTCCCGCGAAC
CCCTCTCTTACCGTGGGTTTGTTCGTCCATGGCTCACTCCCTCTCTACAGTCTCTCTCGTGTGAGGTCGTGACCGTCCAGTCGTCCAGCCGCCTCGACGCGCCGCCACCG
CTTGCTCGTGCCGCCACCGCGTTTCCTCTCTGTTCCGTGGGTTTCAAGATATTGCGCAGACAGCAGCTCGAGCCTCATTTCCTCGCGTTTTCGCCTCTATTCAGCAGCGT
CATTGGGCGTTTTTGGCATCACTTAGCGATTCTGGTAAGATTTAACTCTTATCCATGTTCGGTTGGTGTTAGAATCAAAATGCCCAATGGTTTAGGTGTTGGAATTTTAA
TGTTAGCGTTGATTTTCTATTCTGTAGCGCCGCCCAAGTGTTCGATTGAGTTCGATACACTTCAACGTGGTCACCCACTGCCCAAGGAGTGTTCTAACACACTGTTCGAG
CATGTTCGGTGTTTATCTGCTTCTCTTATATTTGTGCTAAGTTGTAGTAGCGTCTCTAGGCTAGAGGTTGGAAAACCTGGGGCGTTACAGTTGGTATCAGAGCGATCATG
GGTGAGAGTGGCCAATACGCCGACTCAATATGCCTTCCTTTTTTGGGACAAGACCGAGTGGGAGGCTGGGGACATGACAACACAAGAAGGAATTCACTCCTTCCCACTTC
TAGGAGAAGTAGATGAGTGTTGTTCATTAGAGGAGCGCTGCGGGAAAGATCAAATGGTGGTGTTCTTCGTTGTTGCAGAGCAATCAATAATCATTCTTGAAGTATTCAGA
GTCGTCAACGGAGGAAGTGGACGTTCTGGCCCTCCAGACGTTCCTGAAGTCAATTCGCATCCCGAGGCGAATCCCCCTATTCCTCCTCTAGTGCCTCAAGTGGTGCTGAC
AGCGGAGGCATTGCAAGTTATGCTTAGCAATGCAGTCCATAATAACTTGCAGCACGCCGGTGCAAATCAAGCCCCTGCTCATGGCAAAGATGCGCAATTTTTCCGGAGCT
TCATGAAGGCTAAGCCTCCTTCGTTCGATGGTCATCCTGAAAGTTCACAAGCGTGTGTCCAAGGAGCCGCCTCTATGCTCAGAGGCCATGCTCTCACCTGGTGGAACGTA
GTGGAAGCGGAGTTCGTCTCGCTCGTGCAAGGGACCATGACCGTGGCGCAGTATGTTAGAAGGTTTGAGGAATTTTCTTGTCGAGTCCCTGAGTTGGTCGCCATCGAGGC
AAGCAGGACCAATTGGTTCATCAATGGTTTGCGTTCTGAGATTAGAGGGTTGGTCATGCTAGGACGACCAGCCACTTTCGCAGCGGCTCTTGCAAGCGCTCCAATGTTGG
ATAATGACATCCCTAGTGCGGAGCAGTCCCAGGAGGTGGGCACGTCGTCTAGTACCAAGAGGAAGCATGAAGAGTTAGTGCCTGCACCAAGTCAGAAGGTCAGAAGGTCT
CTGCCAGGGACCAGTGAGTGCATTGTGGAGAAAGTCCAGCAAGAGGAGTTCTTGCCCTGGGTCATCGATGAAGAGCTCGGATACCAACTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAAGTTGAGGAAAGATATGGAGAGCTCCAAGGGACAAACTGTGGTTTTGAACACCAGTTTGAAGGACGACAATAGTAAAACTAAGATGGTCGATGACGAAGATGA
AGAGATTAATGAGGAAGATGAAGACGACGATGATAATGACAGATTTACCGAAGCGAGGATGAGGATGAAGATGAAGAGGTTGCCAGATTCTCCTTTTGGTAGTGAAACAA
ACATGATTGCATATAGGAGAACAATGATTAAACATCATTTAGCACATACTAGAAAAAATGTTGCTCCATGTAGTAAAATTTCGGATGAGGTTCGAGATATTTTCAAGAAC
TTATTGAATGGAAATAAAAGAAATAAAGATGGAGATGATTTTAATGATGAAGATGTTGTTGAGAGCAAGAAGAAACAGATGGAAAGTTGCAGAGACGATCCTCAATACCC
CAACTGGGGGCAAAGACTCAAGAGATTAGATCATCTGGGACAGAGACAAAAAAGACCTTTTCTCAGTCAAGAACGCTATCACCTTGCAGCTTCTCTAGAGTCCAGCAAAG
AAGCCTCGTCCTCTAGGGGCTCCTCTTGTGATAGATTTTGGAAAAACTTTTGGAAGATCAAAGCAATCCCAAAAGCCAAAATCTGCTCTTGGAGGATTATCCACAATTCC
TTGCCCACTAATATGAATATTGCTAAGAAAGGCATTCCTATTAACACCCTTTGTGCTTTTTGCAGGTCCCACAACGAAGATTCTAGCCATATATTCTGGGAATGGTCTTC
GCTTTGCTTGGGCTATAAACGAATTCGAATCCGCTGGTCAGTAAAGTTGTTGGAATTGAAAGCCATCATCGAAGGATTGAAATGCCTTTCTTCCTCCGGTGTGCTTGAAG
ACTCTCCGATCGCCACCCCCGTTGTGGTTGAATCCGACGCCATCTCTGTGGTGAAGCTGCTAAATGCGGAAGAAAACGACATTTCAAAAATTCCCTTTTTGATCGAGGAA
ATTAACGAGCTAAAGAAATCCTTCAGGGATCTTACCTTTGTTTTTTGCCCGAGGAGTTGCAATGTCGCTGCTGATCGCCTGGCTCGCATGGCTTGCTCTCTGTCGCTGGG
TTCTTTTTTGGATTCCCCTCCTCTCGTCGAAGAGGATGAGGGATTTTGGTGGGGACCTCCCCTTCTTGTATTAAAAACCTCCTTTATGAGGAGTCCAGAATTGCATTCGG
AATGCGCCCACATCTGGAAAAGGAAGAATTTAGCAGAGCATGGAAATGTAGATTTGCACCCACATCTGGAAAAAAAGATATCTAGCAGACTCCTCTATTCGAAAGAAGTA
GAGGGGAAAGACTTGTCTCTCCTTGGAAAGGTTAAGAGCAACCCCAGCCGCCCCCCCCTTCGCGTGTGTGTAACCGCTGCACAACCCTGCCCCCGCCGCCCAGTCGATCG
CCTCGTCGTCGCCGGTGTTGAAGTTTCGCCACCCCGCCAGCCCAGTCGTCGCGTCGTCGCCACCCAGTCCGCGCGATTCTTTTTCTCCACGGGTTCTCTCTCCCGCGAAC
CCCTCTCTTACCGTGGGTTTGTTCGTCCATGGCTCACTCCCTCTCTACAGTCTCTCTCGTGTGAGGTCGTGACCGTCCAGTCGTCCAGCCGCCTCGACGCGCCGCCACCG
CTTGCTCGTGCCGCCACCGCGTTTCCTCTCTGTTCCGTGGGTTTCAAGATATTGCGCAGACAGCAGCTCGAGCCTCATTTCCTCGCGTTTTCGCCTCTATTCAGCAGCGT
CATTGGGCGTTTTTGGCATCACTTAGCGATTCTGGTAAGATTTAACTCTTATCCATGTTCGGTTGGTGTTAGAATCAAAATGCCCAATGGTTTAGGTGTTGGAATTTTAA
TGTTAGCGTTGATTTTCTATTCTGTAGCGCCGCCCAAGTGTTCGATTGAGTTCGATACACTTCAACGTGGTCACCCACTGCCCAAGGAGTGTTCTAACACACTGTTCGAG
CATGTTCGGTGTTTATCTGCTTCTCTTATATTTGTGCTAAGTTGTAGTAGCGTCTCTAGGCTAGAGGTTGGAAAACCTGGGGCGTTACAGTTGGTATCAGAGCGATCATG
GGTGAGAGTGGCCAATACGCCGACTCAATATGCCTTCCTTTTTTGGGACAAGACCGAGTGGGAGGCTGGGGACATGACAACACAAGAAGGAATTCACTCCTTCCCACTTC
TAGGAGAAGTAGATGAGTGTTGTTCATTAGAGGAGCGCTGCGGGAAAGATCAAATGGTGGTGTTCTTCGTTGTTGCAGAGCAATCAATAATCATTCTTGAAGTATTCAGA
GTCGTCAACGGAGGAAGTGGACGTTCTGGCCCTCCAGACGTTCCTGAAGTCAATTCGCATCCCGAGGCGAATCCCCCTATTCCTCCTCTAGTGCCTCAAGTGGTGCTGAC
AGCGGAGGCATTGCAAGTTATGCTTAGCAATGCAGTCCATAATAACTTGCAGCACGCCGGTGCAAATCAAGCCCCTGCTCATGGCAAAGATGCGCAATTTTTCCGGAGCT
TCATGAAGGCTAAGCCTCCTTCGTTCGATGGTCATCCTGAAAGTTCACAAGCGTGTGTCCAAGGAGCCGCCTCTATGCTCAGAGGCCATGCTCTCACCTGGTGGAACGTA
GTGGAAGCGGAGTTCGTCTCGCTCGTGCAAGGGACCATGACCGTGGCGCAGTATGTTAGAAGGTTTGAGGAATTTTCTTGTCGAGTCCCTGAGTTGGTCGCCATCGAGGC
AAGCAGGACCAATTGGTTCATCAATGGTTTGCGTTCTGAGATTAGAGGGTTGGTCATGCTAGGACGACCAGCCACTTTCGCAGCGGCTCTTGCAAGCGCTCCAATGTTGG
ATAATGACATCCCTAGTGCGGAGCAGTCCCAGGAGGTGGGCACGTCGTCTAGTACCAAGAGGAAGCATGAAGAGTTAGTGCCTGCACCAAGTCAGAAGGTCAGAAGGTCT
CTGCCAGGGACCAGTGAGTGCATTGTGGAGAAAGTCCAGCAAGAGGAGTTCTTGCCCTGGGTCATCGATGAAGAGCTCGGATACCAACTGTAA
Protein sequenceShow/hide protein sequence
MEKLRKDMESSKGQTVVLNTSLKDDNSKTKMVDDEDEEINEEDEDDDDNDRFTEARMRMKMKRLPDSPFGSETNMIAYRRTMIKHHLAHTRKNVAPCSKISDEVRDIFKN
LLNGNKRNKDGDDFNDEDVVESKKKQMESCRDDPQYPNWGQRLKRLDHLGQRQKRPFLSQERYHLAASLESSKEASSSRGSSCDRFWKNFWKIKAIPKAKICSWRIIHNS
LPTNMNIAKKGIPINTLCAFCRSHNEDSSHIFWEWSSLCLGYKRIRIRWSVKLLELKAIIEGLKCLSSSGVLEDSPIATPVVVESDAISVVKLLNAEENDISKIPFLIEE
INELKKSFRDLTFVFCPRSCNVAADRLARMACSLSLGSFLDSPPLVEEDEGFWWGPPLLVLKTSFMRSPELHSECAHIWKRKNLAEHGNVDLHPHLEKKISSRLLYSKEV
EGKDLSLLGKVKSNPSRPPLRVCVTAAQPCPRRPVDRLVVAGVEVSPPRQPSRRVVATQSARFFFSTGSLSREPLSYRGFVRPWLTPSLQSLSCEVVTVQSSSRLDAPPP
LARAATAFPLCSVGFKILRRQQLEPHFLAFSPLFSSVIGRFWHHLAILVRFNSYPCSVGVRIKMPNGLGVGILMLALIFYSVAPPKCSIEFDTLQRGHPLPKECSNTLFE
HVRCLSASLIFVLSCSSVSRLEVGKPGALQLVSERSWVRVANTPTQYAFLFWDKTEWEAGDMTTQEGIHSFPLLGEVDECCSLEERCGKDQMVVFFVVAEQSIIILEVFR
VVNGGSGRSGPPDVPEVNSHPEANPPIPPLVPQVVLTAEALQVMLSNAVHNNLQHAGANQAPAHGKDAQFFRSFMKAKPPSFDGHPESSQACVQGAASMLRGHALTWWNV
VEAEFVSLVQGTMTVAQYVRRFEEFSCRVPELVAIEASRTNWFINGLRSEIRGLVMLGRPATFAAALASAPMLDNDIPSAEQSQEVGTSSSTKRKHEELVPAPSQKVRRS
LPGTSECIVEKVQQEEFLPWVIDEELGYQL