; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005549 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005549
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr6:21567263..21577775
RNA-Seq ExpressionLag0005549
SyntenyLag0005549
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GEU96560.1 reverse transcriptase domain-containing protein [Tanacetum cinerariifolium]2.4e-3949.72Show/hide
Query:  RFKHDAKLFYWDEPFMYKQCFDGIIRRCVSGDEAKEILEQCHSSPYG---------------VYYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPRA
        +F  D K ++WD+P+++K C D +IRRCVSG EA EIL+ CH  P G               V Y+SKWVEA A   NDA+ V +FL+ ++FARFGT RA
Subjt:  RFKHDAKLFYWDEPFMYKQCFDGIIRRCVSGDEAKEILEQCHSSPYG---------------VYYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPRA

Query:  LVSDE---------------DEALWAYRTAYKTPLGMSPYRLVYGKSCHLSLELEHKTLWALKKLNFDAWNRGSVAT
        ++SD                D+ALWA+RTAYKTP+G +PYRLVYGK+CHL +ELEHK  WALK  NFD    G + T
Subjt:  LVSDE---------------DEALWAYRTAYKTPLGMSPYRLVYGKSCHLSLELEHKTLWALKKLNFDAWNRGSVAT

GEW27269.1 reverse transcriptase domain-containing protein [Tanacetum cinerariifolium]1.3e-3746.2Show/hide
Query:  HSKAFIPS--DRRHVQRFKHDAKLFYWDEPFMYKQCFDGIIRRCVSGDEAKEILEQCHSSPYGVYYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPR
        H++ FI      +  ++F  D K ++WD+PF++K   D +IRRCVSG +A EIL+ CH  P GV Y+SKWVEA A   ND++ V +FL +++FARFGTPR
Subjt:  HSKAFIPS--DRRHVQRFKHDAKLFYWDEPFMYKQCFDGIIRRCVSGDEAKEILEQCHSSPYGVYYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPR

Query:  ALVSDE-------------------------DEALWAYRTAYKTPLGMSPYRLVYGKSCHLSLELEHKTLWALKKLNFDAWNRG
        A++SD                          D+ALWA+R AYKT +G + Y+LVYGK+CHL +ELEHK  WALK  NFD    G
Subjt:  ALVSDE-------------------------DEALWAYRTAYKTPLGMSPYRLVYGKSCHLSLELEHKTLWALKKLNFDAWNRG

GFA68532.1 reverse transcriptase domain-containing protein [Tanacetum cinerariifolium]1.0e-3743Show/hide
Query:  QRFKHDAKLFYWDEPFMYKQCFDGIIRRCVSGDEAKEILEQCHSSPYGVY---------------YVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPR
        Q+F  D + ++WD+P+++K C D IIRRCV+G EA +IL+ CH+ P   Y               Y+SKWVEA A   NDA+ V +FL+S +F+RFGTP+
Subjt:  QRFKHDAKLFYWDEPFMYKQCFDGIIRRCVSGDEAKEILEQCHSSPYGVY---------------YVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPR

Query:  ALVSDE--------------------------DEALWAYRTAYKTPLGMSPYRLVYGKSCHLSLELEHKTLWALKKLNFDAWNRGSVATLLRTRLGQQER
        A++SD                           ++ALWA+RTA+KTP+G +PYRLVY KSCHL LELEHK  WALK +NFD    G    L    L +   
Subjt:  ALVSDE--------------------------DEALWAYRTAYKTPLGMSPYRLVYGKSCHLSLELEHKTLWALKKLNFDAWNRGSVATLLRTRLGQQER

Query:  AASRRCL
         A   CL
Subjt:  AASRRCL

XP_016461948.1 PREDICTED: uncharacterized protein LOC107785214, partial [Nicotiana tabacum]1.0e-3740.45Show/hide
Query:  SKAFIPSDRRHVQ--RFKHDAKLFYWDEPFMYKQCFDGIIRRCVSGDEAKEILEQCHSSPYG-------------------------------VYYVSKW
        +   +P D   VQ  RF  +++L+YWDEP++++ C D +IRRC+S  E   IL+ CH+S YG                                YYVSKW
Subjt:  SKAFIPSDRRHVQ--RFKHDAKLFYWDEPFMYKQCFDGIIRRCVSGDEAKEILEQCHSSPYG-------------------------------VYYVSKW

Query:  VEAIACHQNDAKTVSRFLQSHIFARFGTPRALVSDE------------------------------------DEALWAYRTAYKTPLGMSPYRLVYGKSC
        VEA+A   NDAK V  FL+ +IF  FGTPRA++SD                                     D+ALWAYRTA+KTP+GMSPY+LV+GK+C
Subjt:  VEAIACHQNDAKTVSRFLQSHIFARFGTPRALVSDE------------------------------------DEALWAYRTAYKTPLGMSPYRLVYGKSC

Query:  HLSLELEHKTLWALKKLNFD
        HL +ELEHK  W LK+LN D
Subjt:  HLSLELEHKTLWALKKLNFD

XP_019241380.1 PREDICTED: uncharacterized protein LOC109221357 [Nicotiana attenuata]1.7e-3736.3Show/hide
Query:  ILLLLEFYAN-SEGKGERNQ--------ERHSKAFIPSD-RRHVQRFKHDAKLFYWDEPFMYKQCFDGIIRRCVSGDEAKEILEQCHSSPYG--------
        +LLL EF     + KG  NQ        E H         ++  +RF HD   +YWDEP+++KQC D ++RRC+   E + +L  CH+SPYG        
Subjt:  ILLLLEFYAN-SEGKGERNQ--------ERHSKAFIPSD-RRHVQRFKHDAKLFYWDEPFMYKQCFDGIIRRCVSGDEAKEILEQCHSSPYG--------

Query:  -------------------------VYYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPRALVSDE--------------------------------
                                 V YVSKWVEAIA   NDA  V+ F++ +IF+RFGTPRAL+SDE                                
Subjt:  -------------------------VYYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPRALVSDE--------------------------------

Query:  ----------------------------DEALWAYRTAYKTPLGMSPYRLVYGKSCHLSLELEHKTLWALKKLNFDAWNRG
                                    D+ALWAYRTAYKTP+G SPY+LVYGK+CHL +ELEHK  WA+KKLN D    G
Subjt:  ----------------------------DEALWAYRTAYKTPLGMSPYRLVYGKSCHLSLELEHKTLWALKKLNFDAWNRG

TrEMBL top hitse value%identityAlignment
A0A1S3ZC27 uncharacterized protein LOC1077852144.9e-3840.45Show/hide
Query:  SKAFIPSDRRHVQ--RFKHDAKLFYWDEPFMYKQCFDGIIRRCVSGDEAKEILEQCHSSPYG-------------------------------VYYVSKW
        +   +P D   VQ  RF  +++L+YWDEP++++ C D +IRRC+S  E   IL+ CH+S YG                                YYVSKW
Subjt:  SKAFIPSDRRHVQ--RFKHDAKLFYWDEPFMYKQCFDGIIRRCVSGDEAKEILEQCHSSPYG-------------------------------VYYVSKW

Query:  VEAIACHQNDAKTVSRFLQSHIFARFGTPRALVSDE------------------------------------DEALWAYRTAYKTPLGMSPYRLVYGKSC
        VEA+A   NDAK V  FL+ +IF  FGTPRA++SD                                     D+ALWAYRTA+KTP+GMSPY+LV+GK+C
Subjt:  VEAIACHQNDAKTVSRFLQSHIFARFGTPRALVSDE------------------------------------DEALWAYRTAYKTPLGMSPYRLVYGKSC

Query:  HLSLELEHKTLWALKKLNFD
        HL +ELEHK  W LK+LN D
Subjt:  HLSLELEHKTLWALKKLNFD

A0A1U7Z0W9 uncharacterized protein LOC1045898889.2e-3729.55Show/hide
Query:  MGQVANELKARPQRKLPVDTKHPRTEGKDKVQAVTLRSGKPREEKRKPNNIQDVEKNIDKNVVVEKNLE-SGKSDGGINNNVGASSSVLDVEPPYVPPRL
        +GQ+AN + AR Q  LP + +   T  +++++A++LRSGK  E K+   ++Q      D  +  E+ ++ S  + G   N+  A  S      P+ P RL
Subjt:  MGQVANELKARPQRKLPVDTKHPRTEGKDKVQAVTLRSGKPREEKRKPNNIQDVEKNIDKNVVVEKNLE-SGKSDGGINNNVGASSSVLDVEPPYVPPRL

Query:  MHNIVD-PFTKALSAKVFEGHLESLGLLGPTGNSIRVMRNRAPISTFID-------YPKRIPVQLTSWCSCRIPSKIKILL------LLEFYANSEGKGE
             +  F+K L         + L +  P   ++  M   A    F +       +     V LT  CS  I SK+   L       +     S    +
Subjt:  MHNIVD-PFTKALSAKVFEGHLESLGLLGPTGNSIRVMRNRAPISTFID-------YPKRIPVQLTSWCSCRIPSKIKILL------LLEFYANSEGKGE

Query:  RNQERHSKAFIPSDRRHVQR--FKHDAKLFYWDEPFMYKQCFDGIIRRCVSGDEAKEILEQCHSSPY-------------------------------GV
         +    +   +P D  + Q+  F  + K + W++P++YK C D IIRRCV  +E  +IL  CH S Y                               GV
Subjt:  RNQERHSKAFIPSDRRHVQR--FKHDAKLFYWDEPFMYKQCFDGIIRRCVSGDEAKEILEQCHSSPY-------------------------------GV

Query:  YYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPRALVSDE----------------------------------------------------------
         YVSKWVEA+A   NDA+ V +FL+  +F+RFG PRA++SD                                                           
Subjt:  YYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPRALVSDE----------------------------------------------------------

Query:  --DEALWAYRTAYKTPLGMSPYRLVYGKSCHLSLELEHKTLWALKKLNFDAWNRGSVATLLRTRLGQ
          D+ALWAYRTAYKTP+GMSPYRL+YGK+CHL +ELEH+  WA+K LNFD    G    L    L +
Subjt:  --DEALWAYRTAYKTPLGMSPYRLVYGKSCHLSLELEHKTLWALKKLNFDAWNRGSVATLLRTRLGQ

A0A699GTX9 Reverse transcriptase domain-containing protein6.4e-3846.2Show/hide
Query:  HSKAFIPS--DRRHVQRFKHDAKLFYWDEPFMYKQCFDGIIRRCVSGDEAKEILEQCHSSPYGVYYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPR
        H++ FI      +  ++F  D K ++WD+PF++K   D +IRRCVSG +A EIL+ CH  P GV Y+SKWVEA A   ND++ V +FL +++FARFGTPR
Subjt:  HSKAFIPS--DRRHVQRFKHDAKLFYWDEPFMYKQCFDGIIRRCVSGDEAKEILEQCHSSPYGVYYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPR

Query:  ALVSDE-------------------------DEALWAYRTAYKTPLGMSPYRLVYGKSCHLSLELEHKTLWALKKLNFDAWNRG
        A++SD                          D+ALWA+R AYKT +G + Y+LVYGK+CHL +ELEHK  WALK  NFD    G
Subjt:  ALVSDE-------------------------DEALWAYRTAYKTPLGMSPYRLVYGKSCHLSLELEHKTLWALKKLNFDAWNRG

A0A6L2K816 Reverse transcriptase1.9e-3747.67Show/hide
Query:  QRFKHDAKLFYWDEPFMYKQCFDGIIRRCVSGDEAKEILEQCHSSPYG---------------------------------VYYVSKWVEAIACHQNDAK
        + F  D K ++WDEP++++ C + +IRRCV G EA  IL  CH+ P G                                 V Y+SKWVEA A   NDA+
Subjt:  QRFKHDAKLFYWDEPFMYKQCFDGIIRRCVSGDEAKEILEQCHSSPYG---------------------------------VYYVSKWVEAIACHQNDAK

Query:  TVSRFLQSHIFARFGTPRALVSDEDEALWAYRTAYKTPLGMSPYRLVYGKSCHLSLELEHKTLWALKKLNFD
         V +FL+S +FARFGTPRA++ D D+ALWA+RTA+KTP G SPY+LVYGK+CHL ++LEHK  WALK  NFD
Subjt:  TVSRFLQSHIFARFGTPRALVSDEDEALWAYRTAYKTPLGMSPYRLVYGKSCHLSLELEHKTLWALKKLNFD

A0A6L2NA98 Integrase catalytic domain-containing protein5.4e-3745.65Show/hide
Query:  RFKHDAKLFYWDEPFMYKQCFDGIIRRCVSGDEAKEILEQCHSSPYG-----------------------------------VYYVSKWVEAIACHQNDA
        +F  D K ++WD+PF++K C D +IRRCV G EA +ILE CH+   G                                   + Y SKWVEA A   NDA
Subjt:  RFKHDAKLFYWDEPFMYKQCFDGIIRRCVSGDEAKEILEQCHSSPYG-----------------------------------VYYVSKWVEAIACHQNDA

Query:  KTVSRFLQSHIFARFGTPRALVSDE-----------DEALWAYRTAYKTPLGMSPYRLVYGKSCHLSLELEHKTLWALKKLNFD
        + V +FL+S +F RFG PRA++SD            D+ALWA+RTAYKTP+G +PY LVYGK+CHLS+ELEHK  WALK+ NFD
Subjt:  KTVSRFLQSHIFARFGTPRALVSDE-----------DEALWAYRTAYKTPLGMSPYRLVYGKSCHLSLELEHKTLWALKKLNFD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCAGGTAGCTAATGAGCTGAAGGCACGACCTCAAAGGAAACTTCCTGTGGATACTAAACACCCTAGAACGGAAGGTAAGGATAAGGTGCAGGCAGTGACT
CTAAGAAGTGGTAAGCCACGAGAAGAGAAAAGAAAGCCTAATAACATCCAGGATGTAGAGAAGAATATTGACAAAAATGTTGTTGTTGAGAAAAATTTGGAGTCT
GGTAAAAGTGATGGAGGCATCAATAATAATGTTGGAGCATCTAGTTCTGTTCTAGATGTAGAACCACCTTATGTACCGCCCCGCCTTATGCACAACATTGTTGAT
CCGTTTACGAAGGCCCTCTCGGCTAAAGTGTTCGAGGGTCATCTAGAAAGTCTAGGTCTATTAGGTCCCACTGGTAACTCTATTAGGGTGATGAGAAATCGTGCT
CCCATAAGCACCTTCATCGATTACCCCAAGAGGATACCGGTGCAACTTACGTCGTGGTGTTCTTGTCGAATTCCCAGCAAGATCAAGATCCTTTTGCTGCTGGAA
TTTTATGCAAATAGTGAAGGAAAAGGCGAAAGAAATCAAGAACGTCATTCAAAGGCTTTTATCCCTTCAGATAGGAGGCATGTGCAAAGGTTTAAGCATGATGCA
AAATTGTTTTATTGGGATGAGCCATTTATGTATAAGCAATGTTTTGATGGTATTATTCGAAGATGTGTTTCAGGTGATGAAGCAAAGGAAATCCTCGAGCAATGT
CACTCTTCCCCATATGGAGTTTACTATGTGTCCAAGTGGGTGGAGGCCATTGCATGTCATCAGAATGATGCTAAGACGGTGTCGAGGTTTCTTCAATCGCACATT
TTTGCGCGGTTTGGGACACCTAGGGCTCTTGTGAGCGATGAGGATGAGGCTCTATGGGCTTATAGGACAGCCTATAAGACTCCTTTAGGTATGTCTCCTTATAGG
CTAGTATATGGAAAGTCTTGCCATTTATCGTTAGAACTTGAGCATAAAACATTGTGGGCTCTAAAGAAATTGAATTTTGATGCCTGGAATAGGGGTAGCGTCGCG
ACGCTGCTGCGCACGCGGCTTGGGCAACAAGAAAGGGCAGCGTCGCGACGCTGCCTTGCTAAGCGTCTCGACGCTGTCCAGATTTTCCAGAAAAATCAGCTTCCT
TTTGGGCTGATTTTTGGGTCTTCTTCTTCATTTCTCTTGATTGTGCCGGAGCCGCCACGTCGTCGCTGCCGCAAGCAAGAGGCGGGACGAATCAAGGTGGTTAGG
ACAGACACTCCATCTCCGTCAACAACGGAATCTGAGAAGGAAAATGCAGAGAAAGAGGATCAAGAGAAAGAGAAAACTAAAAAGAAAACTAAAGAGGAGGCCTTG
ATGAAGCAACAAGTAGACAAGGGCAAAGGAGTTGCTGAAGCAACAGTCGAAGCAGAGGAGGCTGAGACTGAAGAACCAAGACTGTCGTATGAGCACTTCGTCAAC
AACCTTGCCAGAGCAAAATACTTGGCAATGCTAAGTGGGACTTCCTATTTGAGAGAGGAGTCAGTAAACTCCAACATAGTTCTCGAGTTCTACGCGAACATTGTT
GAAGAAGAAGACTCCCAAGCGGTTGTCTGTGGGACAACAGTAGACTGGAGCCCAGCATTGGTGAGAGTGGCAAATACGCAAATTCAATATGTCTTCCTTTTTGGA
GACAAGACCGAGTGGGAGGCTGGGGACATTTCGAAATCAAGCCATTGCCGCATCTCCCTCCCTTTGTATGTCTCTTTGCACCACCTGCTCCTTGCCATCGTCGCC
GCTCGTGCTTCAGCGCCGTCGTCGCCACCTCCCCTCGCAAGCCCTCTTCGTGAATCTCTCTCTCTCTCTCTCTCTCTCAATTTTTGGTTCGTGTGGAAGTCGCCG
CCTGTCCAGCCGCTGCCTCCTCGATGCGTCGTCACCACTACAGCAAGGTTGGATCTAAGTTCCTCTAGGTTTGTTTTGGTTAGATCTCCCGTGCCTAGCAATTCA
GAGTCCCGTCGACCTCGGTCTAGCCGATTCCGCCTCTGTCCAGCGACATTCTTGGGCGTTTCCAGCAACGATTTGCAGTTTTGGGAATTTGGAGTTGCTGTTCAA
CAAGGAAATAGTTTTTCTCGATTGAATTTGGCCTTTCAAGCATGTTTAAAGAATCTTCATCGGGTGCCTTGGGTAATATGGCCAAGAGGCGATGCACAGTTCGAG
GCCTTGGGTAATATGGTCAAGGGTCGAACACTGAGCTCCATAAAGAGCATTGTGGTCCTGGGTACAAATGGTCAGGGGACAGTGCGGCTCGAAGGGTCAGTCTTG
GAGAGTTTGGTGAGGGCTACGAGAATTAGAAGTCATCGGGTGCCTTGGGTAATATGGCCAAGGGGCGATGCAACAGTTCGAGGCCTTGGGGGTCAGTGTGACTCG
AAGGGTTTGTCTTGGAGAGCTTTGAAAGGACCCAGAGGAGTCGAGTTGAGAGGACTCAGAGGAGTCGAATTGAGAGAACTCCGTGGAGTTGAGAGGACCCAGAGG
AGTCAGATAGAGCTTGTGCGAGCCACTTACTCAGTACCGTGGTTTTGTACTAACCCACCACCAGGTTTTGCAGGTGCTGCAATCATATTCGAGCTTGGTGATGTA
GAGGAAACGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTCAGGTAGCTAATGAGCTGAAGGCACGACCTCAAAGGAAACTTCCTGTGGATACTAAACACCCTAGAACGGAAGGTAAGGATAAGGTGCAGGCAGTGACT
CTAAGAAGTGGTAAGCCACGAGAAGAGAAAAGAAAGCCTAATAACATCCAGGATGTAGAGAAGAATATTGACAAAAATGTTGTTGTTGAGAAAAATTTGGAGTCT
GGTAAAAGTGATGGAGGCATCAATAATAATGTTGGAGCATCTAGTTCTGTTCTAGATGTAGAACCACCTTATGTACCGCCCCGCCTTATGCACAACATTGTTGAT
CCGTTTACGAAGGCCCTCTCGGCTAAAGTGTTCGAGGGTCATCTAGAAAGTCTAGGTCTATTAGGTCCCACTGGTAACTCTATTAGGGTGATGAGAAATCGTGCT
CCCATAAGCACCTTCATCGATTACCCCAAGAGGATACCGGTGCAACTTACGTCGTGGTGTTCTTGTCGAATTCCCAGCAAGATCAAGATCCTTTTGCTGCTGGAA
TTTTATGCAAATAGTGAAGGAAAAGGCGAAAGAAATCAAGAACGTCATTCAAAGGCTTTTATCCCTTCAGATAGGAGGCATGTGCAAAGGTTTAAGCATGATGCA
AAATTGTTTTATTGGGATGAGCCATTTATGTATAAGCAATGTTTTGATGGTATTATTCGAAGATGTGTTTCAGGTGATGAAGCAAAGGAAATCCTCGAGCAATGT
CACTCTTCCCCATATGGAGTTTACTATGTGTCCAAGTGGGTGGAGGCCATTGCATGTCATCAGAATGATGCTAAGACGGTGTCGAGGTTTCTTCAATCGCACATT
TTTGCGCGGTTTGGGACACCTAGGGCTCTTGTGAGCGATGAGGATGAGGCTCTATGGGCTTATAGGACAGCCTATAAGACTCCTTTAGGTATGTCTCCTTATAGG
CTAGTATATGGAAAGTCTTGCCATTTATCGTTAGAACTTGAGCATAAAACATTGTGGGCTCTAAAGAAATTGAATTTTGATGCCTGGAATAGGGGTAGCGTCGCG
ACGCTGCTGCGCACGCGGCTTGGGCAACAAGAAAGGGCAGCGTCGCGACGCTGCCTTGCTAAGCGTCTCGACGCTGTCCAGATTTTCCAGAAAAATCAGCTTCCT
TTTGGGCTGATTTTTGGGTCTTCTTCTTCATTTCTCTTGATTGTGCCGGAGCCGCCACGTCGTCGCTGCCGCAAGCAAGAGGCGGGACGAATCAAGGTGGTTAGG
ACAGACACTCCATCTCCGTCAACAACGGAATCTGAGAAGGAAAATGCAGAGAAAGAGGATCAAGAGAAAGAGAAAACTAAAAAGAAAACTAAAGAGGAGGCCTTG
ATGAAGCAACAAGTAGACAAGGGCAAAGGAGTTGCTGAAGCAACAGTCGAAGCAGAGGAGGCTGAGACTGAAGAACCAAGACTGTCGTATGAGCACTTCGTCAAC
AACCTTGCCAGAGCAAAATACTTGGCAATGCTAAGTGGGACTTCCTATTTGAGAGAGGAGTCAGTAAACTCCAACATAGTTCTCGAGTTCTACGCGAACATTGTT
GAAGAAGAAGACTCCCAAGCGGTTGTCTGTGGGACAACAGTAGACTGGAGCCCAGCATTGGTGAGAGTGGCAAATACGCAAATTCAATATGTCTTCCTTTTTGGA
GACAAGACCGAGTGGGAGGCTGGGGACATTTCGAAATCAAGCCATTGCCGCATCTCCCTCCCTTTGTATGTCTCTTTGCACCACCTGCTCCTTGCCATCGTCGCC
GCTCGTGCTTCAGCGCCGTCGTCGCCACCTCCCCTCGCAAGCCCTCTTCGTGAATCTCTCTCTCTCTCTCTCTCTCTCAATTTTTGGTTCGTGTGGAAGTCGCCG
CCTGTCCAGCCGCTGCCTCCTCGATGCGTCGTCACCACTACAGCAAGGTTGGATCTAAGTTCCTCTAGGTTTGTTTTGGTTAGATCTCCCGTGCCTAGCAATTCA
GAGTCCCGTCGACCTCGGTCTAGCCGATTCCGCCTCTGTCCAGCGACATTCTTGGGCGTTTCCAGCAACGATTTGCAGTTTTGGGAATTTGGAGTTGCTGTTCAA
CAAGGAAATAGTTTTTCTCGATTGAATTTGGCCTTTCAAGCATGTTTAAAGAATCTTCATCGGGTGCCTTGGGTAATATGGCCAAGAGGCGATGCACAGTTCGAG
GCCTTGGGTAATATGGTCAAGGGTCGAACACTGAGCTCCATAAAGAGCATTGTGGTCCTGGGTACAAATGGTCAGGGGACAGTGCGGCTCGAAGGGTCAGTCTTG
GAGAGTTTGGTGAGGGCTACGAGAATTAGAAGTCATCGGGTGCCTTGGGTAATATGGCCAAGGGGCGATGCAACAGTTCGAGGCCTTGGGGGTCAGTGTGACTCG
AAGGGTTTGTCTTGGAGAGCTTTGAAAGGACCCAGAGGAGTCGAGTTGAGAGGACTCAGAGGAGTCGAATTGAGAGAACTCCGTGGAGTTGAGAGGACCCAGAGG
AGTCAGATAGAGCTTGTGCGAGCCACTTACTCAGTACCGTGGTTTTGTACTAACCCACCACCAGGTTTTGCAGGTGCTGCAATCATATTCGAGCTTGGTGATGTA
GAGGAAACGTGA
Protein sequenceShow/hide protein sequence
MGQVANELKARPQRKLPVDTKHPRTEGKDKVQAVTLRSGKPREEKRKPNNIQDVEKNIDKNVVVEKNLESGKSDGGINNNVGASSSVLDVEPPYVPPRLMHNIVD
PFTKALSAKVFEGHLESLGLLGPTGNSIRVMRNRAPISTFIDYPKRIPVQLTSWCSCRIPSKIKILLLLEFYANSEGKGERNQERHSKAFIPSDRRHVQRFKHDA
KLFYWDEPFMYKQCFDGIIRRCVSGDEAKEILEQCHSSPYGVYYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPRALVSDEDEALWAYRTAYKTPLGMSPYR
LVYGKSCHLSLELEHKTLWALKKLNFDAWNRGSVATLLRTRLGQQERAASRRCLAKRLDAVQIFQKNQLPFGLIFGSSSSFLLIVPEPPRRRCRKQEAGRIKVVR
TDTPSPSTTESEKENAEKEDQEKEKTKKKTKEEALMKQQVDKGKGVAEATVEAEEAETEEPRLSYEHFVNNLARAKYLAMLSGTSYLREESVNSNIVLEFYANIV
EEEDSQAVVCGTTVDWSPALVRVANTQIQYVFLFGDKTEWEAGDISKSSHCRISLPLYVSLHHLLLAIVAARASAPSSPPPLASPLRESLSLSLSLNFWFVWKSP
PVQPLPPRCVVTTTARLDLSSSRFVLVRSPVPSNSESRRPRSSRFRLCPATFLGVSSNDLQFWEFGVAVQQGNSFSRLNLAFQACLKNLHRVPWVIWPRGDAQFE
ALGNMVKGRTLSSIKSIVVLGTNGQGTVRLEGSVLESLVRATRIRSHRVPWVIWPRGDATVRGLGGQCDSKGLSWRALKGPRGVELRGLRGVELRELRGVERTQR
SQIELVRATYSVPWFCTNPPPGFAGAAIIFELGDVEET