; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022465 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022465
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr7:29732375..29738944
RNA-Seq ExpressionLag0022465
SyntenyLag0022465
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151688.1 uncharacterized protein LOC111019603 [Momordica charantia]8.6e-5244.76Show/hide
Query:  APMLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQ
        A + +   ALQ + DN     A     P        E+QFIRDF+RYGPP+F+G+SE     E WI +LEAL+  + C+D LK++GAVFML+ +A  WW 
Subjt:  APMLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQ

Query:  SVAAAEDHANRPISWERFKDLLYDYYFPET---------------------YERKFTTLSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAED
         VA  EDH N PI+W   KDLLYDYYFP+T                     YE+KFT  SRFA DL+ T  RKIKRF++GL + I+G + L RP T+AE 
Subjt:  SVAAAEDHANRPISWERFKDLLYDYYFPET---------------------YERKFTTLSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAED

Query:  SLGALIMDKNVSKKPQPHLEKGSTSGDKRKLSPLRNPPIEPTQQQPRR
          GAL+MDK+V +K QP  + G +SG KRK+ P+ +   +P++  P++
Subjt:  SLGALIMDKNVSKKPQPHLEKGSTSGDKRKLSPLRNPPIEPTQQQPRR

XP_022155000.1 uncharacterized protein LOC111022144 [Momordica charantia]8.0e-5052.07Show/hide
Query:  ESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPET------
        E+ FI+DFKRYGPP+FDG+SE   AAE WI +LEA +  + C D  K++GAVFML+ +A  WW S+AAAEDHAN  I W RFKDLLYDYY+ ET      
Subjt:  ESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPET------

Query:  ---------------YERKFTTLSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAEDSLGALIMDKNVSKKPQPHLEKGSTSGDKRKLSPLRN
                       YERKFT LSRFA +L+     KIKRF+KGL + IRG V L RPA++AE   GALIMDK+VS K     E GS+SG KRK  P   
Subjt:  ---------------YERKFTTLSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAEDSLGALIMDKNVSKKPQPHLEKGSTSGDKRKLSPLRN

Query:  PPI--EPTQQQPRRQVP
         P    P  Q   R +P
Subjt:  PPI--EPTQQQPRRQVP

XP_022155925.1 uncharacterized protein LOC111022925 [Momordica charantia]2.6e-4845.7Show/hide
Query:  PPAPPAAPMLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDD
        PP  P   +L+  EALQ + DN         + P+    + EE QFIRDFKR+GPP F+G SE P AAE W+ +LEAL+  + C+D  K+RGAVFML+ +
Subjt:  PPAPPAAPMLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDD

Query:  ARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPET---------------------YERKFTTLSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRP
        A  WW+SVAAAEDHAN P++W RFKDLLY+YYFP T                     YERKFT LSRF    + T + KI +FI GLR EI+G + L  P
Subjt:  ARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPET---------------------YERKFTTLSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRP

Query:  ATFAEDSLGALIMDKNVSKKPQPHLEKGSTSGDKRKLSPL-RNPPIEPTQQQPRRQ
         T+A     AL+MDK + ++PQ     GS+SG KRK +    + P    Q   +RQ
Subjt:  ATFAEDSLGALIMDKNVSKKPQPHLEKGSTSGDKRKLSPL-RNPPIEPTQQQPRRQ

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]6.6e-5251.54Show/hide
Query:  PPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLY
        PP      P++  E++FI+DFKRYGPP+FDG+SE   A E WI +LEAL+  + C D  K++GAVFML+ +A  WW SVAAAED+AN PI W RFK+LLY
Subjt:  PPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLY

Query:  DYYFPET---------------------YERKFTTLSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAEDSLGALIMDKNVSKKPQPHLEKGS
        DYY+PET                     YERKFT LSRFA +L+ T   KIKRF+KGLR+ IRG V L RP T+AE   GAL+MDK+VS K  P  E GS
Subjt:  DYYFPET---------------------YERKFTTLSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAEDSLGALIMDKNVSKKPQPHLEKGS

Query:  TSGDKRKL-SPLRNPPIEPTQQQPRRQ
        +SG KRK  S   +  +   Q+Q + Q
Subjt:  TSGDKRKL-SPLRNPPIEPTQQQPRRQ

XP_022156546.1 uncharacterized protein LOC111023424 [Momordica charantia]3.3e-5146.38Show/hide
Query:  PPGQRRVDPPPPP-------PPPAPPAAPMLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALF
        P G+   DPPPPP        PP PPAA   +   AL      +     +PPR+ +  P++  E+QFI+DFKRYGPP+F G SE    AE W+ +LEAL+
Subjt:  PPGQRRVDPPPPP-------PPPAPPAAPMLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALF

Query:  DLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPET---------------------YERKFTTLSRFAPDLVSTPERK
          + C D  K++GAVFML+ +A  WW SVAA EDHAN P+ W RFK+LLYD+Y+ ET                     YERKFT LS FA +L+ T   K
Subjt:  DLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPET---------------------YERKFTTLSRFAPDLVSTPERK

Query:  IKRFIKGLREEIRGSVALSRPATFAEDSLGALIMDKNVSKKPQPHLEKGSTSGDKRKLSPLRNPPIEPTQQQPRRQ
        IKRF+KGL + IRGSV L RP T+AE   G LIMDK+VS + QP +E GS+ G KRK+ P          Q+P +Q
Subjt:  IKRFIKGLREEIRGSVALSRPATFAEDSLGALIMDKNVSKKPQPHLEKGSTSGDKRKLSPLRNPPIEPTQQQPRRQ

TrEMBL top hitse value%identityAlignment
A0A6J1DCW8 uncharacterized protein LOC1110196034.2e-5244.76Show/hide
Query:  APMLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQ
        A + +   ALQ + DN     A     P        E+QFIRDF+RYGPP+F+G+SE     E WI +LEAL+  + C+D LK++GAVFML+ +A  WW 
Subjt:  APMLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQ

Query:  SVAAAEDHANRPISWERFKDLLYDYYFPET---------------------YERKFTTLSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAED
         VA  EDH N PI+W   KDLLYDYYFP+T                     YE+KFT  SRFA DL+ T  RKIKRF++GL + I+G + L RP T+AE 
Subjt:  SVAAAEDHANRPISWERFKDLLYDYYFPET---------------------YERKFTTLSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAED

Query:  SLGALIMDKNVSKKPQPHLEKGSTSGDKRKLSPLRNPPIEPTQQQPRR
          GAL+MDK+V +K QP  + G +SG KRK+ P+ +   +P++  P++
Subjt:  SLGALIMDKNVSKKPQPHLEKGSTSGDKRKLSPLRNPPIEPTQQQPRR

A0A6J1DL73 uncharacterized protein LOC1110221443.9e-5052.07Show/hide
Query:  ESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPET------
        E+ FI+DFKRYGPP+FDG+SE   AAE WI +LEA +  + C D  K++GAVFML+ +A  WW S+AAAEDHAN  I W RFKDLLYDYY+ ET      
Subjt:  ESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPET------

Query:  ---------------YERKFTTLSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAEDSLGALIMDKNVSKKPQPHLEKGSTSGDKRKLSPLRN
                       YERKFT LSRFA +L+     KIKRF+KGL + IRG V L RPA++AE   GALIMDK+VS K     E GS+SG KRK  P   
Subjt:  ---------------YERKFTTLSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAEDSLGALIMDKNVSKKPQPHLEKGSTSGDKRKLSPLRN

Query:  PPI--EPTQQQPRRQVP
         P    P  Q   R +P
Subjt:  PPI--EPTQQQPRRQVP

A0A6J1DNV8 uncharacterized protein LOC1110229251.3e-4845.7Show/hide
Query:  PPAPPAAPMLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDD
        PP  P   +L+  EALQ + DN         + P+    + EE QFIRDFKR+GPP F+G SE P AAE W+ +LEAL+  + C+D  K+RGAVFML+ +
Subjt:  PPAPPAAPMLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDD

Query:  ARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPET---------------------YERKFTTLSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRP
        A  WW+SVAAAEDHAN P++W RFKDLLY+YYFP T                     YERKFT LSRF    + T + KI +FI GLR EI+G + L  P
Subjt:  ARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPET---------------------YERKFTTLSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRP

Query:  ATFAEDSLGALIMDKNVSKKPQPHLEKGSTSGDKRKLSPL-RNPPIEPTQQQPRRQ
         T+A     AL+MDK + ++PQ     GS+SG KRK +    + P    Q   +RQ
Subjt:  ATFAEDSLGALIMDKNVSKKPQPHLEKGSTSGDKRKLSPL-RNPPIEPTQQQPRRQ

A0A6J1DUM2 uncharacterized protein LOC1110232473.2e-5251.54Show/hide
Query:  PPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLY
        PP      P++  E++FI+DFKRYGPP+FDG+SE   A E WI +LEAL+  + C D  K++GAVFML+ +A  WW SVAAAED+AN PI W RFK+LLY
Subjt:  PPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLY

Query:  DYYFPET---------------------YERKFTTLSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAEDSLGALIMDKNVSKKPQPHLEKGS
        DYY+PET                     YERKFT LSRFA +L+ T   KIKRF+KGLR+ IRG V L RP T+AE   GAL+MDK+VS K  P  E GS
Subjt:  DYYFPET---------------------YERKFTTLSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAEDSLGALIMDKNVSKKPQPHLEKGS

Query:  TSGDKRKL-SPLRNPPIEPTQQQPRRQ
        +SG KRK  S   +  +   Q+Q + Q
Subjt:  TSGDKRKL-SPLRNPPIEPTQQQPRRQ

A0A6J1DVA0 uncharacterized protein LOC1110234241.6e-5146.38Show/hide
Query:  PPGQRRVDPPPPP-------PPPAPPAAPMLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALF
        P G+   DPPPPP        PP PPAA   +   AL      +     +PPR+ +  P++  E+QFI+DFKRYGPP+F G SE    AE W+ +LEAL+
Subjt:  PPGQRRVDPPPPP-------PPPAPPAAPMLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALF

Query:  DLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPET---------------------YERKFTTLSRFAPDLVSTPERK
          + C D  K++GAVFML+ +A  WW SVAA EDHAN P+ W RFK+LLYD+Y+ ET                     YERKFT LS FA +L+ T   K
Subjt:  DLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPET---------------------YERKFTTLSRFAPDLVSTPERK

Query:  IKRFIKGLREEIRGSVALSRPATFAEDSLGALIMDKNVSKKPQPHLEKGSTSGDKRKLSPLRNPPIEPTQQQPRRQ
        IKRF+KGL + IRGSV L RP T+AE   G LIMDK+VS + QP +E GS+ G KRK+ P          Q+P +Q
Subjt:  IKRFIKGLREEIRGSVALSRPATFAEDSLGALIMDKNVSKKPQPHLEKGSTSGDKRKLSPLRNPPIEPTQQQPRRQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTTTGGGTTCCCACAAGCCATATCTCAGCGACCCAAGAGAATAGTGGGTGATTCGAGTGGTGGTGTCCGTTGGAAGTCCCAAGAACGAGATCCTTGGAGCATTGT
GCAGTGTTCTTCGGAATCCGTTGGAATTGTTTTAGAAGTGTTAATCTCATTCTGGCAGAGCAGAGTTGTTCCTGTAGACTGGCCTAGGAAATCTAGGTTGTTTGGTTGTT
TAGGGTTATGGTCTTCCTCGTTCTCCTCTTCATCACCAGGATCAGGTTGCTGCTTAGACGTTCCGATTGTCGATCAAGATGATCAAGTAGAGGAAGTTACTACTCAGCAG
GGGGTCGATCCTCTGGCTCCCCCTATGCAGGAGGCTAATCCCCTGATTCCTCCCGGTCAGCGCAGGGTTGATCCTCCTCCTCCCCCGCCTCCTCCGGCCCCTCCTGCAGC
TCCTATGCTGATCACTCCGGAAGCCCTCCAGACCATGTTCGATAACATGGCCCAGAGAAATGCTAGGCCACCGCGAAACCCTAATTGGGTACCTGAGAACGCGGAGGAAT
CCCAGTTCATTAGGGACTTCAAGCGCTACGGGCCTCCCTCTTTTGATGGGCAATCCGAAAATCCTTTAGCGGCAGAGCGATGGATCACTGATTTGGAGGCACTGTTTGAC
CTCATGAACTGTAATGATTCCTTGAAAATCAGAGGGGCAGTTTTCATGCTCAAGGATGACGCTCGCACGTGGTGGCAATCGGTGGCAGCAGCCGAAGACCATGCTAATCG
GCCGATCTCGTGGGAAAGGTTCAAGGATCTGTTGTATGATTATTACTTCCCGGAGACGTATGAGAGGAAGTTCACTACACTATCACGCTTTGCTCCTGACCTGGTCAGCA
CGCCAGAGCGGAAGATTAAGAGGTTCATTAAAGGTCTTCGTGAGGAAATTCGAGGCTCTGTAGCCTTAAGCAGGCCCGCGACCTTTGCTGAAGACTCACTGGGTGCATTG
ATCATGGATAAGAATGTTTCTAAGAAGCCACAACCTCATCTCGAGAAGGGATCAACCTCTGGAGATAAAAGAAAGTTGTCTCCCCTGAGGAACCCACCTATTGAGCCTAC
TCAGCAACAGCCCAGACGCCAAGTGCCCAAGAAGGTTAGCCAAGCAAATATCAATGGAGTCCTTAAAGGTGGGAAGTCGATTTCACCTCTCATTTCCAGACCAAGAGTTA
GAGCAATTCCCTCATGCAGTGAGACCTCAATCCAAAGCAGAGATAGTGAGAATCAATATCGAGTCATGACTGCAGCGACAGTTGTCGTATCGACAAAGTTACCTTGGTGT
GCAGTTATAAGGAACCAAGGGTTGCAAGATCCAGTTCGGATTGGGTTTTGTCAGACCTATGGTGTTCTTAAAGCACTTGGTCAAGCTGCCTGTAAGCGAGCTTTGCTTCA
AGTGCTAGGAAAGTGGCTAGATGAGGAAAAGTCGCTGCCGGCAGCAGACCCATCGTCGATCTCTCTCCCTCTCCCCCGCGCGCATCTCTCTCTCCTGTGGTTGGTCTGCG
CCGTCTCTTCTCTCCTTCGTGCGTGGAGTTGCAGCCGTGGATCTCTCTCTCTCCCGTGTTCTGACGCCAACAGCAGCTGGTCCGTGACCCCTGCATCCGCGAGCGCCGGC
CAGCACCCACATCTTCGGTTCTTCTCCTTCAACTCGCGTGTAAGGCGTTTTGGACGTGATTGGGTTGATTTAAGGTTTGTGTCGGCTTGTTCGGGCTCTTTTGCGCCGCC
TAGGTGTTCGATTAGATTCGAAACACATCGACTTGAATACCCACTGCCCAAGGATCGTTCTAGTGCGTTGTTCGAGGATTGTTTAGGAGCGTTTGAGCTTCCGCGTGTTG
GTTCGGTTTGCATTCGAGAGTCAACCATGAGTTGTGAGATGTGGTCATGTAAATGCTTAGTGGGCATGAACATGAGTCTAGAAGCATGTTGCAGGGCATATGCATTCTGG
AGATATAAAGAGGAGCACCGAGGCCTTGGGTATAAATGGTCAAGGGTCGGTGTGATGAGTCTGGAGGCAAGTATTGAGGCCTTGGGTATAAATGGTCAAGGGTTAGGTCG
AATGCCGAGCTCTGTAGAGAAGTGTCGAGGCCCTGAGTATAAATGGTCAGGGGTCGGTACAACTCGAAGGGTCAGTCTTGGAGAGTTTTTGCGGAGCGTGACGCAGAAGT
CACGTGTGAAGAGCCATGTGAGGGCTAAATATGTGTTGAGCGAGCTCGTGGAGCTTGAATGGTTAGCGAGTTGCGTAGGAGGGGCTACTTACCAGTACCATGGTTGTACT
GACCCCCTCCCTTCTCTTCCCCCCAACTTTCAGATGATGCAGGTTACGAGGACCGTCTTGACCATGGTGATGCAGAGGAGATGCATGAAGAGGGACCATAGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATTTTGGGTTCCCACAAGCCATATCTCAGCGACCCAAGAGAATAGTGGGTGATTCGAGTGGTGGTGTCCGTTGGAAGTCCCAAGAACGAGATCCTTGGAGCATTGT
GCAGTGTTCTTCGGAATCCGTTGGAATTGTTTTAGAAGTGTTAATCTCATTCTGGCAGAGCAGAGTTGTTCCTGTAGACTGGCCTAGGAAATCTAGGTTGTTTGGTTGTT
TAGGGTTATGGTCTTCCTCGTTCTCCTCTTCATCACCAGGATCAGGTTGCTGCTTAGACGTTCCGATTGTCGATCAAGATGATCAAGTAGAGGAAGTTACTACTCAGCAG
GGGGTCGATCCTCTGGCTCCCCCTATGCAGGAGGCTAATCCCCTGATTCCTCCCGGTCAGCGCAGGGTTGATCCTCCTCCTCCCCCGCCTCCTCCGGCCCCTCCTGCAGC
TCCTATGCTGATCACTCCGGAAGCCCTCCAGACCATGTTCGATAACATGGCCCAGAGAAATGCTAGGCCACCGCGAAACCCTAATTGGGTACCTGAGAACGCGGAGGAAT
CCCAGTTCATTAGGGACTTCAAGCGCTACGGGCCTCCCTCTTTTGATGGGCAATCCGAAAATCCTTTAGCGGCAGAGCGATGGATCACTGATTTGGAGGCACTGTTTGAC
CTCATGAACTGTAATGATTCCTTGAAAATCAGAGGGGCAGTTTTCATGCTCAAGGATGACGCTCGCACGTGGTGGCAATCGGTGGCAGCAGCCGAAGACCATGCTAATCG
GCCGATCTCGTGGGAAAGGTTCAAGGATCTGTTGTATGATTATTACTTCCCGGAGACGTATGAGAGGAAGTTCACTACACTATCACGCTTTGCTCCTGACCTGGTCAGCA
CGCCAGAGCGGAAGATTAAGAGGTTCATTAAAGGTCTTCGTGAGGAAATTCGAGGCTCTGTAGCCTTAAGCAGGCCCGCGACCTTTGCTGAAGACTCACTGGGTGCATTG
ATCATGGATAAGAATGTTTCTAAGAAGCCACAACCTCATCTCGAGAAGGGATCAACCTCTGGAGATAAAAGAAAGTTGTCTCCCCTGAGGAACCCACCTATTGAGCCTAC
TCAGCAACAGCCCAGACGCCAAGTGCCCAAGAAGGTTAGCCAAGCAAATATCAATGGAGTCCTTAAAGGTGGGAAGTCGATTTCACCTCTCATTTCCAGACCAAGAGTTA
GAGCAATTCCCTCATGCAGTGAGACCTCAATCCAAAGCAGAGATAGTGAGAATCAATATCGAGTCATGACTGCAGCGACAGTTGTCGTATCGACAAAGTTACCTTGGTGT
GCAGTTATAAGGAACCAAGGGTTGCAAGATCCAGTTCGGATTGGGTTTTGTCAGACCTATGGTGTTCTTAAAGCACTTGGTCAAGCTGCCTGTAAGCGAGCTTTGCTTCA
AGTGCTAGGAAAGTGGCTAGATGAGGAAAAGTCGCTGCCGGCAGCAGACCCATCGTCGATCTCTCTCCCTCTCCCCCGCGCGCATCTCTCTCTCCTGTGGTTGGTCTGCG
CCGTCTCTTCTCTCCTTCGTGCGTGGAGTTGCAGCCGTGGATCTCTCTCTCTCCCGTGTTCTGACGCCAACAGCAGCTGGTCCGTGACCCCTGCATCCGCGAGCGCCGGC
CAGCACCCACATCTTCGGTTCTTCTCCTTCAACTCGCGTGTAAGGCGTTTTGGACGTGATTGGGTTGATTTAAGGTTTGTGTCGGCTTGTTCGGGCTCTTTTGCGCCGCC
TAGGTGTTCGATTAGATTCGAAACACATCGACTTGAATACCCACTGCCCAAGGATCGTTCTAGTGCGTTGTTCGAGGATTGTTTAGGAGCGTTTGAGCTTCCGCGTGTTG
GTTCGGTTTGCATTCGAGAGTCAACCATGAGTTGTGAGATGTGGTCATGTAAATGCTTAGTGGGCATGAACATGAGTCTAGAAGCATGTTGCAGGGCATATGCATTCTGG
AGATATAAAGAGGAGCACCGAGGCCTTGGGTATAAATGGTCAAGGGTCGGTGTGATGAGTCTGGAGGCAAGTATTGAGGCCTTGGGTATAAATGGTCAAGGGTTAGGTCG
AATGCCGAGCTCTGTAGAGAAGTGTCGAGGCCCTGAGTATAAATGGTCAGGGGTCGGTACAACTCGAAGGGTCAGTCTTGGAGAGTTTTTGCGGAGCGTGACGCAGAAGT
CACGTGTGAAGAGCCATGTGAGGGCTAAATATGTGTTGAGCGAGCTCGTGGAGCTTGAATGGTTAGCGAGTTGCGTAGGAGGGGCTACTTACCAGTACCATGGTTGTACT
GACCCCCTCCCTTCTCTTCCCCCCAACTTTCAGATGATGCAGGTTACGAGGACCGTCTTGACCATGGTGATGCAGAGGAGATGCATGAAGAGGGACCATAGCTAG
Protein sequenceShow/hide protein sequence
MNFGFPQAISQRPKRIVGDSSGGVRWKSQERDPWSIVQCSSESVGIVLEVLISFWQSRVVPVDWPRKSRLFGCLGLWSSSFSSSSPGSGCCLDVPIVDQDDQVEEVTTQQ
GVDPLAPPMQEANPLIPPGQRRVDPPPPPPPPAPPAAPMLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALFD
LMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPETYERKFTTLSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAEDSLGAL
IMDKNVSKKPQPHLEKGSTSGDKRKLSPLRNPPIEPTQQQPRRQVPKKVSQANINGVLKGGKSISPLISRPRVRAIPSCSETSIQSRDSENQYRVMTAATVVVSTKLPWC
AVIRNQGLQDPVRIGFCQTYGVLKALGQAACKRALLQVLGKWLDEEKSLPAADPSSISLPLPRAHLSLLWLVCAVSSLLRAWSCSRGSLSLPCSDANSSWSVTPASASAG
QHPHLRFFSFNSRVRRFGRDWVDLRFVSACSGSFAPPRCSIRFETHRLEYPLPKDRSSALFEDCLGAFELPRVGSVCIRESTMSCEMWSCKCLVGMNMSLEACCRAYAFW
RYKEEHRGLGYKWSRVGVMSLEASIEALGINGQGLGRMPSSVEKCRGPEYKWSGVGTTRRVSLGEFLRSVTQKSRVKSHVRAKYVLSELVELEWLASCVGGATYQYHGCT
DPLPSLPPNFQMMQVTRTVLTMVMQRRCMKRDHS