; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005611 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005611
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr6:23786320..23794655
RNA-Seq ExpressionLag0005611
SyntenyLag0005611
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151688.1 uncharacterized protein LOC111019603 [Momordica charantia]7.4e-4646.86Show/hide
Query:  RDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLH
        RDF+RYGPP+F+G+SE     E WI +LEAL+  + C+D LK++GAVFML+ +A  WW  VA  EDH N PI+W   KDLLYDYYFP+T+KD+KE EFLH
Subjt:  RDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLH

Query:  LAQGSMSVVQYERKFTALSRLLL----------------TW-----------PATFAEALTGALIMDKNVSKKPQPHLEKGSTSGDKRKLSPLRNPPIEP
        L Q ++ V QYE+KFT  SR  L                 W           P T+AEA+ GAL+MDK+V +K QP  + G +SG KRK+ P+ +   +P
Subjt:  LAQGSMSVVQYERKFTALSRLLL----------------TW-----------PATFAEALTGALIMDKNVSKKPQPHLEKGSTSGDKRKLSPLRNPPIEP

Query:  TQQQPRR
        ++  P++
Subjt:  TQQQPRR

XP_022155000.1 uncharacterized protein LOC111022144 [Momordica charantia]1.2e-4852.36Show/hide
Query:  RDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLH
        +DFKRYGPP+FDG+SE   AAE WI +LEA +  + C D  K++GAVFML+ +A  WW S+AAAEDHAN  I W RFKDLLYDYY+ ETVKD KEAEFLH
Subjt:  RDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLH

Query:  LAQGSMSVVQYERKFTALSRLLLTW---------------------------PATFAEALTGALIMDKNVSKKPQPHLEKGSTSGDKRKLSPLRNPPI--
        L QG++SV QYERKFT LSR  L                             PA++AEA+ GALIMDK+VS K     E GS+SG KRK  P    P   
Subjt:  LAQGSMSVVQYERKFTALSRLLLTW---------------------------PATFAEALTGALIMDKNVSKKPQPHLEKGSTSGDKRKLSPLRNPPI--

Query:  EPTQQQPRRQVP
         P  Q   R +P
Subjt:  EPTQQQPRRQVP

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]2.5e-4952.63Show/hide
Query:  RDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLH
        +DFKRYGPP+FDG+SE   A E WI +LEAL+  + C D  K++GAVFML+ +A  WW SVAAAED+AN PI W RFK+LLYDYY+PETVKD KEAEFLH
Subjt:  RDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLH

Query:  LAQGSMSVVQYERKFTALSRLLLTW---------------------------PATFAEALTGALIMDKNVSKKPQPHLEKGSTSGDKRKL-SPLRNPPIE
        L QG++SV QYERKFT LSR  L                             P T+AEA+ GAL+MDK+VS K  P  E GS+SG KRK  S   +  + 
Subjt:  LAQGSMSVVQYERKFTALSRLLLTW---------------------------PATFAEALTGALIMDKNVSKKPQPHLEKGSTSGDKRKL-SPLRNPPIE

Query:  PTQQQPRRQ
          Q+Q + Q
Subjt:  PTQQQPRRQ

XP_022156546.1 uncharacterized protein LOC111023424 [Momordica charantia]4.8e-4542.8Show/hide
Query:  PVPPSQAMSRGHDPEVPIVDQDDQVEEVTTQQGSILWLPLCRRLIPYSSRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLK
        P PP+ A       E+ +++    V    TQ    L  P          +DFKRYGPP+F G SE    AE W+ +LEAL+  + C D  K++GAVFML+
Subjt:  PVPPSQAMSRGHDPEVPIVDQDDQVEEVTTQQGSILWLPLCRRLIPYSSRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLK

Query:  DDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRLLLTW--------------------------
         +A  WW SVAA EDHAN P+ W RFK+LLYD+Y+ ETV+D KE EFLHL QG+++V QYERKFT LS   L                            
Subjt:  DDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRLLLTW--------------------------

Query:  -PATFAEALTGALIMDKNVSKKPQPHLEKGSTSGDKRKLSPLRNPPIEPTQQQPRRQ
         P T+AEA+ G LIMDK+VS + QP +E GS+ G KRK+ P          Q+P +Q
Subjt:  -PATFAEALTGALIMDKNVSKKPQPHLEKGSTSGDKRKLSPLRNPPIEPTQQQPRRQ

XP_022157413.1 uncharacterized protein LOC111024114 [Momordica charantia]4.0e-4450.26Show/hide
Query:  RDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLH
        RDFKR+GPP F+G SE P A E W+ +LEAL+  + C+D  K+RGAVFML+ +A  WW+SVAAAEDH N P++W RFKDLLY+YYFP TV+++K AEFL 
Subjt:  RDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLH

Query:  LAQGSMSVVQYERKFTALSR---------------------------LLLTWPATFAEALTGALIMDKNVSKKPQPHLEKGSTSGDKRKLS
        L QGS++V QYERKFT LSR                           L++  P T+A A+  AL+MDK + ++PQ     GS+SG KRK +
Subjt:  LAQGSMSVVQYERKFTALSR---------------------------LLLTWPATFAEALTGALIMDKNVSKKPQPHLEKGSTSGDKRKLS

TrEMBL top hitse value%identityAlignment
A0A6J1DCW8 uncharacterized protein LOC1110196033.6e-4646.86Show/hide
Query:  RDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLH
        RDF+RYGPP+F+G+SE     E WI +LEAL+  + C+D LK++GAVFML+ +A  WW  VA  EDH N PI+W   KDLLYDYYFP+T+KD+KE EFLH
Subjt:  RDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLH

Query:  LAQGSMSVVQYERKFTALSRLLL----------------TW-----------PATFAEALTGALIMDKNVSKKPQPHLEKGSTSGDKRKLSPLRNPPIEP
        L Q ++ V QYE+KFT  SR  L                 W           P T+AEA+ GAL+MDK+V +K QP  + G +SG KRK+ P+ +   +P
Subjt:  LAQGSMSVVQYERKFTALSRLLL----------------TW-----------PATFAEALTGALIMDKNVSKKPQPHLEKGSTSGDKRKLSPLRNPPIEP

Query:  TQQQPRR
        ++  P++
Subjt:  TQQQPRR

A0A6J1DL73 uncharacterized protein LOC1110221445.9e-4952.36Show/hide
Query:  RDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLH
        +DFKRYGPP+FDG+SE   AAE WI +LEA +  + C D  K++GAVFML+ +A  WW S+AAAEDHAN  I W RFKDLLYDYY+ ETVKD KEAEFLH
Subjt:  RDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLH

Query:  LAQGSMSVVQYERKFTALSRLLLTW---------------------------PATFAEALTGALIMDKNVSKKPQPHLEKGSTSGDKRKLSPLRNPPI--
        L QG++SV QYERKFT LSR  L                             PA++AEA+ GALIMDK+VS K     E GS+SG KRK  P    P   
Subjt:  LAQGSMSVVQYERKFTALSRLLLTW---------------------------PATFAEALTGALIMDKNVSKKPQPHLEKGSTSGDKRKLSPLRNPPI--

Query:  EPTQQQPRRQVP
         P  Q   R +P
Subjt:  EPTQQQPRRQVP

A0A6J1DTA8 uncharacterized protein LOC1110241142.0e-4450.26Show/hide
Query:  RDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLH
        RDFKR+GPP F+G SE P A E W+ +LEAL+  + C+D  K+RGAVFML+ +A  WW+SVAAAEDH N P++W RFKDLLY+YYFP TV+++K AEFL 
Subjt:  RDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLH

Query:  LAQGSMSVVQYERKFTALSR---------------------------LLLTWPATFAEALTGALIMDKNVSKKPQPHLEKGSTSGDKRKLS
        L QGS++V QYERKFT LSR                           L++  P T+A A+  AL+MDK + ++PQ     GS+SG KRK +
Subjt:  LAQGSMSVVQYERKFTALSR---------------------------LLLTWPATFAEALTGALIMDKNVSKKPQPHLEKGSTSGDKRKLS

A0A6J1DUM2 uncharacterized protein LOC1110232471.2e-4952.63Show/hide
Query:  RDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLH
        +DFKRYGPP+FDG+SE   A E WI +LEAL+  + C D  K++GAVFML+ +A  WW SVAAAED+AN PI W RFK+LLYDYY+PETVKD KEAEFLH
Subjt:  RDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLH

Query:  LAQGSMSVVQYERKFTALSRLLLTW---------------------------PATFAEALTGALIMDKNVSKKPQPHLEKGSTSGDKRKL-SPLRNPPIE
        L QG++SV QYERKFT LSR  L                             P T+AEA+ GAL+MDK+VS K  P  E GS+SG KRK  S   +  + 
Subjt:  LAQGSMSVVQYERKFTALSRLLLTW---------------------------PATFAEALTGALIMDKNVSKKPQPHLEKGSTSGDKRKL-SPLRNPPIE

Query:  PTQQQPRRQ
          Q+Q + Q
Subjt:  PTQQQPRRQ

A0A6J1DVA0 uncharacterized protein LOC1110234242.3e-4542.8Show/hide
Query:  PVPPSQAMSRGHDPEVPIVDQDDQVEEVTTQQGSILWLPLCRRLIPYSSRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLK
        P PP+ A       E+ +++    V    TQ    L  P          +DFKRYGPP+F G SE    AE W+ +LEAL+  + C D  K++GAVFML+
Subjt:  PVPPSQAMSRGHDPEVPIVDQDDQVEEVTTQQGSILWLPLCRRLIPYSSRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLK

Query:  DDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRLLLTW--------------------------
         +A  WW SVAA EDHAN P+ W RFK+LLYD+Y+ ETV+D KE EFLHL QG+++V QYERKFT LS   L                            
Subjt:  DDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRLLLTW--------------------------

Query:  -PATFAEALTGALIMDKNVSKKPQPHLEKGSTSGDKRKLSPLRNPPIEPTQQQPRRQ
         P T+AEA+ G LIMDK+VS + QP +E GS+ G KRK+ P          Q+P +Q
Subjt:  -PATFAEALTGALIMDKNVSKKPQPHLEKGSTSGDKRKLSPLRNPPIEPTQQQPRRQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCTTTTATATTCTTGGCAGCCGCCAGACCCCCCCCTGCGTGTTCCTCTTCAGCCGCCAGCCTTCCGTGAAGCTCCAGCGCCGGCCACCTCCTTCAGCCACTGCTC
GAGTACGCCGCCTGTGCACGTCGCGTTGCTGCCGTCAACCTCTCTCTCCCTCTCGCGTGTTCGTCCGCCGGCAGCAAGGCTCGCGAAGCCTTCGTCTTCCTCTCTTTCTT
CATCCGCGTGCGTGTGGGTAGGTTCGTGGTCGTGGGTCTTCACCAGAGCTCCTCCTCTGGCGTCGGTTCGTCTCCTTTGCAGCACTCGCTTCTGTCCTCTCGAGCGTCGA
CCAACACCCGCGATTTCGAGCGTTGTTCCTCCAAACTCGTGGCAAGCTGGAATCTTCGTTGTTCTGGTGTTTAGGCGCGTTCGGGCTGATTTAAGGATCGTTTCGGCGTG
CTTAGGCTCTTTCGGTAAGCTCTCAGCATTACCCATGCTTTATAAACCTCAATTTGATTACCCATTAGCTTTAGGTGCTGAAAAGTTTAATGTTGTTCGAGTTTCTGATC
TGCAGCGCCGTCTAGGTGTTCGATTAGGTTCGAAACACATCAACTTGCATACCCACTGCCCAAGGACGTTTTGGTGCGTTGTTCGAGGTTGTTGCAACCCGTCTTGCGTG
ATATCGGGTCCGTTAGGTAAGGTGTATTCGGTGGCTGTTCATCGAGCGTTTGATCTCAAATATCGTGTTCAGCGAATACCCACAACTCGAAGGATCTTGATTTTGGTTAC
CCATAACCCGGTGACTTGGGATCTTGGTTGTTGGGTCGTTTCGAACACAAGTCGGCTTGTCTCAAGGGCCTCGGGTATAAAAGGTCGGGGACTGATATATCACTATTGGT
GTCGATGCCTCGGGTATAAATGGTCGAGGGTCGATATGCCAATGTTAGATAAAGAGGAGCATCGAGGCCTTGGGTATAAATGGTCAAGGGTCGGTGTGATGAGTCCTGAG
GCAAGTATTGAGGCCTTGGGTATAAACGGTCAAGGGTCAATACGTTCCTTAGTCATCGAGGCCTTGGGTATAAATGGTCAAGGGTCGATGCACAGTTCGAGGCCTTCGGT
AGGGGCTGCTTACCAGTACCTTAGTGTACTGACCCCCTCCCCTCTCTCTCCCCCCAACTACCAGATTTTGCAGGTTATGAGGACTGCGTGGACCGTGTTGCTTACTGTGG
CTGTTGGTTCTGTGAATGTTCTGTCGGGTTGTAGTTGGTGGAGTAATTTGATAATGTGTATATGGAAGTGGTTTTTTTCTTGTTCAGCAGGTGTAGGAAAATTTTGGGTT
TGTCCAATGTGCGTCGTTATGCTGCCGAAATTTTCGGTGCTCACGGTTTGTTTTGGTCTAGGAGTTAGTAATGTCGCTGGGTTAGCTTTTAAAATCTCGGGGCGTTACAG
TTGGTATCAGAGCAGAGTTGTTCCTGTAGACTGGCCTAGGAAATCTAGGTTGTTTGGTTGTTTAGGGTTATGGTCTTCCTCGTTCTCCTCTCCATCACCAGTACCACCTT
CTCAGGCAATGTCTCGCGGTCATGACCCTGAAGTTCCAATTGTCGATCAAGATGATCAAGTAGAGGAAGTTACTACTCAGCAGGGGTCGATCCTCTGGCTCCCCCTATGC
AGGAGGCTAATCCCCTATTCCTCCCGGGACTTCAAGCGCTACGGGCCTCCCTCTTTTGATGGGCAATCCGAAAATCCTTTAGCAGCAGAGCGATGGATCACTGATTTGGA
GGCACTGTTTGACCTCATGAACTGTAATGATTCCTTGAAAATCAGAGGGGCAGTTTTCATGCTCAAGGATGACGCTCGCACGTGGTGGCAATCGGTGGCAGCAGCCGAAG
ACCATGCTAATCGACCGATCTCGTGGGAAAGGTTCAAGGATCTATTGTATGATTATTACTTCCCGGAGACAGTCAAGGACGACAAAGAAGCGGAATTTCTTCATTTGGCC
CAGGGGAGTATGTCTGTAGTGCAGTATGAGAGGAAGTTCACTGCACTATCACGTTTGCTCCTGACCTGGCCCGCGACCTTTGCTGAAGCACTCACGGGTGCATTGATCAT
GGATAAGAATGTTTCTAAGAAGCCACAACCTCATCTCGAGAAGGGATCAACCTCTGGAGATAAAAGAAAGTTGTCTCCCCTGAGGAACCCACCTATTGAGCCTACTCAGC
AACAGCCCAGACGCCAAGTGCCCAAGGAGGTTAGCCAAGCAAACATCAATGGAGTCCTTAAAGGTGGGAAGTCGGTTTCACCTCTCATTTCCAGGCTTAAAGTTGGTGAA
GTTGTTGATGAAGCCTTTCGTAGGACCGGGATGAGCATTCTTGAGGTCATTGAAGATGGAGTCAAGGCCAGTCCACCAGTGTGGTTAGGCAGCTCTTTCGAGTTCTGGAG
TTCAGTTTTGGGTGATCGTGTGTTCTGGCACATGGTGTCGTCGAAAGGTTCTTTGCTTGTCGATAAGAAGCTTAGCTTGGGTTCATTGCTCCTTTTGAGGTCTTGGAGCG
CATCGGGCTTGTTGCTTACAGGTTGGCGTTATCACCGACCATGTCAACTGAGCGTGATGTGTCTATGTGTCCACGTTGTGGAAATGCGTGCGTGTTCCTTCGCATGTGTG
AATTCTGAACCCCTGTCGTCGATCGAGGACTTGACATTTGTGGTGGAGAAGTCTGTCAAGGTCCTATCCAAAGAAGCGAGCAAGGTCGACCTTCGCCGTCCTTCTCTCTC
TGCGCGTCTCTCCCGCGAACTCCTCCTCTCTCGGTTCGTTCTTGAGCCTCGTCTGCCCGAGCGTCGACCACCCGCATTCAGCGTTGTCTCCACTGGCGTAGTCTTTTCCT
CTGTCGTCTTTCATCGCTGCAGTTCGATCCTCTTTCCTTGTTTGCTGGCTGTGAGTAGAGCGGATCTTGTGCTGGTTTTGGCCGTTCGGCGATTAGGTCGTTTCGGTGTT
AGCTCTTTCGCGCCGTCTAGGTGTTCGATTAGGTTCGAAACACATCGACTTGCATACCCACTGCCAAGGATCGTTCTAGTGCGTTGTTCGAGGTTGTTGCAACCCGTCTT
GCGTGATATCGGGTCCGTTGGGGCCTCGGGTATAAAAGGTCGGGGACTGATATATCACTATTGGTGTCGATGCCTCGGGTATAAATGGTCGAGGGTCGATATGCCAATGT
TAGATAAAGAGGAGCATCGAGGCCTTGGGTATAAATGGTCAAGGGTCGGTGTGATGAGTCCTGAGGCAAGTATTGAGGCCTTGGGTATAAATGGTCAAGGGTCATACGTT
GCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGCTTTTATATTCTTGGCAGCCGCCAGACCCCCCCCTGCGTGTTCCTCTTCAGCCGCCAGCCTTCCGTGAAGCTCCAGCGCCGGCCACCTCCTTCAGCCACTGCTC
GAGTACGCCGCCTGTGCACGTCGCGTTGCTGCCGTCAACCTCTCTCTCCCTCTCGCGTGTTCGTCCGCCGGCAGCAAGGCTCGCGAAGCCTTCGTCTTCCTCTCTTTCTT
CATCCGCGTGCGTGTGGGTAGGTTCGTGGTCGTGGGTCTTCACCAGAGCTCCTCCTCTGGCGTCGGTTCGTCTCCTTTGCAGCACTCGCTTCTGTCCTCTCGAGCGTCGA
CCAACACCCGCGATTTCGAGCGTTGTTCCTCCAAACTCGTGGCAAGCTGGAATCTTCGTTGTTCTGGTGTTTAGGCGCGTTCGGGCTGATTTAAGGATCGTTTCGGCGTG
CTTAGGCTCTTTCGGTAAGCTCTCAGCATTACCCATGCTTTATAAACCTCAATTTGATTACCCATTAGCTTTAGGTGCTGAAAAGTTTAATGTTGTTCGAGTTTCTGATC
TGCAGCGCCGTCTAGGTGTTCGATTAGGTTCGAAACACATCAACTTGCATACCCACTGCCCAAGGACGTTTTGGTGCGTTGTTCGAGGTTGTTGCAACCCGTCTTGCGTG
ATATCGGGTCCGTTAGGTAAGGTGTATTCGGTGGCTGTTCATCGAGCGTTTGATCTCAAATATCGTGTTCAGCGAATACCCACAACTCGAAGGATCTTGATTTTGGTTAC
CCATAACCCGGTGACTTGGGATCTTGGTTGTTGGGTCGTTTCGAACACAAGTCGGCTTGTCTCAAGGGCCTCGGGTATAAAAGGTCGGGGACTGATATATCACTATTGGT
GTCGATGCCTCGGGTATAAATGGTCGAGGGTCGATATGCCAATGTTAGATAAAGAGGAGCATCGAGGCCTTGGGTATAAATGGTCAAGGGTCGGTGTGATGAGTCCTGAG
GCAAGTATTGAGGCCTTGGGTATAAACGGTCAAGGGTCAATACGTTCCTTAGTCATCGAGGCCTTGGGTATAAATGGTCAAGGGTCGATGCACAGTTCGAGGCCTTCGGT
AGGGGCTGCTTACCAGTACCTTAGTGTACTGACCCCCTCCCCTCTCTCTCCCCCCAACTACCAGATTTTGCAGGTTATGAGGACTGCGTGGACCGTGTTGCTTACTGTGG
CTGTTGGTTCTGTGAATGTTCTGTCGGGTTGTAGTTGGTGGAGTAATTTGATAATGTGTATATGGAAGTGGTTTTTTTCTTGTTCAGCAGGTGTAGGAAAATTTTGGGTT
TGTCCAATGTGCGTCGTTATGCTGCCGAAATTTTCGGTGCTCACGGTTTGTTTTGGTCTAGGAGTTAGTAATGTCGCTGGGTTAGCTTTTAAAATCTCGGGGCGTTACAG
TTGGTATCAGAGCAGAGTTGTTCCTGTAGACTGGCCTAGGAAATCTAGGTTGTTTGGTTGTTTAGGGTTATGGTCTTCCTCGTTCTCCTCTCCATCACCAGTACCACCTT
CTCAGGCAATGTCTCGCGGTCATGACCCTGAAGTTCCAATTGTCGATCAAGATGATCAAGTAGAGGAAGTTACTACTCAGCAGGGGTCGATCCTCTGGCTCCCCCTATGC
AGGAGGCTAATCCCCTATTCCTCCCGGGACTTCAAGCGCTACGGGCCTCCCTCTTTTGATGGGCAATCCGAAAATCCTTTAGCAGCAGAGCGATGGATCACTGATTTGGA
GGCACTGTTTGACCTCATGAACTGTAATGATTCCTTGAAAATCAGAGGGGCAGTTTTCATGCTCAAGGATGACGCTCGCACGTGGTGGCAATCGGTGGCAGCAGCCGAAG
ACCATGCTAATCGACCGATCTCGTGGGAAAGGTTCAAGGATCTATTGTATGATTATTACTTCCCGGAGACAGTCAAGGACGACAAAGAAGCGGAATTTCTTCATTTGGCC
CAGGGGAGTATGTCTGTAGTGCAGTATGAGAGGAAGTTCACTGCACTATCACGTTTGCTCCTGACCTGGCCCGCGACCTTTGCTGAAGCACTCACGGGTGCATTGATCAT
GGATAAGAATGTTTCTAAGAAGCCACAACCTCATCTCGAGAAGGGATCAACCTCTGGAGATAAAAGAAAGTTGTCTCCCCTGAGGAACCCACCTATTGAGCCTACTCAGC
AACAGCCCAGACGCCAAGTGCCCAAGGAGGTTAGCCAAGCAAACATCAATGGAGTCCTTAAAGGTGGGAAGTCGGTTTCACCTCTCATTTCCAGGCTTAAAGTTGGTGAA
GTTGTTGATGAAGCCTTTCGTAGGACCGGGATGAGCATTCTTGAGGTCATTGAAGATGGAGTCAAGGCCAGTCCACCAGTGTGGTTAGGCAGCTCTTTCGAGTTCTGGAG
TTCAGTTTTGGGTGATCGTGTGTTCTGGCACATGGTGTCGTCGAAAGGTTCTTTGCTTGTCGATAAGAAGCTTAGCTTGGGTTCATTGCTCCTTTTGAGGTCTTGGAGCG
CATCGGGCTTGTTGCTTACAGGTTGGCGTTATCACCGACCATGTCAACTGAGCGTGATGTGTCTATGTGTCCACGTTGTGGAAATGCGTGCGTGTTCCTTCGCATGTGTG
AATTCTGAACCCCTGTCGTCGATCGAGGACTTGACATTTGTGGTGGAGAAGTCTGTCAAGGTCCTATCCAAAGAAGCGAGCAAGGTCGACCTTCGCCGTCCTTCTCTCTC
TGCGCGTCTCTCCCGCGAACTCCTCCTCTCTCGGTTCGTTCTTGAGCCTCGTCTGCCCGAGCGTCGACCACCCGCATTCAGCGTTGTCTCCACTGGCGTAGTCTTTTCCT
CTGTCGTCTTTCATCGCTGCAGTTCGATCCTCTTTCCTTGTTTGCTGGCTGTGAGTAGAGCGGATCTTGTGCTGGTTTTGGCCGTTCGGCGATTAGGTCGTTTCGGTGTT
AGCTCTTTCGCGCCGTCTAGGTGTTCGATTAGGTTCGAAACACATCGACTTGCATACCCACTGCCAAGGATCGTTCTAGTGCGTTGTTCGAGGTTGTTGCAACCCGTCTT
GCGTGATATCGGGTCCGTTGGGGCCTCGGGTATAAAAGGTCGGGGACTGATATATCACTATTGGTGTCGATGCCTCGGGTATAAATGGTCGAGGGTCGATATGCCAATGT
TAGATAAAGAGGAGCATCGAGGCCTTGGGTATAAATGGTCAAGGGTCGGTGTGATGAGTCCTGAGGCAAGTATTGAGGCCTTGGGTATAAATGGTCAAGGGTCATACGTT
GCTTAG
Protein sequenceShow/hide protein sequence
MELLYSWQPPDPPLRVPLQPPAFREAPAPATSFSHCSSTPPVHVALLPSTSLSLSRVRPPAARLAKPSSSSLSSSACVWVGSWSWVFTRAPPLASVRLLCSTRFCPLERR
PTPAISSVVPPNSWQAGIFVVLVFRRVRADLRIVSACLGSFGKLSALPMLYKPQFDYPLALGAEKFNVVRVSDLQRRLGVRLGSKHINLHTHCPRTFWCVVRGCCNPSCV
ISGPLGKVYSVAVHRAFDLKYRVQRIPTTRRILILVTHNPVTWDLGCWVVSNTSRLVSRASGIKGRGLIYHYWCRCLGYKWSRVDMPMLDKEEHRGLGYKWSRVGVMSPE
ASIEALGINGQGSIRSLVIEALGINGQGSMHSSRPSVGAAYQYLSVLTPSPLSPPNYQILQVMRTAWTVLLTVAVGSVNVLSGCSWWSNLIMCIWKWFFSCSAGVGKFWV
CPMCVVMLPKFSVLTVCFGLGVSNVAGLAFKISGRYSWYQSRVVPVDWPRKSRLFGCLGLWSSSFSSPSPVPPSQAMSRGHDPEVPIVDQDDQVEEVTTQQGSILWLPLC
RRLIPYSSRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLHLA
QGSMSVVQYERKFTALSRLLLTWPATFAEALTGALIMDKNVSKKPQPHLEKGSTSGDKRKLSPLRNPPIEPTQQQPRRQVPKEVSQANINGVLKGGKSVSPLISRLKVGE
VVDEAFRRTGMSILEVIEDGVKASPPVWLGSSFEFWSSVLGDRVFWHMVSSKGSLLVDKKLSLGSLLLLRSWSASGLLLTGWRYHRPCQLSVMCLCVHVVEMRACSFACV
NSEPLSSIEDLTFVVEKSVKVLSKEASKVDLRRPSLSARLSRELLLSRFVLEPRLPERRPPAFSVVSTGVVFSSVVFHRCSSILFPCLLAVSRADLVLVLAVRRLGRFGV
SSFAPSRCSIRFETHRLAYPLPRIVLVRCSRLLQPVLRDIGSVGASGIKGRGLIYHYWCRCLGYKWSRVDMPMLDKEEHRGLGYKWSRVGVMSPEASIEALGINGQGSYV
A