; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001250 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001250
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr4:27782769..27791147
RNA-Seq ExpressionLag0001250
SyntenyLag0001250
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151688.1 uncharacterized protein LOC111019603 [Momordica charantia]4.8e-5046.15Show/hide
Query:  APMLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALL--------------------------D
        A + +   ALQ + DN     A     P        E+QFIRDF+RYGPP+F+G+SE     E WI +LEAL                           D
Subjt:  APMLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALL--------------------------D

Query:  LMNSAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSAPERKIKRFIKGLREEIRGSVALSRPATFAEA
        ++ + EDH N PI+W   KDLLYDYYFP+T+KD+KE EFLHL Q ++ V QYE+KFT  SRFA DL+    RKIKRF++GL + I+G + L RP T+AEA
Subjt:  LMNSAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSAPERKIKRFIKGLREEIRGSVALSRPATFAEA

Query:  LTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSPL
        + GAL+MDK+V +K QP  + G +SGVKRK+ P+
Subjt:  LTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSPL

XP_022155000.1 uncharacterized protein LOC111022144 [Momordica charantia]2.4e-4951.79Show/hide
Query:  ESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALL--------------------------DLMNSAEDHANRPISWERFKDLLYDYYFPETVKDDKE
        E+ FI+DFKRYGPP+FDG+SE   AAE WI +LEA                            D + +AEDHAN  I W RFKDLLYDYY+ ETVKD KE
Subjt:  ESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALL--------------------------DLMNSAEDHANRPISWERFKDLLYDYYFPETVKDDKE

Query:  AEFLHLAQGSMSVVQYERKFTALSRFAPDLVSAPERKIKRFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSPLRT
        AEFLHL QG++SV QYERKFT LSRFA +L+     KIKRF+KGL + IRG V L RPA++AEA+ GALIMDK+VS K     E GS+SGVKRK  P   
Subjt:  AEFLHLAQGSMSVVQYERKFTALSRFAPDLVSAPERKIKRFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSPLRT

Query:  HLLSLLSISPDAKCPRRLAKQTSM
                 P  + P+  A+   M
Subjt:  HLLSLLSISPDAKCPRRLAKQTSM

XP_022155925.1 uncharacterized protein LOC111022925 [Momordica charantia]3.0e-4445.38Show/hide
Query:  PPAPPAAPMLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALL---------------------
        PP  P   +L+  EALQ + DN         + P+    + EE QFIRDFKR+GPP F+G SE P AAE W+ +LEAL                      
Subjt:  PPAPPAAPMLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALL---------------------

Query:  -----DLMNSAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSAPERKIKRFIKGLREEIRGSVALSRP
             + + +AEDHAN P++W RFKDLLY+YYFP TV+++K AEFL L Q S+ V QYERKFT LSRF    +   + KI +FI GLR EI+G + L  P
Subjt:  -----DLMNSAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSAPERKIKRFIKGLREEIRGSVALSRP

Query:  ATFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLS
         T+A A+  AL+MDK + ++PQ     GS+SGVKRK +
Subjt:  ATFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLS

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]2.0e-5150.63Show/hide
Query:  PPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALL--------------------------DLMNSAEDHANRPISWERFKDLLY
        PP      P++  E++FI+DFKRYGPP+FDG+SE   A E WI +LEAL                           D + +AED+AN PI W RFK+LLY
Subjt:  PPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALL--------------------------DLMNSAEDHANRPISWERFKDLLY

Query:  DYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSAPERKIKRFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGS
        DYY+PETVKD KEAEFLHL QG++SV QYERKFT LSRFA +L+     KIKRF+KGLR+ IRG V L RP T+AEA+ GAL+MDK+VS K  P  E GS
Subjt:  DYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSAPERKIKRFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGS

Query:  TSGVKRKLSPLRTHLLSLLSISPDAKCPRRLAKQTSM
        +SGVKRK       L+         + P+R A+   M
Subjt:  TSGVKRKLSPLRTHLLSLLSISPDAKCPRRLAKQTSM

XP_022156546.1 uncharacterized protein LOC111023424 [Momordica charantia]1.8e-4948.22Show/hide
Query:  DPPPPP-------PPPAPPAAPMLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALL-------
        DPPPPP        PP PPAA   +   AL      +     +PPR+ +  P++  E+QFI+DFKRYGPP+F G SE    AE W+ +LEAL        
Subjt:  DPPPPP-------PPPAPPAAPMLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALL-------

Query:  -------------------DLMNSAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSAPERKIKRFIKG
                           D + + EDHAN P+ W RFK+LLYD+Y+ ETV+D KE EFLHL QG+++V QYERKFT LS FA +L+     KIKRF+KG
Subjt:  -------------------DLMNSAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSAPERKIKRFIKG

Query:  LREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSP
        L + IRGSV L RP T+AEA+ G LIMDK+VS + QP +E GS+ GVKRK+ P
Subjt:  LREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSP

TrEMBL top hitse value%identityAlignment
A0A6J1DCW8 uncharacterized protein LOC1110196032.3e-5046.15Show/hide
Query:  APMLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALL--------------------------D
        A + +   ALQ + DN     A     P        E+QFIRDF+RYGPP+F+G+SE     E WI +LEAL                           D
Subjt:  APMLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALL--------------------------D

Query:  LMNSAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSAPERKIKRFIKGLREEIRGSVALSRPATFAEA
        ++ + EDH N PI+W   KDLLYDYYFP+T+KD+KE EFLHL Q ++ V QYE+KFT  SRFA DL+    RKIKRF++GL + I+G + L RP T+AEA
Subjt:  LMNSAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSAPERKIKRFIKGLREEIRGSVALSRPATFAEA

Query:  LTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSPL
        + GAL+MDK+V +K QP  + G +SGVKRK+ P+
Subjt:  LTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSPL

A0A6J1DL73 uncharacterized protein LOC1110221441.2e-4951.79Show/hide
Query:  ESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALL--------------------------DLMNSAEDHANRPISWERFKDLLYDYYFPETVKDDKE
        E+ FI+DFKRYGPP+FDG+SE   AAE WI +LEA                            D + +AEDHAN  I W RFKDLLYDYY+ ETVKD KE
Subjt:  ESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALL--------------------------DLMNSAEDHANRPISWERFKDLLYDYYFPETVKDDKE

Query:  AEFLHLAQGSMSVVQYERKFTALSRFAPDLVSAPERKIKRFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSPLRT
        AEFLHL QG++SV QYERKFT LSRFA +L+     KIKRF+KGL + IRG V L RPA++AEA+ GALIMDK+VS K     E GS+SGVKRK  P   
Subjt:  AEFLHLAQGSMSVVQYERKFTALSRFAPDLVSAPERKIKRFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSPLRT

Query:  HLLSLLSISPDAKCPRRLAKQTSM
                 P  + P+  A+   M
Subjt:  HLLSLLSISPDAKCPRRLAKQTSM

A0A6J1DNV8 uncharacterized protein LOC1110229251.5e-4445.38Show/hide
Query:  PPAPPAAPMLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALL---------------------
        PP  P   +L+  EALQ + DN         + P+    + EE QFIRDFKR+GPP F+G SE P AAE W+ +LEAL                      
Subjt:  PPAPPAAPMLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALL---------------------

Query:  -----DLMNSAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSAPERKIKRFIKGLREEIRGSVALSRP
             + + +AEDHAN P++W RFKDLLY+YYFP TV+++K AEFL L Q S+ V QYERKFT LSRF    +   + KI +FI GLR EI+G + L  P
Subjt:  -----DLMNSAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSAPERKIKRFIKGLREEIRGSVALSRP

Query:  ATFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLS
         T+A A+  AL+MDK + ++PQ     GS+SGVKRK +
Subjt:  ATFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLS

A0A6J1DUM2 uncharacterized protein LOC1110232479.5e-5250.63Show/hide
Query:  PPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALL--------------------------DLMNSAEDHANRPISWERFKDLLY
        PP      P++  E++FI+DFKRYGPP+FDG+SE   A E WI +LEAL                           D + +AED+AN PI W RFK+LLY
Subjt:  PPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALL--------------------------DLMNSAEDHANRPISWERFKDLLY

Query:  DYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSAPERKIKRFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGS
        DYY+PETVKD KEAEFLHL QG++SV QYERKFT LSRFA +L+     KIKRF+KGLR+ IRG V L RP T+AEA+ GAL+MDK+VS K  P  E GS
Subjt:  DYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSAPERKIKRFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGS

Query:  TSGVKRKLSPLRTHLLSLLSISPDAKCPRRLAKQTSM
        +SGVKRK       L+         + P+R A+   M
Subjt:  TSGVKRKLSPLRTHLLSLLSISPDAKCPRRLAKQTSM

A0A6J1DVA0 uncharacterized protein LOC1110234248.9e-5048.22Show/hide
Query:  DPPPPP-------PPPAPPAAPMLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALL-------
        DPPPPP        PP PPAA   +   AL      +     +PPR+ +  P++  E+QFI+DFKRYGPP+F G SE    AE W+ +LEAL        
Subjt:  DPPPPP-------PPPAPPAAPMLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALL-------

Query:  -------------------DLMNSAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSAPERKIKRFIKG
                           D + + EDHAN P+ W RFK+LLYD+Y+ ETV+D KE EFLHL QG+++V QYERKFT LS FA +L+     KIKRF+KG
Subjt:  -------------------DLMNSAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSAPERKIKRFIKG

Query:  LREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSP
        L + IRGSV L RP T+AEA+ G LIMDK+VS + QP +E GS+ GVKRK+ P
Subjt:  LREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCTTCCTCGTTCTCCTCTCCATCACCAGGGTTGATCCTCCTCCTCCCCCGCCTCCTCCGGCCCCTCCTGCGGCTCCTATGCTGATCACTCCGGAAGCCCTCCAGAC
CATGTTCGATAACATGGCCCAGAGAAATGCTAGGCCACCGCGAAACCCTAATTGGGTACCTGAGAACGCGGAGGAATCCCAGTTCATTAGGGACTTCAAGCGCTACGGGC
CTCCCTCTTTTGATGGGCAATCCGAAAATCCTTTAGCAGCAGAGCGATGGATCACTGATTTGGAGGCACTGTTGGACCTCATGAACTCAGCCGAAGACCATGCTAATCGA
CCGATCTCGTGGGAAAGGTTCAAGGATCTATTGTATGATTATTACTTCCCGGAGACAGTCAAGGACGACAAAGAAGCGGAATTTCTTCATTTGGCCCAGGGGAGTATGTC
TGTAGTGCAGTATGAGAGGAAGTTCACTGCACTATCACGCTTTGCTCCTGACCTGGTCAGCGCGCCAGAGCGGAAGATTAAGAGGTTCATTAAAGGTCTTCGTGAGGAAA
TTCGAGGCTCTGTGGCCCTAAGCAGGCCCGCGACCTTTGCTGAAGCACTCACGGGTGCATTGATCATGGATAAGAATGTTTCTAAGAAGCCACAACCTCATCTCGAGAAG
GGATCAACCTCTGGAGTTAAAAGAAAGTTGTCTCCCCTGAGGACCCACCTATTGAGCCTACTCAGCATCAGCCCAGACGCCAAGTGCCCAAGGAGGTTAGCCAAGCAAAC
ATCAATGGAGTCCTTAAAGGTGGGAAGTCGGTTTCACCTCTCATTTCCAGCGTTTTGCAACTATTATGGAATCCATGAACCAAGTCTCAAGGTCTTTTTGGACTTGCTCA
TGATAATGCTCTTGGTGTTAAGGAAGTGCCATCTCATAGCCATTCTTGAAGGATGGGTTGAAAAGGGAGTTGACGCAGCGGAAGCCGTTGCAGAGTTTGTTCCTTCCTTC
AGTGATGCCTCTTTGAACTCCAACTTTGCTAGTGAAGTCAGCTTTTTGATGCACATGGCTAAGTTTCCTGTAGATAGTGCAAGAACCCAGTTCGGATTGGTGTTTTTCAG
ACCTTTGGTGTTCTTAACGCACTTGGTCAAGTTGCCGTTAAGCTTTGGGATTTCAGTGGAAGGTGGCTTGTTGAGCAAGAAGCCACTGCCGCAAACCAGCCCCCTTTCTT
TACGTTTTTCTTCTCCCCTGCGCAGCCGGCCAGCCCCACCGCCACTCGAGCTCGCGCCACCGCCTCCCCCTGCTGCCGCCGCTCGAGTTCCAGCCGGTCAGTCACCGTCG
CAGCCGCCTTGTCGCGCCGTCGCCGCCAGTCTCCCTTCCGCACGACTCTATCTCTTTTGTCTCGCCGTCTCCGCGTCGTTTCCTCTCTGTCCGTGCGTTTTTGGTCGAGT
TAAGTATCGGCTCGGAGTCTCCTTTCCTCGCGTTTTCGCCTCTGTCCAGCAGCGTCGTTGGGCGTTTTCGGCGTGGGTTAGTGTTTCCGCGCCGTCTAGGTGTTCGATTA
AGTTCGAAACACTTCAACTTGGGTACCCACTGCTCGAAGAGCGTTCTAGCTCACTGGTTGTGGTTGGTATAACCCGTCTAGCGCAAAAGCGGGTGTATTCGGTTGCTGTT
CAGCGAGCGTTTGGTCTCAAATATCGTGTTCAGCGAATACCCACAACTCGAAAGACCTTGATTTTGGTTACCCATAACCCGGTGACTTGGGATCTTGGTTGTTGGGTCGT
TTCGAACACAAGTCGGCTCGTCTCAAGGGCCTCGGGTATAAAAGGTCGGGGACTGATATATCACTATTGGTGTCGATGCCACGGGTATAAATGGTCGAAGGTCGATATGC
CAATGTTAGATAAAGAGGAGCATCGAGGCCTTGGGTATAAATGGTCAAGGGTCGGTGTGATGAGTCCTGAGGCAAGTATTGAGGCCTTGGGTATAAACGGTCAAGGGTCA
ATACGTTGCTTAGTCATCGAGGCCTTGGGTATAAATGGTCAAGGGTCGATGCACAGTTCGAGGCCTTGGGTCGCAGGGAAATGTCGGGGCCTTAAGTGCAGAATCCGGAT
TCTGAATCCTGGGCCTGGGGCGTTACAGATGACCTTATTAGACTTGATGCTCCAAAGAAAATTTCACGGATCTACTAATGGTGAAGACAAGTTTCAAATTATAAGCTTTT
CTGAGAACAATGATCTTGCCTGGATGTTCACTGGGGTCCTGAAAGACGAGGTTTTCAGAGCATTTTACATTGTCAAGGAAGATGAGCCTTCAACTGTTTTGAGAAACAAA
GAAAAGCTCTATGGAACCAATGAAGCAAACATAAAGGCATTACGGAGAAGCTTCAATGCTGTTAAACCCTTGCATCTTAGCAATACAATGGTTAGAGGCCAACTATCACC
TCAAAGTTTGCTTACGAGCATGGTGTCGAAAATCAAGATTACACTGCCTAGGATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCTTCCTCGTTCTCCTCTCCATCACCAGGGTTGATCCTCCTCCTCCCCCGCCTCCTCCGGCCCCTCCTGCGGCTCCTATGCTGATCACTCCGGAAGCCCTCCAGAC
CATGTTCGATAACATGGCCCAGAGAAATGCTAGGCCACCGCGAAACCCTAATTGGGTACCTGAGAACGCGGAGGAATCCCAGTTCATTAGGGACTTCAAGCGCTACGGGC
CTCCCTCTTTTGATGGGCAATCCGAAAATCCTTTAGCAGCAGAGCGATGGATCACTGATTTGGAGGCACTGTTGGACCTCATGAACTCAGCCGAAGACCATGCTAATCGA
CCGATCTCGTGGGAAAGGTTCAAGGATCTATTGTATGATTATTACTTCCCGGAGACAGTCAAGGACGACAAAGAAGCGGAATTTCTTCATTTGGCCCAGGGGAGTATGTC
TGTAGTGCAGTATGAGAGGAAGTTCACTGCACTATCACGCTTTGCTCCTGACCTGGTCAGCGCGCCAGAGCGGAAGATTAAGAGGTTCATTAAAGGTCTTCGTGAGGAAA
TTCGAGGCTCTGTGGCCCTAAGCAGGCCCGCGACCTTTGCTGAAGCACTCACGGGTGCATTGATCATGGATAAGAATGTTTCTAAGAAGCCACAACCTCATCTCGAGAAG
GGATCAACCTCTGGAGTTAAAAGAAAGTTGTCTCCCCTGAGGACCCACCTATTGAGCCTACTCAGCATCAGCCCAGACGCCAAGTGCCCAAGGAGGTTAGCCAAGCAAAC
ATCAATGGAGTCCTTAAAGGTGGGAAGTCGGTTTCACCTCTCATTTCCAGCGTTTTGCAACTATTATGGAATCCATGAACCAAGTCTCAAGGTCTTTTTGGACTTGCTCA
TGATAATGCTCTTGGTGTTAAGGAAGTGCCATCTCATAGCCATTCTTGAAGGATGGGTTGAAAAGGGAGTTGACGCAGCGGAAGCCGTTGCAGAGTTTGTTCCTTCCTTC
AGTGATGCCTCTTTGAACTCCAACTTTGCTAGTGAAGTCAGCTTTTTGATGCACATGGCTAAGTTTCCTGTAGATAGTGCAAGAACCCAGTTCGGATTGGTGTTTTTCAG
ACCTTTGGTGTTCTTAACGCACTTGGTCAAGTTGCCGTTAAGCTTTGGGATTTCAGTGGAAGGTGGCTTGTTGAGCAAGAAGCCACTGCCGCAAACCAGCCCCCTTTCTT
TACGTTTTTCTTCTCCCCTGCGCAGCCGGCCAGCCCCACCGCCACTCGAGCTCGCGCCACCGCCTCCCCCTGCTGCCGCCGCTCGAGTTCCAGCCGGTCAGTCACCGTCG
CAGCCGCCTTGTCGCGCCGTCGCCGCCAGTCTCCCTTCCGCACGACTCTATCTCTTTTGTCTCGCCGTCTCCGCGTCGTTTCCTCTCTGTCCGTGCGTTTTTGGTCGAGT
TAAGTATCGGCTCGGAGTCTCCTTTCCTCGCGTTTTCGCCTCTGTCCAGCAGCGTCGTTGGGCGTTTTCGGCGTGGGTTAGTGTTTCCGCGCCGTCTAGGTGTTCGATTA
AGTTCGAAACACTTCAACTTGGGTACCCACTGCTCGAAGAGCGTTCTAGCTCACTGGTTGTGGTTGGTATAACCCGTCTAGCGCAAAAGCGGGTGTATTCGGTTGCTGTT
CAGCGAGCGTTTGGTCTCAAATATCGTGTTCAGCGAATACCCACAACTCGAAAGACCTTGATTTTGGTTACCCATAACCCGGTGACTTGGGATCTTGGTTGTTGGGTCGT
TTCGAACACAAGTCGGCTCGTCTCAAGGGCCTCGGGTATAAAAGGTCGGGGACTGATATATCACTATTGGTGTCGATGCCACGGGTATAAATGGTCGAAGGTCGATATGC
CAATGTTAGATAAAGAGGAGCATCGAGGCCTTGGGTATAAATGGTCAAGGGTCGGTGTGATGAGTCCTGAGGCAAGTATTGAGGCCTTGGGTATAAACGGTCAAGGGTCA
ATACGTTGCTTAGTCATCGAGGCCTTGGGTATAAATGGTCAAGGGTCGATGCACAGTTCGAGGCCTTGGGTCGCAGGGAAATGTCGGGGCCTTAAGTGCAGAATCCGGAT
TCTGAATCCTGGGCCTGGGGCGTTACAGATGACCTTATTAGACTTGATGCTCCAAAGAAAATTTCACGGATCTACTAATGGTGAAGACAAGTTTCAAATTATAAGCTTTT
CTGAGAACAATGATCTTGCCTGGATGTTCACTGGGGTCCTGAAAGACGAGGTTTTCAGAGCATTTTACATTGTCAAGGAAGATGAGCCTTCAACTGTTTTGAGAAACAAA
GAAAAGCTCTATGGAACCAATGAAGCAAACATAAAGGCATTACGGAGAAGCTTCAATGCTGTTAAACCCTTGCATCTTAGCAATACAATGGTTAGAGGCCAACTATCACC
TCAAAGTTTGCTTACGAGCATGGTGTCGAAAATCAAGATTACACTGCCTAGGATTTGA
Protein sequenceShow/hide protein sequence
MVFLVLLSITRVDPPPPPPPPAPPAAPMLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALLDLMNSAEDHANR
PISWERFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSAPERKIKRFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEK
GSTSGVKRKLSPLRTHLLSLLSISPDAKCPRRLAKQTSMESLKVGSRFHLSFPAFCNYYGIHEPSLKVFLDLLMIMLLVLRKCHLIAILEGWVEKGVDAAEAVAEFVPSF
SDASLNSNFASEVSFLMHMAKFPVDSARTQFGLVFFRPLVFLTHLVKLPLSFGISVEGGLLSKKPLPQTSPLSLRFSSPLRSRPAPPPLELAPPPPPAAAARVPAGQSPS
QPPCRAVAASLPSARLYLFCLAVSASFPLCPCVFGRVKYRLGVSFPRVFASVQQRRWAFSAWVSVSAPSRCSIKFETLQLGYPLLEERSSSLVVVGITRLAQKRVYSVAV
QRAFGLKYRVQRIPTTRKTLILVTHNPVTWDLGCWVVSNTSRLVSRASGIKGRGLIYHYWCRCHGYKWSKVDMPMLDKEEHRGLGYKWSRVGVMSPEASIEALGINGQGS
IRCLVIEALGINGQGSMHSSRPWVAGKCRGLKCRIRILNPGPGALQMTLLDLMLQRKFHGSTNGEDKFQIISFSENNDLAWMFTGVLKDEVFRAFYIVKEDEPSTVLRNK
EKLYGTNEANIKALRRSFNAVKPLHLSNTMVRGQLSPQSLLTSMVSKIKITLPRI